Design and build the production systems that power the Together Cloud inference and fine-tuning APIs, enabling reliability and performance at scale; Partner with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world; Perform architecture and research work for AI workloads; Analyze and improve efficiency, scalability, and stability of various system resources; Conduct design and code reviews
Requirements:
5+ years experience writing high-performance, well-tested, production quality code; Bachelor’s degree in computer science or equivalent industry experience; Demonstrated experience in building large scale, fault tolerant, distributed systems like storage, search, and computation; Expert level programmer in one or more of Python, Go, Rust, or C/C++; Experience implementing runtime inference services at scale or similar
Text:
ML Engineer, LLM Design and build the production systems that power the Together Cloud inference and fine-tuning APIs, enabling reliability and performance at scale; Partner with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world; Perform architecture and research work for AI workloads; Analyze and improve efficiency, scalability, and stability of various system resources; Conduct design and code reviews 5+ years experience writing high-performance, well-tested, production quality code; Bachelor’s degree in computer science or equivalent industry experience; Demonstrated experience in building large scale, fault tolerant, distributed systems like storage, search, and computation; Expert level programmer in one or more of Python, Go, Rust, or C/C++; Experience implementing runtime inference services at scale or similar
Please click here, if the job didn't load correctly.
Please wait. You are being redirected to the job in 3 seconds.