Summary
Join Together AI as a Systems Research Engineer specialized in Machine Learning Systems to research and build the next generation AI platform. You will design large-scale distributed training systems, a low-latency/high-throughput inference engine, and stay up-to-date with the latest advancements in machine learning systems.
Requirements
- Strong background in machine learning systems, such as distributed learning and efficient inference for large language models and diffusion models
- Knowledge of ML/AI applications and models, especially foundation models such as large language models and diffusion models, how they are constructed and how they are used
- Knowledge of system performance profiling and optimization tools for ML systems
- Excellent problem-solving and analytical skills
- Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experience
Responsibilities
- Optimize and fine-tune existing training and inference platform to achieve better performance and scalability
- Collaborate with cross-functional teams to integrate cutting edge research ideas into existing software systems
- Develop your own ideas of optimizing the training and inference platforms and push the frontier of machine learning systems research
- Stay up-to-date with the latest advancements in machine learning systems techniques and apply many of them to the Together platform
Benefits
- Competitive compensation
- Startup equity
- Health insurance
- Flexibility in terms of remote work