Systems Research Engineer

  • $160k-$230k
  • Remote - United States

Remote

Software Development

Mid-level

Summary

Join Together AI as a Systems Research Engineer specialized in Machine Learning Systems to research and build the next generation AI platform. You will design large-scale distributed training systems, a low-latency/high-throughput inference engine, and stay up-to-date with the latest advancements in machine learning systems.

Requirements

  • Strong background in machine learning systems, such as distributed learning and efficient inference for large language models and diffusion models
  • Knowledge of ML/AI applications and models, especially foundation models such as large language models and diffusion models, how they are constructed and how they are used
  • Knowledge of system performance profiling and optimization tools for ML systems
  • Excellent problem-solving and analytical skills
  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experience

Responsibilities

  • Optimize and fine-tune existing training and inference platform to achieve better performance and scalability
  • Collaborate with cross-functional teams to integrate cutting edge research ideas into existing software systems
  • Develop your own ideas of optimizing the training and inference platforms and push the frontier of machine learning systems research
  • Stay up-to-date with the latest advancements in machine learning systems techniques and apply many of them to the Together platform

Benefits

  • Competitive compensation
  • Startup equity
  • Health insurance
  • Flexibility in terms of remote work
Share this job:
Please let Together AI know you found this job on Remote First Jobs 🙏
Apply