Summary
The job is for a member of the LLM Inference team at CentML to help build state-of-the-art software. The candidate should have strong coding skills, a bachelor's degree or equivalent practical experience, and 5+ years of industry experience in software engineering.
Requirements
- Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
- Strong coding skills in Python and C/C++
- 5+ years of industry experience in software engineering
Responsibilities
- Write safe, scalable, modular, and high-quality (C++/Python) code for our core backend software
- Perform benchmarking, profiling, and system-level programming for GPU applications
- Provide code reviews, design docs, and tutorials to facilitate collaboration among the team
- Conduct unit tests and performance tests for different stages of the inference pipeline
Preferred Qualifications
- Solid fundamentals in machine learning and deep learning
- Solid fundamentals in operating systems, computer architecture, and parallel programming
- Research experience in systems or machine learning
- Industry experience in building enterprise-scale large distributed systems
- Experience with training, deploying, or optimizing the inference of LLMs in production is a plus
- Experience with performance modeling, profiling, debugging, and code optimization or architectural knowledge of CPU and GPU is a plus
Benefits
- An open and inclusive work environment
- Employee stock options
- Best-in-class medical and dental benefits
- Parental Leave top-up for 6 months
- Professional development budget
- Flexible vacation time to promote a healthy work-life blend