Job description
Job Title: Sr Manager, AI/ML Infra Engineering
Position Type: Full-Time / Permanent
Location: Remote (Must be based in Bay Area, Chicago, Boston for occasional in-person collaboration)
Salary Range: $250,000 + Bonus + RSU + Comprehensive Benefits
Job Description:
As a Senior Manager of AI/ML Infrastructure Engineering, you will lead the development and scaling of cloud-native AI/ML infrastructure to support both classical ML and Generative AI. This is a high-impact leadership role where you will drive the architecture and implementation of end-to-end model productionalization, from deployment to monitoring and continuous refresh.
This role sits within the AI and Data Engineering organization and partners closely with product, engineering, security, compliance, and business teams to ensure the ML infrastructure is secure, scalable, and production-grade. Your leadership will directly contribute to building a unified and intelligent AI/ML platform during a critical stage of technology transformation.
Responsibilities:
- Lead the design and development of scalable, secure, and resilient AI/ML infrastructure, supporting both traditional ML and GenAI use cases
- Own the architecture and execution of ML model lifecycle management: development, deployment, monitoring, and refresh
- Drive engineering excellence in MLOps, data lakehouse architecture, observability, and platform performance
- Collaborate with Security, Risk, Product, Architecture, and Engineering teams to align infrastructure with business and compliance needs
- Champion cloud-native infrastructure (e.g., Kubernetes, cloud platforms like AWS, GCP, Azure, or Oracle) and DevOps best practices
- Manage and mentor a high-performing engineering team, fostering growth and accountability
- Ensure alignment across technical and business stakeholders and balance near-term delivery with long-term platform goals
Requirements:
- 10+ years of experience in AI/ML infrastructure engineering, including experience with both classical ML and GenAI
- Proven track record of building and scaling ML infrastructure, MLOps, and production ML systems
- Deep expertise in cloud platforms (AWS, Azure, GCP, or Oracle), containerization (Kubernetes), and infrastructure automation
- Strong knowledge of data lakehouse architectures and data pipeline performance tuning
- Experience deploying ML models at scale with emphasis on security, observability, and lifecycle automation
- Demonstrated leadership in managing engineering teams and cross-functional technical projects
- Excellent communication and stakeholder alignment skills
About Us:
Founded in 2009, IntelliPro stands as a global leader in talent acquisition and HR solutions. Our commitment to delivering unparalleled service to clients, fostering employee growth, and building enduring partnerships sets us apart. With a dynamic presence in the USA, China, Canada, Singapore, Philippines, UK, India, Netherlands, and Germany, we continue to lead the way in global talent solutions.
IntelliPro is proud to be an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We also ensure that all applicants have access to accommodations throughout the hiring process. Learn more at https://intelliprogroup.com
Compensation:
The compensation offered will depend on various factors, including location, experience, education, and job-related skills. This role includes a competitive base salary, bonus, equity, and a comprehensive benefits package, subject to eligibility.