Software Engineer, Analytics Platform

at OpenAI
๐Ÿ‡บ๐Ÿ‡ธ United States - Remote
๐Ÿ’ป Software Development๐Ÿ”ต Mid-level

Job description

About the Team

The Research Platform Analytics team designs, builds, and operates the critical foundational data and analytics infrastructure that enables research at OpenAI.

Our goal is one, and one only: accelerate the progress of research towards AGI. We do this by owning a variety of observability and analytics systems aimed at providing quality signals about our research, and own the entire lifecycle of it, starting with data production from training workloads, to ingestion, post-processing and end-user analytics products. All of this at large scale.

About the Role

As we scale up with more researchers and engineers joining OpenAI, we seek a pragmatic and passionate engineer with a strong focus on the experience for both engineers and scientists that work in our large data sets.

Our work involves building a generic data processing platform that enables researchers to store, query, and process petabyte-scale datasets efficiently. This includes developing and maintaining large-scale stream and batch data pipelines, ensuring our infrastructure scales to support ML workloads, and making trade-offs to deliver impact quickly. We work across distributed data systems, infrastructure, and observability, ensuring reliability while moving fast.

You will find yourself at home if you are comfortable with work such as scaling Kubernetes services, debugging Kafka consumer lag, diagnosing distributed systems failures, and developing new end-to-end data processing pipelinesโ€”from raw data capture to analytics using Presto, Trino, or Flink. A portion of this role involves hands-on infrastructure work, including deploying and troubleshooting core services.

This role is based in San Francisco, CA or open to being remote within the US. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

In this role, you will:

  • Build and maintain large-scale stream and batch processing pipelines (Kafka, Spark, Flink, Trino/Presto).

  • Develop a general-purpose data processing platform for handling massive datasets.

  • Scale applications for ML research, ensuring smooth operation as workloads grow.

  • Ensure the security, integrity, and compliance of data according to industry and company standards.

  • Ensure our analytics and data platforms can scale reliably to the next several orders of magnitude

  • Accelerate company productivity by empowering your fellow engineers, researchers, and teammates with excellent data tooling and systems, providing a best in case experience

  • Bring new features and capabilities to the world by partnering with product engineers, trust & safety and other teams to build the technical foundations

  • Like all other teams, we are responsible for the reliability of the systems we build. This includes an on-call rotation to respond to critical incidents as needed

You might thrive in this role if you have:

  • Proficient in Python and backend development, with experience working in large codebases (monorepos).

  • Experience building and operating large-scale stream and batch processing pipelines (Kafka, Spark, Flink, Presto/Trino).

  • Hands-on experience with Kubernetes, Terraform, and deploying/troubleshooting production systems.

  • Worked on access control, provenance, auditing, and large-scale data movement.

  • Passion for building systems that provide key insights, especially in ML training workflows.

  • Comfortable in a fast-moving environment, making trade-offs to deliver impact quickly.

  • Understanding of data transformations in ML training and inference workflows is a plus.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via thisย link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Share this job:
Please let OpenAI know you found this job on Remote First Jobs ๐Ÿ™

Similar Remote Jobs

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service ๐Ÿ™

Apply