Astronomer Logo

Staff Software Engineer Platform Infrastructure

Job Description

Astronomer empowers data teams to bring mission-critical software, analytics, and AI to life and is the company behind Astro, the industry-leading unified DataOps platform powered by Apache Airflow®. Astro accelerates building reliable data products that unlock insights, unleash AI value, and powers data-driven applications. Trusted by more than 800 of the world’s leading enterprises, Astronomer lets businesses do more with their data. To learn more, visit www.astronomer.io.

About this role:

Astronomer’s products run on a complex, multi-cloud platform — and the reliability, scalability, and operational maturity of that platform is what we’re actively investing in. The work ahead on our Platform team isn’t about managing data or curating pipelines; it’s about building the foundational systems that everything else runs on — the kind of systems where the failure modes matter, the latency budgets are real, and getting it wrong has visible consequences for hundreds of enterprise customers.

We’re looking for a Staff+-level engineer who has built production platform systems at scale before — not just consumed them. You’ve designed systems under load, reasoned carefully about failure, and made your colleagues’ lives better by giving them something solid and well-understood to build on top of. You write strategy documents and then write the code that proves them out. You’ve been the person who made the call, and lived with the consequences.

This is a foundational role at a consequential moment: your work will directly shape what Astronomer’s products — Astro, Observe, and our IDE — are capable of over the next several years. This role reports directly to the VP responsible for delivering these platforms reliably.

What you get to do:

Astronomer has a healthy and complex infrastructure estate spanning multiple cloud providers, a mix of managed and self-hosted systems, and an increasingly ambitious set of requirements as our products evolve. We have a clear sense of where we need to go; we need the right person to figure out how to get there and then go build it.

This is very much a technical role — you’ll be just as involved in building these systems as in specifying and designing them. We’re not looking for someone to write strategy documents; we’re looking for someone who writes the strategy and the code, and who has done exactly that before at scale.

  • Blaze a Trail: Own and develop our platform infrastructure strategy, with the sponsorship and responsibility to match. Map out what we need, make the calls, and own the outcomes.

  • Be an Owner: Be directly involved in deciding what we work on and how we work on it. Make promises, and keep them.

  • Do Sensible Things: Make principled build vs. buy assessments and advocate for the right tools for the right job — not the fashionable ones, not the ones already in the estate just because they’re there.

  • Garage Door Open: Create and maintain comprehensive internal documentation and decision records for systems and processes. Participate in architectural forums and make principled, open decisions that the rest of the organisation can learn from and hold us to.

What you bring to the role:

  • Distributed systems depth, grounded in practice. You have a solid working model of how production systems fail — consistency and availability tradeoffs, failure cascades, backpressure, graceful degradation. You can draw the diagram, explain the failure modes at each node, and make a reasoned argument for which ones actually matter in a given context. NALSD thinking is how you naturally approach a new system design.

  • Kubernetes at operator depth. You know what happens inside the scheduler and the control loop when things go wrong, because you’ve been there. You’ve operated clusters under real load, not just deployed workloads onto them.

  • Strong Go proficiency. The platform team writes production Go. You should be fluent: you’ve built and shipped systems in it, and you have opinions about what good Go looks like.

  • Multi-cloud experience, not just multi-cloud exposure. You’ve made considered architectural decisions across AWS, GCP, and/or Azure — not just consumed managed services, but evaluated tradeoffs between them and lived with those decisions in production.

  • Experience defining requirements and driving technology choices across an engineering organisation. You’ve been the person in the room who frames the decision correctly, not just the one who executes it.

  • Strong written and verbal communication. You can write a design doc that changes minds, and a postmortem that makes the organisation smarter. You’ve worked effectively in a globally-distributed team.

Bonus points if you have:

  • Experience with storage primitives at the system level — you’ve reasoned about when to reach for a relational store vs. an object store vs. something else, and you have real opinions informed by real failures.

  • Experience working on a SaaS/PaaS product across multiple cloud providers.

  • Familiarity with Apache Airflow or workflow orchestration systems.

#LI-Hybrid

At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Share this job:
Please let Astronomer know you found this job on Remote First Jobs 🙏

903 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like Astronomer

Explore remote-first companies similar to Astronomer. Discover other top-rated employers that offer flexible schedules and work-from-anywhere options.

Wizeline Logo

Wizeline

1001-5000 www.wizeline.ai

A global technology services provider building digital products and platforms, with a focus on AI-powered solutions.

View company profile →
Keboola Logo

Keboola

Build and automate data workflows and AI pipelines with our orchestration platform.

View company profile →
Dagster Labs Logo

Dagster Labs

An open-source data orchestration platform for building, scheduling, and monitoring AI and data pipelines.

View company profile →
Emi Labs Logo

Emi Labs

A frontline recruitment automation platform that uses AI to accelerate high-volume hiring across LATAM.

View company profile →
Scaler Logo

Scaler

A platform for real estate investors and managers to analyze building data, improve fund performance, and meet sustainability reporting.

View company profile →
Willow Logo

Willow

An AI-driven digital twin platform that optimizes building operations, reduces costs, and enhances sustainability.

View company profile →

Project: Career Search

Rev. 2026.5

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply