Site Reliability Engineer - Observability

  • $160k-$180k
  • Remote - Worldwide

Remote

DevOps

Mid-level

Job description

ABOUT THE ROLE

Second Front Systems’ (2F) Product team is seeking a highly skilled and motivated Senior Site Reliability Engineer to join our Observability team. We are a small team working to accelerate the deployment of emerging technology into national security use-cases. We are seeking technical professionals who want to operate on the front lines of an exciting and disruptive mission.

As a Senior SRE for Second Front Systems, you’ll be responsible for deploying, maintaining, and scaling our observability infrastructure across multiple DoD networks. You’ll work with Kubernetes-based platforms, BigBang charts from DoD Platform One, and build automation to make our monitoring stack easier to deploy for new customers. You’ll be empowered to collaborate with others to implement infrastructure that delivers unique capabilities for our commercial and government customers, including the Department of Defense.

The Observability team is looking for a strong SRE with deep DevSecOps and Kubernetes experience. Someone who has deployed and maintained monitoring infrastructure at scale, with an eye for security in highly-regulated environments. Experience with DoD software deployments, Platform One, and single-tenant architectures is highly valued.

We are a fast-growing entrepreneurial team working at the convergence of technology and national security. If this type of effort interests you, come join us!

Note: This position requires U.S. citizenship due to government contract requirements.

What You’ll Do

  • Deploy and maintain observability stack (Grafana, Mimir, Prometheus) across multiple customer clusters and DoD networks
  • Build Helm chart abstractions and automation to streamline monitoring deployments for new customers
  • Troubleshoot and debug complex Kubernetes issues, networking problems, and monitoring stack failures
  • Configure and maintain BigBang charts and DoD Platform One integrations
  • Design and implement infrastructure automation using tools like Pulumi, ArgoCD, and Flux
  • Work with Istio service mesh and Keycloak for authentication in secure environments
  • Monitor and optimize performance of monitoring infrastructure across multiple environments
  • Collaborate with security teams to ensure compliance with NIST requirements and DoD standards
  • Participate in on-call rotation and incident response for production environments

Skills You’ll Bring to Our Team

  • 5+ years of Site Reliability Engineering or DevOps experience
  • Deep experience with Kubernetes administration, troubleshooting, and scaling
  • Hands-on experience deploying and maintaining observability tools (Prometheus, Grafana, Mimir/Cortex)
  • Strong understanding of Helm charts, GitOps practices, and CNCF tooling
  • Experience with service mesh technologies (Istio preferred)
  • Proven ability to debug complex distributed systems and networking issues
  • Understanding of authentication systems and security in regulated environments
  • Ability to work independently and collaborate with team members in a remote environment

Preferred Qualifications

  • Active security clearance or ability to obtain a Secret-level security clearance
  • Previous experience with DoD software deployments and Platform One
  • Experience with BigBang charts and Iron Bank containers
  • Experience working in national security or highly regulated environments
  • Familiarity with compliance frameworks (NIST, FedRAMP, etc.)
  • Experience with infrastructure as code (Pulumi, Terraform)

Technologies we Use

  • Observability: Grafana stack, Prometheus, custom alerting tools
  • Kubernetes: Helm, ArgoCD, Flux, Tekton, BigBang charts
  • Security: Istio, Keycloak, Kyverno
  • Infrastructure: AWS/GCP/Azure, Pulumi, Git/GitLab
  • Languages: YAML, Bash, Go

$160,000 - $180,000 a year

Perks & Benefits

This role is full time.  As a public benefit corporation, we’re a team of purpose-driven trailblazers transforming the future of U.S. national security. We hire the best to do their best and, as such, we are committed to providing the perks and benefits you need to be successful—both in- and outside the workplace.

We offer you:

Competitive Salary

100% Healthcare, vision and dental coverage

401(k) + 3% company contribution

Wellness perks (Fitness classes, mental health resources)

Equity incentive plan

Tech + office supplies stipend

Annual professional development stipend

Flexible paid time off + federal holidays off

Parental leave

Work from anywhere

Referral Bonus

Visit our careers page to learn more.

#LI-Remote

Share this job:
Please let Second Front Systems know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply