Staff Site Reliability Engineer

🇺🇸 United States - Remote
🔧 DevOps🟣 Senior

Job description

Required U.S. Citizenship / No clearance needed / 100% remote within the US / EST Time Zone

Staff Site Reliability Engineer / Cloud SME

Location: 100% remote in the continental US

Type: Long-term contract (3+ years)

Role Summary

As the Staff SRE/Cloud SME, you will be a critical technical leader driving the rearchitecting of our existing monolithic system into a resilient, cloud-native architecture. This role requires deep expertise across multiple cloud platforms (Azure and AWS) and container orchestration (Kubernetes) to ensure the next-generation platform meets the highest standards of scalability, reliability, and security.

Key Responsibilities

Architecture & Transformation Leadership

  • Lead the technical rearchitecting efforts, transforming a large-scale monolithic system into a modern microservices-based, cloud-native application.
  • Collaborate with cross-functional teams (Engineering, Architecture, Product) to define and implement the new system architecture using domain-driven design (DDD) principles.
  • Conduct technology evaluations and provide recommendations for new tools, frameworks, and cloud services to enhance our infrastructure.

Reliability Engineering & Cloud Operations

  • Utilize Kubernetes (K8S) for container orchestration and management, ensuring extreme scalability, reliability, and high availability of the system.
  • Implement robust, highly resilient, and highly available components for the system.
  • Develop and implement comprehensive monitoring, logging, and alerting mechanisms to ensure optimal system performance and availability.
  • Drive the adoption of DevOps principles and practices throughout the software development lifecycle, ensuring seamless integration and continuous deployment processes.

Technical Expertise & Mentorship

  • Stay up-to-date with emerging technologies, frameworks, and industry trends related to systems and cloud computing.
  • Mentor and provide technical guidance to junior team members, fostering a culture of continuous learning and professional growth.

Required Qualifications

  • Cloud Platforms: 7+ years of experience with cloud computing platforms. Strong multi-cloud expertise required with AWS and Azure.
  • Cloud-Native Transformation: 7+ years of experience in rearchitecting large-scale monolithic applications to cloud-native architectures.
  • Container Orchestration: Strong expertise in Kubernetes (K8S) is required, including hands-on experience with both AKS (Azure Kubernetes Service) and EKS (Elastic Kubernetes Service).
  • Networking: Strong experience with Cloud Networking, with the ability to design and resolve complex cloud networking architecture problems.
  • IaC: Expert knowledge of Terraform for infrastructure-as-code deployment and management.
  • Security: Must possess strong knowledge of security best practices for containers and Kubernetes clusters.
  • Education: Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field.
  • Bonus Knowledge: Knowledge of load balancing algorithms.

Thanks for applying!

Share this job:
Please let ASCENDING know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Apply