Job description
About the role:
We are looking for a DevOps Engineer to design, build, and maintain scalable, secure, and production-grade infrastructure for a high-load SaaS platform.
You will take full ownership of cloud infrastructure, deployment automation, system reliability, and performance at scale.
This is your opportunity to engineer infrastructure the way top-tier SaaS products are built — with automation, observability, cost-efficiency, and high availability.
What you’ll be doing:
- Design and maintain a secure, scalable, and highly available AWS infrastructure
- Implement Infrastructure as Code (IaC) using tools like Terraform or AWS CDK.
- Create and maintain CI/CD pipelines for automatic, zero-downtime deployments (every 2 weeks).
- Monitor and manage infrastructure health, logs, metrics, and set up effective alerting systems.
- Investigate and resolve incidents, perform root cause analysis, and prevent recurrences.
- Continuously optimize infrastructure for performance and cost.
- Collaborate closely with backend/frontend engineers to align infrastructure with software architecture.
Tech Stack:
- Cloud Platform: AWS (VPC, EC2, ELB, WAF, S3, IAM, RDS Aurora, SES, Lambda, ECS, EKS, CloudFront)
- Infrastructure as Code: Terraform, AWS CDK
- Containers & Orchestration: Docker, Kubernetes (EKS)
- CI/CD Tools: GitHub Actions, Bitbucket Pipelines, ArgoCD, Helm, FluxCD
- Monitoring & Observability: AWS CloudWatch, Prometheus, Grafana, Loki, Sentry
- Databases: AWS RDS Aurora (MySQL/PostgreSQL), ClickHouse
- Version Control: Github, Bitbucket
We’re looking for:
- 5+ years of hands-on DevOps experience, with strong focus on AWS infrastructure.
- Proven experience in a SaaS environment is mandatory: you understand the expectations around uptime, multi-tenancy, deployments, and rapid iteration.
- Deep knowledge of AWS services: EC2, EKS, RDS, IAM, S3, Lambda, WAF, etc.
- Experience setting up and maintaining robust CI/CD pipelines (with fast, safe deployments).
- Proficiency with IaC tools like Terraform or AWS CDK.
- Solid skills with Docker, Kubernetes (EKS).
- Understanding of relational databases (especially MySQL/PostgreSQL) and advanced SQL.
- Comfort with monitoring, alerting, and incident response in a production environment.
- Familiarity with scripting/automation in Python or Bash.
- Strong Linux/Unix administration skills.
- English proficiency at least Pre-Intermediate.
Nice to have:
- Experience managing high-load and distributed systems in production.
- Experience working with ClickHouse.
- Exposure to GitOps practices and tools like ArgoCD or FluxCD.
- Enthusiasm for clean code, automation, and operational excellence.
What we offer:
- Lead infrastructure efforts on a next-gen high-load SaaS platform.
- Work in a product-driven environment with bi-weekly releases and full DevOps ownership.
- Build with the latest cloud-native stack (AWS, EKS, IaC, GitOps).
- Collaborate with a senior, passionate engineering team.
- Enjoy full remote flexibility and a product-first culture.