Job Description
Job Title: DevOps Engineer
Position Type: Full-Time, Remote
Working Hours: U.S. Client Business Hours (with flexibility for deployments, incident response, and on-call rotations)
About the Role
Our client is seeking a DevOps Engineer to build, maintain, and optimize scalable infrastructure and deployment pipelines that support fast, secure, and reliable software delivery.
This role requires strong expertise in cloud infrastructure, CI/CD automation, Kubernetes, monitoring, and infrastructure-as-code. The DevOps Engineer will work closely with development and engineering teams to improve deployment reliability, automate operational workflows, strengthen security, and ensure high system availability across environments.
This is an ideal role for someone who enjoys solving infrastructure challenges, optimizing systems at scale, and building highly automated and resilient environments.
Responsibilities
Infrastructure Management
• Provision, configure, and maintain infrastructure across AWS, GCP, or Azure
• Implement Infrastructure-as-Code using Terraform, Pulumi, or CloudFormation
• Configure networking, compute, storage, IAM, and cloud services to support scalability and security
• Optimize infrastructure performance, availability, and cost efficiency
CI/CD & Deployment Automation
• Build and maintain CI/CD pipelines using GitHub Actions, Jenkins, GitLab CI, or CircleCI
• Automate build, test, deployment, and rollback workflows across environments
• Improve deployment reliability and support zero-downtime deployment strategies
• Partner with engineering teams to streamline release cycles and reduce bottlenecks
Containerization & Kubernetes
• Manage Docker-based application environments and Kubernetes clusters
• Deploy and maintain microservices-based architectures
• Monitor cluster health, autoscaling, and workload performance
• Improve container orchestration efficiency and infrastructure stability
Monitoring, Logging & Incident Response
• Implement observability solutions using Prometheus, Grafana, Datadog, New Relic, or similar tools
• Configure centralized logging and alerting systems (ELK Stack, Splunk, CloudWatch, etc.)
• Participate in incident response and on-call rotations when required
• Conduct root cause analysis (RCA) and improve system resiliency post-incident
Security & Compliance
• Apply cloud security best practices including IAM, least privilege access, and encryption standards
• Support compliance initiatives related to SOC 2, HIPAA, PCI-DSS, GDPR, or similar frameworks
• Run vulnerability scans, patch systems proactively, and monitor infrastructure security posture
• Ensure deployment pipelines and infrastructure configurations follow security standards
Documentation & Process Improvement
• Maintain clear documentation for infrastructure, pipelines, runbooks, and deployment workflows
• Identify opportunities for automation, reliability improvements, and operational efficiency
• Collaborate cross-functionally with developers, QA, and engineering leadership to improve DevOps processes
What Makes You a Perfect Fit
• Problem solver who thrives at the intersection of development and operations
• Calm, methodical, and reliable during high-pressure incidents and outages
• Passionate about automation, scalability, reliability, and system optimization
• Strong communicator who collaborates effectively across technical and non-technical teams
• Ownership-driven with a proactive approach to preventing issues before they occur
Required Experience & Skills
• 3+ years of experience in DevOps, Site Reliability Engineering (SRE), or Infrastructure Engineering
• Hands-on experience with at least one major cloud provider (AWS, GCP, or Azure)
• Strong understanding of CI/CD pipelines and deployment automation
• Experience with Docker and Kubernetes in production environments
• Familiarity with Linux systems administration and scripting
• Experience with monitoring and logging platforms
Preferred Experience & Skills
• Expertise with Terraform, Pulumi, or other Infrastructure-as-Code tools
• Experience supporting SaaS, fintech, healthcare, or enterprise-scale applications
• Familiarity with serverless architectures (AWS Lambda, Google Cloud Functions, Azure Functions)
• Experience optimizing infrastructure costs and cloud resource utilization
• Cloud or DevOps certifications such as AWS Certified DevOps Engineer, CKA, CKAD, or equivalent
What Does a Typical Day Look Like?
A DevOps Engineer’s day revolves around ensuring systems are automated, secure, scalable, and reliable. You will:
• Review monitoring dashboards and investigate infrastructure alerts or incidents
• Improve CI/CD pipelines to accelerate deployments and reduce failures
• Provision or modify infrastructure using Terraform or CloudFormation
• Collaborate with developers to troubleshoot deployments and optimize application performance
• Analyze logs and system metrics to identify reliability or scalability issues
• Update runbooks, workflows, and infrastructure documentation
• Implement automation improvements that reduce manual operational work
In essence: you are the backbone of infrastructure reliability and deployment automation, ensuring systems remain stable, scalable, and efficient while enabling engineering teams to ship faster with confidence.
Key Metrics for Success (KPIs)
• Deployment frequency and release reliability improvements
• System uptime ≥ 99.9%
• Reduced MTTR (Mean Time to Recovery) for production incidents
• Infrastructure cost optimization and efficiency improvements
• Reduced deployment failures and rollback rates
• Positive engineering feedback regarding deployment speed and reliability
Interview Process
• Initial Phone Screen
• Video Interview with Pavago Recruiter
• Technical Assessment (e.g., design a CI/CD pipeline or provision infrastructure using Terraform)
• Client Interview with Engineering / DevOps Leadership
• Offer & Background Verification
#DevOps #Kubernetes #Terraform #AWS #CloudEngineering #Infrastructure #SRE #Docker #CICD #RemoteJobs #DevOpsEngineer #CloudComputing #Automation #TechJobs #PlatformEngineering











