Job Description
Description
DVT is one of the top software development companies on the continent, consulting on cutting-edge applications for leading enterprises in South Africa and globally. We are committed to continuously developing our people, with a strong culture of learning, internal speaking, and sponsored technical events across the AWS ecosystem.
We are looking for a Senior Cloud & AWS Support Engineer to join our cloud team on a client-embedded engagement. This is a dual role: alongside building and maintaining cloud infrastructure and automation, you will provide day-to-day AWS operational support — owning incident, alert, and request triage, on-call response, and the operational health of the client’s AWS estate. You sit within the client’s Operations (“keeping the lights on”) team, working closely with their platform and engineering teams to keep production stable while improving it.
Requirements
DUTIES AND RESPONSIBILITIES
AWS Support & Operations
Act as first responder for the client’s AWS environment — triage, diagnose, and resolve incidents and service requests within agreed SLAs
Own alert handling across CloudWatch, GuardDuty, Security Hub, and AWS Health, including the email/notification alerting pipeline, routing, and noise reduction
Participate in the on-call rotation (incl. cross-timezone cover where client launches require it) and lead structured incident response and post-incident reviews
Manage the operational ticket queue (incident, request, problem, and change), maintaining clear status, ownership, and communication to client stakeholders
Build and maintain runbooks, playbooks, and knowledge-base articles to standardise response and enable faster, repeatable resolution
Drive problem management — identify recurring issues, perform root-cause analysis, and convert findings into permanent fixes and automation
Support patching, backup/restore verification, and routine operational maintenance under tagging- and SCP-based governance
Infrastructure & Automation
Design, implement, and maintain scalable CI/CD pipelines (e.g. GitHub Actions, AWS CodePipeline, GitLab CI) for automated testing, deployment, and provisioning
Manage infrastructure-as-code with Terraform (primary), CloudFormation and CDK — remote state, modular refactoring, and multi-environment deployments
Develop ephemeral feature environments for isolated testing (Terraform workspaces / Terragrunt) and automate provisioning, routing (Route 53, ALB/NLB), and teardown
Apply scheduled shut-downs to drive cost management; use tagging, Kubernetes taints/tolerations, and tag-driven patching schedules
Design and implement AWS networking — VPC architecture, security groups, NACLs, Transit Gateway, and hybrid connectivity
Deployment & Operations
Collaborate with software teams to integrate and deploy backend services (Java, .NET, Python, Node.js) and containerised applications
Implement blue/green or canary deployment strategies with safe, traceable rollouts and automated rollback
Integrate unit and functional/API testing (pytest, Postman/Newman, LocalStack) into the CI pipeline
Establish conventions for test execution, image tagging, versioning, and reusable CI/CD components
Manage production deployments, change windows, and release coordination across environment
Monitoring, Security & Compliance
Implement comprehensive monitoring, logging, and observability (CloudWatch, X-Ray, third-party APM)
Monitor and optimise system performance, deployment efficiency, resource utilisation, and cloud costs
Maintain security best practices — Well-Architected & OWASP, secrets management (Secrets Manager, Parameter Store), IAM policies, SAST/DAST scanning, and compliance controls
Configure alerting and incident-response workflows and lead post-incident reviews
Ensure compliance with relevant standards (ISO 27001, SOC 2, POPIA / GDPR) per client requirements
Leadership & Client Engagement
Provide technical leadership and mentorship to junior engineers and client development teams
Engage directly with client stakeholders to understand requirements, advise, and present solutions
Participate in architectural reviews, retrospectives, and planning to continuously improve tooling and process
Conduct training and create documentation to embed DevOps and operational best practices
Contribute to pre-sales — solution design, effort estimation, and technical proposals
Required Experience and Skills
7+ years in Cloud DevOps, SRE, or AWS operational support, with strong CI/CD and infrastructure automation experience
Hands-on experience running AWS managed support / operations — incident management, alert triage, on-call, and SLA-bound resolution
Deep AWS proficiency: EC2, ECS/EKS, Lambda, S3, DynamoDB, RDS, VPC, Route 53, CloudFront, IAM (Identity Center), CloudWatch, X-Ray
Expert IaC (Terraform required; CloudFormation beneficial) — remote state, modules, multi-environment
Strong CI/CD platform experience (GitHub Actions, CodePipeline, GitLab CI, or Jenkins)
Advanced scripting in Bash or Python for automation and tooling
Containerisation and orchestration (Docker, Kubernetes/EKS, Helm beneficial)
Familiarity with AWS GuardDuty, Security Hub, and AWS Health alert handling and triage
Experience with ITSM / ticketing workflows and structured incident, problem, and change management
Monitoring/observability tooling (CloudWatch, Prometheus, Grafana, Datadog, or similar)
AWS networking — VPC design, security groups, load balancers, DNS, hybrid connectivity
Secrets management (Secrets Manager, Parameter Store, HashiCorp Vault)
DevSecOps practices — SCA/SAST/DAST, vulnerability and compliance automation
AWS cost optimisation, tagging strategies, and FinOps practices
Excellent communication; proven mentorship; consulting / professional-services delivery across concurrent client engagements
Preferred Qualifications
AWS certifications — Solutions Architect Professional, DevOps Engineer Professional, Security Specialty
AWS Certified SysOps Administrator (operations-focused)
Certified Kubernetes Administrator (CKA) / Application Developer (CKAD)
Multi-cloud (AWS, Azure, GCP) or hybrid cloud architecture experience
Experience in regulated industries (financial services, gaming/lottery, healthcare, government)
GitOps tooling (ArgoCD, Flux); serverless and event-driven design
Disaster recovery, backup strategies, and multi-region deployments
Configuration management (Ansible, Chef, Puppet); public speaking / open-source contributions
Minimum Requirements
Matric (Grade 12) certificate
Bachelor’s degree in Computer Science, IT, Engineering, or related field (or equivalent practical experience)
At least one relevant professional certification (AWS DevOps Engineer, Solutions Architect, SysOps Administrator, CKA, or equivalent)
South African work authorisation or valid work permit
Willingness to participate in an on-call rotation for production AWS support
Ability to travel occasionally for client meetings or site visits
Who we are:












