L3 AWS Support Engineer

  • Remote - India

Remote

DevOps

Mid-level

Job description

Job Overview

We are seeking a highly skilled L3 Engineer to serve as the technical authority for Windows and Linux operating systems hosted on AWS virtual machines (VMs), with a focus on advanced cloud architecture, security, and automation. In this critical role, you will architect scalable solutions, oversee OS patching and AWS strategies and resolve complex incidents. As a Senior Engineer, you will drive performance optimization, implement cutting-edge security measures, automate workflows, and mentor technical teams to align infrastructure with business objectives. This position is ideal for a senior expert with deep expertise in AWS, OS management, and modern DevOps practices.

Key Responsibilities

Architectural Oversight:

Design, implement, and optimize cloud architectures for Windows and Linux VMs in AWS, ensuring performance, security, and scalability.

Conduct Well-Architected Framework reviews to assess and enhance system reliability, cost-efficiency, and operational excellence.

Complex Troubleshooting:

Act as the final escalation point for critical incidents involving Windows and Linux VMs, resolving issues in hybrid environments, multi-region setups, and service interdependencies.

Debug and optimize advanced AWS services, including Lambda functions and API Gateway configurations.

System Management and Optimization:

Oversee advanced maintenance of Windows (e.g., AD integration, performance tuning) and Linux (e.g., kernel optimization, service hardening) systems in AWS.

Develop and execute enterprise-level OS patching strategies for Windows (e.g., WSUS, SCCM) and Linux (e.g., custom repositories, automation), ensuring compliance and minimal downtime.

Analyze future resource needs, plan capacity, and resolve performance bottlenecks to maintain optimal system efficiency.

Security and Governance:

Implement advanced security measures, including GuardDuty, AWS WAF, and Security Hub, to protect Windows and Linux environments.

Conduct regular security audits and compliance checks (e.g., SOC2, GDPR) and enforce standards for OS hardening and AWS IAM policies.

Lead root cause analysis (RCA) for security incidents and implement preventative measures.

Automation & DevOps:

Automate infrastructure scaling, patching, and monitoring workflows using scripting (e.g., Bash, Python, PowerShell) and AWS CLI.

Automate provisioning of AWS resources using Terraform or CloudFormation.

Disaster Recovery & High Availability:

Design and test failover strategies and disaster recovery mechanisms for critical Windows and Linux applications in AWS.

Architect high-availability solutions (e.g., multi-AZ setups, failover clusters) to ensure business continuity.

Team Leadership:

Provide technical guidance to L1 and L2 engineers, fostering skill development and operational excellence.

Create knowledge-sharing sessions and maintain best practices documentation for OS management, AWS, and Kubernetes operations.

Cost Management:

Implement resource tagging strategies and leverage cost management tools (e.g., AWS Cost Explorer) to reduce operational expenses while maintaining performance.

Lead cost optimization efforts, such as right-sizing EC2 instances and utilizing reserved or spot instances.

Stakeholder Collaboration:

Partner with customers, architects, and leadership to address complex requirements and lead migration/modernization projects for Windows and Linux systems.

Provide expert guidance on patching schedules, security compliance, and AWS infrastructure roadmaps.

Required Skills and Qualifications

Technical Skills:

Expert-level mastery of Windows operating systems (e.g., Windows Server 2016/2019/2022) and/or Linux distributions (e.g., Ubuntu, CentOS, RHEL), including kernel-level diagnostics.

Deep expertise in OS patching strategies, including enterprise tools (e.g., WSUS, SCCM for Windows; Ansible, Satellite for Linux) and automation frameworks.

Comprehensive understanding of AWS core services (e.g., EC2, S3, VPC, IAM) and advanced features (e.g., Lambda, API Gateway, Systems Manager).

Strong proficiency in AWS automation, scripting (e.g., Bash, Python, PowerShell), and AWS CLI for managing infrastructure and patching.

Extensive experience with AWS CloudFormation and Terraform for infrastructure-as-code.

Advanced knowledge of AWS security best practices, identity and access management (e.g., IAM roles, MFA), and networking (e.g., VPC peering, subnets).

Capacity Planning: Ability to analyze future resource needs and plan capacity for Windows, Linux, and Kubernetes workloads.

Performance Optimization: Expertise in identifying and resolving performance bottlenecks in OS and AWS environments.

Migration and Modernization: Proven ability to lead complex migration and modernization projects for cloud-based systems.

Soft Skills:

Exceptional communication and interpersonal skills to articulate technical solutions to stakeholders and mentor teams.

Superior problem-solving and analytical skills to address enterprise-scale challenges and drive innovation.

Ability to work independently, lead cross-functional initiatives, and foster collaboration in a dynamic environment.

Customer-focused approach with a strategic mindset for aligning technical expertise with business goals.

Experience:

7+ years of experience in IT support, systems engineering, or cloud architecture, with a focus on Windows and Linux OS management, patching, and AWS.

Proven track record of managing enterprise-scale AWS environments and Kubernetes clusters.

Education:

Bachelor’s degree in Computer Science, Information Technology, or a related field (Master’s preferred, or equivalent experience).

Certifications (Preferred):

AWS Certified Solutions Architect – Professional

Microsoft Certified: Azure Administrator Associate (or equivalent Windows Server certification)

Red Hat Certified Engineer (RHCE) or Linux Professional Institute Certification (LPIC-23)

Additional Requirements

Flexibility to provide on-call support for critical incidents and oversee patching or failover operations across global time zones.

Proficiency with collaboration tools (e.g., Microsoft Teams, Slack) and enterprise ticketing systems (e.g., ServiceNow, Jira).

Share this job:
Please let Rackspace Technology know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply