DevOps and Site Reliability Engineer

  • Remote - Mexico, Costa Rica

Remote

DevOps

Mid-level

Job description

Join Our Team

Oowlish, one of Latin America’s rapidly expanding software development companies, is seeking experienced technology professionals to enhance our diverse and vibrant team.

As a valued member of Oowlish, you will collaborate with premier clients from the United States and Europe, contributing to pioneering digital solutions. Our commitment to creating a nurturing work environment is recognized by our certification as a Great Place to Work, where you will have opportunities for professional development, growth, and a chance to make a significant international impact.

We offer the convenience of remote work, allowing you to craft a work-life balance that suits your personal and professional needs. We’re looking for candidates who are passionate about technology, proficient in English, and excited to engage in remote collaboration for a worldwide presence.

About the Role:

We are seeking a DevOps & Site Reliability Engineer to join a growing AI-focused SaaS startup. In this role, you’ll be responsible for maintaining, optimizing, and scaling the infrastructure that supports our platform, ensuring high availability, performance, and reliability.

You’ll work closely with development and product teams to improve deployment processes, monitor systems, and respond to incidents proactively.

If you are passionate about DevOps culture, automation, and ensuring systems are always running smoothly, this is the perfect opportunity for you!

Key Responsibilities:

  • Deploy and manage web, mobile, and API applications across cloud environments
  • Implement and maintain monitoring and observability tools like NewRelic, Datadog, or Prometheus/Grafana
  • Design and optimize CI/CD pipelines using tools like Azure Pipelines, Jenkins, or CircleCI
  • Manage containerized environments with Docker, Kubernetes, and Helm
  • Build and manage cloud infrastructure on Azure, AWS, or GCP
  • Write automation scripts using Bash and other scripting languages
  • Develop and maintain incident response processes and disaster recovery strategies
  • Collaborate with development, product, and operations teams to improve system reliability and deployment efficiency

Must-Have:

  • 3+ years of experience in a DevOps, Site Reliability Engineering (SRE), or related role
  • Strong hands-on experience with the deployment of web, mobile, and API applications
  • Expertise in monitoring and observability tools (e.g., NewRelic, Datadog, Prometheus/Grafana)
  • Strong experience with CI/CD pipelines and associated tools (Azure Pipelines, Jenkins, CircleCI)
  • Proficiency with Docker, Kubernetes, and Helm
  • Experience working with cloud platforms like Azure, AWS, or GCP
  • Scripting proficiency in Bash
  • Familiarity with incident response and disaster recovery planning

Benefits & Perks:

Home office;

Flexible Hours

Competitive compensation based on experience;

Career plans to allow for extensive growth in the company;

International Projects;

Oowlish English Program (Technical and Conversational);

Oowlish Fitness with Total Pass;

Connecting You (Internet allowance);

Anniversary bonus;

Wedding gift;

Pet adoption incentive;

New baby Oowl bonus;

Back to School bonus;

Streaming Subscription;

PTO Bonus;

Games and Competitions;

You can also apply here:

Website: https://www.oowlish.com/work-with-us/

LinkedIn: https://www.linkedin.com/company/oowlish/jobs/

Instagram: https://www.instagram.com/oowlishtechnology/

#LI-LM1

#LI-CD1

#LI-EA1

#LI-TC1

#LI-ET1

#LI-TT1

#LI-JH1

#LI-DP1

#LI-LS1

#LI-AB1

#LI-KN1

#LI-SR1

#LI-JS1

#LI-FZ1

Share this job:
Please let Oowlish know you found this job on Remote First Jobs 🙏
Apply