Job Description

Join Our Team

Oowlish, one of Latin America’s rapidly expanding software development companies, is seeking experienced technology professionals to enhance our diverse and vibrant team.

As a valued member of Oowlish, you will collaborate with premier clients from the United States and Europe, contributing to pioneering digital solutions. Our commitment to creating a nurturing work environment is recognized by our certification as a Great Place to Work, where you will have opportunities for professional development, growth, and a chance to make a significant international impact.

We offer the convenience of remote work, allowing you to craft a work-life balance that suits your personal and professional needs. We’re looking for candidates who are passionate about technology, proficient in English, and excited to engage in remote collaboration for a worldwide presence.

About the Role:

We are seeking a Senior Site Reliability Engineer (SRE) to help build and evolve the reliability practices that support our production systems at scale.

This role goes beyond traditional DevOps and infrastructure management. We are specifically looking for someone with deep experience in Site Reliability Engineering, including ownership of Service Level Objectives (SLOs), incident response processes, observability strategies, and production reliability initiatives.

You will partner with engineering teams to improve availability, performance, monitoring, alerting, and operational excellence while helping establish a strong reliability culture across the organization.

The ideal candidate has experience operating large-scale production environments, leading incident response efforts, and leveraging observability data to improve system health and customer experience proactively.

Responsibilities:

  • Design, implement, and improve Site Reliability Engineering practices across production environments.
  • Define, manage, and continuously improve Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets.
  • Lead and participate in incident response and incident command processes.
  • Build and evolve observability strategies, including monitoring, logging, alerting, and distributed tracing.
  • Improve system reliability, availability, scalability, and operational efficiency.
  • Partner with engineering teams to improve application performance and production readiness.
  • Develop automation solutions that reduce operational overhead and improve reliability.
  • Participate in root cause analysis and post-incident reviews.
  • Drive continuous improvement initiatives based on operational insights and incident learnings.
  • Help establish reliability best practices across teams and services.

Requirements:

  • 5+ years of professional experience in Site Reliability Engineering, DevOps, or Production Engineering roles.
  • Strong understanding of Site Reliability Engineering principles and best practices.
  • Experience supporting and operating production systems at scale.
  • Strong knowledge of monitoring, observability, and reliability engineering concepts.
  • Experience working in cloud-based environments.
  • Strong troubleshooting and problem-solving skills.
  • Experience working with distributed systems and modern application architectures.

Must have:

  • Proven Site Reliability Engineering experience.
  • Experience in defining and managing:
    • Service Level Objectives (SLOs)
    • Service Level Indicators (SLIs)
    • Error Budgets
  • Experience leading or actively participating in Incident Command and Incident Response processes.
  • Experience designing and implementing observability strategies.
  • Hands-on experience with:
    • Monitoring
    • Logging
    • Alerting
    • Distributed Tracing
  • Experience improving system reliability, availability, and operational excellence.
  • Experience supporting mission-critical production environments.
  • Experience with cloud platforms (AWS preferred).
  • Strong automation mindset.
  • Experience conducting root cause analysis and postmortems.

Nice to have:

  • Kubernetes experience.
  • Terraform or Infrastructure as Code experience.
  • CI/CD pipeline experience.
  • Experience with containerized environments.
  • Experience with distributed microservices architectures.
  • Experience with performance engineering.
  • Experience mentoring engineers on reliability practices.
  • Multi-cloud experience.
  • Experience working in highly regulated or high-availability environments.

Benefits & Perks:

Home office;

Competitive compensation based on experience;

Career plans to allow for extensive growth in the company;

International Projects;

Oowlish English Program (Technical and Conversational);

Oowlish Fitness with Total Pass;

Games and Competitions;

You can also apply here:

Website: https://www.oowlish.com/work-with-us/

LinkedIn: https://www.linkedin.com/company/oowlish/jobs/

Instagram: https://www.instagram.com/oowlishtechnology/

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Share this job:
Please let Oowlish know you found this job on Remote First Jobs 🙏

190 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like Oowlish

Explore remote-first companies similar to Oowlish. Discover other top-rated employers that offer flexible schedules and work-from-anywhere options.

BetterEngineer Logo

BetterEngineer

Connecting U.S. startups and tech companies with pre-vetted, senior Latin American software engineers.

View company profile →
Truelogic Software Logo

Truelogic Software

Provides nearshore staff augmentation, dedicated teams, and innovation projects with LATAM tech talent for US companies.

View company profile →
Get Devs Logo

Get Devs

51-200 getdevs.com

Provides IT staff augmentation services, building dedicated offshore teams of software talent in the Philippines.

View company profile →
Devsu Logo

Devsu

An AI-native technology partner offering software delivery, application modernization, and staff augmentation services.

View company profile →
Talentus Global Logo

Talentus Global

Provides IT talent and software solutions, specializing in near-shore BPO and digital transformation.

View company profile →
Inventive Works, LLC Logo

Inventive Works, LLC

Custom software applications and cloud migration services for businesses of all sizes.

View company profile →

Project: Career Search

Rev. 2026.6

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply