Staff Site Reliability Engineer

  • Remote - Latin America

Remote

DevOps

Mid-level

Job description

Explore the Nearsure experience!

🌐 Join our close-knit LATAM remote team: Connect through fun activities like coffee breaks, tech talks, and games with your team-mates and management.

🍃 Say goodbye to micromanagement! We champion autonomy, open communication, and respect for diversity as our core values.

⚖️ Your well-being matters: Our People Care team is here from day one to support you with everything from time-off requests to wellness check-ins.

Plus, our Accounts Management team ensures smooth, effective client relationships, so you can focus on what you do best.

Ready to grow with us? 🚀

Here’s what we offer you by joining us!

Competitive USD salary 💲 – We value your skills and contributions!

🌐 100% remote work 🏢 – While you can work from anywhere, you’re always welcome to connect with teammates and grow your network at our coworking spaces across LATAM!

💼 Paid time off – Take the time you need according to your country’s regulations, all while receiving your full salary. Rest, recharge, and come back stronger!

🎉 National Holidays celebrated 🌴 – Take time off to celebrate important events and traditions with loved ones, fully embracing your culture.

😷 Sick leave – Focus on your health without the stress. Take the necessary time to recover and feel better.

💸 Refundable Annual Credit – Spend it on the perks you love to enhance your work-life balance!

🤝 Team-building activities – Join us for coffee breaks, tech talks, and after-work gatherings to bond with your Nearsure family and feel part of our vibrant community.

🥳 Birthday day off 🎂 – Enjoy an extra day off during your birthday week to celebrate in style with friends and family!

About the project

As a Staff Site Reliability Engineer, you will own and optimize OpenTelemetry pipelines, enabling scalable and efficient observability. You’ll build tools that empower teams, support incident response, and drive best practices. Your work ensures a reliable, secure infrastructure and actionable alerting across the organization.

How your day-to-day work will look like

✅ Design, implement, and maintain observability pipelines across the three main signals—logs, metrics, and traces—ensuring standardized, scalable, and efficient data ingestion. Optimize ingestion strategies to balance cost, performance, and usability.

✅ Build self-service automation and tooling that enables development teams to instrument and leverage observability without requiring manual intervention from the SRE team. Drive adoption of best practices while ensuring teams own their telemetry.

✅ Design the processes, playbooks, checklists, and automations for them and other engineers to follow during an incident.

✅ Interact with members from almost all teams across the business to understand their monitoring, alerting, and SLO / SLA requirements and design systems and processes that ensure we meet or exceed these requirements. Influence architectural decisions during initial design stages to ensure resiliency and scale at the outset of software development.

✅ Design the processes, playbooks, checklists, and automations for them and other engineers to follow during an incident.

✅ Leverage Infrastructure-as-Code (IaC) to provision and manage monitoring tools, alerting rules, and our observability configurations across OTEL Pipelines.

✅ Design base-level requirements for new and existing services to ensure that all client infrastructure and code are monitored consistently and accurately at a basic level.

✅ Take full ownership of client infrastructure reliability, ensuring adherence to key availability and security KPIs.

This would make you the ideal candidate

✨Bachelor’s Degree in Computer Science, Engineering, or a related field.

✨8+ Years of experience workingas an SRE Engineer or in a very similar role, more focused on observability.

✨5+ Years of experience working with cloud (AWS).

✨5+ Years of experience working with IaC tools (Terraform) and GitOps CI/CD solutions (ArgoCD, GitHub Actions, or similar).

✨4+ Years of experience working with monitoring and logging tools such as Grafana, Prometheus, Loki, New Relic, or Datadog (experience managing observability pipelines at scale in high-throughput environments).

✨4+ Years of experience working in Kubernetes, including its core components, deployment methodologies, and monitoring best practices.

✨Strong communication skills with team members and stakeholders (technical and nontechnical communication).

✨Strong scripting abilities (Python, Go, or similar) for automating observability tasks.

✨Experience integrating incident management platforms (PagerDuty, Jira) with automated alerting workflows.

✨Advanced English Level is required for this role as you will work with US clients. Effective communication in English is essential to deliver the best solutions to our clients and expand your horizons.

What to expect from our hiring process

1️. Let’s chat about your experience!

2. Impress our recruiters, and you’ll move on to a technical interview with our top developers.

3. Nail that, and you’ll meet our client - your final step to joining our amazing team!

🎯 At Nearsure, we’re dedicated to solving complex business challenges through cutting-edge technology and we believe in the power of tailored solutions. Whether you are passionate about transforming businesses with Generative AI, building innovative software products, or implementing comprehensive enterprise platform solutions, we invite you to be part of our dynamic team!

We would love to hear from you if you are eager to make an impact and join a collaborative team that values creativity and expertise.

Let’s work together to shape the future of technology!

🧑‍💻 Apply now!

By applying to this position, you authorize Nearsure to collect, store, transfer, and process your personal data in accordance with our Privacy Policy. For more information, please review our Privacy Policy.

Share this job:
Please let Nearsure know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply