Site Reliability Engineer

Job description

We’re looking for highly motivated, passionate site reliability engineers to join our growing team. At evertz.io, our teams are building services that are used by the biggest names in the exciting broadcast and media industry. Our services are hosted in AWS, with a Serverless First mindset.

As part of this role you will work with our talented teams to help harden our multi-tenant SaaS platform. Using best in class observability tooling, you will be working to debug incidents, while also identifying and implementing improvements to the platform to ensure its continued reliability. Your drive to eliminate toil will see you automating processes and building the tools to do so.

We offer flexible working hours, great benefits, and the freedom to experiment with new technologies and tools to build better products.

Skills and Experience you will bring:

  • At least 3 years of hands-on experience managing critical, high-availability production infrastructure, demonstrating success in maintaining reliability and maximizing application uptime.
  • Proficient in at least one programming language (such as Python, Java, or Rust), with experience designing and building production-quality automation, tools, or software libraries.
  • At least 3 years working with monitoring, log aggregation, and observability platforms such as Datadog, CloudWatch, Honeycomb, Splunk, or New Relic, using data-driven insights to proactively identify and resolve issues.
  • Excellent analytical skills with the ability to understand end-to-end use cases, map system flows, debug complex issues, and anticipate potential failure points.
  • Proven track record translating SLO’s and SLI’s into actionable improvements. Reliability, monitoring, and observability are not just words to you.
  • At least 3 years of experience with cloud technologies, in particular AWS Services and tools such as Cloud Formation, Lambda, DynamoDB, SQS, SNS, EC2, S3, AWS CLI, Boto3.
  • Solid foundation in Linux systems administration, networking, and security.
  • Familiarity with the use and configuration of CI & CD pipelines such as Jenkins & AWS CodePipeline.

Additional skills and experience that will make you standout:

  • Experience architecting and deploying serverless applications in cloud environments.
  • Experience with infrastructure-as-code tools like Terraform or CloudFormation, enabling reproducible and scalable environments.
  • Previous participation in production on-call rotations, with direct involvement in incident management and post-incident reviews.
  • Demonstrated expertise in performance optimization for core AWS services, including Lambda, DynamoDB, API Gateway, SQS, EventBridge, and EC2.
  • Experience supporting and improving systems with frequent, high-velocity deployment cycles.
  • Familiarity with security compliance frameworks (e.g., OWASP, ISO, CSA, PCI), and hands-on experience conducting threat assessments and implementing remediation plans.
  • Background in security practices, including penetration testing, threat modeling, and usage of both open-source and commercial security tools.
  • Experience developing and implementing advanced deployment strategies for web application infrastructures—such as canary, A/B testing, blue/green deployments, or red/line patterns.
  • Hands-on experience with chaos engineering—intentionally testing systems under extreme conditions to improve reliability and fault tolerance.
  • Track record of championing system reliability, continuous improvement, and operational excellence throughout an organization.

Recruitment process:

  • Screening with recruiter (45 min)
  • Technical interview with Hiring Manager (60 min)

Please note, this email address will only respond to requests regarding privacy concerns. This inbox will not respond to job applications, resumes, or questions regarding an application. When you apply to a job on this site, the personal data contained in your application will be collected by Evertz Microsystems Ltd (“Controller”), which is located at 5292 John Lucas Drive, Burlington, Ontario, Canada and can be contacted by emailing [email protected]. Controller’s data protection officer is Nadiera Toolsieram, who can be contacted at [email protected]. Your personal data will be processed for the purposes of managing Controller’s and its’ subsidiaries’ and affiliates’ recruitment related activities, which include setting up and conducting interviews and tests for applicants, evaluating and assessing the results thereto, and as is otherwise needed in the recruitment and hiring processes. Such processing is legally permissible under Art. 6(1)(f) of Regulation (EU) 2016679 (General Data Protection Regulation) as necessary for the purposes of the legitimate interests pursued by the Controller, which are the solicitation, evaluation, and selection of applicants for employment.

A complete privacy policy can be found at https://evertz.com/contact/privacy/

Your personal data will be retained by Controller as long as Controller determines it is necessary to evaluate your application for employment. Under the GDPR, you have the right to request access to your personal data, to request that your personal data be rectified or erased, and to request that processing of your personal data be restricted. You also have to right to data portability. In addition, you may lodge a complaint with an EU supervisory authority.

Share this job:
Please let Evertz know you found this job on Remote First Jobs 🙏

Similar Remote Jobs

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply