Site Reliability Engineer

  • Remote - Canada

Remote

DevOps

Mid-level

Job description

WELCOME to Aetion!We are one of the country’s leading science-driven technology companies using real-world evidence to provide innovative healthcare solutions. Our Aetion Evidence Platform is used to evaluate the safety, effectiveness and value of medications, delivering better outcomes to patients, medical professionals, and clients. We’ve partnered with top biopharma companies and are backed by leading venture capital firms to help increase our medical research and expand our product line. Aetion is headquartered in the US and has expanded throughout Europe with a Technology Hub in Barcelona.

Aetion and Aetion’s leadership are recipients of several prestigious awards:

  • Parity.org’s 2024 List of Best Companies for Equal Advancement Opportunities
  • Digital Health New York’s 2024 New York Digital Health 100
  • Newsweek’s World’s Best Digital Health Companies of 2024

Come join us!

PERKS of being an A-Teamer:

  • 25 vacations days
  • Daily in-office lunch stipend (and a fully stocked kitchen!)
  • Sabbatical opportunity after five years of employment
  • Commitment to professional development opportunities with access to Skillsoft learning experience platform
  • Employee-led initiatives including annual company-wide innovation day & DEI resource groups
  • Comprehensive private health coverage w/ out-of-network reimbursements options .
  • Peer & company recognition programs
  • Mental Health & Wellness Benefits
  • Monthly educational lunch & learn

Why join Aetion’s Tech Team?

  • You’ll collaborate with other engineering leaders on all matters that impact the Engineering team, including resourcing and building technology/product vision
  • You’ll have the opportunity to coach and mentor colleagues, including code reviews, higher-level software design, and direct management
  • The team works on a technical stack which includes both cloud and on-premise deployments, big-data ingestion and analytics, distributed systems, and algorithmic complexity.

DESCRIPTION:

As Site Reliability Engineer, you will be a critical member of Aetion’s engineering organization. As Aetion’s products continue to scale, we are writing the next chapter in our ability to innovate and operate with efficiency and maturity. You will be an instrumental part of us continuing down this path.

As a member of the Site Reliability Engineering team, you will own Aetion’s infrastructure which is cloud-based, containerized, and managed through Infrastructure as Code (IaC). You will be supporting day to day operations (such as provisioning infrastructure and providing production support) and helping with engineering projects (maturing our infrastructure and automation). The team has great things in place but also has strong ambitions about what else we want to achieve.

RESPONSIBILITIES:

Your duties will include, but are not limited to:

  • Perform delivery and production support tasks, including monitoring, troubleshooting, and resolving infrastructure and application issues to ensure system reliability and uptime.
  • Continually streamline automation and processes to improve operational maturity and efficiency.
  • Provision, configure, and maintain Aetion’s infrastructure with a focus on simplicity, innovation, automation, reliability, scalability, security, cost-effectiveness, and ease of support.
  • Build and maintain Aetion’s development and deployment pipelines, supporting CI/CD and long-term-stable testing and release cycles.
  • Collaborate with cross-functional teams to provide timely and effective production support, ensuring a seamless experience for end-users and internal stakeholders.
  • Develop automation frameworks to support other development teams and reduce manual intervention in operational tasks.
  • Effectively contribute to complex engineering projects while balancing operational responsibilities.

QUALIFICATIONS:

Required:

Education:

  • Bachelor’s Degree in Computer Science, Engineering, or a related field, or equivalent experience.

Experience:

  • Systems Engineering/DevOps/Distributed Systems: Minimum 5+ years of experience in Systems Engineering, DevOps, or developing distributed systems with strong knowledge of cloud architecture, particularly AWS and Kubernetes.
  • Software development: 5+ years of experience in software development with proficiency at least in Python or Java. Proficiency in unix based shell scripting is required. JavaScript and TypeScript are a plus.
  • Infrastructure as Code (IaC) & CI/CD Tools: 3+ years with tools and languages such as Pulumi, Terraform, Ansible, or GitHub Actions (GHA).
  • Cloud Providers: 5+ years with cloud platforms (AWS required, GCP nice to have).
  • Containerized Workloads: 3+ years with Docker and Kubernetes. Experience with orchestration tools such as Concord and Karpenter is a plus.
  • Security Engineering: Experience with proactive threat prevention, incident response, and implementing compliance programs.
  • Databases: Experience working with SQL databases, big data platforms, and supporting big data pipelines.
  • Linux Systems: In-depth experience solving complex issues on Linux systems and/or within the JVM.

Skills:

  • Empathy for end-users and a strong service mindset when supporting day-to-day operations, ensuring a positive experience for stakeholders.
  • Strong understanding of cloud infrastructure design with a focus on security, reliability, and scalability.
  • Detailed knowledge of configuration, implementation, and maintenance of CI/CD pipelines and tooling (e.g., GitHub Actions or Jenkins).
  • Strong English language skills, both written and verbal, with the ability to communicate effectively across teams (e.g., commercial and science/analytics teams).
  • Ability to prioritize, communicate effectively, design for repeatability and scalability, exude ownership, and dig beneath the hood with technology.
  • Flexibility to improve existing systems and innovate on new capabilities.
  • Collaborative, open-minded, and able to quickly grasp complex concepts to contribute to the team’s overall effectiveness.

Preferred:

Experience with:

  • Debugging, tracing, and profiling Java applications.
  • Provisioning and operating SQL databases and big data platforms (Spark).
  • The healthcare or banking industry, or other fields where information security is a concern.
  • Privacy (HIPAA, GDPR) and security (SOC 2, Hitrust) certifications.
  • Google Cloud Platform.
  • Lean and agile ways of working.

About Aetion’s Site Reliability Engineering Team:

  • Presentation at AWS Re:Invent 2018.
  • Architecture Video with AWS.
  • Blog Post on AWS.

Aetion is an Equal Opportunity Employer. Aetion is committed to being an employer of choice, not just a good place to work, but a great and inclusive place to work. To that end, we strive to recruit and maintain a workforce that meaningfully represents the diverse and culturally rich communities that we serve. Qualified applicants will receive consideration for employment without regard to their race, color, religion, national origin, sex, sexual orientation, gender identity, protected veteran status or disabled status or genetic information.

Share this job:
Please let Aetion know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply