Intersystems Group Logo

Major Incident Lead

Job Description

Overview

We are seeking an experienced Major Incident Lead – Site Reliability to join our Managed Services team. This role is responsible for leading the response to high-severity, customer-impacting incidents across InterSystems’ managed services platforms. Acting as the Incident Commander, the role ensures rapid service restoration, clear and confident stakeholder communication, and disciplined coordination across SRE, engineering, support, cloud, and service delivery teams.

Operating within an SRE-aligned service model , the Major Incident Lead focuses on protecting service reliability through the effective use of service level indicators and service level objectives, prioritizing customer impact reduction over root cause analysis during live incidents. Beyond incident response, the role drives post-incident reviews , turning operational failures into measurable reliability improvements and reduced repeat incidents.

This position is critical to maintaining customer trust, platform resilience, and operational excellence in a 24x7, mission-critical, and highly regulated environment.

Key Responsibilities

  • Lead end-to-end management of P1 and P2 major incidents affecting InterSystems managed services customers; Serve as the single point of coordination for major incidents
  • Deliver timely, accurate and audience-appropriate communications to customers, partners, internal leadership, service delivery, implementation, support and customer success teams
  • Manage executive-level updates during prolonged or high-risk incidents
  • Lead post-incident reviews and root cause analysis (RCA)
  • Ensure corrective and preventative actions are clearly defined, owned and tracked; while identifying trends, systemic risks and recuring issues for automation opportunities and improved tooling
  • Contribute to continuous improvement of incident processes, tooling and runbooks
  • Ensure incidents are managed in line with contractual SLAs and regulatory requirements; maintain accurate incident documentation and audit-ready records
  • Maintain and improve major incident playbooks and escalation paths; participate in on-call rotations and incident simulations
  • Support customer service reviews and contractual discussions when required

Experience/Qualification Required

  • 5+ years of experience leading major incidents in a managed services, service delivery, cloud or enterprise IT environment
  • Strong understanding of SRE principles, availability engineering, operational resilience practices and ITIL Incident, Problem, and Change Management
  • Experience operating in 24x7, mission-critical environments with incident and service management tools (ServiceNow, Jira, PagerDuty)
  • Understanding of services related to cloud-native platforms (AWS, Azure or GCP), observability solutions (Coralogix, Grafana, Prometheus, CloudWatch, Splunk)
  • Strong understanding of data platform operations, high availability and scaling in support of mission-critical systems such as InterSystems IRIS, HealthShare, Intellicare and Trakcare
  • Ability to lead under pressure and make clear, structured decisions; Reliability-first mindset
  • Excellent verbal and written communication skills
  • Strong stakeholder management, including senior leadership and customers
  • Bachelors or Masters degree in Computer Science, Engineering or related technical field

About InterSystems

InterSystems, a creative data technology provider, delivers a unified foundation for next-generation applications for healthcare, finance, manufacturing, and supply chain customers in more than 80 countries. Our data platforms solve interoperability, speed, and scalability problems for large organizations around the globe to unlock the power of data and allow people to perceive data in imaginative ways. Established in 1978, InterSystems is committed to excellence through its 24×7 support for customers and partners around the world. Privately held and headquartered in Boston, Massachusetts, InterSystems has 38 offices in 28 countries worldwide. For more information, please visit InterSystems.com.

Share this job:
Please let Intersystems Group know you found this job on Remote First Jobs 🙏

402 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like Intersystems Group

Find your next opportunity with companies that specialize in Information Technology Solutions, Airport Information Management, Flight Information Displays Systems, and Airport Advertising And Content Management Systems. Explore remote-first companies like Intersystems Group that prioritize flexible work and home-office freedom.

TEECOM Logo

TEECOM

51-200 teecom.com

A technology consulting engineering firm designing and implementing integrated technology systems globally.

24 open positions →
INTEGRITYOne Partners Logo

INTEGRITYOne Partners

Provides business and technology services for federal agencies, specializing in modernization and IT consulting.

2 open positions →
Zia Consulting Logo

Zia Consulting

Business transformation partners

View company profile →
Client Solution Architects Logo

Client Solution Architects

Provides integrated technology and operational support services to defense and federal government organizations.

View company profile →
Talent Right AUS Logo

Talent Right AUS

End-to-end recruitment and HR services for small and medium-sized Australian businesses.

View company profile →
Echo360 Logo

Echo360

A Learning Transformation Platform for education and business, offering video, engagement, and assessment solutions.

3 open positions →

Project: Career Search

Rev. 2026.2

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply