Principal Cloud Platform Engineer - Core Infra

at Sift
  • Remote - United States

Remote

DevOps

Principal

Job description

The Core Platform team at Sift builds and evolves the foundational systems that power our online services. As a Principal Engineer, you’ll play a critical leadership role in shaping the technical strategy, architecture, and direction of our infrastructure and services platform. You’ll drive initiatives that enhance the availability, reliability, scalability, and performance of our systems—ensuring they are resilient, secure, and aligned with the growing needs of our customers and business.

This role is ideal for a hands-on engineering leader who thrives in solving complex distributed systems challenges, mentors others, and influences both technology and organizational strategy across teams.

What you’ll do

  • Provide technical leadership and vision for Sift’s online infrastructure—ensuring it is highly available, performant, and scalable.

  • Drive architecture and design of immutable, fault-tolerant, multi-region infrastructure and services.

  • Lead the implementation of sophisticated multi-region deployments (e.g., BigTable clusters with regional routing strategies) to meet global customer needs.

  • Solve high-scale, high-throughput problems requiring deep understanding of messaging systems, distributed data stores, and real-time computation.

  • Guide improvements to developer workflows, CI/CD pipelines, and local development environments to streamline efficiency across teams.

  • Architect and build robust internal libraries and platforms for interacting with our core systems—data stores, messaging layers, and infrastructure services.

  • Develop proactive monitoring and self-healing systems to improve the resilience of critical services.

  • Act as a strategic advisor to engineering teams, providing deep technical guidance on data architecture, service optimization, caching strategies, and scalability planning.

  • Participate in and help evolve our on-call strategy, ensuring rapid and effective incident response while reducing long-term operational toil.

  • Mentor and coach senior engineers across teams, driving engineering excellence and knowledge sharing.

Technical Stack: GCP, AWS, Terraform, Kubernetes, Vault, Jenkins, Kafka, Airflow, Snowflake, Spark, Java 11, Python 3, Ruby 2.7, Ruby on Rails

What makes you a strong fit

You are a systems thinker who thrives in tackling technical complexity and aligning infrastructure investments with business needs. You’re passionate about building reliable, high-scale systems, and you’re equally committed to lifting those around you—acting as a multiplier across teams. Your communication skills, engineering rigor, and product-minded approach help you build trust and drive initiatives from conception to execution.

Key qualifications

  • 10+ years of experience in Software Engineering, SRE, or Infrastructure roles, with a demonstrated focus on distributed systems and platform-level challenges.

  • Deep expertise in designing, scaling, and operating cloud-native systems on AWS or GCP.

  • Proven ability to architect infrastructure as code with tools like Terraform or CloudFormation.

  • Advanced programming skills in languages such as Java, Python, or Scala.

  • Extensive experience with messaging systems (e.g., Kafka) and distributed databases (e.g., BigTable, Snowflake).

  • Strong knowledge of containerization and orchestration technologies such as Kubernetes.

  • A track record of reducing operational complexity through automation, observability, and self-healing systems.

  • Experience influencing architectural decisions across multiple engineering teams.

  • Excellent collaboration and communication skills, with a history of mentoring and cross-functional leadership.

Benefits and perks:

  • Competitive total compensation package

  • 401k plan

  • Medical, dental, and vision coverage

  • Wellness reimbursement

  • Education reimbursement

  • Flexible time off

Our interview process

  • Introduction interview: a 30-minute session with a recruiter to discuss your background and the role.

  • Hiring Manager interview: a 60-minute interview with the hiring manager to explore your fit for the position.

  • Virtual onsite loop with the team: a comprehensive session comprising four interviews lasting approximately 4 hours, covering system design, coding abilities, deep dive, and values and behavior-based conversations.

During these sessions, you will have the opportunity to learn about company culture, meet engineers or peers from your team, and discuss distributed system problems. You will have time for interesting questions and gain transparency regarding your future responsibilities and the project.

A little about us

Sift is the AI-powered fraud platform securing digital trust for leading global businesses. Our deep investments in machine learning and user identity, a data network scoring 1 trillion events per year, and a commitment to long-term customer success empower more than 700 customers to grow fearlessly. Brands including DoorDash, Yelp, and Poshmark rely on Sift to unlock growth and deliver seamless consumer experiences. Visit us at sift.com and follow us on LinkedIn.

Share this job:
Please let Sift know you found this job on Remote First Jobs 🙏

Benefits of using Remote First Jobs

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

Advanced Filters

Filter by category, benefits, seniority, and more.

Priority Job Alerts

Get timely alerts for new job openings every day.

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

Search remote, work from home, 100% online jobs

We help you connect with top remote-first companies.

Search jobs

Hiring remote talent? Post a job

Frequently Asked Questions

What makes Remote First Jobs different from other job boards?

Unlike other job boards that only show jobs from companies that pay to post, we actively scan over 20,000 companies to find remote positions. This means you get access to thousands more jobs, including ones from companies that don't typically post on traditional job boards. Our platform is dedicated to fully remote positions, focusing on companies that have adopted remote work as their standard practice.

How often are new jobs added?

New jobs are constantly being added as our system checks company websites every day. We process thousands of jobs daily to ensure you have access to the most up-to-date remote job listings. Our algorithms scan over 20,000 different sources daily, adding jobs to the board the moment they appear.

Can I trust the job listings on Remote First Jobs?

Yes! We verify all job listings and companies to ensure they're legitimate. Our system automatically filters out spam, junk, and fake jobs to ensure you only see real remote opportunities.

Can I suggest companies to be added to your search?

Yes! We're always looking to expand our listings and appreciate suggestions from our community. If you know of companies offering remote positions that should be included in our search, please let us know. We actively work to increase our coverage of remote job opportunities.

How do I apply for jobs?

When you find a job you're interested in, simply click the 'Apply Now' button on the job listing. This will take you directly to the company's application page. We kindly ask you to mention that you found the position through Remote First Jobs when applying, as it helps us grow and improve our service 🙏

Apply