Gather AI Logo

Cloud Engineer Platform & Infrastructure

Job Description

About Us

Are you ready to build the future of supply chain? At Gather AI, we’re not just creating software; we’re pioneering a new era of warehouse intelligence. We’ve developed a groundbreaking, vision-powered platform that uses autonomous drones and existing equipment to capture real-time data, completely digitizing workflows that have historically been manual and error-prone. This means facilities operate smarter, safer, and more efficiently, ultimately redefining “on-time, in full” delivery.

If you’re looking for an opportunity to contribute to truly transformative technology and make a significant impact in a vital industry, Gather AI is the place for you. We’re leading the charge in the rapidly evolving robotics industry, and we invite you to join us in reshaping the global supply chain, one intelligent warehouse at a time.

About the Team

This role sits within the Backend and Platform Engineering organization. You’ll work day-to-day alongside the Fullstack Engineering team, ensuring application services have the cloud infrastructure they need to scale safely and deploy reliably. You’ll also partner closely with the ML Systems Engineering (Ops) team, enabling the infrastructure capabilities required for production ML pipelines, model serving, and data workloads. Cross-functionally, you’ll collaborate with QA, Release Engineering, and Platform and Security stakeholders to ensure cloud environments support stable testing pipelines, access control, secrets management, and operational governance.

About the Role

We are looking for a Cloud Engineer (Platform & Infrastructure) to help mature our cloud operations into a structured and scalable platform. Our foundational infrastructure is already in place and actively supporting production workloads, but many current practices evolved organically during earlier growth stages. Rather than building from scratch, you’ll evolve an existing production environment by introducing stronger operational patterns, improving deployment safety, and ensuring our infrastructure layer reliably supports increasing system scale. This role offers meaningful ownership of the infrastructure backbone supporting a platform that combines real-time application systems with machine learning workloads, and the opportunity to influence how systems are deployed, operated, and scaled as the organization grows.

What You’ll Do

  • Review and rationalize current Azure and AWS environments, identifying configuration drift, security gaps, and operational inconsistencies, and establish clear configuration standards across cloud accounts
  • Introduce repeatable Infrastructure-as-Code patterns to ensure cloud resources are provisioned, versioned, and audited through automated workflows
  • Strengthen CI/CD pipelines for infrastructure and application deployment to reduce manual operations and increase release safety across both application services and ML workloads
  • Establish consistent logging, metrics, and alerting practices across infrastructure and container workloads to improve operational visibility
  • Audit and improve cloud security practices including IAM policies, secrets management, network segmentation, and operational access controls
  • Evaluate current infrastructure architecture and introduce patterns that enable workloads to operate portably across both Azure and AWS environments
  • Improve Kubernetes platform reliability by refining autoscaling policies, workload isolation, and cluster lifecycle management
  • Partner with Fullstack and ML teams to reduce infrastructure friction around environments, networking, and resource provisioning

What You’ll Need

  • Bachelor’s degree in Computer Science, Engineering, or a related field
  • 5+ years of experience operating production cloud infrastructure at scale
  • Deep experience with at least one major cloud provider (Azure or AWS) and working familiarity with the other
  • Hands-on experience with Kubernetes and Docker for running containerized workloads in production environments
  • Proficiency with Terraform or equivalent Infrastructure-as-Code tooling for provisioning and managing cloud infrastructure
  • Experience implementing automated deployment pipelines using tools such as GitHub Actions, GitLab CI, or similar platforms
  • Strong operational mindset with a focus on reliability, automation, and clear technical documentation

Nice to Have

  • Experience with observability tooling such as Prometheus, ELK, OpenTelemetry, or similar logging, metrics, and monitoring systems
  • Familiarity supporting ML infrastructure workloads including pipeline orchestration, model deployment, and scalable inference environments
  • Experience working in logistics, robotics-adjacent platforms, or real-time distributed systems
  • Track record of translating application requirements into secure, reliable, and operationally safe infrastructure architecture
  • Exposure to cloud cost visibility and optimization practices
  • Experience introducing infrastructure governance standards including templates, security baselines, and operational documentation
Share this job:
Please let Gather AI know you found this job on Remote First Jobs 🙏

4886 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like Gather AI

Find your next opportunity with companies that specialize in Artificial Intelligence, Intralogistics, Logistics, and Supply Chain. Explore remote-first companies like Gather AI that prioritize flexible work and home-office freedom.

project44 Logo

project44

501-1000 project44.com

AI-powered platform provides real-time supply chain visibility, automation, and management.

View company profile →
Tomorrow.io Logo

Tomorrow.io

Our space-powered AI resilience platform helps organizations manage weather challenges and opportunities.

View company profile →
ISEE Logo

ISEE

51-200 www.isee.ai

Developing AI-powered autonomous yard trucks for global supply chain logistics

View company profile →
Entefy Logo

Entefy

An enterprise AI software and automation company focused on multisensory AI and digital transformation.

View company profile →
Nomagic Logo

Nomagic

51-200 nomagic.ai

Develops AI-powered pick-and-place robotic systems for e-commerce and warehouse automation.

View company profile →
VIZION Logo

VIZION

Provides API-based solutions for real-time container shipment tracking and global trade intelligence.

View company profile →

Project: Career Search

Rev. 2026.4

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply