TechBiz Global Logo

Platform Engineer

Job Description

Description

At TechBiz Global, we are providing recruitment service to our TOP clients from our portfolio. We are currently seeking a Platform Engineerto join one of our clients’ teams. If you’re looking for an exciting opportunity to grow in a innovative environment, this could be the perfect fit for you.

As a Platform Engineer, you will play a key role in the infrastructure team, helping to own, operate, and evolve the confidential compute platform that powers a secure, developer-facing AI API.

The role is ideal for a hands-on, infrastructure-focused and security-minded engineer who enjoys working across Kubernetes, cloud infrastructure, on-premise systems, observability, automation, and low-level platform hardening.

Responsibilities:

  • Kubernetes Ownership: Be the Kubernetes maintainer. Own cluster operations end-to-end: upgrades, capacity, networking, RBAC, manifests/Helm charts, ongoing operations and maintenance, with operational and governance guidance from the lead. You make the calls; the lead is the sounding board, not the bottleneck.

  • Cloud Infrastructure: Run and evolve infrastructure on AWS, GCP, Azure, and others, leveraging each cloud’s confidential compute primitives (Nitro Enclaves, Confidential VMs, Confidential Space) to scale the API and to host adjacent, non-API workloads.

  • On-Premise Operations: Operate and extend our on-premise confidential compute footprint: bare-metal provisioning, network topology, hardware lifecycle.

  • Observability: Own observability for the on-premise side of the platform. Design and operate the metrics, logs, and traces pipeline for bare-metal and CVM workloads where standard cloud-native observability falls short. Make the on-prem fleet as legible as a managed cluster.

  • Infrastructure as Code: Maintain provisioning and configuration via Ansible and IaC (Terraform or equivalent): reproducible, code-reviewed, no snowflake servers.

  • Platform Hardening: Harden the platform: OS hardening, network segmentation, secret rotation, and reproducible images.

  • Backend & API Development: Write and maintain backend services and the API gateway in Node.js / TypeScript. Contribute to performance-critical enclave-side code (we use Rust, willingness to learn is fine if you bring strong systems fundamentals).

  • Attestation & Cryptography: Contribute to the attestation pipeline and the cryptographic plumbing that connects clients, enclaves, and our API gateway.

  • Runbook Discipline: Enforce the runbook discipline: Every recurring operation, every incident class, every non-obvious recovery procedure gets a runbook. You write them, you keep them current, you push back when work ships without one. If something breaks at 3am, the next person on call does not guess.

  • Developer Experience: Help define SLOs and engage with developer users: triage SDK issues, reproduce bugs, ship fixes.

Requirements

  • Linux: Strong Linux sysadmin background. You are comfortable on the shell, you understand systemd, networking (iptables/nftables, TLS, routing), file systems, and how to diagnose a misbehaving Linux box without panicking.

  • Kubernetes: Production Kubernetes experience as a maintainer, not a user. You have upgraded clusters, debugged CNI/DNS issues, written manifests/Helm charts, set up RBAC and network policies, and recovered a cluster that didn’t want to be recovered.

  • Bare-Metal / On-Premise: Some bare-metal or on-premise experience. You’ve racked servers, configured IPMI, set up network gear, or worked in a colocation environment, even if it’s not your primary background.

  • Cloud: Proficiency in at least one of AWS, GCP, or Azure. AWS and experience with Nitro Enclaves strongly recommended.

  • IaC: Configuration and IaC: Ansible and Terraform.

  • Node.js / TypeScript: Comfortable building and publishing SDKs, writing servers, debugging async code, dealing with packaging (npm, semver).

  • Observability: Observability fluency, especially for environments where you don’t get a managed control plane for free: Prometheus, Grafana, Loki/Tempo, OpenTelemetry, or equivalents.

  • CI/CD: GitHub Actions, Buildkite, or equivalent.

  • Security Mindset: You think about threat models. You don’t paste secrets into Slack. You read code with “what could go wrong here” running in the background.

Nice to have:

  • Rust: experience writing, reading, and debugging Rust, or strong systems programming background (C, C++, Go) with willingness to learn Rust. Our enclave-side performance-critical code is Rust.

  • Hands-on experience with confidential computing internals: Intel TDX, AMD SEV-SNP, AWS Nitro Enclaves, and/or similar TEE technology.

  • General understanding of applied cryptography.

  • Experience operating LLM inference workloads (vLLM, TensorRT-LLM, GPU scheduling).

  • Familiarity with the OpenAI API surface and the ecosystem of clients/SDKs built around it.

  • Reproducible builds, signed releases (Sigstore/cosign), supply chain security.

  • Open-source contributions you can point to.

Share this job:
Please let TechBiz Global know you found this job on Remote First Jobs 🙏

18059 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like TechBiz Global

Explore remote-first companies similar to TechBiz Global. Discover other top-rated employers that offer flexible schedules and work-from-anywhere options.

Hatch IT Logo

Hatch IT

IT consulting and professional services for software development, cybersecurity, and data solutions.

View company profile →
HappyFunCorp Logo

HappyFunCorp

Product development consultancy designing and building digital products and software

View company profile →
Test Double Logo

Test Double

A software consulting agency providing senior developer and product consultants to client teams.

View company profile →
Intellectsoft Logo

Intellectsoft

Digital transformation and software engineering company serving global organizations and technology startups since 2007.

View company profile →
Thaloz Logo

Thaloz

51-200 thaloz.com

A nearshore tech partner specializing in building and augmenting tech teams with Latin American talent for global clients.

View company profile →
Lightci (Light Consulting) Logo

Lightci (Light Consulting)

51-200 lightci.com

Designs, builds, and launches software, offering engineering recruitment, platform modernization, and AI integration.

View company profile →

Project: Career Search

Rev. 2026.6

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply