Job Description

Company Description

As Hungary’s most attractive employer in 2025 (according to Randstad’s representative survey), Deutsche Telekom IT Solutions is a subsidiary of the Deutsche Telekom Group. The company provides a wide portfolio of IT and telecommunications services with more than 5300 employees. We have hundreds of large customers, corporations in Germany and in other European countries.

DT-ITS recieved the Best in Educational Cooperation award from HIPA in 2019, acknowledged as the the Most Ethical Multinational Company in 2019. The company continuously develops its four sites in Budapest, Debrecen, Pécs and Szeged and is looking for skilled IT professionals to join its team.

Job Description

General description/ Purpose

NVIDIA and Deutsche Telekom are jointly developing the world’s first industrial AI cloud for European manufacturers. This AI factory in Germany will host 10,000 GPUs across NVIDIA DGX B200 systems and RTX Pro Servers. Deutsche Telekom provides secure, sovereign and fast infrastructure, including data centers, operations, security, and AI solutions.

Role Overview

We are seeking a Platform Engineer to build, automate, and operate the platform services of the Industrial AI Cloud. This role focuses on running and evolving large-scale Kubernetes clusters, container orchestration platforms, CI/CD pipelines, GitOps and associated automation to support AI workloads. Experience with Infrastructure as Code. You’ll be part of the team supporting Kubernetes workloads, ensuring smooth operations and continuous improvement of the platform layer, while collaborating with infrastructure, security, and AI teams.

Key Responsibilities

  • Operate & Evolve Kubernetes Platform: Build, configure, and maintain bare metal hosts and Kubernetes clusters to run GPU/AI workloads.
  • Design & Operate NVIDIA AI related software stack(Slurm, Run AI)
  • Provide customized application support for AI related workloads
  • Container Orchestration & Automation: Manage Helm charts, GitOps workflows, Ansible scripts, possibly Terraform code and automation for deploying services and AI workloads.
  • Operate Kubernetes Workloads: Act as primary contact for all Kubernetes-related topics, including troubleshooting, performance tuning, and scaling.
  • CI/CD & GitOps: Develop and maintain CI/CD pipelines with Jenkins and GitLab; implement GitOps practices for consistent deployments and infrastructure changes. Terraform basics.
  • Monitoring & Observability: Operate and enhance Prometheus and Grafana monitoring stacks for bare metal hosts, Kubernetes and platform services.
  • Container Images & Registries: Build, optimize and secure container images (Docker, Podman); manage registries and versioning, image scanning (Trivy).
  • Object Storage & Persistent Volumes: Integrate and maintain object storage solutions for AI workloads.
  • Run AI & HPC Workloads: Support and operate distributed AI workloads within bare metal hosts and Kubernetes environments.
  • Collaboration with Infrastructure & AI Teams: Coordinate closely with Infrastructure Engineers, data center staff and AI developers to ensure smooth delivery of services.
  • ITIL Processes: Follow incident, problem, and change management workflows; create and maintain operational runbooks. Adhere to ZERO outage guidelines.

What We Offer

  • Work on Europe’s first industrial AI cloud with cutting-edge technologies.
  • Direct collaboration with NVIDIA and Deutsche Telekom experts.
  • Hybrid working model, training opportunities, and career progression.

Qualifications

Required Skills and Qualifications

  • Kubernetes Certified Administrator (CKA) or equivalent experience in production environments. CKS advantage.

  • NVIDIA GPU-Accelerated server platform knowledge

  • Data Engineering, Data Transformation, Data Migration tools knowledge

  • Knowledge of Nvidia AI software stack related to GPU orchestration

  • GPU based Cloud platform software stack knowledge incl. its dependencies on below layers

  • Strong experience with CI/CD tools (Jenkins, GitLab) and GitOps practices.

  • Proficiency with Helm charts and Kubernetes resource management.

  • Scripting/programming in Python or Bash; Infrastructure-as-Code with Terraform and Ansible.

  • Experience with container images (Docker, Podman) and image scanning.

  • Familiarity with object storage systems and persistent volume management.

  • Knowledge of monitoring and observability tools (Prometheus, Grafana).

  • Understanding of running AI/HPC workloads at scale.

  • Strong troubleshooting and operational support skills in mission-critical environments.

Preferred Attributes

  • Ability to automate repetitive operational tasks and build self-service capabilities for developers.
  • Security-conscious attitude in day-to-day operations.
  • Excellent communication and cross-team coordination skills.

Additional Information

You will be working in the European Union to meet our customers’ data security and privacy requirements.

\* Please be informed that our remote working possibility is only available within Hungary due to European taxation regulation.

Share this job:
Please let Deutsche Telekom IT Solutions HU know you found this job on Remote First Jobs 🙏

5822 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like Deutsche Telekom IT Solutions HU

Find your next opportunity with companies that specialize in Serv. Management (sem), It Operations (ito), Telecommunications (tc), and Partner Mng. & Enduser Services (pme). Explore remote-first companies like Deutsche Telekom IT Solutions HU that prioritize flexible work and home-office freedom.

Instrumental Group Logo

Instrumental Group

A digital agency providing marketing, sales, and customer service solutions, specializing in HubSpot implementations.

View company profile →
Twelve Consulting Group Logo

Twelve Consulting Group

Implements EPM platforms and cloud-based business planning and forecasting solutions.

View company profile →
Kixie Logo

Kixie

51-200 kixie.click

An all-in-one calling and texting platform for revenue teams.

View company profile →
VRP Consulting Logo

VRP Consulting

A global full-service Salesforce consulting, development, and outsourcing partner.

View company profile →
Pierce Washington Logo

Pierce Washington

Transforming quote-to-cash processes and delivering Total Commerce solutions for enterprise businesses.

View company profile →
e2open Logo

e2open

1001-5000 www.e2open.com

A connected supply chain software platform for managing, moving, and selling goods and services.

View company profile →

Project: Career Search

Rev. 2026.5

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply