The Codest Logo

RL Environments Research Engineer

Job Description

Description

🌍 Hello World!

We are The Codest -  International Tech Software Company with tech hubs in Poland delivering global IT solutions and projects. Our core values lie in “Customers and People First” approach that prioritises the needs of our customers and a collaborative environment for our employees, enabling us to deliver exceptional products and services.

Our expertise centers on web development, cloud engineering, DevOps and quality.  After many years of developing our own product - Yieldbird, which was honored as a laureate of the prestigious Top25 Deloitte awards, we arrived at our mission: to help tech companies build impactful product and scale their IT teams through boosting IT delivery performance. Through our extensive experience with product development challenges, we have become experts in building digital products and scaling IT teams.

But our journey does not end here - we want to continue our growth. If you’re goal-driven and looking for new opportunities, join our team! What awaits you is an enriching and collaborative environment that fosters your growth at every step.

We are currently looking for a RL Environments Research Engineers.

Our client builds reinforcement learning environments and training tasks for frontier AI labs. The work is technical, research-adjacent, and hands-on. We’re not looking for web developers or backend engineers who have used LLM APIs.

💡 Key Responsibilities:

  • Design and build MLE/SWE environments and diverse tasks.

  • Target a specified language model and satisfy the required difficulty distribution.

Requirements

✅ The right candidates have:

  • Experience with PyTorch or JAX at the framework level (not just importing a model)

  • Familiarity with RL concepts: reward functions, environment design, training loops, evaluation

  • Ability to read ML papers and implement them. This is a core part of the job. If someone hasn’t reproduced or extended a research result, they’ll struggle here.

  • Production Python skills: Docker, git, clean code, reproducible environments. Notebooks-only people won’t work.

  • Exposure to any of: model training/finetuning, inference optimization, CUDA/Triton kernels, distributed training, model internals (attention, KV caches, tokenizers)

Nice to have but not required:

  • Publications or competitive programming background

  • Experience with MuJoCo, game environments, or simulation frameworks

  • Scientific computing (Rust, C++, numerical methods)

🚫 Profiles that don’t fit:

  • Web/backend engineers whose AI experience is limited to calling LLM APIs, building RAG pipelines, or prompt engineering

  • Data engineers or data scientists who work in notebooks and dashboards

  • DevOps/infra engineers without ML depth

✅ The simplest test: have you ever trained a model from scratch or built something where a model learns from an environment?

📜 Our Promise (what you can expect from us):

  • 34 - 44k PLN (B2B/useme)

  • 100% remote work (but we have offices in Krakow and Warsaw and we’re happy to meet there from time to time 😉)

  • 300 PLN to use on our benefits platform, Worksmile - gift cards, medical services, sports, etc.

  • Our B2B contract contains provisions that allow you to obtain IP BOX support

  • Integration events, education opportunities and much more…

  • A unique opportunity to take your career to the next level - we’re looking for people who want to create an impact. You have ideas, we want to hear them!

Questions, insights? Feel free to reach out to our recruiting team:

[email protected]

In the meantime, feel free to visit our website where you can find key facts about us.

Share this job:
Please let The Codest know you found this job on Remote First Jobs 🙏

38 similar remote jobs

Explore latest remote opportunities and join a team that values work flexibility.

Remote companies like The Codest

Find your next opportunity with companies that specialize in Outsourcing It, Java, Php, and Python. Explore remote-first companies like The Codest that prioritize flexible work and home-office freedom.

Devsu Logo

Devsu

An AI-native technology partner offering software delivery, application modernization, and staff augmentation services.

View company profile →
Test Double Logo

Test Double

A software consulting agency providing senior developer and product consultants to client teams.

View company profile →
Inventive Works, LLC Logo

Inventive Works, LLC

Custom software applications and cloud migration services for businesses of all sizes.

View company profile →
8th Light Logo

8th Light

Designs, develops, and deploys tech solutions, partnering with clients for digital product transformation.

View company profile →
99x Brazil Logo

99x Brazil

51-200 99x.io

Provides expert teams for digital products, technology, and AI solutions to companies globally.

View company profile →
Get Devs Logo

Get Devs

51-200 getdevs.com

Provides IT staff augmentation services, building dedicated offshore teams of software talent in the Philippines.

View company profile →

Project: Career Search

Rev. 2026.5

[ Remote Jobs ]
Direct Access

We source jobs directly from 21,000+ company career pages. No intermediaries.

01

Discover Hidden Jobs

Unique jobs you won't find on other job boards.

02

Advanced Filters

Filter by category, benefits, seniority, and more.

03

Priority Job Alerts

Get timely alerts for new job openings every day.

04

Manage Your Job Hunt

Save jobs you like and keep a simple list of your applications.

21,000+ SOURCES UPDATED 24/7
Apply