Preference Model
Joined in
2026
Website:
https://www.preferencemodel.com/
RL Environments Engineer - Low-Level Engineering and Kernel Inference Optimization
Preference Model
Responds Quickly
Full Remote
·
Worldwide
·
Product
·
5 years of experience
·
English - C1
We're hiring Low-Level Engineers to design and build RL environments that teach LLMs kernel development, hardware optimization, and systems programming. The goal is to create realistic feedback loops where models learn to write high-performance code across GPU and CPU architectures.
This is a remote contractor role with ≥4 hours overlap to PST and advanced English (C1/C2) required.
Requirements
Minimal Qualifications
Strong Python (engineering-quality, not notebook-only)
Production mindset (debugging,...
More
90 views
·
6 applications
·
12d
RL Environments Engineer
to $20000
Preference Model
Responds Quickly
Full Remote
·
Worldwide
·
Product
·
5 years of experience
·
English - C1
We’re hiring RL Environments Engineers to design and build MLE/SWE environments that deliver high-quality, diverse tasks with minimal supervision. You will target a specific language model, meet a defined difficulty distribution, and deliver about one task every 10 hours. This is a remote contractor role with ≥4 hours overlap to PST and advanced English (C1/C2) required.
Responsibilities
Design and build MLE/SWE environments and diverse tasks.
Target a specified language model and satisfy the required...
More
221 views
·
14 applications
·
12d