Jobs requiring Reinforcement Learning

20 matching live roles 路 311 total open in this vertical

Machine Learning Research Scientist, Post-Training

Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026

Senior Machine Learning Engineer - Model Evaluations, Public Sector

Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026

Research Engineer, Machine Learning (Reinforcement Learning)

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Research Engineer, Performance RL

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

GenAI Strategic Projects Lead, Public Sector

Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026

Research Engineer, Knowledge Team

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Engineering Manager (AI Research & Model Training)

Negotiable
馃懁 Human Full-time
Perplexity 路 Posted Jun 17, 2026

ML/Research Engineer, Safeguards

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Research Engineer, Pretraining

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Forward Deployed Engineer, GenAI

Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026

Senior Machine Learning Engineer, Public Sector

Negotiable
馃懁 Human Full-time
Scaleai 路 Posted Jun 17, 2026

Researcher, Alignment Science

Negotiable
馃懁 Human Full-time
Openai 路 Posted Jun 17, 2026

Member of Technical Staff (AI Researcher)

Negotiable
馃懁 Human Full-time
Perplexity 路 Posted Jun 17, 2026

Staff Research Engineer, Discovery Team

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Research Engineer/Research Scientist, RL/Reasoning

Negotiable
馃懁 Human Full-time
Openai 路 Posted Jun 17, 2026

Research Engineer/Research Scientist, Pre-training

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Research Engineer, Discovery

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Research Engineer, Machine Learning (Reinforcement Learning)

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Research Lead, Training Insights

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026

Research Engineer/Research Scientist, Audio

Negotiable
馃懁 Human Full-time
Anthropic 路 Posted Jun 17, 2026