Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Research Scientist, Science of Post-Training and Reinforcement Learning, Location: London

Page: 1

Research Scientist, Science of Post-Training and Reinforcement Learning

with RL for sequence models, post-training, preference-based learning, or agentic systems. Experience with modern research...Snapshot We are starting a small team aimed at building a real science of post-training for agents. This involves...

Company: Google DeepMind
Location: London
Posted Date: 05 Mar 2026

Research Scientist Intern - Foundational Research

academic community. Our focus areas are: LLM Training (Continued Pretraining, Instruction Tuning, Reinforcement Learning...Interested in training and evaluating large-scale LLMs ( 200B) in a frontier research team focused on AI impact...

Company: Thomson Reuters
Location: London
Posted Date: 20 Feb 2026