Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Research Engineer, Reward Models Training, Location: USA

Page: 3

Senior Machine Learning Engineer, Agentic

Policy Optimization (PPO), and reward modeling to improve agent performance. Launch and support fine-tuned models... with applied AI/ML teams to translate state-of-the-art research in agentic reasoning, planning, and tool use into reliable...

Posted Date: 21 Nov 2025

Artificial Intelligence/Machine Learning Engineer

, so we provide funds for continuing education. We also offer in-house training and ongoing development through our internal GROW... look good! Work Hard, Play Hard - We reward our employees with generous vacation time, to the tune of up to five weeks off...

Posted Date: 31 Oct 2025

Sr. Manager - Automated Solutions

operations and service training and education. This Managerial position regularly engages in business planning and analysis... for People Management processes including but not limited to selection, training, performance, operational results, cost...

Company: BD
Location: Sparks, MD
Posted Date: 16 Jan 2026

Member of Technical Staff (Applied ML)

About this role As a Machine Learning Research Engineer, you'll drive research that teaches models what great feels... or ML research engineering, especially in post-training/fine-tuning large models (SFT, RLHF, DPO). Experience with LLM...

Company: FitNext Co.
Location: San Francisco, CA
Posted Date: 15 Jan 2026
Salary: $20000 - 35000 per year