qualifications Hands-on experience with policy gradient methods such as PPO Experience with multi-agent task planning algorithms... swarm macro-actions in real time. This role sits at the intersection of machine learning and multi-agent decision making...