networking and hardware health systems to deliver end-to-end reliability across servers, switches, and data center infrastructure...About the Team The Frontier Systems team at OpenAI builds, launches, and supports the largest supercomputers in the...
. If you want to build and operate infrastructure for frontier AI workloads, automate systems at petascale, and be part... infrastructure at scale. Background in reliability engineering, distributed systems, or hardware acceleration environments...
. About the Role As a Site Reliability Engineer (SRE) at Mercor, you'll own production reliability across our most critical... systems, partnering directly with infrastructure leadership. You'll play a foundational role in building our SRE function...