-compatible evaluation servers Implementing logic to check agent actions against scenario definitions Creating or extending... have: Experience with Model Context Protocol (MCP) or similar structured agent-server interfaces Knowledge of FastAPI or similar async...