(RDMA, InfiniBand, RoCE, UCX). Strong understanding of GPU architectures and performance trade-offs on AI workloads... your career. THE ROLE: As an AI Performance Engineers you will focus on pushing machine learning workloads to peak hardware...
reliability, scalability, and cost/performance trade-offs end-to-end. The ideal candidate combines strong ML depth with MLOps.... Strong experience with model performance and efficiency optimization: latency/throughput tuning, batching, quantization, ONNX/TensorRT...
, caching, and performance Comfortable integrating SDKs (payments, crypto, AI models, etc.) Write reliable code: unit tests... and OpenAI-compatible endpoints Previous experience building AI SaaS or PaaS platforms Knowledge of GPU resource management...