and scale-out inference. Develop methods and tooling to utilize dynamic resources in service of inference Support... inference Operational experience with atleast one of sglang, or vllm and with kserve, llm-d. Experience running inference...
We are looking for a Principal Machine Learning Engineer to lead the design, development, and operation of production-grade machine learning..., TorchServe, Triton Inference Server/TIS). Workflow Orchestration: (Airflow, Kubeflow, MLflow, Ray, Vertex AI, SageMaker...
your career. THE ROLE: AMD’s Software and Solutions Team is seeking a Principal ML/AI Solutions Engineer to empower customers... and partners in adopting AMD’s AI software stack. This role requires strong technical depth in machine learning frameworks, GPU...
enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project... Platform: Build the Kubernetes deployment and workload management stack for Dynamo to facilitate inference deployments at scale...
Engineer to optimize AI training and inference workloads and guide the evolution of next-generation AMD Instinct GPU... your career. Principle GPU Performance Engineer - Artificial Intelligence THE ROLE: We are seeking a GPU Performance...
your career. THE ROLE: We are seeking a GPU Performance Engineer to optimize AI training and inference workloads and guide the... evolution of next-generation AMD Instinct GPU architectures. In this role, you will work across the software and hardware stack...