Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Principal Engineer Inference Stack, Location: Santa Clara, CA

Page: 1

Principal Engineer Inference Stack

and scale-out inference. Develop methods and tooling to utilize dynamic resources in service of inference Support... inference Operational experience with atleast one of sglang, or vllm and with kserve, llm-d. Experience running inference...

Posted Date: 04 Feb 2026

Principal Machine Learning Engineer (DLP Detection)

We are looking for a Principal Machine Learning Engineer to lead the design, development, and operation of production-grade machine learning..., TorchServe, Triton Inference Server/TIS). Workflow Orchestration: (Airflow, Kubeflow, MLflow, Ray, Vertex AI, SageMaker...

Location: Santa Clara, CA
Posted Date: 07 Feb 2026

Principal ML/AI Solutions Engineer

your career. THE ROLE: AMD’s Software and Solutions Team is seeking a Principal ML/AI Solutions Engineer to empower customers... and partners in adopting AMD’s AI software stack. This role requires strong technical depth in machine learning frameworks, GPU...

Posted Date: 01 Feb 2026

Principal Software Engineer - Dynamo

enthusiastic about building the next generation of scalable AI systems. As a Principal Software Engineer on the Dynamo project... Platform: Build the Kubernetes deployment and workload management stack for Dynamo to facilitate inference deployments at scale...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 02 Jan 2026

Principal GPU Performance Engineer - Artificial Intelligence

Engineer to optimize AI training and inference workloads and guide the evolution of next-generation AMD Instinct GPU... your career. Principle GPU Performance Engineer - Artificial Intelligence THE ROLE: We are seeking a GPU Performance...

Posted Date: 31 Jan 2026

Principal GPU Performance Engineer - Artificial Intelligence

your career. THE ROLE: We are seeking a GPU Performance Engineer to optimize AI training and inference workloads and guide the... evolution of next-generation AMD Instinct GPU architectures. In this role, you will work across the software and hardware stack...

Posted Date: 22 Nov 2025