Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Engineer, TensorRT-LLM, Location: Santa Clara, CA

Page: 1

Senior Software Engineer, TensorRT-LLM

We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 13 Dec 2025

Senior Software Development Engineer, TensorRT-LLM

We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 02 Nov 2025

Senior Systems Software Engineer, TAO Machine Learning Data Modeling

NVIDIA is hiring a Senior Systems Software Engineer for machine learning data modeling to join the TAO Toolkit ML Data... and familiar with deep learning architectures and tools like NVIDIA TensorRT-LLM, Multimodal-LLM, and Triton Server. NVIDIA...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 11 Oct 2025

Senior GenAI Algorithms Engineer — Model Optimizations for Inference

design to integration—within NVIDIA’s ecosystem (TensorRT Model Optimizer, NeMo/Megatron, TensorRT-LLM) and open-source... stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Deploy optimized models into leading OSS inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 23 Sep 2025

Senior GenAI Algorithms Engineer — Post-Training Optimizations

's AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...—within NVIDIA’s ecosystem (TensorRT Model Optimizer, Megatron-LM, Megatron-Bridge, Nvidia-NeMo, NeMo-AutoModel, TensorRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 18 Sep 2025

Sr Principal Machine Learning Engineer (Prisma AIRS)

security posture from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research... expertise with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these...

Location: Santa Clara, CA
Posted Date: 12 Dec 2025

Sr Principal Machine Learning Engineer (Prisma AIRS)

security posture from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research... expertise with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these...

Location: Santa Clara, CA
Posted Date: 12 Dec 2025

Principal Machine Learning Platform Engineer (Prisma AIRS)

) is a significant plus. Demonstrated expertise with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open... security posture from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve...

Location: Santa Clara, CA
Posted Date: 25 Nov 2025

Principal Machine Learning Platform Engineer (Prisma AIRS)

) is a significant plus. Demonstrated expertise with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open... security posture from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve...

Location: Santa Clara, CA
Posted Date: 25 Nov 2025

Principal Machine Learning Platform Engineer (Prisma AIRS)

) is a significant plus. Demonstrated expertise with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open... security posture from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve...

Location: Santa Clara, CA
Posted Date: 19 Nov 2025

Principal Machine Learning Platform Engineer (Prisma AIRS)

) is a significant plus. Demonstrated expertise with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open... security posture from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve...

Location: Santa Clara, CA
Posted Date: 18 Nov 2025