Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Engineer, TensorRT-LLM, Location: Santa Clara, CA

Page: 1

Senior Software Engineer, TensorRT-LLM

We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 04 Mar 2026

Senior Software Engineer – TensorRT Edge-LLM

-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... with popular LLM frameworks and libraries such as TensorRT, TensorRT-LLM, vLLM, SGLang, MLC-LLM, or FlashInfer. A track record...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Feb 2026

Principal Software Engineer – Large-Scale LLM Memory and Storage Systems

, this platform enables efficient, resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer... serving engines (such as vLLM, SGLang, TensorRT-LLM), with a focus on KV-cache offload, reuse, and remote sharing...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Dec 2025

Senior Deep Learning Software Engineer, Inference and Model Optimization

and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 27 Feb 2026

Senior System Software Engineer - Dynamo-Triton Inference Server

We are looking for a Senior System Software Engineer to work on . NVIDIA is hiring software engineers for its GPU...-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 20 Feb 2026

Senior Deep Learning Software Engineer

We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Feb 2026

Senior Deep Learning Software Engineer, Inference and Model Optimization

and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 23 Jan 2026

Lead Senior Software Engineer, Agentic AI Applications

to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA is seeking a Senior Software.... Experience developing for GPU platforms and familiarity with NVIDIA technologies (e.g., CUDA, TensorRT, Triton, NeMo) and LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Jan 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

's AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...—within NVIDIA’s ecosystem (TensorRT Model Optimizer, Megatron-LM, Megatron-Bridge, Nvidia-NeMo, NeMo-AutoModel, TensorRT-LLM...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Feb 2026

Senior Deep Learning Algorithm Engineer

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang. As NVIDIA makes inroads into the Datacenter business...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Jan 2026

Senior Developer Technology Engineer - Windows AI Platform

. Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software... LLM and GenAI software. Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite. Some travel...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Principal Machine Learning Engineer (Prisma AIRS)

from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research on cutting-edge areas...., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these areas are a significant plus. Experience...

Location: Santa Clara, CA
Posted Date: 22 Feb 2026

Principal Machine Learning Platform Engineer (Prisma AIRS)

with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these areas... from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority...

Location: Santa Clara, CA
Posted Date: 30 Jan 2026