We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its... TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning...
-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... with popular LLM frameworks and libraries such as TensorRT, TensorRT-LLM, vLLM, SGLang, MLC-LLM, or FlashInfer. A track record...
, this platform enables efficient, resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer... serving engines (such as vLLM, SGLang, TensorRT-LLM), with a focus on KV-cache offload, reuse, and remote sharing...
and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...
We are looking for a Senior System Software Engineer to work on . NVIDIA is hiring software engineers for its GPU...-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution...
We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...
and engineering teams alike developing best-in-class AI models. We are now looking for a Senior Deep Learning Software Engineer... opportunities. Continuously innovate on the inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM...
to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA is seeking a Senior Software.... Experience developing for GPU platforms and familiarity with NVIDIA technologies (e.g., CUDA, TensorRT, Triton, NeMo) and LLM...
's AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...—within NVIDIA’s ecosystem (TensorRT Model Optimizer, Megatron-LM, Megatron-Bridge, Nvidia-NeMo, NeMo-AutoModel, TensorRT-LLM...
We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang. As NVIDIA makes inroads into the Datacenter business...
. Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software... LLM and GenAI software. Experience with CUDA and NVIDIA's Nsight GPU profiling and debugging suite. Some travel...
from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research on cutting-edge areas...., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these areas are a significant plus. Experience...
with modern LLM inference engines (e.g., vLLM, SGLang, TensorRT-LLM) is required. Open-source contributions in these areas... from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority...