Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Development Engineer – LLM Inference Framework, Location: Santa Clara, CA

Page: 1

Senior Software Development Engineer – LLM Inference Framework

your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building...) and will be upstreamed into open-source inference frameworks such as vLLM and SGLang to make AMD a first-class platform for LLM serving...

Posted Date: 20 Dec 2025

Senior Software Development Engineer - LLM Kernel & Inference Systems

GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the... with model and framework teams to align LLM architectures with hardware-aware optimizations, improving real-world inference...

Posted Date: 20 Dec 2025

Senior Software Development Engineer – SGLang and Inference Stack

optimization, feature development, and scaling of the SGLang framework across AMD GPU platforms for LLM, multimodal serving and RL..., and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems...

Posted Date: 11 Feb 2026

Senior Software Development Engineer - SGLang and Inference Stack

optimization, feature development, and scaling of the SGLang LLM framework across AMD GPU platforms. Distributed System... contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical...

Posted Date: 20 Dec 2025

Senior AI Inference Compiler Engineer

architectures. Collaborating with members of the deep learning software framework teams and the hardware architecture teams... of deep learning models, algorithms and frameworks, such as PyTorch, XLA etc. Understanding of LLM inference optimizations...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Feb 2026

Senior Software Engineer – TensorRT Edge-LLM

-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack...-art inference framework in modern C++ that extends TensorRT with autoregressive model serving capabilities, including...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Feb 2026

Senior Deep Learning Framework Communications Engineer

, and Inference Engines such as TRT-LLM, vLLM, SGLang Rapid prototyping and development with Python, C++, CUDA or related DSLs... engineer to bring advanced communication technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Senior Software Test Development Engineer - Deep Learning

We are looking for a Software Test development engineer in NVIDIA’s Deep Learning SWQA team. The position is in NVIDIA... experience. Good C/C++ software development or test development experience. Good user/development experiences...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 30 Jan 2026

Senior Deep Learning Algorithm Engineer

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... inference across diverse GPU platforms. You will collaborate with research scientists, software engineers, and hardware...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Jan 2026

Senior Compiler Engineer - AI

work or research experience in kernel generation, mega kernels, compiler optimizations, synthesis, LLM inference.... Today, we are increasingly known as “the AI computing company”. NVIDIA is hiring world class Software Engineers with AI Compiler experience...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Feb 2026