Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Development Engineer - LLM Kernel , Location: Santa Clara, CA

Page: 1

Senior Software Development Engineer - LLM Kernel & Inference Systems

GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the..., and PyTorch for AMD GPUs, contributing both internally and upstream. * LLM-Aware Kernel Development Design and optimize GPU...

Posted Date: 19 Dec 2025

Senior Software Development Engineer – LLM Inference Framework

, or similar GPU architectures and kernel development Software Engineering Expertise in Python and preferably experience in C/C... your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building...

Posted Date: 19 Dec 2025

Senior Software Engineer – TensorRT Edge-LLM

-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... Science, Electrical/Computer Engineering, or a closely related field. 4+ years of relevant software development experience...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Feb 2026

Senior Software Development Engineer – SGLang and Inference Stack

engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... K2.5, etc. GPU Kernel Development: Design, implement, and optimize high-performance GPU kernels using HIP, Triton...

Posted Date: 12 Feb 2026

Senior Software Development Engineer - SGLang and Inference Stack

contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical... and tune large-scale training and inference models for optimal performance on AMD hardware. GPU Kernel Development: Design...

Posted Date: 20 Dec 2025

Principal Software Engineer - AI Inference

NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference... to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Feb 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

, including custom kernel development with CUDA and Triton. This role offers a unique opportunity to work at the intersection...'s AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Feb 2026

Senior Deep Learning Algorithm Engineer

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... as establishing a data-driven approach to hardware design and system software development. We collaborate with a broad cross section...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 15 Jan 2026

Senior Compiler Engineer - AI

doing technology development on problems of kernel generation and optimizations for computational graphs for next generation... work or research experience in kernel generation, mega kernels, compiler optimizations, synthesis, LLM inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Feb 2026

Senior AI Inference Compiler Engineer

. Today, we are increasingly known as “the AI computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring... software engineers for its Deep Learning & AI Compiler (DLC) team. Academic and commercial groups around the world are using...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 26 Feb 2026

Senior Deep Learning Framework Communications Engineer

engineer to bring advanced communication technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX..., and Inference Engines such as TRT-LLM, vLLM, SGLang Rapid prototyping and development with Python, C++, CUDA or related DSLs...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 24 Jan 2026

Principal Machine Learning Engineer (Prisma AIRS)

from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research on cutting-edge areas... with low-level performance optimization, such as custom CUDA kernel development or using Triton Language, is a plus. Experience...

Location: Santa Clara, CA
Posted Date: 22 Feb 2026

Principal Machine Learning Platform Engineer (Prisma AIRS)

from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority... are a significant plus. Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton...

Location: Santa Clara, CA
Posted Date: 29 Jan 2026