Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Development Engineer - LLM Kernel & Inference Systems, Location: Santa Clara, CA

Page: 1

Senior Software Development Engineer - LLM Kernel & Inference Systems

GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the... inference systems (e.g., FasterTransformer), with demonstrated performance tuning. * GPU Kernel Development Proven experience...

Posted Date: 20 Dec 2025

Senior Software Development Engineer – LLM Inference Framework

your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building.... This role sits at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends. THE PERSON...

Posted Date: 20 Dec 2025

Senior Software Engineer, AI Inference Systems

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve... building and optimizing LLM inference engines (e.g., vLLM, SGLang). Hands-on work with ML compilers and DSLs (e.g., Triton...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 12 Nov 2025

Senior Software Development Engineer - SGLang

and tune large-scale training and inference models for optimal performance on AMD hardware. GPU Kernel Development: Design..., and enabling training and inference at scale across multi-GPU and multi-node systems. You will collaborate across internal GPU...

Posted Date: 20 Dec 2025

Sr Principal Machine Learning Engineer (Prisma AIRS)

security posture from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research... professional experience in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models...

Location: Santa Clara, CA
Posted Date: 17 Jan 2026

Sr Principal Machine Learning Engineer (Prisma AIRS)

security posture from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research... professional experience in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models...

Location: Santa Clara, CA
Posted Date: 16 Jan 2026

Principal Machine Learning Platform Engineer (Prisma AIRS)

security posture from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve... in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models at scale. Expert...

Location: Santa Clara, CA
Posted Date: 19 Nov 2025