Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Senior Software Development Engineer - SGLang and Inference Stack, Location: Santa Clara, CA

Page: 1

Senior Software Development Engineer – SGLang and Inference Stack

engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...

Posted Date: 12 Feb 2026

Senior Software Development Engineer - SGLang and Inference Stack

and align kernel-level optimizations with full-stack performance goals. Contribute to SGLang Development: Support... contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical...

Posted Date: 20 Dec 2025

Senior Software Development Engineer – LLM Inference Framework

PREFERRED EXPERIENCE: Inference Stack Knowledge Hands-on understanding of vLLM, SGLang, or similar inference stacks... your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building...

Posted Date: 20 Dec 2025

Senior Software Development Engineer - LLM Kernel & Inference Systems

. This role is deeply focused on LLM inference stacks, including vLLM, SGLang, and internal inference platforms. You will work... GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the...

Posted Date: 19 Dec 2025

Senior Software Engineer – TensorRT Edge-LLM

-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... development for critical transformer components such as attention, GEMM, and MoE. Benchmark, profile, and optimize inference...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 14 Feb 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning... co-design. Your work will span multiple layers of the AI software stack—ranging from algorithm design to integration...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 01 Feb 2026