Job Search Results

Senior Software Development Engineer - LLM Kernel & Inference Systems

GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the..., and PyTorch for AMD GPUs, contributing both internally and upstream. * LLM-Aware Kernel Development Design and optimize GPU...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 19 Dec 2025

Senior Software Development Engineer – LLM Inference Framework

, or similar GPU architectures and kernel development Software Engineering Expertise in Python and preferably experience in C/C... your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 19 Dec 2025

Senior Software Engineer – TensorRT Edge-LLM

-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... Science, Electrical/Computer Engineering, or a closely related field. 4+ years of relevant software development experience...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 14 Feb 2026

Senior Software Development Engineer – SGLang and Inference Stack

engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... K2.5, etc. GPU Kernel Development: Design, implement, and optimize high-performance GPU kernels using HIP, Triton...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 12 Feb 2026

Senior Software Development Engineer - SGLang and Inference Stack

contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical... and tune large-scale training and inference models for optimal performance on AMD hardware. GPU Kernel Development: Design...

Apply Now

Company: Advanced Micro Devices

Location: Santa Clara, CA

Posted Date: 20 Dec 2025

Principal Software Engineer - AI Inference

NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference... to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 24 Feb 2026

Senior GenAI Algorithms Engineer — Post-Training Optimizations

, including custom kernel development with CUDA and Triton. This role offers a unique opportunity to work at the intersection...'s AI software stack, e.g., TensorRT Model Optimizer, NeMo/Megatron, and TensorRT-LLM. Construct and curate large problem specific...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 01 Feb 2026

Senior Deep Learning Algorithm Engineer

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms... as establishing a data-driven approach to hardware design and system software development. We collaborate with a broad cross section...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 15 Jan 2026

Senior Compiler Engineer - AI

doing technology development on problems of kernel generation and optimizations for computational graphs for next generation... work or research experience in kernel generation, mega kernels, compiler optimizations, synthesis, LLM inference...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 26 Feb 2026

Senior AI Inference Compiler Engineer

. Today, we are increasingly known as “the AI computing company”. We are looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring... software engineers for its Deep Learning & AI Compiler (DLC) team. Academic and commercial groups around the world are using...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 26 Feb 2026

Senior Deep Learning Framework Communications Engineer

engineer to bring advanced communication technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX..., and Inference Engines such as TRT-LLM, vLLM, SGLang Rapid prototyping and development with Python, C++, CUDA or related DSLs...

Apply Now

Company: Nvidia

Location: Santa Clara, CA

Posted Date: 24 Jan 2026

Principal Machine Learning Engineer (Prisma AIRS)

from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research on cutting-edge areas... with low-level performance optimization, such as custom CUDA kernel development or using Triton Language, is a plus. Experience...

Apply Now

Company: Palo Alto Networks

Location: Santa Clara, CA

Posted Date: 22 Feb 2026

Principal Machine Learning Platform Engineer (Prisma AIRS)

from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve as a technical authority... are a significant plus. Experience with low-level performance optimization, such as custom CUDA kernel development or using Triton...

Apply Now

Company: Palo Alto Networks

Location: Santa Clara, CA

Posted Date: 29 Jan 2026

Find your dream job NOW!

Keywords: Senior Software Development Engineer - LLM Kernel , Location: Santa Clara, CA

Page: 1

Senior Software Development Engineer - LLM Kernel & Inference Systems

Senior Software Development Engineer – LLM Inference Framework

Senior Software Engineer – TensorRT Edge-LLM

Senior Software Development Engineer – SGLang and Inference Stack

Senior Software Development Engineer - SGLang and Inference Stack

Principal Software Engineer - AI Inference

Senior GenAI Algorithms Engineer — Post-Training Optimizations

Senior Deep Learning Algorithm Engineer

Senior Compiler Engineer - AI

Senior AI Inference Compiler Engineer

Senior Deep Learning Framework Communications Engineer

Principal Machine Learning Engineer (Prisma AIRS)

Principal Machine Learning Platform Engineer (Prisma AIRS)