Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Deep Learning Compiler Engineer - CUDA, Location: Shanghai

Page: 1

Deep Learning Compiler Engineer - CUDA

Compiler Architect in our group! The NVIDIA Architecture group is looking for world class architects and engineers... DSL and the core compiler of tile-aware GPU programming model for emerging GPU architectures Continuously innovate...

Company: Nvidia
Location: Shanghai
Posted Date: 15 Jan 2026

Triton AI Compiler Engineer

your career. THE ROLE: A software development engineer on teams building and optimizing Deep Learning applications... advanced compiler technologies to improve deep learning performance. Optimize Deep Learning Pipeline: Enhance the full...

Location: Shanghai
Posted Date: 12 Feb 2026

Software Development Engineer

or strong academic exposure to deep learning systems, understands LLM and multimodal model architectures, and is eager to write... production-quality code that balances functionality, correctness, and performance. KEY RESPONSIBILITIES: Deep Learning & LLM...

Location: Shanghai
Posted Date: 05 Mar 2026

GPU Kernel Development Engineer

advanced compiler technologies to improve deep learning performance. Optimize Deep Learning Pipeline: Enhance the full... & Optimization: Strong experience in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA, and assembly...

Location: Shanghai
Posted Date: 03 Mar 2026

AI Framework Engineer

your career. THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning... frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference...

Location: Shanghai
Posted Date: 27 Feb 2026

AI Software Engineer

) systems. Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance... Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA...

Location: Shanghai
Posted Date: 14 Feb 2026

Sr. Software Development Engineer

: Leverage advanced compiler technologies and graph compilers to enhance the full deep learning and inference pipeline... using Triton to develop and optimize deep learning operators. Compiler Knowledge: Understanding or practical experience...

Location: Shanghai
Posted Date: 01 Feb 2026

Software Development Engineer

: Leverage advanced compiler technologies and graph compilers to enhance the full deep learning and inference pipeline... experience using Triton to develop and optimize deep learning operators. Compiler Knowledge: Understanding or practical...

Location: Shanghai
Posted Date: 20 Jan 2026

AI Software Engineer

your career. THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning... frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference...

Location: Shanghai
Posted Date: 16 Jan 2026

AI Framework Engineer

) systems. Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance... Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA...

Location: Shanghai
Posted Date: 10 Dec 2025

GPU Kernel Software Engineer

. Develop and optimize GPU kernel based on triton/cuda/hip. Bring up Deep Learning Models on AMD GPU. Collaborate and interact... your career. KEY RESPONSIBILITIES: Develop and optimize deep learning frameworks like VLLM, Megatron, PyTorch, etc. on AMD GPU...

Location: Shanghai
Posted Date: 07 Feb 2026

AI Framework Eng

your career. THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning... frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference...

Location: Shanghai
Posted Date: 27 Feb 2026

AI Framework Eng

Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance. Optimize Deep Learning... for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low...

Location: Shanghai
Posted Date: 10 Dec 2025

AI Framework Eng

your career. THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning... frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference...

Location: Shanghai
Posted Date: 10 Dec 2025