Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Deep Learning Compiler Engineer - CUDA, Location: Shanghai

Page: 1

Deep Learning Compiler Engineer - CUDA

Compiler Architect in our group! The NVIDIA Architecture group is looking for world class architects and engineers... DSL and the core compiler of tile-aware GPU programming model for emerging GPU architectures Continuously innovate...

Company: Nvidia
Location: Shanghai
Posted Date: 15 Jan 2026

AI Software Engineer

your career. THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning... frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference...

Location: Shanghai
Posted Date: 16 Jan 2026

Software Development Engineer

) systems. Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance... Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA...

Location: Shanghai
Posted Date: 05 Jan 2026

AI Framework Engineer

: Leverage advanced compiler technologies and graph compilers to enhance the full deep learning and inference pipeline... using Triton to develop and optimize deep learning operators. Compiler Knowledge: Understanding or practical experience...

Location: Shanghai
Posted Date: 11 Dec 2025

GPU Kernel Software Engineer

your career. THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning... frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference...

Location: Shanghai
Posted Date: 10 Dec 2025

AI Framework Engineer

) systems. Utilize Cutting-Edge Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance... Development & Optimization: Experienced in designing and optimizing GPU kernels for deep learning on AMD GPUs using HIP, CUDA...

Location: Shanghai
Posted Date: 10 Dec 2025

GPU Kernel Software Development Engineer

Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance. Optimize Deep Learning... for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low...

Location: Shanghai
Posted Date: 15 Nov 2025

Senior System Software Engineer - AI Performance and Efficiency Tools

++ and Python), analytical, and debugging Good understanding of Deep Learning frameworks like PyTorch and TensorFlow, distributed... with NVIDIA GPUs, CUDA Programming and NCCL Motivated self-starter with strong problem-solving skills and customer-facing...

Company: Nvidia
Location: Shanghai
Posted Date: 12 Dec 2025

AI Framework Eng

Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance. Optimize Deep Learning... for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low...

Location: Shanghai
Posted Date: 10 Dec 2025

AI Framework Eng

Compiler Tech: Leverage advanced compiler technologies to improve deep learning performance. Optimize Deep Learning... for deep learning on AMD GPUs using HIP, CUDA, and assembly (ASM). Strong knowledge of AMD architectures (GCN, RDNA) and low...

Location: Shanghai
Posted Date: 10 Dec 2025

AI Framework Eng

: Leverage advanced compiler technologies and graph compilers to enhance the full deep learning and inference pipeline... using Triton to develop and optimize deep learning operators. Compiler Knowledge: Understanding or practical experience...

Location: Shanghai
Posted Date: 10 Dec 2025

AI Framework Eng

your career. THE ROLE: As a core member of the team, you will play a pivotal role in optimizing and developing deep learning... frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference...

Location: Shanghai
Posted Date: 10 Dec 2025