Job Search Results

AI Inference Engineer

Job Description: WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a cul...

Apply Now

Company: Advanced Micro Devices

Location: Beijing

Posted Date: 15 Dec 2025

AI Software System Engineer(Kernel/Runtime)

your career. THE ROLE: As an AICE Software System Design Engineer, you will be responsible for the development, debugging..., distributions, compilers, performance optimizations for inference or training, along with strong programming skills in C...

Apply Now

Company: Advanced Micro Devices

Location: Beijing

Posted Date: 06 Mar 2026

Master Principal Cloud Engineer – GPU & AI Infrastructure

Job Category: Pre Sales Job Description: Position Overview As a GPU Specialist Cloud Engineer (CE) within the.... Optimization: Advise customers on right-sizing GPU shapes based on workload requirements (e.g., training vs. inference, FP8 vs...

Apply Now

Company: Oracle

Location: Beijing

Posted Date: 05 Mar 2026

Devtech Compute Engineer

and inference on GPU. You’ll join a team of ML, HPC and Software Engineers and Applied Researcher developing a framework designed...: In your role as Devtech Compute Engineer or CUDA Performance Engineer you will be primarily for the development of performance...

Apply Now

Company: Nvidia

Location: Beijing

Posted Date: 01 Mar 2026

Software Engineer for SPICE (AI)

) and their integration into Cadence’s EDA ecosystem. The engineer will architect intelligent systems that enhance productivity, automate... scalable systems for model training, inference, and integration with existing tools Collaborate with simulation and analysis...

Apply Now

Company: Cadence Design Systems

Location: Beijing

Posted Date: 07 Feb 2026

Senior Research Engineer - Multimodal & Video Foundation Model

for multimodal language models, integrating text, visual, and audio modalities. Engineer scalable training and inference pipelines... experience working with the full development pipeline from data processing & data loading to training, inference...

Apply Now

Company: Tether Operations

Location: Beijing

Posted Date: 27 Jan 2026

Software Engineer 2 - Processing Unit for Copilot

operations (e.g., FlashAttention, GEMM, LayerNorm) to outperform standard libraries. Inference Engine Architecture: Contribute... to the development of our high-performance inference engine, focusing on graph optimizations, operator fusion, and dynamic...

Apply Now

Company: Microsoft

Location: Beijing

Posted Date: 13 Mar 2026

Senior Software Engineer - Processing Unit for Copilot

operations (e.g., FlashAttention, GEMM, LayerNorm) to outperform standard libraries. Inference Engine Architecture: Contribute... to the development of our high-performance inference engine, focusing on graph optimizations, operator fusion, and dynamic...

Apply Now

Company: Microsoft

Location: Beijing

Posted Date: 13 Mar 2026

Senior Software Engineer- MAI Platform

, and multi-agent orchestration at scale. LLM Serving Infrastructure: Optimize inference stacks including request scheduling, KV...) with hands-on implementation experience. LLM infrastructure foundation: Understanding of Transformer inference mechanics...

Apply Now

Company: Microsoft

Location: Beijing

Posted Date: 13 Mar 2026

Software Engineer 2 - MAI Platform

, and multi-agent orchestration at scale. LLM Serving Infrastructure: Optimize inference stacks including request scheduling, KV...) with hands-on implementation experience. LLM infrastructure foundation: Understanding of Transformer inference mechanics...

Apply Now

Company: Microsoft

Location: Beijing

Posted Date: 12 Mar 2026

Senior Machine Learning Engineer - AI Effects and Editing

robust pipelines for LoRA-based model training, post-training quantization, and inference optimisation. Develop... with LoRA training, model post-processing (quantization, pruning), and on-device inference optimisation. Familiarity with image...

Apply Now

Company: Canva

Location: Beijing

Posted Date: 11 Mar 2026

Senior Software Engineer

solutions encompassing backend AI service APIs, model inference optimization, and frontend interfaces to showcase new... AI models (e.g., diffusion models for image/video, GANs, autoregressive models). Building and optimizing inference pipelines...

Apply Now

Company: Microsoft

Location: Beijing

Posted Date: 11 Mar 2026

Senior Software Engineer

-solving skills in LLM inference optimization, token efficiency, and response tuning. Experience with AI frameworks...

Apply Now

Company: Microsoft

Location: Beijing

Posted Date: 06 Mar 2026

Senior Software Engineer

, or Triton. Optimize model inference and training pipelines for speed, throughput, memory efficiency, and cost across GPU... and architecture design. Familiar with inference optimization, experience in developing popular inference framework such as TensorRT...

Apply Now

Company: Microsoft

Location: Beijing

Posted Date: 05 Mar 2026

AI Product performance Engineer

Integration: Collaborate with software stack teams to expose optimized kernels within high-level frameworks and inference engines... using OpenAI Triton or other Python-based DSLs for agile kernel development and auto-tuning. Inference Engine Experience...

Apply Now

Company: Advanced Micro Devices

Location: Beijing

Posted Date: 03 Mar 2026

AI Product Performance Engineer

Integration: Collaborate with software stack teams to expose optimized kernels within high-level frameworks and inference engines... using OpenAI Triton or other Python-based DSLs for agile kernel development and auto-tuning. Inference Engine Experience...

Apply Now

Company: Advanced Micro Devices

Location: Beijing

Posted Date: 02 Mar 2026

Generative AI Algorithms Engineer

of multimodal inference and training, such as image generation, 3D, video generation, editing, ViT and other models. Efficient... inference algorithms research and advanced quantization, e.g. batching, KV caching, efficient attentions, long context...

Apply Now

Company: Qualcomm

Location: Beijing

Posted Date: 02 Mar 2026

Senior Developer Technology Engineer

on maximizing training and inference speed while enabling effortless scalability. What You’ll Be Doing: Profile, analyze..., and optimize GPU‑accelerated code to improve training and inference performance for large‑scale recommender systems. Design...

Apply Now

Company: Nvidia

Location: Beijing

Posted Date: 01 Mar 2026

Developer Technology Engineer - AI

, through both library development and direct contribution to the applications. This includes training and inference... of software design, programming techniques, and algorithms. Expert knowledge of LLM training/inference optimization, including...

Apply Now

Company: Nvidia

Location: Beijing

Posted Date: 01 Mar 2026

AI Video Research Engineer Intern

, supervised fine-tuning, post-training, inference, architecture design, or evaluation Benchmark models against current state...

Apply Now

Company: Tether Operations

Location: Beijing

Posted Date: 19 Feb 2026

Find your dream job NOW!

Keywords: AI Inference Engineer, Location: Beijing

Page: 1

AI Inference Engineer

AI Software System Engineer(Kernel/Runtime)

Master Principal Cloud Engineer – GPU & AI Infrastructure

Devtech Compute Engineer

Software Engineer for SPICE (AI)

Senior Research Engineer - Multimodal & Video Foundation Model

Software Engineer 2 - Processing Unit for Copilot

Senior Software Engineer - Processing Unit for Copilot

Senior Software Engineer- MAI Platform

Software Engineer 2 - MAI Platform

Senior Machine Learning Engineer - AI Effects and Editing

Senior Software Engineer

Senior Software Engineer

Senior Software Engineer

AI Product performance Engineer

AI Product Performance Engineer

Generative AI Algorithms Engineer

Senior Developer Technology Engineer

Developer Technology Engineer - AI

AI Video Research Engineer Intern