Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: AI Training Reliability Engineer, Location: Beijing

Page: 1

AI Training Reliability Engineer

your career. Responsibilities Own reliability governance (standards, runbooks, SLIs/SLOs) and deliver KPI improvements... stop-the-world restarts. Establish fault-injection/chaos and regression gates to prevent reliability regressions (GPU/NIC...

Location: Beijing
Posted Date: 01 Feb 2026

AI Training Reliability Engineer

your career. Responsibilities Own reliability governance (standards, runbooks, SLIs/SLOs) and deliver KPI improvements... stop-the-world restarts. Establish fault-injection/chaos and regression gates to prevent reliability regressions (GPU/NIC...

Location: Beijing
Posted Date: 01 Feb 2026

AI Framework Engineer

frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference... principles to drive continuous improvement. THE PERSON: Skilled engineer with strong technical and analytical expertise in C...

Location: Beijing
Posted Date: 31 Jan 2026

Electrical Regional Engineer, Colocation Regional Engineering

Regional Engineer, you will drive Electrical/Mechanical Engineering services, construction and implementation of enterprise.... The Colo Regional Engineer is the engineering representative on behalf of the Data Center Field Engineering Team...

Company: Amazon
Location: Beijing
Posted Date: 28 Jan 2026

Mechnical Colo Regional Engineer, Colocation Regional Engineering

Regional Engineer, you will drive Mechanical/Electrical Engineering services, construction and implementation of enterprise.... The Colo Regional Engineer is the engineering representative on behalf of the Data Center Field Engineering Team...

Company: Amazon
Location: Beijing
Posted Date: 28 Jan 2026

Staff Software Engineer – Infinia L4

record of performance, reliability, and scalability, DDN empowers businesses to tackle the most challenging AI and data... impact in the world of AI and data storage. Job Description As a Staff Software Engineer – Infinia L3, you’ll...

Posted Date: 19 Nov 2025

Mechanical Engineer

development and growing an inclusive culture ensures you have the support to thrive. Whether through mentorship, training... while improving productivity, energy security and reliability. With global operations and a comprehensive portfolio of software...

Location: Beijing
Posted Date: 28 Jan 2026

Supplier Quality & Development Engineer

, we revolutionized high-voltage technology and pioneered many world’s firsts to bring safety, reliability and efficiency to power... and escalate issues proactively. 4. You will design and deliver training programs to enhance SQE technical skills and will foster...

Company: Hitachi
Location: Beijing
Posted Date: 18 Jan 2026

Lead Supplier Quality & Development Engineer

, we revolutionized high-voltage technology and pioneered many world’s firsts to bring safety, reliability and efficiency to power... and escalate issues proactively. 4. You will design and deliver training programs to enhance SQE technical skills and will foster...

Company: Hitachi
Location: Beijing
Posted Date: 17 Jan 2026

Supplier Quality & Development Engineer

, we revolutionized high-voltage technology and pioneered many world’s firsts to bring safety, reliability and efficiency to power... and escalate issues proactively. 4. You will design and deliver training programs to enhance SQE technical skills and will foster...

Company: Hitachi
Location: Beijing
Posted Date: 17 Jan 2026

Supplier Quality & Development Engineer

, we revolutionized high-voltage technology and pioneered many world’s firsts to bring safety, reliability and efficiency to power... and escalate issues proactively. 4. You will design and deliver training programs to enhance SQE technical skills and will foster...

Company: Hitachi
Location: Beijing
Posted Date: 17 Jan 2026

AI Agent Engineer

and agent protocols (e.g., MCP/A2A). Improve training reliability (automation, failover, health checks) to keep jobs running... through cluster faults. Optimize distributed training performance (parallelism, comms/storage/operator tuning) to improve GPU...

Location: Beijing
Posted Date: 16 Jan 2026

Advanced Manufacturing Engineer

for performance, reliability, cost and manufacturability Engaging in all phases of new product introduction and process development... requirements for size, precision, reliability and cost Leading high mechanical accuracy process and equipment design Delivering...

Company: GE HealthCare
Location: Beijing
Posted Date: 13 Nov 2025

Senior SLAM and Deep Learning Engineer, Autonomous Vehicles

driving applications, including pre-training and fine-tuning. Design innovative data generation and collection strategies..., ensuring performance, safety, and reliability standards are met. What we need to see: A MS, or PhD, or equivalent...

Company: Nvidia
Location: Beijing
Posted Date: 13 Nov 2025

HSE manager

. As a global leader, we revolutionized high-voltage technology and pioneered many world’s firsts to bring safety, reliability..., also ensure appropriate HSE equipment and tools are available and used. Identifies, plans and initiates training needs, e.g...

Company: Hitachi
Location: Beijing
Posted Date: 24 Dec 2025