Find your dream job NOW!

Click on Location links to filter by Job Title & Location.
Click on Company links to filter by Company & Location.
For exact match, enclose search terms in "double quotes".

Keywords: Systems Reliability Engineer, Location: Santa Clara, CA

Page: 9

Senior Software Engineer, Streaming Performance and Analytics

of gameplay. Just click and play! Visit us at We are looking for a Senior Systems Software engineer to join a team of highly... regressions, and identify root causes and actionable insights. Drive improvements to service reliability, efficiency, and user...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 06 Aug 2025

Senior DGX Cloud AI Infrastructure Software Engineer

and availability of AI systems. As a senior DGX Cloud AI Infrastructure software engineer at NVIDIA, you will have the opportunity... with the necessary resources and scale to foster innovation. We are seeking an AI infrastructure software engineer...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 03 Aug 2025

Senior AI Engineer, Agents and Developer Workflows

and Driverless Cars to cater to their infrastructure and software development workflow needs. As a senior engineer on AI Workflow..., and boost release reliability Experience designing, developing, and deploying AI agents to automate software development...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 30 Jul 2025

Senior Sustaining Hardware Engineer

PCB designs, and signal integrity to deliver state-of-the-art products that drive network performance and reliability... solution, determining if any other systems are susceptible, and implementing measures to prevent the issue from occurring...

Company: Arista Networks
Location: Santa Clara, CA
Posted Date: 18 Oct 2025

AIML - Sr Machine Learning Engineer - Answers, Knowledge & Information

distributed systems engineers. As such, we are looking for candidates with applied machine learning experience and strong software... and implement the low latency and high reliability runtime tech stacks for global search Design and develop data pipeline...

Company: Apple
Location: Santa Clara, CA
Posted Date: 16 Oct 2025

Product Engineer, NPI

-leading flash storage systems to our customers. WHAT YOU'LL DO Serve as the Operations technical lead, providing critical..., and system-level performance (NAND, PCIe, NVMe) to ensure world-class reliability, quality, and cost targets are met. Streamline...

Company: Pure Storage
Location: Santa Clara, CA
Posted Date: 15 Oct 2025

Senior Hardware Design Engineer, Ethernet Switching

PCB designs, and signal integrity to deliver state-of-the-art products that drive network performance and reliability... engineers to bring-up and debug systems Familiarity with signal integrity and power integrity concepts and tools...

Company: Arista Networks
Location: Santa Clara, CA
Posted Date: 12 Oct 2025

Senior Research Engineer - Autonomous Vehicles

. Develop robust monitoring and debugging tools to ensure the reliability and performance of training workflows on large GPU... experience designing and optimizing distributed training systems with frameworks like PyTorch, JAX, or TensorFlow. Deep...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 09 Oct 2025

AI Platform Engineer Intern (Agentic) - Master's Degree

Intern, you will be part of our AI Development Team with a focus on Agentic AI - systems that don’t just answer questions... AI agents, proposing enhancements to improve reliability and effectiveness. Work with data engineers and software developers...

Company: Marvell
Location: Santa Clara, CA
Posted Date: 04 Oct 2025
Salary: $26 - 51 per hour

Software Engineer, AI Infra Innovation

on creating a significant impact on the business. Lead the development of distributed systems at scale, ensuring reliability...-quality, enterprise-grade software. Expertise in systems engineering and a strong background in AI. Proven capability...

Company: Pure Storage
Location: Santa Clara, CA
Posted Date: 03 Oct 2025

Senior Principal DC Optical Engineer - AI Infrastructure

infrastructure build and operations, passive panel and optical interconnects, optical splice enclosures, fiber optic cabling systems.... Perform fiber optic testing and certification to ensure network performance and reliability. Troubleshoot and resolve...

Company: Oracle
Location: Santa Clara, CA
Posted Date: 03 Oct 2025

Senior Principal Optical Engineer - AI Infrastructure

of AI cluster network fabric and GPU compute server systems through a combination of a deep level understanding of optical... teams and develop and execute comprehensive test plans to evaluate link performance and reliability for high availability...

Company: Oracle
Location: Santa Clara, CA
Posted Date: 03 Oct 2025

Mobile Software Development Engineer - Android, Amazon Connect

-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - 2+ years... Android) along with distributed software applications, tools, systems and services. Translate functional requirements...

Company: Amazon
Location: Santa Clara, CA
Posted Date: 01 Oct 2025

Mercury Pricing POD - Senior Cloud Apps Engineer

, identification of code metrics, system risk analysis, software reliability analysis. ● Develop REST APIs, work on integrations..., AWS Elastic Kubernetes Service, AWS RDS, AWS API Gateway, Rancher Platform for EKS. Familiarity with database systems...

Company: Pure Storage
Location: Santa Clara, CA
Posted Date: 28 Sep 2025

Principal Optical Engineer - AI Infrastructure

of AI cluster network fabric and GPU compute server systems through a combination of a deep level understanding of optical... and execute comprehensive test plans to evaluate link performance and reliability for high availability of AI clusters...

Company: Oracle
Location: Santa Clara, CA
Posted Date: 26 Sep 2025
Salary: $96800 - 223400 per year

Principal Optical Engineer - AI Infrastructure

of AI cluster network fabric and GPU compute server systems through a combination of a deep level understanding of optical... and execute comprehensive test plans to evaluate link performance and reliability for high availability of AI clusters...

Company: Oracle
Location: Santa Clara, CA
Posted Date: 26 Sep 2025
Salary: $96800 - 223400 per year

Senior Algorithm Engineer, Map-Perception Fusion

Driving Systems: Help develop and refine systems that integrate state-of-the-art perception and mapping technologies for use... automotive industry best practices for safety and reliability. What we need to see: Bachelor’s or Master’s degree in Computer...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 19 Sep 2025

Principal Engineer, Federated Learning

, ease of use, reliability and security. Leverage NVIDIA's cutting-edge hardware and software platforms to enhance... high-performance software systems. 8+ years of architect experience in designing and developing distributed systems. 5...

Company: Nvidia
Location: Santa Clara, CA
Posted Date: 17 Sep 2025

Sr. Software Engineer

Strong experience with distributed systems architecture, broker architecture, and IPC Working knowledge of modern desktop UI frameworks... with Git-based version control systems is required Experience with Agile/Scrum software development processes...

Posted Date: 13 Sep 2025
Salary: $15000 - 20000 per year

Senior Machine Learning Engineer, Customer Engagement Technology

systems. We develop multi-modal, multi-turn, goal-oriented dialog systems that can handle customer issues at Amazon scale... across multiple languages. These systems are designed to adapt to changing company policies and invoke correct APIs to automate...

Company: Amazon
Location: Santa Clara, CA
Posted Date: 07 Sep 2025