cooling for servers or high-performance computing (HPC) is a major plus. A solid grasp of thermal fundamentals, including...-on technical discussions and hardware troubleshooting...
efficiently. Set up and configure HPC clusters to meet specific requirements and workloads. Manage and maintain HPC hardware...Top Skills Required for this role : 1. HPC - High performance computing 2.AWS cloud services 3.DevOps CI CD 4...
hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring every FLOP counts... in delivering optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise...
& storage: Choose and implement an on-prem hardware and data pipeline design or a cloud/S3 alternative with explicit cost... stacks (local GPU nodes or cloud). Skills 5+ years designing and deploying high-throughput storage or HPC pipelines (≥1...
within Cisco, including marketing, system hardware, software, product engineering, and manufacturing. Through this collaboration.... Open-minded, driven, diverse and deeply creative people at Cisco craft the hardware that makes the internet work. Bring...
within Cisco, including marketing, system hardware, software, product engineering, and manufacturing. Through this collaboration.... Open-minded, driven, diverse and deeply creative people at Cisco craft the hardware that makes the internet work. Bring...
-based solutions to get the most out of modern GPU hardware for real production workloads. Take on team-specific projects...-performance computing (HPC), machine learning systems, or computer architecture. Strong programming skills in C++ Preferred...
NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC... working with NVIDIA GPU hardware is a strong plus. Good to have solid understanding of virtualization in Linux (KVM, Docker...
your career. THE ROLE: This highly technical role supports large-scale datacenter graphics hardware and software subsystem... across hardware, virtualization software, and networking stacks, communicating complex concepts clearly, and building...
Top Skills: Data Center & AI Cluster Networking · High-performance interconnects - GPU, HPC, AI clusters...-scale DCs inter-connects and fabric for HPC, AI, and GPU computing clusters. · Develop high-performance data center fabric...
, Grafana, Loki) and incident response frameworks. Familiarity with high-performance computing (HPC) or AI/ML training... infrastructure at scale. Background in reliability engineering, distributed systems, or hardware acceleration environments...
, scalability, and portability across diverse architectures—empowering breakthroughs in AI, HPC, and beyond. If you’re passionate... leadership, and measurable impact on AI and HPC workloads. Qualifications: You must possess the below minimum qualifications...
language for practitioners in AI, data science and HPC, through popular frameworks such as NumPy, SciPy, TensorFlow and PyTorch... computing, data analytics, deep learning, and professional graphics, running on hardware ranging from supercomputers to the...
, and maintenance of hardware and software products. Communicate with customers to understand their technical issues and provide timely... hardware. Good knowledge of networking protocols and technologies. Experience designing and implementing IT infrastructure...
to support deep learning and high-performance computing (HPC) workloads in large-scale data centers. We focus on delivering core... software components for the next generation of AI and HPC platforms, benchmarks, and fine-tuning performance. Our work spans...
Platform Residency engineer must: Understand Kubernetes deeply, Support troubleshoot and optimize a Kubernetes driven HPC..., Ansible Cluster Management Open-source data tools: Kafka Cloud Databases: AWS Databases Linux HPC related tools Core...
solutions. As HW/SW co-design engineer, you will collaborate with a strong architecture, software, and design teams... prototyping, modeling, and analysis of ML/HPC workloads. Through your prototyping, experiments and analysis, you will provide...
and lead a new team focused on developer productivity within Inference: making every engineer in the org dramatically... friction of working across heterogeneous hardware Establish and drive productivity metrics across the Inference org, creating...
, cluster bring-up, hardware installation, and troubleshooting across compute, network, and GPU environments. The engineer...Job Title: DATA CENTER OPERATIONS ENGINEER Location :San Jose, CA The Data Center Operations Engineer...