, you will join our GPU Software team to design and implement user-mode driver features that enable high-performance compute workloads... of people around the world. Come build with us! Role and Responsibilities As a Software Driver Engineer...
Compute division focuses on building large-scale and highly available cloud infrastructure, which supports both public... cloud products (like VolcEngine ECS service) and the internal products. Compute-US team focuses on the development...
your career. Principal / Senior GPU Software Performance Engineer — Training at Scale THE ROLE: We train large models.../codegen choices for target architectures Scale beyond one GPU: Optimize P2P and collective comms, overlap compute/comm...
for multi-GPU and multi-platform performance. Experience with AI software framework, such as PyTorch, vLLM, SGLang... engineer who can provide technical leadership in the development of various AI frameworks in the AMD ecosystem. You will play...
: Develop and implement the overall QA strategy and frameworks for testing GPU-based software products, spanning various...-functional requirements Analyze and debug complex failure scenarios in GPU software environment, including root cause analysis...
Job Title: Senior RTL Design Engineer – Floating Point Architecture Location: Remote (Anywhere in USA) Full-time...: Salary + Benefits + Bonuses About the Role We are seeking a Senior RTL Design Engineer with strong expertise in floating...
Job Title: Senior RTL Design Engineer - Floating Point Architecture Location: Remote (Anywhere in USA) Full-time...: Salary + Benefits + Bonuses About the Role We are seeking a Senior RTL Design Engineer with strong expertise in floating...
-based GPU compute platform, which complements our Kubernetes-based orchestration layer for GPU and CPU workloads. The...), and resource management for GPU-accelerated workloads. Strong troubleshooting skills across compute, storage, and network layers...
your career. THE PERSON: We are seeking a DevOps / Platform Engineer to join our team building and operating large-scale GPU... compute infrastructure that powers AI and ML workloads. The ideal candidate should be passionate about software engineering...
-on experience supporting compute, GPUs, and AI services on both GCP and Azure. Hands-on GPU Cluster Management: Take a leadership... track record as a Principal or Senior Staff Engineer. Expert-level knowledge of NVIDIA GPU architecture and technologies...
in AI, GPU, video, and IO domains. Collaborate with planning, software and hardware cross-functional teams to develop... in defining and driving architecture for next-generation Adaptive SoCs, with on Processor subsystems, Interconnect, AI, GPU, video...
hardware/software co-optimization by identifying opportunities where architectural features can unlock significant performance... improvements Characterize and optimize memory hierarchy performance, interconnect utilization, and compute resource efficiency...
by DriveNets software. DriveNets Network Cloud-AI solution, based on the same technology, was introduced to the market in 2023... stability, real-time monitoring, logging, and alerting. Administer Linux systems, ranging from powerful GPU enabled servers...
to manage the entire lifecycle of a large scale RAG pipeline . - Architect, implement, and manage a high-performance compute... to automate the deployment and management of the on-premise hardware and software stack. This ensures consistency...
with ROCm software developers, DC GPU HW/FW/ASIC Teams, Field Engineering Teams, OEM/ODM partners, CSPs, and Marketing/Business... your career. THE ROLE: The Senior Manager, DC GPU Advanced Forward Deployment and Systems Engineering is a leadership position...