your career. Principle GPU Performance Engineer - Artificial Intelligence THE ROLE: We are seeking a GPU Performance... Engineer to optimize AI training and inference workloads and guide the evolution of next-generation AMD Instinct GPU...
Engineer to own and revolutionize our engineering velocity and developer experience. This is a high-impact, cross-functional... & Security Performance Optimization: Drive deep technical optimizations at the compiler and architectural levels to improve...
engineer to bring advanced communication technologies into AI stacks, including PyTorch, TRT-LLM, vLLM, SGLang, JAX... (Triton, cuTe) Solid grasp of AI models, parallelisms, and/or compiler technologies (e.g. torch.compile) Experience...
Nvidia is hiring a Senior SOC/IP Methodology Engineer to help design and architect next generation custom SoC/IP... vendors such as Cadence, Synopsys, Mentor (CDC, LP Checks, Genus, First Encounter, Innovus, Design Compiler, Fusion Compiler...
High Performance AI Engineer to build groundbreaking multi-agent systems for the CUDA ecosystem. We build innovative... agentic runtimes and compiler-integrated orchestration that work together with NVIDIA's software stack to provide...
at the intersection of model architecture, GPU kernels, compiler technology, and distributed systems, collaborating closely... GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the...
software teams and engage with open-source communities to integrate cutting-edge compiler technologies and drive upstream... contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical...
: You are a systems-minded ML engineer who thinks in terms of throughput, latency, memory movement, and scheduling, not just model code... with kernel, compiler, and networking teams to close end-to-end performance gaps. You enjoy working in open source and driving...
We are now looking for a Senior Deep Learning Software Engineer, PyTorch-TensorRT Performance! NVIDIA is seeking... an experienced Deep Learning Engineer passionate about analyzing and improving the performance of Torch inference with TensorRT...
designs. As a software engineer, you will craft highly efficient software to automate and facilitate chip design...: Good architecture and RTL design knowledge Strong expertise in modern C++, compiler, build systems, and database...
AI companies in the world? THE ROLE: As a Forward Deployment Software Engineer, you will work closely with our most strategic... business value. This role is a unique blend of customer relationship skills and elite software engineer; you will work side...
your career. THE ROLE: We are seeking a GPU Performance Engineer to optimize AI training and inference workloads and guide the.../RCCL, MPI), and deliver optimizations to maximize scaling efficiency. Collaborate with compiler/runtime teams to improve...
speculative decoding, LoRA, MoE, and KV cache management. Design and implement compiler and runtime optimizations tailored... with compiler infrastructure for large language model inference. Exposure to robotics or embedded AI pipelines, including...
or support Hands-on experience with Cadence Virtuoso or Synopsys Custom Compiler Familiarity with Calibre rule decks...
, we strongly encourage you to apply – Experience on parallelizing compiler development Ability to develop complex C++ code...
learning models to be deployed in Physical AI systems. As part of the role, you will develop compiler technology to allow... to develop the best solution for partners working on our platforms. What you'll be doing: Developing compiler technologies...
learning models to be deployed in Physical AI systems. As part of the role, you will develop compiler technology to allow... to develop the best solution for partners working on our platforms. What you'll be doing: Developing compiler technologies...
with industry-standard EDA tools for physical design, including Cadence Genus and Innovus, and Synopsys Design Compier, IC Compiler... and Fusion Compiler Working knowledge of static timing analysis tools such as Tempus or PrimeTime and EM/IR-Drop/Crosstalk...
applications, on supercomputers or the cloud Background with tasking or asynchronous runtimes Background on compiler...
challenges. You will have the chance to work on custom and compiler ram layouts with cut in edge process technology... Crowd: SRAM digital custom block design experience SRAM compiler experience With competitive salaries and a generous...