engineer with strong technical and analytical expertise in GPGPU C++, Triton, TileLang or DSL development within Linux... performance goals. Initiate and help with different level codegen optimizations. Contribute to SGLang Development: Support...
and align kernel-level optimizations with full-stack performance goals. Contribute to SGLang Development: Support... contributions that benefit AMD’s AI software ecosystem. THE PERSON: Skilled engineer with strong technical and analytical...
PREFERRED EXPERIENCE: Inference Stack Knowledge Hands-on understanding of vLLM, SGLang, or similar inference stacks... your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building...
. This role is deeply focused on LLM inference stacks, including vLLM, SGLang, and internal inference platforms. You will work... GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the...
-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack... development for critical transformer components such as attention, GEMM, and MoE. Benchmark, profile, and optimize inference...
architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning... co-design. Your work will span multiple layers of the AI software stack—ranging from algorithm design to integration...