GPUs. THE PERSON: You are a senior systems engineer with deep LLM domain knowledge who enjoys working close to the... inference systems (e.g., FasterTransformer), with demonstrated performance tuning. * GPU Kernel Development Proven experience...
your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building.... This role sits at the intersection of inference engines, distributed systems, and GPU runtime and kernel backends. THE PERSON...
We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve... building and optimizing LLM inference engines (e.g., vLLM, SGLang). Hands-on work with ML compilers and DSLs (e.g., Triton...
and tune large-scale training and inference models for optimal performance on AMD hardware. GPU Kernel Development: Design..., and enabling training and inference at scale across multi-GPU and multi-node systems. You will collaborate across internal GPU...
security posture from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research... professional experience in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models...
security posture from development through runtime. As a Senior Principal Machine Learning Engineer, you will drive research... professional experience in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models...
security posture from development through runtime. As a Principal Machine Learning Inference Engineer, you will serve... in software engineering with a deep focus on MLOps, ML systems, or productionizing machine learning models at scale. Expert...