, you will design, implement, and productionize model optimization algorithms for inference and deployment on NVIDIA’s latest hardware... architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning...
, and productionize model optimization algorithms for inference and deployment on NVIDIA’s latest hardware platforms. The focus is on ease... architecture search, and streamlined deployment strategies with open-sourced inference frameworks. Seeking a Senior Deep Learning...
training algorithms, and model parallel paradigms. Performance tuning and optimizations, model training and finetuning... Models (LLM) and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end...
an ambitious and forward-thinking AI/ML System Performance Engineer to contribute to the development of next-generation inference... optimizations and deliver industry-leading performance. In this role, you will investigate and prototype scalable inference...