scalability. Work includes multiple sub-areas: resource scheduling, task orchestration, model training, model inference, model... optimization algorithms like quantization and pruning. - Practical experience in performance optimization/tuning of deep learning...
like quantization and pruning. - Practical experience in performance optimization/tuning of deep learning model training/inference... scalability. Work includes multiple sub-areas: resource scheduling, task orchestration, model training, model inference, model...
-areas: ML model training and evaluation, model optimization, model inference, model management, dataset management, workflow... training and inference - Strong understanding and engineering experience of cutting-edge LLM research and engineering (e.g...
Optimization: Optimize the performance of model inference, including but not limited to efficient utilization of computing... (such as TensorFlow, PyTorch, DeepSpeed) and their deployment in production environments. - Familiarity with model inference optimization...
and Supply Chain, Customer Experience and Governance and Knowledge Graph. We constantly work on areas such as modeling inference... and performance optimization, model training and deployment, data processing pipeline, features engineering and online services...