, and optimizes for E2E workload and to contribute to external frameworks (e.g., PyTorch, vLLM, SGLang). Implements various... to identify performance bottlenecks and proposes solutions across individual component teams. Optimizes code for various computing...