Interests GPU Systems & CUDA Kernel Optimization MLSys (vLLM, FlashInfer, DeepSpeed) TinyML and Efficient ML AWQ, GPTQ Megatron-LM (DGX Scale)