Interests

  • GPU Systems & CUDA Kernel Optimization
  • MLSys (vLLM, FlashInfer, DeepSpeed)
  • TinyML and Efficient ML
  • AWQ, GPTQ
  • Megatron-LM (DGX Scale)