Yineng Zhang zhyncs

🔭 I am a software engineer on the Model Performance Team at Baseten. I used to work at Meituan, using Tensorflow, TensorRT for CTR GPU Inference and PyTorch for LLM GPU Inference.
💻 Open Source: Team Member at LMSYS Org, working on SGLang, and a committer for both FlashInfer and LMDeploy.
👀 If you're interested in learning more about my experiences and SGLang, I recommend checking out my talk about SGLang at GPU MODE.
📫 How to reach me: [email protected] or Telegram
📄 Learn more about my work experience: Linkedin

Provide feedback