news

Oct 01, 2024 I’m honored to serve as a reviewer for IEEE Transactions on Mobile Computing (TMC). I’m excited about the opportunity to contribute to the community in this new role!
May 16, 2024 I’m selected as one of the ML and Systems Rising Stars! Thanks to everyone who has supported me along the way! I’ll be attending the workshop at NVIDIA’s headquarters in Santa Clara, CA, on July 15-16.
Mar 21, 2024 Our paper “ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models” has been accepted to OSDI 2024. Preprint available on ArXiv. Code will be released soon. Stay tuned! :sparkles: :smile:
Jan 25, 2024 We released two new papers on ArXiv! Check them out: ServerlessLLM and MoE-Infinite