Yao Fu (符 尧)
Ph.D. Student at The University of Edinburgh
1.45, Informatics Forum
Edinburgh, EH8 9AB
Scotland, UK
I am a Ph.D. student in Computer Science at The University of Edinburgh, supervised by Prof. Luo Mai. I received my B.Eng. degree in Computer Science and Technology from Sun Yat-sen University in June 2021. I was supervised by Prof. Di Wu at Sun Yat-sen University as a member of Yat-sen Honor School.
I study the intersection of machine learning and distributed systems. My goal is to build efficient systems for the large-scale deployment of machine learning models. My current research focuses on the efficient inference of large language models in serverless computing clusters.
news
Mar 21, 2024 | Our paper “ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models” has been accepted to OSDI 2024. Preprint available on ArXiv. Code will be released soon. Stay tuned! |
---|---|
Jan 25, 2024 | We released two new papers on ArXiv! Check them out: ServerlessLLM and MoE-Infinite |
publications
- ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language ModelsOSDI, 2024
- MoE-Infinity: Activation-Aware Expert Offloading for Efficient MoE ServingarXiv preprint arXiv:2401.14361, 2024
- TorchOpt: An Efficient Library for Differentiable OptimizationJMLR, 2023
- Optimizing the numbers of queries and replies in convex federated learning with differential privacyIEEE Transactions on Dependable and Secure Computing, 2023
- Ekko: A Large-Scale deep learning recommender system with Low-Latency model updateOSDI, 2022