UC Berkeley Sky Computing Lab Home Publications
Publications

See my Google Scholar for the latest list.

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios N. Angelopoulos, Tianle Li, Dacheng Li, Banghua Zhu, Hao Zhang, Michael I. Jordan, Joseph E. Gonzalez, Ion Stoica
ICML'24

Fairness in Serving Large Language Models
Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica
OSDI'24

S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Ying Sheng*, Shiyi Cao*, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica
MLsys'24

LightSeq: sequence level parallelism for distributed training of long context transformers
Dacheng Li*, Rulin Shao*, Anze Xie, Eric P. Xing, Joseph E. Gonzalez, Ion Stoica, Xuezhe Ma, Hao Zhang
Under submission to ICLR'24

How long can opensource llms truly promise on context length
Dacheng Li*, Rulin Shao*, Anze Xie, Ying Sheng, Lianmin Zheng, Joseph E. Gonzalez, Ion Stoica, Xuezhe Ma, and Hao Zhang
NeurIPS 2023 Instruction following workshop

AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness
Dacheng Li , Hongyi Wang, Eric P. Xing, Hao Zhang
36th Conference on Neural Information Processing Systems (NeurIPS 2022)

Dual Contradistinctive Generative AutoEncoder
Gaurav Parmar*, Dacheng Li* , Kwonjoon Lee*, Zhuowen Tu
2021 Conference on Computer Vision and Pattern Recognition (CVPR 2021)

MPCFORMER: FAST, PERFORMANT AND PRIVATE TRANSFORMER INFERENCE WITH MPC
Dacheng Li* , Rulin Shao*, Hongyi Wang*, Han Guo, Eric P. Xing, Hao Zhang
ICLR'23 (Spotlight)

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Zheng Lianmin*, Wei-Lin Chiang*, Ying Sheng*, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li , Eric. P Xing, Hao Zhang, Joseph E. Gonzalez, Ion Stoica
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 23)

Does compressing activations help model parallel training?
Song Bian*, Dacheng Li* , Hongyi Wang, Eric P. Xing, Shivaram Venkataraman
MLSys'24

Awards

A Faster and More Accurate Secure Model Serving Framework on the Cloud
PI: Eric P. Xing
Amazon Research Awards ($80000)