 |
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios N. Angelopoulos, Tianle Li, Dacheng Li, Banghua Zhu, Hao Zhang, Michael I. Jordan, Joseph E. Gonzalez, Ion Stoica
ICML'24
|
 |
Fairness in Serving Large Language Models
Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica
OSDI'24
|
 |
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Ying Sheng*, Shiyi Cao*, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica
MLsys'24
|
 |
LightSeq: sequence level parallelism for distributed training of long context transformers
Dacheng Li*, Rulin Shao*, Anze Xie, Eric P. Xing, Joseph E. Gonzalez, Ion Stoica, Xuezhe Ma, Hao Zhang
Under submission to ICLR'24
|
 |
How long can opensource llms truly promise on context length
Dacheng Li*, Rulin Shao*, Anze Xie, Ying Sheng, Lianmin Zheng, Joseph E. Gonzalez, Ion Stoica, Xuezhe Ma, and Hao Zhang
NeurIPS 2023 Instruction following workshop
|
 |
AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness
Dacheng Li , Hongyi Wang, Eric P. Xing, Hao Zhang
36th Conference on Neural Information Processing Systems (NeurIPS 2022)
|
 |
Dual Contradistinctive Generative AutoEncoder
Gaurav Parmar*, Dacheng Li* , Kwonjoon Lee*, Zhuowen Tu
2021 Conference on Computer Vision and Pattern Recognition (CVPR 2021)
|
 |
MPCFORMER: FAST, PERFORMANT AND PRIVATE TRANSFORMER INFERENCE WITH MPC
Dacheng Li* , Rulin Shao*, Hongyi Wang*, Han Guo, Eric P. Xing, Hao Zhang
ICLR'23 (Spotlight)
|
 |
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Zheng Lianmin*, Wei-Lin Chiang*, Ying Sheng*, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li , Eric. P Xing, Hao Zhang, Joseph E. Gonzalez, Ion Stoica
Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 23)
|
 |
Does compressing activations help model parallel training?
Song Bian*, Dacheng Li* , Hongyi Wang, Eric P. Xing, Shivaram Venkataraman
MLSys'24
|
|