- BigMac: A Communication-Effcient Mixture-of-Experts Model Structure for Fast Training and Inference
Zewen Jin, Shengnan Wang, Jiaan Zhu, Hongrui Zhan, Youhui Bai, Zhenyu Ming, Cheng Li
In AAAI, March 2025. - XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference
Shengnan Wang, Youhui Bai, Lin Zhang, Pingyi Zhou, Shixiong Zhao, Gong Zhang, Sen Wang, Renhai Chen, Hua Xu, Hongwei Sun
In arXiv, May 2024. - A Survey on Auto-Parallelism of Neural Networks Training
Peng Liang, Yu Tang, Xiaoda Zhang, Youhui Bai, Teng Su, linbo qiao, Zhiquan Lai, Dongsheng Li
In TPDS, May 2023. - A Generic, High-Performance, Compression-Aware Framework for Data Parallel DNN Training
Hao Wu, Shiyi Wang, Youhui Bai (corresponding author), Cheng Li, Quan Zhou, Jun Yi, Feng Yan, Ruichuan Chen, Yinlong Xu
In TPDS, Virtual, April 2023. - MPress: Democratizing Billion-Scale Model Training on Multi-GPU Servers via Memory-Saving Inter-Operator Parallelism
Quan Zhou, Haiquan Wang, Xiaoyan Yu, Cheng Li, Youhui Bai, Feng Yan, Yinlong Xu
In HPCA, Montreal, QC, Canada, February 2023. - Gradient Compression Supercharged High-Performance Data Parallel DNN Training (code)
Youhui Bai, Cheng Li, Quan Zhou, Jun Yi, Ping Gong, Feng Yan, Ruichuan Chen, Yinlong Xu
In SOSP, Virtual, October 2021. - Efficient Data Loader for Fast Sampling-based GNN Training on Large Graphs
Youhui Bai, Cheng Li, Zhiqi Lin, Yufei Wu, Youshan Miao, Yunxin Liu, Yinlong Xu
In GNNSys Workshop, Virtual, April 2021. - Efficient Data Loader for Fast Sampling-based GNN Training on Large Graphs (code)
Youhui Bai, Cheng Li, Zhiqi Lin, Yufei Wu, Youshan Miao, Yunxin Liu, Yinlong Xu
In TPDS, Virtual, March 2021. - Fast Logging and Recovery Support for Transactional Databases
Youhui Bai, Cheng Li, Yinlong Xu
In SOSP Poster, Shanghai, China, October 2017. - PDS: An I/O-efficient Scaling Scheme for Parity Declustered Data Layout.
Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian and Youhui Bai
In ICPP, Bristol, UK, August 2017.