avatar

Youhui Bai, 白有辉

Yesterday is a history, tomorrow is a mystery, but today is a gift, that is why it is called "Present".

HangZhou, China

Publications

  1. BigMac: A Communication-Effcient Mixture-of-Experts Model Structure for Fast Training and Inference
    Zewen Jin, Shengnan Wang, Jiaan Zhu, Hongrui Zhan, Youhui Bai, Zhenyu Ming, Cheng Li
    In AAAI, March 2025.
  2. XL3M: A Training-free Framework for LLM Length Extension Based on Segment-wise Inference
    Shengnan Wang, Youhui Bai, Lin Zhang, Pingyi Zhou, Shixiong Zhao, Gong Zhang, Sen Wang, Renhai Chen, Hua Xu, Hongwei Sun
    In arXiv, May 2024.
  3. A Survey on Auto-Parallelism of Neural Networks Training
    Peng Liang, Yu Tang, Xiaoda Zhang, Youhui Bai, Teng Su, linbo qiao, Zhiquan Lai, Dongsheng Li
    In TPDS, May 2023.
  4. A Generic, High-Performance, Compression-Aware Framework for Data Parallel DNN Training
    Hao Wu, Shiyi Wang, Youhui Bai (corresponding author), Cheng Li, Quan Zhou, Jun Yi, Feng Yan, Ruichuan Chen, Yinlong Xu
    In TPDS, Virtual, April 2023.
  5. MPress: Democratizing Billion-Scale Model Training on Multi-GPU Servers via Memory-Saving Inter-Operator Parallelism
    Quan Zhou, Haiquan Wang, Xiaoyan Yu, Cheng Li, Youhui Bai, Feng Yan, Yinlong Xu
    In HPCA, Montreal, QC, Canada, February 2023.
  6. Gradient Compression Supercharged High-Performance Data Parallel DNN Training (code)
    Youhui Bai, Cheng Li, Quan Zhou, Jun Yi, Ping Gong, Feng Yan, Ruichuan Chen, Yinlong Xu
    In SOSP, Virtual, October 2021.
  7. Efficient Data Loader for Fast Sampling-based GNN Training on Large Graphs
    Youhui Bai, Cheng Li, Zhiqi Lin, Yufei Wu, Youshan Miao, Yunxin Liu, Yinlong Xu
    In GNNSys Workshop, Virtual, April 2021.
  8. Efficient Data Loader for Fast Sampling-based GNN Training on Large Graphs (code)
    Youhui Bai, Cheng Li, Zhiqi Lin, Yufei Wu, Youshan Miao, Yunxin Liu, Yinlong Xu
    In TPDS, Virtual, March 2021.
  9. Fast Logging and Recovery Support for Transactional Databases
    Youhui Bai, Cheng Li, Yinlong Xu
    In SOSP Poster, Shanghai, China, October 2017.
  10. PDS: An I/O-efficient Scaling Scheme for Parity Declustered Data Layout.
    Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian and Youhui Bai
    In ICPP, Bristol, UK, August 2017.