Zhihao Zhang

avatar.jpg

9219 GHC,

4902 Forbes Ave,

Pittsburgh, PA 15213

I am a third-year Ph.D. student at Computer Science Department of Carnegie Mellon University. I’m a member of the CMU Catalyst research group, and fortunate to be advised by Prof. Zhihao Jia.

Prior to joining CMU, I received my Master degree at the Robotics Institute of Carnegie Mellon University and B.Sc in Computer Science at Renmin University of China, where I have been advised Prof. Changliu Liu and Prof. Qin Jin.

Research interests: Efficient System for Machine Learning

selected publications

  1. ICML
    Accelerating retrieval-augmented language model serving with speculation
    Zhihao Zhang, Alan Zhu , Lijie Yang , and 4 more authors
    To appear at ICML 2024, 2024
  2. ASPLOS
    Specinfer: Accelerating generative llm serving with speculative inference and token tree verification
    Xupeng Miao * , Gabriele Oliaro *Zhihao Zhang * , and 7 more authors
    To appear at ASPLOS 2024, 2023
  3. ICLR
    GradSign: Model Performance Inference with Theoretical Insights
    Zhihao Zhang, and Zhihao Jia
    In International Conference on Learning Representations , 2021
  4. NeurIPS
    Communication Bounds for the Distributed Experts Problem
    Zhihao Jia , Qi Pang , Trung Tran , and 3 more authors (in alphabetic order)
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems , 2024