Cited By
View all- Li YWei FZhang CZhang HSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)EAGLEProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693232(28935-28948)Online publication date: 21-Jul-2024
- Kim SHooper CGholami ADong ZLi XShen SMahoney MKeutzer KSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)SqueezeLLMProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693028(23901-23923)Online publication date: 21-Jul-2024
- Butler BYu SMazaheri AJannesari A(2024)PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined SpeculationProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC41406.2024.00046(1-19)Online publication date: 17-Nov-2024
- Show More Cited By