Cited By
View all- Xu DZhang HYang LLiu RHuang GXu MLiu XEeckhout LSmaragdakis GLiang KSampson AKim MRossbach C(2025)Fast On-device LLM Inference with NPUsProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707239(445-462)Online publication date: 30-Mar-2025
- Xu DXu MLou CZhang LHuang GJin XLiu XTsafrir DMUSUVATHI MGupta RAbu-Ghazaleh N(2024)SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge ServersProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3617232.3624847(368-385)Online publication date: 27-Apr-2024
- Xu MXu DLou CZhang LHuang GJin XLiu X(2024)Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge ServersIEEE Transactions on Mobile Computing10.1109/TMC.2024.344243023:12(14344-14360)Online publication date: Dec-2024
- Show More Cited By