Cited By
View all- Gritskikh MIsakov AGusarova NDobrenko DTomilov IVatian A(2024)Loss Function Role in Processing Sequences with Heavy-Tailed DistributionsIntelligent Data Engineering and Automated Learning – IDEAL 202410.1007/978-3-031-77731-8_33(361-374)Online publication date: 19-Nov-2024
- Du JJiang JZheng JZhang HHuang DLu Y(2023)Improving Computation and Memory Efficiency for Real-world Transformer Inference on GPUsACM Transactions on Architecture and Code Optimization10.1145/361768920:4(1-22)Online publication date: 26-Oct-2023
- Zhang QLiu YLiu TQian D(2023)CoFB: latency-constrained co-scheduling of flows and batches for deep learning inference service on the CPU–GPU systemThe Journal of Supercomputing10.1007/s11227-023-05183-679:13(14172-14199)Online publication date: 4-Apr-2023
- Show More Cited By