Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleAugust 2024
PRoof: A Comprehensive Hierarchical Profiling Framework for Deep Neural Networks with Roofline Analysis
ICPP '24: Proceedings of the 53rd International Conference on Parallel ProcessingPages 822–832https://doi.org/10.1145/3673038.3673116The increasing diversity of deep neural network (DNN) models and hardware platforms necessitates effective model profiling for high-performance inference deployment. Current DNN profiling tools suffer from either limited optimization insights due to the ...
- research-articleSeptember 2023
Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs
ICPP '23: Proceedings of the 52nd International Conference on Parallel ProcessingPages 786–796https://doi.org/10.1145/3605573.3605596The requirement for deploying deep learning (DL) models efficiently has boosted the research of DL compilers. Especially, the difficulty of generating optimized tensor programs has driven DL compilers to commonly adopt the auto-tuning approaches. ...
- research-articleDecember 2020
HitAnomaly: Hierarchical Transformers for Anomaly Detection in System Log
IEEE Transactions on Network and Service Management (ITNSM), Volume 17, Issue 4Pages 2064–2076https://doi.org/10.1109/TNSM.2020.3034647Enterprise systems often produce a large volume of logs to record runtime status and events. Anomaly detection from system logs is crucial for service management and system maintenance. Most existing log-based anomaly detection methods use log event <...