Ang Li

Cited by

	All	Since 2019
Citations	4223	3973
h-index	37	36
i10-index	81	80

1400

700

350

1050

20162017201820192020202120222023202438 56 140 204 264 480 640 958 1400

Public access

View all

97 articles

9 articles

available

not available

Based on funding mandates

Co-authors

Tony (Tong) GengAssistant Professor, University of RochesterVerified email at rochester.edu
Samuel SteinPacific Northwest National LaboratoryVerified email at pnnl.gov
Chunshu WuPostdoctoral Research Fellow at University of Rochester, ECE DepartmentVerified email at ur.rochester.edu
Martin HerbordtProfessor, Electrical and Computer Engineering, Boston UniversityVerified email at bu.edu
Shuaiwen Leon SongVP of Research, Together.ai; Ex-Microsoft; Tenured ProfessorVerified email at together.ai
Cheng TanGoogle, Arizona State UniversityVerified email at google.com
Yufei DingUniversity of California, San DiegoVerified email at ucsd.edu
Antonino TumeoPacific Northwest National LaboratoryVerified email at pnnl.gov
Akash KumarFull Professor, Chair of Embedded Systems, Ruhr University BochumVerified email at rub.de
Henk CorporaalProfessor Embedded System Architectures, Eindhoven University of TechnologyVerified email at tue.nl
Qiang GuanKent State UniversityVerified email at cs.kent.edu
Ying MaoAssociate Professor, Fordham UniversityVerified email at cis.fordham.edu
Caiwen DingAssociate Professor, University of Minnesota - Twin CitiesVerified email at umn.edu
Kevin J. BarkerHigh Performance Computing Group Lead, Pacific Northwest National LaboratoryVerified email at pnnl.gov
Jiajia LiNorth Carolina State UniversityVerified email at ncsu.edu
Yanfei LiPostdoc, Pacific Northwest National LaboratoryVerified email at pnnl.gov
Chenxu LiuPacific Northwest National LaboratoryVerified email at pnnl.gov
Yingyan (Celine) LinAssociate Professor, Georgia Institute of TechnologyVerified email at gatech.edu
Hongwu PengPh.D. Student, University of ConnecticutVerified email at uconn.edu
Anqi GuoBoston UniversityVerified email at bu.edu

Ang Li

Pacific Northwest National Laboratory and University of Washington

Verified email at pnnl.gov - Homepage

GPU High Performance Computing Quantum Computing Computer Architecture


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Superneurons: Dynamic GPU memory management for training deep neural networks L Wang, J Ye, Y Zhao, W Wu, A Li, SL Song, Z Xu, T Kraska Proceedings of the 23rd ACM SIGPLAN symposium on principles and practice of …, 2018	319	2018
AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020	307	2020
Evaluating modern gpu interconnect: Pcie, nvlink, nv-sli, nvswitch and gpudirect A Li, SL Song, J Chen, J Li, X Liu, NR Tallent, KJ Barker IEEE Transactions on Parallel and Distributed Systems 31 (1), 94-110, 2019	288	2019
Qasmbench: A low-level quantum benchmark suite for nisq evaluation and simulation A Li, S Stein, S Krishnamoorthy, J Ang ACM Transactions on Quantum Computing 4 (2), 1-26, 2023	192*	2023
I-GCN: A graph convolutional network accelerator with runtime locality enhancement through islandization T Geng, C Wu, Y Zhang, C Tan, C Xie, H You, M Herbordt, Y Lin, A Li MICRO-54: 54th annual IEEE/ACM international symposium on microarchitecture …, 2021	120	2021
A synchronization-free algorithm for parallel sparse triangular solves W Liu, A Li, J Hogg, IS Duff, B Vinter Euro-Par 2016: Parallel Processing: 22nd International Conference on …, 2016	115	2016
Accelerating transformer-based deep learning models on fpgas using column balanced block pruning H Peng, S Huang, T Geng, A Li, W Jiang, H Liu, S Wang, C Ding 2021 22nd International Symposium on Quality Electronic Design (ISQED), 142-148, 2021	104	2021
Adaptive and transparent cache bypassing for GPUs A Li, GJ van den Braak, A Kumar, H Corporaal Proceedings of the International Conference for High Performance Computing …, 2015	96	2015
Locality-aware CTA clustering for modern GPUs A Li, SL Song, W Liu, X Liu, A Kumar, H Corporaal ACM SIGARCH Computer Architecture News 45 (1), 297-311, 2017	94	2017
Qugan: A quantum state fidelity based generative adversarial network SA Stein, B Baheri, D Chen, Y Mao, Q Guan, A Li, B Fang, S Xu 2021 IEEE International Conference on Quantum Computing and Engineering (QCE …, 2021	92*	2021
Bns-gcn: Efficient full-graph training of graph convolutional networks with partition-parallelism and random boundary node sampling C Wan, Y Li, A Li, NS Kim, Y Lin Proceedings of Machine Learning and Systems 4, 673-693, 2022	81	2022
Tartan: evaluating modern GPU interconnect via a multi-GPU benchmark suite A Li, SL Song, J Chen, X Liu, N Tallent, K Barker 2018 IEEE International Symposium on Workload Characterization (IISWC), 191-202, 2018	69	2018
OpenCGRA: An open-source unified framework for modeling, testing, and evaluating CGRAs C Tan, C Xie, A Li, KJ Barker, A Tumeo 2020 IEEE 38th International Conference on Computer Design (ICCD), 381-388, 2020	67	2020
FPDeep: Scalable acceleration of CNN training on deeply-pipelined FPGA clusters T Wang, T Geng, A Li, X Jin, M Herbordt IEEE Transactions on Computers 69 (8), 1143-1158, 2020	64*	2020
Fine-grained synchronizations and dataflow programming on GPUs A Li, GJ van den Braak, H Corporaal, A Kumar Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015	62	2015
Fast synchronization‐free algorithms for parallel sparse triangular solves with multiple right‐hand sides W Liu, A Li, JD Hogg, IS Duff, B Vinter Concurrency and Computation: Practice and Experience 29 (21), e4244, 2017	61	2017
Gcod: Graph convolutional network acceleration via dedicated algorithm and accelerator co-design H You, T Geng, Y Zhang, A Li, Y Lin 2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022	59	2022
Quclassi: A hybrid deep neural network architecture based on quantum state fidelity SA Stein, B Baheri, D Chen, Y Mao, Q Guan, A Li, S Xu, C Ding Proceedings of Machine Learning and Systems 4, 251-264, 2022	58	2022
Exploring and analyzing the real impact of modern on-package memory on HPC scientific kernels A Li, W Liu, MRB Kristensen, B Vinter, H Wang, K Hou, A Marquez, ... Proceedings of the International Conference for High Performance Computing …, 2017	58	2017
A length adaptive algorithm-hardware co-design of transformer on fpga through sparse attention and dynamic pipelining H Peng, S Huang, S Chen, B Li, T Geng, A Li, W Jiang, W Wen, J Bi, H Liu, ... Proceedings of the 59th ACM/IEEE Design Automation Conference, 1135-1140, 2022	55	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors