default search action
Ang Li 0006
Person information
- affiliation: Pacific Northwest National Laboratory, Richland, WA, USA
Other persons with the same name
- Ang Li — disambiguation page
- Ang Li 0001 — University of Maryland, College Park, MD, USA
- Ang Li 0002 — Duke University, Durham, NC, USA
- Ang Li 0003 — Xi'an Jiaotong University, Faculty of Electronic and Information Engineering, Shaanxi, China (and 3 more)
- Ang Li 0004 — Beijing Forestry University, Department of Psychology, China (and 2 more)
- Ang Li 0005 — University of Maryland, College Park, MD, USA (and 2 more)
- Ang Li 0007 (aka: Ang Leon Li) — University of Queensland, Brisbane, QLD, Australia
- Ang Li 0008 — University of Melbourne, School of Computing and Information Systems, Parkville, Victoria, Australia
- Ang Li 0009 — Florida State University, Department of Computer Science, Tallahassee, FL, USA (and 1 more)
- Ang Li 0010 — Stanford University, CA, USA
- Ang Li 0011 — University of Washington, Seattle, WA, USA (and 1 more)
- Ang Li 0012 — Nanjing University of Posts and Telecommunications, Nanjing, China
- Ang Li 0013 — Arizona State University, Tempe, AZ, USA
- Ang Li 0014 — Shanghai Normal University, Shanghai, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j16]Wei Sun, Ang Li, Sander Stuijk, Henk Corporaal:
How Much Can We Gain From Tensor Kernel Fusion on GPUs? IEEE Access 12: 126135-126144 (2024) - [j15]Yuri Alexeev, Maximilian Amsler, Marco Antonio Barroca, Sanzio Bassini, Torey Battelle, Daan Camps, David Casanova, Young Jay Choi, Frederic T. Chong, Charles Chung, Christopher Codella, Antonio D. Córcoles, James Cruise, Alberto Di Meglio, Ivan Duran, Thomas Eckl, Sophia E. Economou, Stephan J. Eidenbenz, Bruce Elmegreen, Clyde Fare, Ismael Faro, Cristina Sanz Fernández, Rodrigo Neumann Barros Ferreira, Keisuke Fuji, Bryce Fuller, Laura Gagliardi, Giulia Galli, Jennifer R. Glick, Isacco Gobbi, Pranav Gokhale, Salvador de la Puente Gonzalez, Johannes Greiner, Bill Gropp, Michele Grossi, Emanuel Gull, Burns Healy, Matthew R. Hermes, Benchen Huang, Travis S. Humble, Nobuyasu Ito, Artur F. Izmaylov, Ali Javadi-Abhari, Douglas M. Jennewein, Shantenu Jha, Liang Jiang, Barbara Jones, Wibe Albert de Jong, Petar Jurcevic, William M. Kirby, Stefan Kister, Masahiro Kitagawa, Joel Klassen, Katherine Klymko, Kwangwon Koh, Masaaki Kondo, Doga Murat Kürkçüoglu, Krzysztof Kurowski, Teodoro Laino, Ryan Landfield, Matthew L. Leininger, Vicente Leyton-Ortega, Ang Li, Meifeng Lin, Junyu Liu, Nicolás Lorente, André Luckow, Simon Martiel, Francisco Martín-Fernández, Margaret Martonosi, Claire Marvinney, Arcesio Castañeda Medina, Dirk Merten, Antonio Mezzacapo, Kristel Michielsen, Abhishek Mitra, Tushar Mittal, Kyungsun Moon, Joel Moore, Sarah Mostame, Mario Motta, Young-Hye Na, Yunseong Nam, Prineha Narang, Yu-ya Ohnishi, Daniele Ottaviani, Matthew Otten, Scott Pakin, Vincent R. Pascuzzi, Edwin Pednault, Tomasz Piontek, Jed Pitera, Patrick Rall, Gokul Subramanian Ravi, Niall Robertson, Matteo A. C. Rossi, Piotr Rydlichowski, Hoon Ryu, Georgy Samsonidze, Mitsuhisa Sato, Nishant Saurabh, Vidushi Sharma, Kunal Sharma, Soyoung Shin, George Slessman, Mathias Steiner, Iskandar Sitdikov, In-Saeng Suh, Eric D. Switzer, Wei Tang, Joel Thompson, Synge Todo, Minh C. Tran, Dimitar Trenev, Christian Trott, Huan-Hsin Tseng, Norm M. Tubman, Esin Tureci, David García Valiñas, Sofia Vallecorsa, Christopher Wever, Konrad Wojciechowski, Xiaodi Wu, Shinjae Yoo, Nobuyuki Yoshioka, Victor Wen-zhe Yu, Seiji Yunoki, Sergiy Zhuk, Dmitry Zubarev:
Quantum-centric supercomputing for materials science: A perspective on challenges and future directions. Future Gener. Comput. Syst. 160: 666-710 (2024) - [j14]Hatem Helal, Jesun Firoz, Jenna A. Bilbrey, Henry Sprueill, Kristina M. Herman, Mario Michael Krell, Tom Murray, Manuel Lopez Roldan, Mike Kraus, Ang Li, Payel Das, Sotiris S. Xantheas, Sutanay Choudhury:
Acceleration of Graph Neural Network-Based Prediction Models in Chemistry via Co-Design Optimization on Intelligence Processing Units. J. Chem. Inf. Model. 64(5): 1568-1580 (2024) - [j13]Chunshu Wu, Chen Yang, Sahan Bandara, Tong Geng, Anqi Guo, Pouya Haghi, Ang Li, Martin C. Herbordt:
FPGA-Accelerated Range-Limited Molecular Dynamics. IEEE Trans. Computers 73(6): 1544-1558 (2024) - [c106]Zheng Wang, Yuke Wang, Jiaqi Deng, Da Zheng, Ang Li, Yufei Ding:
RAP: Resource-aware Automated GPU Sharing for Multi-GPU Recommendation Model Training and Input Preprocessing. ASPLOS (2) 2024: 964-979 - [c105]Meng Wang, Bo Fang, Ang Li, Prashant J. Nair:
Red-QAOA: Efficient Variational Optimization through Circuit Reduction. ASPLOS (2) 2024: 980-998 - [c104]Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan:
FTTN: Feature-Targeted Testing for Numerical Properties of NVIDIA & AMD Matrix Accelerators. CCGrid 2024: 39-46 - [c103]Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan:
Discovery of Floating-Point Differences Between NVIDIA and AMD GPUs. CCGrid 2024: 663-666 - [c102]Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari, Timothy Tsai, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant J. Nair, Kevin J. Barker, Ang Li:
Understanding Mixed Precision GEMM with MPGemmFI: Insights into Fault Resilience. CLUSTER 2024: 166-178 - [c101]Jinyang Li, Ang Li, Weiwen Jiang:
QUAPPROX: A Framework for Benchmarking the Approximability of Variational Quantum Circuit. ICASSP 2024: 13376-13380 - [c100]Chunshu Wu, Ruibing Song, Chuan Liu, Yunan Yang, Ang Li, Michael C. Huang, Tong Geng:
Extending Power of Nature from Binary to Real-Valued Graph Learning in Real World. ICLR 2024 - [c99]Pouya Haghi, Cheng Tan, Anqi Guo, Chunshu Wu, Dongfang Liu, Ang Li, Anthony Skjellum, Tong Geng, Martin C. Herbordt:
SmartFuse: Reconfigurable Smart Switches to Accelerate Fused Collectives in HPC Applications. ICS 2024: 413-425 - [c98]Ruibing Song, Chunshu Wu, Chuan Liu, Ang Li, Michael C. Huang, Tong Geng:
DS-GL: Advancing Graph Learning via Harnessing Nature's Power within Scalable Dynamical Systems. ISCA 2024: 45-57 - [c97]Keyi Yin, Xiang Fang, Travis S. Humble, Ang Li, Yunong Shi, Yufei Ding:
Surf-Deformer: Mitigating Dynamic Defects on Surface Code via Adaptive Deformation. MICRO 2024: 750-764 - [c96]Pouya Haghi, Chunshu Wu, Zahra Azad, Yanfei Li, Andrew Gui, Yuchen Hao, Ang Li, Tony Tong Geng:
Bridging the Gap Between LLMs and LNS with Dynamic Data Format and Architecture Codesign. MICRO 2024: 1617-1631 - [c95]Zheng Wang, Yuke Wang, Boyuan Feng, Guyue Huang, Dheevatsa Mudigere, Bharath Muthiah, Ang Li, Yufei Ding:
OPER: Optimality-Guided Embedding Table Parallelization for Large-scale Recommendation Model. USENIX ATC 2024: 667-682 - [c94]Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin J. Barker, Ang Li:
Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs. ICPE (Companion) 2024: 14-20 - [i52]Zirui Mao, Xinyi Li, Shenyang Hu, Ganesh Gopalakrishnan, Ang Li:
A GPU accelerated mixed-precision Smoothed Particle Hydrodynamics framework with cell-based relative coordinates. CoRR abs/2401.08586 (2024) - [i51]Ryan L'Abbate, Anthony D'Onofrio Jr., Samuel Stein, Samuel Yen-Chi Chen, Ang Li, Pin-Yu Chen, Juntao Chen, Ying Mao:
A Quantum-Classical Collaborative Training Architecture Based on Quantum State Fidelity. CoRR abs/2402.15333 (2024) - [i50]Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan:
FTTN: Feature-Targeted Testing for Numerical Properties of NVIDIA & AMD Matrix Accelerators. CoRR abs/2403.00232 (2024) - [i49]Yanfei Li, Juejing Liu, Xiaodong Zhao, Wenjun Liu, Tong Geng, Ang Li, Xin Zhang:
Accurate and Data-Efficient Micro-XRD Phase Identification Using Multi-Task Learning: Application to Hydrothermal Fluids. CoRR abs/2403.10042 (2024) - [i48]Shuwen Kan, Zefan Du, Miguel Palma, Samuel Alexander Stein, Chenxu Liu, Wenqi Wei, Juntao Chen, Ang Li, Ying Mao:
Scalable Circuit Cutting and Scheduling in a Resource-constrained and Distributed Quantum System. CoRR abs/2405.04514 (2024) - [i47]Hexu Zhao, Haoyang Weng, Daohan Lu, Ang Li, Jinyang Li, Aurojit Panda, Saining Xie:
On Scaling Up 3D Gaussian Splatting Training. CoRR abs/2406.18533 (2024) - [i46]Mingkai Chen, Taowen Wang, James Chenhao Liang, Chuan Liu, Chunshu Wu, Qifan Wang, Ying Nian Wu, Michael Huang, Chuang Ren, Ang Li, Tong Geng, Dongfang Liu:
Inertial Confinement Fusion Forecasting via LLMs. CoRR abs/2407.11098 (2024) - [i45]Chuan Liu, Chunshu Wu, Shihui Cao, Mingkai Chen, James Chenhao Liang, Ang Li, Michael Huang, Chuang Ren, Dongfang Liu, Ying Nian Wu, Tong Geng:
Diff-PIC: Revolutionizing Particle-In-Cell Simulation for Advancing Nuclear Fusion with Diffusion Models. CoRR abs/2408.02693 (2024) - [i44]Zirui Mao, Shenyang Hu, Ang Li:
A GPU accelerated mixed-precision Finite Difference informed Random Walker (FDiRW) solver for strongly inhomogeneous diffusion problems. CoRR abs/2408.11376 (2024) - [i43]Yuhang Liang, Xinyi Li, Jie Ren, Ang Li, Bo Fang, Jieyang Chen:
Light-Weight Fault Tolerant Attention for Large Language Model Training. CoRR abs/2410.11720 (2024) - 2023
- [j12]Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Nathan R. Tallent, Kevin J. Barker, Ang Li:
Accelerating matrix-centric graph processing on GPUs through bit-level optimizations. J. Parallel Distributed Comput. 177: 53-67 (2023) - [j11]Ying Mao, Vaishali Sharma, Wenjia Zheng, Long Cheng, Qiang Guan, Ang Li:
Elastic Resource Management for Deep Learning Applications in a Container Cluster. IEEE Trans. Cloud Comput. 11(2): 2204-2216 (2023) - [j10]Wei Sun, Ang Li, Tong Geng, Sander Stuijk, Henk Corporaal:
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors. IEEE Trans. Parallel Distributed Syst. 34(1): 246-261 (2023) - [c93]Zhenyu Pan, Anshujit Sharma, Jerry Yao-Chieh Hu, Zhuo Liu, Ang Li, Han Liu, Michael C. Huang, Tong Geng:
Ising-Traffic: Using Ising Machine Learning to Predict Traffic Congestion under Uncertainty. AAAI 2023: 9354-9363 - [c92]Anthony D'Onofrio Jr., Amir Hossain, Lesther Santana, Naseem Machlovi, Samuel Stein, Jinwei Liu, Ang Li, Ying Mao:
Distributed Quantum Learning with co-Management in a Multi-tenant Quantum System. IEEE Big Data 2023: 221-228 - [c91]Zhuo Liu, Yunan Yang, Zhenyu Pan, Anshujit Sharma, Amit Hasan, Caiwen Ding, Ang Li, Michael C. Huang, Tong Geng:
Ising-CF: A Pathbreaking Collaborative Filtering Method Through Efficient Ising Machine Learning. DAC 2023: 1-6 - [c90]Yixuan Luo, Cheng Tan, Nicolas Bohm Agostini, Ang Li, Antonino Tumeo, Nirav Dave, Tong Geng:
ML-CGRA: An Integrated Compilation Framework to Enable Efficient Machine Learning Acceleration on CGRAs. DAC 2023: 1-6 - [c89]Yan-Hao Chen, Yuwei Jin, Fei Hua, Ari B. Hayes, Ang Li, Yunong Shi, Eddy Z. Zhang:
A Pulse Generation Framework with Augmented Program-aware Basis Gates and Criticality Analysis. HPCA 2023: 773-786 - [c88]Xinyi Li, Ignacio Laguna, Bo Fang, Katarzyna Swirydowicz, Ang Li, Ganesh Gopalakrishnan:
Design and Evaluation of GPU-FPX: A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs. HPDC 2023: 59-71 - [c87]Hongwu Peng, Shaoyi Huang, Tong Zhou, Yukui Luo, Chenghong Wang, Zigeng Wang, Jiahui Zhao, Xi Xie, Ang Li, Tony Geng, Kaleel Mahmood, Wujie Wen, Xiaolin Xu, Caiwen Ding:
AutoReP: Automatic ReLU Replacement for Fast Private Network Inference. ICCV 2023: 5155-5165 - [c86]Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Sutanay Choudhury, Ang Li:
BitGNN: Unleashing the Performance Potential of Binary Graph Neural Networks on GPUs. ICS 2023: 264-276 - [c85]Anqi Guo, Yuchen Hao, Chunshu Wu, Pouya Haghi, Zhenyu Pan, Min Si, Dingwen Tao, Ang Li, Martin C. Herbordt, Tong Geng:
Software-Hardware Co-design of Heterogeneous SmartNIC System for Recommendation Models Inference and Training. ICS 2023: 336-347 - [c84]Pouya Haghi, William Krska, Cheng Tan, Tong Geng, Po-Hao Chen, Connor Greenwood, Anqi Guo, Thomas M. Hines, Chunshu Wu, Ang Li, Anthony Skjellum, Martin C. Herbordt:
FLASH: FPGA-Accelerated Smart Switches with GCN Case Study. ICS 2023: 450-462 - [c83]Samuel Alexander Stein, Nathan Wiebe, Yufei Ding, James A. Ang, Ang Li:
Q-BEEP: Quantum Bayesian Error Mitigation Employing Poisson Modeling over the Hamming Spectrum. ISCA 2023: 8:1-8:13 - [c82]Anbang Wu, Yufei Ding, Ang Li:
QuComm: Optimizing Collective Communication for Distributed Quantum Computing. MICRO 2023: 479-493 - [c81]Samuel Alexander Stein, Sara Sussman, Teague Tomesh, Charles Guinn, Esin Tureci, Sophia Fuhui Lin, Wei Tang, James A. Ang, Srivatsan Chakram, Ang Li, Margaret Martonosi, Fred Chong, Andrew A. Houck, Isaac L. Chuang, Michael Austin DeMarco:
HetArch: Heterogeneous Microarchitectures for Superconducting Quantum Systems. MICRO 2023: 539-554 - [c80]Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Kevin J. Barker, Ang Li, Yufei Ding:
MGG: Accelerating Graph Neural Networks with Fine-Grained Intra-Kernel Communication-Computation Pipelining on Multi-GPU Platforms. OSDI 2023: 779-795 - [c79]Jinyang Li, Zhepeng Wang, Zhirui Hu, Prasanna Date, Ang Li, Weiwen Jiang:
A Novel Spatial-Temporal Variational Quantum Circuit to Enable Deep Learning on NISQ Devices. QCE 2023: 272-282 - [c78]Tommy Nguyen, Yue Shi, Samuel Alexander Stein, Tim Stavenger, Marvin Warner, Martin Roetteler, Torsten Hoefler, Ang Li:
A Reference Implementation for a Quantum Message Passing Interface. QCE 2023: 292-293 - [c77]Chunshu Wu, Tong Geng, Anqi Guo, Sahan Bandara, Pouya Haghi, Chuan Liu, Ang Li, Martin C. Herbordt:
FASDA: An FPGA-Aided, Scalable and Distributed Accelerator for Range-Limited Molecular Dynamics. SC 2023: 98:1-98:14 - [c76]Yue Shi, Tommy Nguyen, Samuel Alexander Stein, Tim Stavenger, Marvin Warner, Martin Roetteler, Torsten Hoefler, Ang Li:
A Reference Implementation for a Quantum Message Passing Interface. SC Workshops 2023: 1420-1425 - [c75]Boyuan Zhang, Bo Fang, Qiang Guan, Ang Li, Dingwen Tao:
MEMQSim: Highly Memory-Efficient and Modularized Quantum State-Vector Simulation. SC Workshops 2023: 1452-1453 - [c74]Meng Wang, Fei Hua, Chenxu Liu, Nicholas P. Bauman, Karol Kowalski, Daniel Claudino, Travis S. Humble, Prashant J. Nair, Ang Li:
Enabling Scalable VQE Simulation on Leading HPC Systems. SC Workshops 2023: 1460-1467 - [c73]Fei Hua, Meng Wang, Gushu Li, Bo Peng, Chenxu Liu, Muqing Zheng, Samuel Alexander Stein, Yufei Ding, Eddy Z. Zhang, Travis S. Humble, Ang Li:
QASMTrans: A QASM Quantum Transpiler Framework for NISQ Devices. SC Workshops 2023: 1468-1477 - [i42]Xiaodong Zhao, YiXuan Luo, Juejing Liu, Wenjun Liu, Kevin M. Rosso, Xiaofeng Guo, Tong Geng, Ang Li, Xin Zhang:
Machine Learning Automated Approach for Enormous Synchrotron X-Ray Diffraction Data Interpretation. CoRR abs/2303.10881 (2023) - [i41]Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Sutanay Choudhury, Ang Li:
BitGNN: Unleashing the Performance Potential of Binary Graph Neural Networks on GPUs. CoRR abs/2305.02522 (2023) - [i40]Samuel Alexander Stein, Sara Sussman, Teague Tomesh, Charles Guinn, Esin Tureci, Sophia Fuhui Lin, Wei Tang, James A. Ang, Srivatsan Chakram, Ang Li, Margaret Martonosi, Frederic T. Chong, Andrew A. Houck, Isaac L. Chuang, Michael Austin DeMarco:
Microarchitectures for Heterogeneous Superconducting Quantum Computers. CoRR abs/2305.03243 (2023) - [i39]Jinyang Li, Zhepeng Wang, Zhirui Hu, Prasanna Date, Ang Li, Weiwen Jiang:
A Novel Spatial-Temporal Variational Quantum Circuit to Enable Deep Learning on NISQ Devices. CoRR abs/2307.09771 (2023) - [i38]Hongwu Peng, Shaoyi Huang, Tong Zhou, Yukui Luo, Chenghong Wang, Zigeng Wang, Jiahui Zhao, Xi Xie, Ang Li, Tony Geng, Kaleel Mahmood, Wujie Wen, Xiaolin Xu, Caiwen Ding:
AutoReP: Automatic ReLU Replacement for Fast Private Network Inference. CoRR abs/2308.10134 (2023) - [i37]Boyuan Zhang, Bo Fang, Qiang Guan, Ang Li, Dingwen Tao:
MEMQSim: Highly Memory-Efficient and Modularized Quantum State-Vector Simulation. CoRR abs/2309.16979 (2023) - [i36]Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin J. Barker, Ang Li:
Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs. CoRR abs/2311.04417 (2023) - [i35]Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari, Timothy Tsai, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant J. Nair, Kevin J. Barker, Ang Li:
MPGemmFI: A Fault Injection Technique for Mixed Precision GEMM in ML Applications. CoRR abs/2311.05782 (2023) - [i34]Anthony D'Onofrio Jr., Amir Hossain, Lesther Santana, Naseem Machlovi, Samuel Stein, Jinwei Liu, Ang Li, Ying Mao:
Distributed Quantum Learning with co-Management in a Multi-tenant Quantum System. CoRR abs/2312.08158 (2023) - 2022
- [c72]Bo Fang, M. Yusuf Özkaya, Ang Li, Ümit V. Çatalyürek, Sriram Krishnamoorthy:
Efficient Hierarchical State Vector Simulation of Quantum Circuits via Acyclic Graph Partitioning. CLUSTER 2022: 289-300 - [c71]Hongwu Peng, Shaoyi Huang, Shiyang Chen, Bingbing Li, Tong Geng, Ang Li, Weiwen Jiang, Wujie Wen, Jinbo Bi, Hang Liu, Caiwen Ding:
A length adaptive algorithm-hardware co-design of transformer on FPGA through sparse attention and dynamic pipelining. DAC 2022: 1135-1140 - [c70]Anqi Guo, Tong Geng, Yongan Zhang, Pouya Haghi, Chunshu Wu, Cheng Tan, Yingyan Lin, Ang Li, Martin C. Herbordt:
FCsN: A FPGA-Centric SmartNIC Framework for Neural Networks. FCCM 2022: 1-2 - [c69]Anqi Guo, Tong Geng, Yongan Zhang, Pouya Haghi, Chunshu Wu, Cheng Tan, Yingyan Lin, Ang Li, Martin C. Herbordt:
A Framework for Neural Network Inference on FPGA-Centric SmartNICs. FPL 2022: 1-8 - [c68]Chengming Zhang, Tong Geng, Anqi Guo, Jiannan Tian, Martin C. Herbordt, Ang Li, Dingwen Tao:
H-GCN: A Graph Convolutional Network Accelerator on Versal ACAP Architecture. FPL 2022: 200-208 - [c67]Bo Fang, Siva Kumar Sastry Hari, Timothy Tsai, Xinyi Li, Ganesh Gopalakrishnan, Ignacio Laguna, Kevin J. Barker, Ang Li:
Towards Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications. FTXS@SC 2022: 47-52 - [c66]Antonino Tumeo, Nicolas Bohm Agostini, Serena Curzel, Ankur Limaye, Cheng Tan, Vinay Amatya, Marco Minutoli, Vito Giovanni Castellana, Ang Li, Joseph B. Manzano:
SO(DA)2: End-to-end Generation of Specialized Reconfigurable Architectures (Invited Talk). PARMA-DITAM@HiPEAC 2022: 1:1-1:15 - [c65]Cheng Tan, Nicolas Bohm Agostini, Tong Geng, Chenhao Xie, Jiajia Li, Ang Li, Kevin J. Barker, Antonino Tumeo:
DRIPS: Dynamic Rebalancing of Pipelined Streaming Applications on CGRAs. HPCA 2022: 304-316 - [c64]Haoran You, Tong Geng, Yongan Zhang, Ang Li, Yingyan Lin:
GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design. HPCA 2022: 460-474 - [c63]Cheng Tan, Thierry Tambe, Jeff Jun Zhang, Bo Fang, Tong Geng, Gu-Yeon Wei, David Brooks, Antonino Tumeo, Ganesh Gopalakrishnan, Ang Li:
ASAP: automatic synthesis of area-efficient and precision-aware CGRAs. ICS 2022: 4:1-4:13 - [c62]Chengming Zhang, Sian Jin, Tong Geng, Jiannan Tian, Ang Li, Dingwen Tao:
CEAZ: accelerating parallel I/O via hardware-algorithm co-designed adaptive lossy compression. ICS 2022: 12:1-12:13 - [c61]Samuel Alexander Stein, Ying Mao, James A. Ang, Ang Li:
QuCNN: A Quantum Convolutional Neural Network with Entanglement Based Backpropagation. SEC 2022: 368-374 - [c60]Betis Baheri, Qiang Guan, Vipin Chaudhary, Ang Li:
Quantum Noise in the Flow of Time: A Temporal Study of the Noise in Quantum Computers. IOLTS 2022: 1-5 - [c59]Betis Baheri, Jacob Tronge, Bo Fang, Ang Li, Vipin Chaudhary, Qiang Guan:
MARS: Malleable Actor-Critic Reinforcement Learning Scheduler. IPCCC 2022: 217-226 - [c58]Jou-An Chen, Hsin-Hsuan Sung, Xipeng Shen, Nathan R. Tallent, Kevin J. Barker, Ang Li:
Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU. IPDPS 2022: 515-525 - [c57]Samuel Alexander Stein, Nathan Wiebe, James A. Ang, Ang Li:
Improving Variational Quantum Algorithms performance through Weighted Quantum Ensembles. IPDPS Workshops 2022: 616-617 - [c56]Samuel Alexander Stein, Nathan Wiebe, James A. Ang, Ang Li:
Benchmarking Quantum Processor Performance through Quantum Distance Metrics Over An Algorithm Suite. IPDPS Workshops 2022: 618-624 - [c55]Samuel Alexander Stein, Nathan Wiebe, Yufei Ding, Bo Peng, Karol Kowalski, Nathan A. Baker, James A. Ang, Ang Li:
EQC: ensembled quantum computing for variational quantum algorithms. ISCA 2022: 59-71 - [c54]Samuel Alexander Stein, Betis Baheri, Daniel Chen, Ying Mao, Qiang Guan, Ang Li, Shuai Xu, Caiwen Ding:
QuClassi: A Hybrid Deep Neural Network Architecture based on Quantum State Fidelity. MLSys 2022 - [c53]Cheng Wan, Youjie Li, Ang Li, Nam Sung Kim, Yingyan Lin:
BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling. MLSys 2022 - [i33]Jou-An Chen, Hsin-Hsuan Sung, Nathan R. Tallent, Kevin J. Barker, Xipeng Shen, Ang Li:
Bit-GraphBLAS: Bit-Level Optimizations of Matrix-Centric Graph Processing on GPU. CoRR abs/2201.08560 (2022) - [i32]Tong Geng, Chunshu Wu, Yongan Zhang, Cheng Tan, Chenhao Xie, Haoran You, Martin C. Herbordt, Yingyan Lin, Ang Li:
I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization. CoRR abs/2203.03606 (2022) - [i31]Cheng Wan, Youjie Li, Ang Li, Nam Sung Kim, Yingyan Lin:
BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node Sampling. CoRR abs/2203.10983 (2022) - [i30]Bo Fang, M. Yusuf Özkaya, Ang Li, Ümit V. Çatalyürek, Sriram Krishnamoorthy:
Efficient Hierarchical State Vector Simulation of Quantum Circuits via Acyclic Graph Partitioning. CoRR abs/2205.06973 (2022) - [i29]Wei Sun, Ang Li, Tong Geng, Sander Stuijk, Henk Corporaal:
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numerical Behaviors. CoRR abs/2206.02874 (2022) - [i28]Yanfei Li, Tong Geng, Samuel Alexander Stein, Ang Li, Huimin Yu:
GAAF: Searching Activation Functions for Binary Neural Networks through Genetic Algorithm. CoRR abs/2206.03291 (2022) - [i27]Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Ang Li, Yufei Ding:
GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing. CoRR abs/2206.08482 (2022) - [i26]Chengming Zhang, Tong Geng, Anqi Guo, Jiannan Tian, Martin C. Herbordt, Ang Li, Dingwen Tao:
H-GCN: A Graph Convolutional Network Accelerator on Versal ACAP Architecture. CoRR abs/2206.13734 (2022) - [i25]Fei Hua, Yuwei Jin, Ang Li, Yan-Hao Chen, Chi Zhang, Ari B. Hayes, Hang Gao, Eddy Z. Zhang:
A Synergistic Compilation Workflow for Tackling Crosstalk in Quantum Machines. CoRR abs/2207.05751 (2022) - [i24]Hongwu Peng, Shaoyi Huang, Shiyang Chen, Bingbing Li, Tong Geng, Ang Li, Weiwen Jiang, Wujie Wen, Jinbo Bi, Hang Liu, Caiwen Ding:
A Length Adaptive Algorithm-Hardware Co-design of Transformer on FPGA Through Sparse Attention and Dynamic Pipelining. CoRR abs/2208.03646 (2022) - [i23]Anbang Wu, Yufei Ding, Ang Li:
CollComm: Enabling Efficient Collective Quantum Communication Based on EPR buffering. CoRR abs/2208.06724 (2022) - [i22]Yuke Wang, Boyuan Feng, Zheng Wang, Tong Geng, Kevin J. Barker, Ang Li, Yufei Ding:
Empowering GNNs with Fine-grained Communication-Computation Pipelining on Multi-GPU Platforms. CoRR abs/2209.06800 (2022) - [i21]Jieyang Chen, Chenhao Xie, Jesun Sahariar Firoz, Jiajia Li, Shuaiwen Leon Song, Kevin J. Barker, Mark Raugas, Ang Li:
MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems. CoRR abs/2209.07552 (2022) - [i20]Samuel Alexander Stein, Ying Mao, James A. Ang, Ang Li:
QuCNN : A Quantum Convolutional Neural Network with Entanglement Based Backpropagation. CoRR abs/2210.05443 (2022) - [i19]Hatem Helal, Jesun Firoz, Jenna A. Bilbrey, Mario Michael Krell, Tom Murray, Ang Li, Sotiris S. Xantheas, Sutanay Choudhury:
Extreme Acceleration of Graph Neural Network-based Prediction Models for Quantum Chemistry. CoRR abs/2211.13853 (2022) - 2021
- [j9]Yanfei Li, Tong Geng, Ang Li, Huimin Yu:
BCNN: Binary complex neural network. Microprocess. Microsystems 87: 104359 (2021) - [j8]Tong Geng, Ang Li, Tianqi Wang, Chunshu Wu, Yanfei Li, Runbin Shi, Wei Wu, Martin C. Herbordt:
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference. IEEE Trans. Parallel Distributed Syst. 32(1): 199-213 (2021) - [j7]Ang Li, Simon Su:
Accelerating Binarized Neural Networks via Bit-Tensor-Cores in Turing GPUs. IEEE Trans. Parallel Distributed Syst. 32(7): 1878-1891 (2021) - [j6]Cheng Tan, Chenhao Xie, Tong Geng, Andres Marquez, Antonino Tumeo, Kevin J. Barker, Ang Li:
ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing. IEEE Trans. Parallel Distributed Syst. 32(12): 2880-2892 (2021) - [c52]Hongwu Peng, Shanglin Zhou, Scott Weitze, Jiaxin Li, Sahidul Islam, Tong Geng, Ang Li, Wei Zhang, Minghu Song, Mimi Xie, Hang Liu, Caiwen Ding:
Binary Complex Neural Network Acceleration on FPGA : (Invited Paper). ASAP 2021: 85-92 - [c51]Cheng Tan, Nicolas Bohm Agostini, Jeff Zhang, Marco Minutoli, Vito Giovanni Castellana, Chenhao Xie, Tong Geng, Ang Li, Kevin J. Barker, Antonino Tumeo:
OpenCGRA: Democratizing Coarse-Grained Reconfigurable Arrays. ASAP 2021: 149-155 - [c50]Ganesh Gopalakrishnan, Ignacio Laguna, Ang Li, Pavel Panchekha, Cindy Rubio-González, Zachary Tatlock:
Guarding Numerics Amidst Rising Heterogeneity. Correctness@SC 2021: 9-15 - [c49]Cheng Tan, Chenhao Xie, Ang Li, Kevin J. Barker, Antonino Tumeo:
AURORA: Automated Refinement of Coarse-Grained Reconfigurable Accelerators. DATE 2021: 1388-1393 - [c48]Betis Baheri, Daniel Chen, Bo Fang, Samuel Alexander Stein, Vipin Chaudhary, Ying Mao, Shuai Xu, Ang Li, Qiang Guan:
TQEA: Temporal Quantum Error Analysis. DSN (Supplements) 2021: 65-67 - [c47]Tong Geng, Chunshu Wu, Cheng Tan, Chenhao Xie, Anqi Guo, Pouya Haghi, Sarah Yuan He, Jiajia Li, Martin C. Herbordt, Ang Li:
A Survey: Handling Irregularities in Neural Network Acceleration with FPGAs. HPEC 2021: 1-8 - [c46]Daniel Manu, Yi Sheng, Junhuan Yang, Jieren Deng, Tong Geng, Ang Li, Caiwen Ding, Weiwen Jiang, Lei Yang:
FL-DISCO: Federated Generative Adversarial Network for Graph-based Molecule Drug Discovery: Special Session Paper. ICCAD 2021: 1-7 - [c45]Hongwu Peng, Shiyang Chen, Zhepeng Wang, Junhuan Yang, Scott A. Weitze, Tong Geng, Ang Li, Jinbo Bi, Minghu Song, Weiwen Jiang, Hang Liu, Caiwen Ding:
Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search (Special Session Paper). ICCAD 2021: 1-7 - [c44]Yongan Zhang, Haoran You, Yonggan Fu, Tong Geng, Ang Li, Yingyan Lin:
G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency. ICCAD 2021: 1-9 - [c43]Cheng Tan, Tong Geng, Chenhao Xie, Nicolas Bohm Agostini, Jiajia Li, Ang Li, Kevin J. Barker, Antonino Tumeo:
DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications. ICCD 2021: 33-40 - [c42]Chenhao Xie, Jieyang Chen, Jesun Firoz, Jiajia Li, Shuaiwen Leon Song, Kevin J. Barker, Mark Raugas, Ang Li:
Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures. ICPP 2021: 53:1-53:11 - [c41]Samuel Alexander Stein, Ryan L'Abbate, Wenrui Mu, Yue Liu, Betis Baheri, Ying Mao, Qiang Guan, Ang Li, Bo Fang:
A Hybrid System for Learning Classical Data in Quantum States. IPCCC 2021: 1-7 - [c40]Hongwu Peng, Shaoyi Huang, Tong Geng, Ang Li, Weiwen Jiang, Hang Liu, Shusen Wang, Caiwen Ding:
Accelerating Transformer-based Deep Learning Models on FPGAs using Column Balanced Block Pruning. ISQED 2021: 142-148 - [c39]Tong Geng, Chunshu Wu, Yongan Zhang, Cheng Tan, Chenhao Xie, Haoran You, Martin C. Herbordt, Yingyan Lin, Ang Li:
I-GCN: A Graph Convolutional Network Accelerator with Runtime Locality Enhancement through Islandization. MICRO 2021: 1051-1063 - [c38]Samuel Alexander Stein, Betis Baheri, Daniel Chen, Ying Mao, Qiang Guan, Ang Li, Bo Fang, Shuai Xu:
QuGAN: A Quantum State Fidelity based Generative Adversarial Network. QCE 2021: 71-81 - [c37]Boyuan Feng, Yuke Wang, Tong Geng, Ang Li, Yufei Ding:
APNN-TC: accelerating arbitrary precision neural networks on ampere GPU tensor cores. SC 2021: 37 - [c36]Ang Li, Bo Fang, Christopher E. Granade, Guen Prawiroatmodjo, Bettina Heim, Martin Roetteler, Sriram Krishnamoorthy:
SV-sim: scalable PGAS-based state vector simulation of quantum circuits. SC 2021: 97 - [i18]Yanfei Li, Tong Geng, Ang Li, Huimin Yu:
BCNN: Binary Complex Neural Network. CoRR abs/2104.10044 (2021) - [i17]Boyuan Feng, Yuke Wang, Tong Geng, Ang Li, Yufei Ding:
APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores. CoRR abs/2106.12169 (2021) - [i16]Chengming Zhang, Sian Jin, Tong Geng, Jiannan Tian, Ang Li, Dingwen Tao:
CEAZ: Accelerating Parallel I/O via Hardware-Algorithm Co-Design of Efficient and Adaptive Lossy Compression. CoRR abs/2106.13306 (2021) - [i15]Hongwu Peng, Shanglin Zhou, Scott Weitze, Jiaxin Li, Sahidul Islam, Tong Geng, Ang Li, Wei Zhang, Minghu Song, Mimi Xie, Hang Liu, Caiwen Ding:
Binary Complex Neural Network Acceleration on FPGA. CoRR abs/2108.04811 (2021) - [i14]Hongwu Peng, Shiyang Chen, Zhepeng Wang, Junhuan Yang, Scott A. Weitze, Tong Geng, Ang Li, Jinbo Bi, Minghu Song, Weiwen Jiang, Hang Liu, Caiwen Ding:
Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search. CoRR abs/2109.06355 (2021) - [i13]Yongan Zhang, Haoran You, Yonggan Fu, Tong Geng, Ang Li, Yingyan Lin:
G-CoS: GNN-Accelerator Co-Search Towards Both Better Accuracy and Efficiency. CoRR abs/2109.08983 (2021) - [i12]Haoran You, Tong Geng, Yongan Zhang, Ang Li, Yingyan Lin:
GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design. CoRR abs/2112.11594 (2021) - 2020
- [j5]Tianqi Wang, Tong Geng, Ang Li, Xi Jin, Martin C. Herbordt:
FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters. IEEE Trans. Computers 69(8): 1143-1158 (2020) - [j4]Ang Li, Shuaiwen Leon Song, Jieyang Chen, Jiajia Li, Xu Liu, Nathan R. Tallent, Kevin J. Barker:
Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect. IEEE Trans. Parallel Distributed Syst. 31(1): 94-110 (2020) - [c35]Pengfei Zou, Ang Li, Kevin J. Barker, Rong Ge:
Indicator-Directed Dynamic Power Management for Iterative Workloads on GPU-Accelerated Systems. CCGRID 2020: 559-568 - [c34]Jesun Sahariar Firoz, Ang Li, Jiajia Li, Kevin J. Barker:
On the Feasibility of Using Reduced-Precision Tensor Core Operations for Graph Analytics. HPEC 2020: 1-7 - [c33]Tong Geng, Chunshu Wu, Cheng Tan, Bo Fang, Ang Li, Martin C. Herbordt:
CQNN: a CGRA-based QNN Framework. HPEC 2020: 1-7 - [c32]Cheng Tan, Chenhao Xie, Ang Li, Kevin J. Barker, Antonino Tumeo:
OpenCGRA: An Open-Source Unified Framework for Modeling, Testing, and Evaluating CGRAs. ICCD 2020: 381-388 - [c31]Pengfei Zou, Ang Li, Kevin J. Barker, Rong Ge:
Detecting Anomalous Computation with RNNs on GPU-Accelerated HPC Machines. ICPP 2020: 52:1-52:11 - [c30]Runbin Shi, Peiyan Dong, Tong Geng, Yuhao Ding, Xiaolong Ma, Hayden Kwok-Hay So, Martin C. Herbordt, Ang Li, Yanzhi Wang:
CSB-RNN: a faster-than-realtime RNN acceleration framework with compressed structured blocks. ICS 2020: 24:1-24:12 - [c29]Jiajia Li, Mahesh Lakshminarasimhan, Xiaolong Wu, Ang Li, Catherine Olschanowsky, Kevin J. Barker:
A Sparse Tensor Benchmark Suite for CPUs and GPUs. IISWC 2020: 193-204 - [c28]Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steven K. Reinhardt, Martin C. Herbordt:
AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing. MICRO 2020: 922-936 - [c27]Jiajia Li, Mahesh Lakshminarasimhan, Xiaolong Wu, Ang Li, Catherine Olschanowsky, Kevin J. Barker:
A parallel sparse tensor benchmark suite on CPUs and GPUs. PPoPP 2020: 403-404 - [c26]Ang Li, Omer Subasi, Xiu Yang, Sriram Krishnamoorthy:
Density matrix quantum circuit simulation via the BSP machine on modern GPU clusters. SC 2020: 13 - [i11]Jiajia Li, Mahesh Lakshminarasimhan, Xiaolong Wu, Ang Li, Catherine Olschanowsky, Kevin J. Barker:
A Parallel Sparse Tensor Benchmark Suite on CPUs and GPUs. CoRR abs/2001.00660 (2020) - [i10]Runbin Shi, Peiyan Dong, Tong Geng, Yuhao Ding, Xiaolong Ma, Hayden Kwok-Hay So, Martin C. Herbordt, Ang Li, Yanzhi Wang:
CSB-RNN: A Faster-than-Realtime RNN Acceleration Framework with Compressed Structured Blocks. CoRR abs/2005.05758 (2020) - [i9]Ang Li, Simon Su:
Accelerating Binarized Neural Networks via Bit-Tensor-Cores in Turing GPUs. CoRR abs/2006.16578 (2020) - [i8]Cheng Tan, Chenhao Xie, Andres Marquez, Antonino Tumeo, Kevin J. Barker, Ang Li:
ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing. CoRR abs/2011.04931 (2020) - [i7]Chenhao Xie, Jieyang Chen, Jesun Sahariar Firoz, Jiajia Li, Shuaiwen Leon Song, Kevin J. Barker, Mark Raugas, Ang Li:
Fast and Scalable Sparse Triangular Solver for Multi-GPU Based HPC Architectures. CoRR abs/2012.06959 (2020)
2010 – 2019
- 2019
- [j3]Jiajia Li, Yuchen Ma, Xiaolong Wu, Ang Li, Kevin J. Barker:
PASTA: a parallel sparse tensor algorithm benchmark suite. CCF Trans. High Perform. Comput. 1(2): 111-130 (2019) - [c25]Tong Geng, Tianqi Wang, Chunshu Wu, Chen Yang, Shuaiwen Leon Song, Ang Li, Martin C. Herbordt:
LP-BNN: Ultra-low-Latency BNN Inference with Layer Parallelism. ASAP 2019: 9-16 - [c24]Chenhao Xie, Xingyao Zhang, Ang Li, Xin Fu, Shuaiwen Song:
PIM-VR: Erasing Motion Anomalies In Highly-Interactive Virtual Reality World with Customized Memory Cube. HPCA 2019: 609-622 - [c23]Tong Geng, Tianqi Wang, Chunshu Wu, Chen Yang, Wei Wu, Ang Li, Martin C. Herbordt:
O3BNN: an out-of-order architecture for high-performance binarized neural network inference with fine-grained pruning. ICS 2019: 461-472 - [c22]Pengfei Zou, Ang Li, Kevin J. Barker, Rong Ge:
Fingerprinting Anomalous Computation with RNN for GPU-accelerated HPC Machines. IISWC 2019: 253-256 - [c21]Ang Li, Tong Geng, Tianqi Wang, Martin C. Herbordt, Shuaiwen Leon Song, Kevin J. Barker:
BSTC: a novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets. SC 2019: 38:1-38:30 - [i6]Tong Geng, Tianqi Wang, Ang Li, Xi Jin, Martin C. Herbordt:
A Scalable Framework for Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters with Weight and Workload Balancing. CoRR abs/1901.01007 (2019) - [i5]Jiajia Li, Yuchen Ma, Xiaolong Wu, Ang Li, Kevin J. Barker:
PASTA: A Parallel Sparse Tensor Algorithm Benchmark Suite. CoRR abs/1902.03317 (2019) - [i4]Ang Li, Shuaiwen Leon Song, Jieyang Chen, Jiajia Li, Xu Liu, Nathan R. Tallent, Kevin J. Barker:
Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect. CoRR abs/1903.04611 (2019) - [i3]Tong Geng, Ang Li, Tianqi Wang, Chunshu Wu, Yanfei Li, Antonino Tumeo, Martin C. Herbordt:
UWB-GCN: Hardware Acceleration of Graph-Convolution-Network through Runtime Workload Rebalancing. CoRR abs/1908.10834 (2019) - 2018
- [c20]Du Shen, Shuaiwen Leon Song, Ang Li, Xu Liu:
CUDAAdvisor: LLVM-based runtime profiling for modern GPUs. CGO 2018: 214-227 - [c19]Ang Li, Weifeng Liu, Linnan Wang, Kevin J. Barker, Shuaiwen Leon Song:
Warp-Consolidation: A Novel Execution Model for GPUs. ICS 2018: 53-64 - [c18]Ang Li, Shuaiwen Leon Song, Jieyang Chen, Xu Liu, Nathan R. Tallent, Kevin J. Barker:
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite. IISWC 2018: 191-202 - [c17]Shuaiwen Leon Song, Natalie J. Bates, Ang Li:
Introduction to HPPAC 2018. IPDPS Workshops 2018: 674 - [c16]Linnan Wang, Jinmian Ye, Yiyang Zhao, Wei Wu, Ang Li, Shuaiwen Leon Song, Zenglin Xu, Tim Kraska:
Superneurons: dynamic GPU memory management for training deep neural networks. PPoPP 2018: 41-53 - [i2]Linnan Wang, Jinmian Ye, Yiyang Zhao, Wei Wu, Ang Li, Shuaiwen Leon Song, Zenglin Xu, Tim Kraska:
SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks. CoRR abs/1801.04380 (2018) - 2017
- [j2]Weifeng Liu, Ang Li, Jonathan D. Hogg, Iain S. Duff, Brian Vinter:
Fast synchronization-free algorithms for parallel sparse triangular solves with multiple right-hand sides. Concurr. Comput. Pract. Exp. 29(21) (2017) - [c15]Wenfeng Zhao, Ang Li, Yi Wang, Yajun Ha:
Analysis and design of energy-efficient data-dependent SRAM. ASICON 2017: 912-915 - [c14]Ang Li, Shuaiwen Leon Song, Weifeng Liu, Xu Liu, Akash Kumar, Henk Corporaal:
Locality-Aware CTA Clustering for Modern GPUs. ASPLOS 2017: 297-311 - [c13]Ang Li, Wenfeng Zhao, Shuaiwen Leon Song:
BVF: enabling significant on-chip power savings via bit-value-favor for throughput processors. MICRO 2017: 532-545 - [c12]Ang Li, Weifeng Liu, Mads Ruben Burgdorff Kristensen, Brian Vinter, Hao Wang, Kaixi Hou, Andres Marquez, Shuaiwen Leon Song:
Exploring and analyzing the real impact of modern on-package memory on HPC scientific kernels. SC 2017: 26 - 2016
- [c11]Ang Li, Shuaiwen Leon Song, Akash Kumar, Eddy Z. Zhang, Daniel G. Chavarría-Miranda, Henk Corporaal:
Critical points based register-concurrency autotuning for GPUs. DATE 2016: 1273-1278 - [c10]Weifeng Liu, Ang Li, Jonathan D. Hogg, Iain S. Duff, Brian Vinter:
A Synchronization-Free Algorithm for Parallel Sparse Triangular Solves. Euro-Par 2016: 617-630 - [c9]Ang Li, Shuaiwen Leon Song, Mark Wijtvliet, Akash Kumar, Henk Corporaal:
SFU-Driven Transparent Approximation Acceleration on GPUs. ICS 2016: 15:1-15:14 - [c8]Ang Li, Shuaiwen Leon Song, Eric Brugel, Akash Kumar, Daniel G. Chavarría-Miranda, Henk Corporaal:
X: A Comprehensive Analytic Model for Parallel Machines. IPDPS 2016: 242-252 - 2015
- [j1]Ang Li, Akash Kumar, Yajun Ha, Henk Corporaal:
Correlation ratio based volume image registration on GPUs. Microprocess. Microsystems 39(8): 998-1011 (2015) - [c7]Mohammad Shihabul Haque, Ang Li, Akash Kumar, Qingsong Wei:
Accelerating non-volatile/hybrid processor cache design space exploration for application specific embedded systems. ASP-DAC 2015: 435-440 - [c6]Runbin Shi, Zheng Xu, Zhihao Sun, Maurice Peemen, Ang Li, Henk Corporaal, Di Wu:
A Locality Aware Convolutional Neural Networks Accelerator. DSD 2015: 591-598 - [c5]Ang Li, Y. C. Tay, Akash Kumar, Henk Corporaal:
Transit: A Visual Analytical Model for Multithreaded Machines. HPDC 2015: 101-106 - [c4]Ang Li, Gert-Jan van den Braak, Henk Corporaal, Akash Kumar:
Fine-Grained Synchronizations and Dataflow Programming on GPUs. ICS 2015: 109-118 - [c3]Ang Li, Gert-Jan van den Braak, Akash Kumar, Henk Corporaal:
Adaptive and transparent cache bypassing for GPUs. SC 2015: 17:1-17:12 - [i1]Mohammad Shihabul Haque, Ang Li, Akash Kumar, Qingsong Wei:
Accelerating Non-volatile/Hybrid Processor Cache Design Space Exploration for Application Specific Embedded Systems. CoRR abs/1506.03193 (2015) - 2014
- [c2]Ang Li, Akash Kumar:
Accelerating Volume Image Registration through Correlation Ratio Based Methods on GPUs. DSD 2014: 82-89 - [c1]Qiang Wu, Yajun Ha, Akash Kumar, Shaobo Luo, Ang Li, Shihab Mohamed:
A heterogeneous platform with GPU and FPGA for power efficient high performance computing. ISIC 2014: 220-223
Coauthor Index
aka: Tony Tong Geng
aka: Shuaiwen Leon Song
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-26 00:47 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint