Ang Li
Title
Cited by
Cited by
Year
Superneurons: Dynamic GPU memory management for training deep neural networks
L Wang, J Ye, Y Zhao, W Wu, A Li, SL Song, Z Xu, T Kraska
Proceedings of the 23rd ACM SIGPLAN symposium on principles and practice of …, 2018
1052018
Adaptive and transparent cache bypassing for GPUs
A Li, GJ van den Braak, A Kumar, H Corporaal
Proceedings of the International Conference for High Performance Computing …, 2015
772015
A synchronization-free algorithm for parallel sparse triangular solves
W Liu, A Li, J Hogg, IS Duff, B Vinter
European Conference on Parallel Processing, 617-630, 2016
732016
Locality-aware CTA clustering for modern GPUs
A Li, SL Song, W Liu, X Liu, A Kumar, H Corporaal
ACM SIGARCH Computer Architecture News 45 (1), 297-311, 2017
552017
Evaluating modern GPU interconnect: Pcie, nvlink, nv-sli, nvswitch and gpudirect
A Li, SL Song, J Chen, J Li, X Liu, NR Tallent, KJ Barker
IEEE Transactions on Parallel and Distributed Systems 31 (1), 94-110, 2019
492019
Exploring and analyzing the real impact of modern on-package memory on HPC scientific kernels
A Li, W Liu, MRB Kristensen, B Vinter, H Wang, K Hou, A Marquez, ...
Proceedings of the International Conference for High Performance Computing …, 2017
462017
Fine-grained synchronizations and dataflow programming on GPUs
A Li, GJ van den Braak, H Corporaal, A Kumar
Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015
442015
Fast synchronization‐free algorithms for parallel sparse triangular solves with multiple right‐hand sides
W Liu, A Li, JD Hogg, IS Duff, B Vinter
Concurrency and Computation: Practice and Experience 29 (21), e4244, 2017
372017
Tartan: evaluating modern GPU interconnect via a multi-GPU benchmark suite
A Li, SL Song, J Chen, X Liu, N Tallent, K Barker
2018 IEEE International Symposium on Workload Characterization (IISWC), 191-202, 2018
282018
SFU-driven transparent approximation acceleration on GPUs
A Li, SL Song, M Wijtvliet, A Kumar, H Corporaal
Proceedings of the 2016 International Conference on Supercomputing, 1-14, 2016
272016
Cudaadvisor: Llvm-based runtime profiling for modern gpus
D Shen, SL Song, A Li, X Liu
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
242018
A heterogeneous platform with GPU and FPGA for power efficient high performance computing
Q Wu, Y Ha, A Kumar, S Luo, A Li, S Mohamed
2014 International Symposium on Integrated Circuits (ISIC), 220-223, 2014
212014
Critical points based register-concurrency autotuning for GPUs
A Li, SL Song, A Kumar, EZ Zhang, D Chavarría-Miranda, H Corporaal
2016 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2016
192016
X: A comprehensive analytic model for parallel machines
A Li, SL Song, E Brugel, A Kumar, D Chavarria-Miranda, H Corporaal
2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016
172016
LP-BNN: Ultra-low-latency BNN inference with layer parallelism
T Geng, T Wang, C Wu, C Yang, SL Song, A Li, M Herbordt
2019 IEEE 30th International Conference on Application-specific Systems …, 2019
162019
AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing
T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
152020
Transit: A visual analytical model for multithreaded machines
A Li, YC Tay, A Kumar, H Corporaal
Proceedings of the 24th International Symposium on High-Performance Parallel …, 2015
152015
O3BNN: An out-of-order architecture for high-performance binarized neural network inference with fine-grained pruning
T Geng, T Wang, C Wu, C Yang, W Wu, A Li, MC Herbordt
Proceedings of the ACM International Conference on Supercomputing, 461-472, 2019
142019
Warp-consolidation: A novel execution model for gpus
A Li, W Liu, L Wang, K Barker, SL Song
Proceedings of the 2018 International Conference on Supercomputing, 53-64, 2018
142018
BSTC: A novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets
A Li, T Geng, T Wang, M Herbordt, SL Song, K Barker
Proceedings of the International Conference for High Performance Computing …, 2019
132019
The system can't perform the operation now. Try again later.
Articles 1–20