Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect A Li, SL Song, J Chen, J Li, X Liu, N Tallent, K Barker IEEE Transactions on Parallel and Distributed Systems, 2019, 2019 | 44 | 2019 |
Towards practical algorithm based fault tolerance in dense linear algebra P Wu, Q Guan, N DeBardeleben, S Blanchard, D Tao, X Liang, J Chen, ... Proceedings of the 25th ACM International Symposium on High-Performance …, 2016 | 32 | 2016 |
Tartan: evaluating modern GPU interconnect via a multi-GPU benchmark suite A Li, SL Song, J Chen, X Liu, N Tallent, K Barker 2018 IEEE International Symposium on Workload Characterization (IISWC), 191-202, 2018 | 25 | 2018 |
Online algorithm-based fault tolerance for cholesky decomposition on heterogeneous systems with gpus J Chen, X Liang, Z Chen 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016 | 25 | 2016 |
Silent data corruption resilient two-sided matrix factorizations P Wu, N DeBardeleben, Q Guan, S Blanchard, J Chen, D Tao, X Liang, ... Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of …, 2017 | 23 | 2017 |
Correcting soft errors online in fast fourier transform X Liang, J Chen, D Tao, S Li, P Wu, H Li, K Ouyang, Y Liu, F Song, ... Proceedings of the International Conference for High Performance Computing …, 2017 | 21 | 2017 |
GPU-ABFT: Optimizing algorithm-based fault tolerance for heterogeneous systems with GPUs J Chen, S Li, Z Chen 2016 IEEE International Conference on Networking, Architecture and Storage …, 2016 | 12 | 2016 |
TSM2: Optimizing Tall-and-Skinny Matrix-Matrix Multiplication on GPUs J Chen, N Xiong, X Liang, D Tao, S Li, K Ouyang, K Zhao, ... ACM International Conference on Supercomputing, 2019 | 11 | 2019 |
GreenLA: green linear algebra software for GPU-accelerated heterogeneous computing J Chen, L Tan, P Wu, D Tao, H Li, X Liang, S Li, R Ge, L Bhuyan, Z Chen SC'16: Proceedings of the International Conference for High Performance …, 2016 | 9 | 2016 |
Fault Tolerant One-sided Matrix Decompositions on Heterogeneous Systems with GPUs J Chen, H Li, S Li, X Liang, P Wu, D Tao, K Ouyang, Y Liu, K Zhao, ... High Performance Computing, Networking, Storage and Analysis, SC18 …, 2018 | 8 | 2018 |
Algorithm-based fault tolerance for convolutional neural networks K Zhao, S Di, S Li, X Liang, Y Zhai, J Chen, K Ouyang, F Cappello, ... IEEE Transactions on Parallel and Distributed Systems, 2020 | 5 | 2020 |
Beeflow: A workflow management system for in situ processing across hpc and cloud systems J Chen, Q Guan, Z Zhang, X Liang, L Vernon, A McPherson, LT Lo, ... 2018 IEEE 38th International Conference on Distributed Computing Systems …, 2018 | 4 | 2018 |
Cholesky factorization on heterogeneous cpu and gpu systems J Chen, Z Chen 2015 Ninth International Conference on Frontier of Computer Science and …, 2015 | 4 | 2015 |
Ft-isort: Efficient fault tolerance for introsort S Li, H Li, X Liang, J Chen, E Giem, K Ouyang, K Zhao, S Di, F Cappello, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 3 | 2019 |
Towards Predicting the Impact of Roll-Forward Failure Recovery for HPC Applications B Fang, J Chen, M Ripeanu, S Krishnamoorthy 2019 49th Annual IEEE/IFIP International Conference on Dependable Systems …, 2019 | 2 | 2019 |
Build and execution environment (BEE): an encapsulated environment enabling HPC applications running everywhere J Chen, Q Guan, X Liang, P Bryant, P Grubel, A McPherson, LT Lo, ... 2018 IEEE International Conference on Big Data (Big Data), 1737-1746, 2018 | 2 | 2018 |
Optimizing multi-grid based reduction for efficient scientific data management X Liang, B Whitney, J Chen, L Wan, Q Liu, D Tao, J Kress, D Pugmire, ... arXiv preprint arXiv:2010.05872, 2020 | 1 | 2020 |
FTRANS: energy-efficient acceleration of transformers using FPGA B Li, S Pandey, H Fang, Y Lyv, J Li, J Chen, M Xie, L Wan, H Liu, C Ding Proceedings of the ACM/IEEE International Symposium on Low Power Electronics …, 2020 | 1 | 2020 |
Estimating Lossy Compressibility of Scientific Data Using Deep Neural Networks Z Qin, J Wang, Q Liu, J Chen, D Pugmire, N Podhorszki, S Klasky IEEE Letters of the Computer Society 3 (1), 5-8, 2020 | 1 | 2020 |
Extending the Publish/Subscribe Abstraction for High-Performance I/O and Data Management at Extreme Scale J Logan, M Ainsworth, C Atkins, J Chen, J Choi, J Gu, J Kress, ... Data Engineering, 35, 2020 | 1 | 2020 |