Follow
Kaixi Hou
Title
Cited by
Cited by
Year
Fast segmented sort on gpus
K Hou, W Liu, H Wang, W Feng
Proceedings of the International Conference on Supercomputing, 1-10, 2017
692017
Exploring and analyzing the real impact of modern on-package memory on HPC scientific kernels
A Li, W Liu, MRB Kristensen, B Vinter, H Wang, K Hou, A Marquez, ...
Proceedings of the International Conference for High Performance Computing …, 2017
572017
Sep-graph: finding shortest execution paths for graph processing under a hybrid framework on GPU
H Wang, L Geng, R Lee, K Hou, Y Zhang, X Zhang
Proceedings of the 24th Symposium on Principles and Practice of Parallel …, 2019
552019
Parallel Transposition of Sparse Data Structures
H Wang, W Liu, K Hou, W Feng
Proceedings of the 30th ACM International Conference on Supercomputing, 2016
512016
Auto-tuning strategies for parallelizing sparse matrix-vector (spmv) multiplication on multi-and many-core processors
K Hou, W Feng, S Che
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
472017
Aspas: A framework for automatic simdization of parallel sorting on x86-based many-core processors
K Hou, H Wang, W Feng
Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015
422015
AAlign: A SIMD Framework for Pairwise Sequence Alignment on x86-based Multi-and Many-core Processors
K Hou, H Wang, W Feng
Proceedings of the 2016 IEEE International Parallel and Distributed …, 2016
312016
Gpu-unicache: Automatic code generation of spatial blocking for stencils on gpus
K Hou, H Wang, W Feng
Proceedings of the computing frontiers conference, 107-116, 2017
272017
Highly efficient compensation-based parallelism for wavefront loops on gpus
K Hou, H Wang, W Feng, JS Vetter, S Lee
2018 IEEE International parallel and distributed processing symposium (IPDPS …, 2018
222018
A framework for the automatic vectorization of parallel sort on x86-based processors
K Hou, H Wang, W Feng
IEEE Transactions on Parallel and Distributed Systems 29 (5), 958-972, 2018
202018
Delivering parallel programmability to the masses via the intel mic ecosystem: A case study
K Hou, H Wang, W Feng
2014 43rd International Conference on Parallel Processing Workshops, 273-282, 2014
202014
Robotomata: A Framework for Approximate Pattern Matching of Big Data on an Automata Processor
X Yu, K Hou, H Wang, W Feng
Big Data (Big Data), 2017 IEEE International Conference on, 2018
112018
The research of Levenberg-Marquardt algorithm in curve fittings on multiple GPUs
L Zhang, Y Zhao, K Hou
2011IEEE 10th International Conference on Trust, Security and Privacy in …, 2011
92011
pDindel: Accelerating indel detection on a multicore CPU architecture with SIMD
D Zhang, H Wang, K Hou, J Zhang, W Feng
2015 IEEE 5th International Conference on Computational Advances in Bio and …, 2015
82015
Segmented merge: A new primitive for parallel sparse matrix computations
H Ji, S Lu, K Hou, H Wang, Z Jin, W Liu, B Vinter
International Journal of Parallel Programming 49, 732-744, 2021
42021
A framework for fast and fair evaluation of automata processing hardware
X Yu, K Hou, H Wang, W Feng
2017 IEEE International Symposium on Workload Characterization (IISWC), 120-121, 2017
42017
Exploring performance portability for accelerators via high-level parallel patterns
K Hou
Virginia Tech, 2018
32018
Performance evaluation of the three-dimensional finite-difference time-domain (FDTD) method on Fermi architecture GPUs
K Hou, Y Zhao, J Huang, L Zhang
Algorithms and Architectures for Parallel Processing: 11th International …, 2011
32011
FDM-Seismology in OpenCL
K Hou
Personal Copy, 0
3
A Feasibility Study for MPI over HDFS
W Feng, D Zhang, J Zhang, K Hou, S Pumma, H Wang
2020 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–20