Mark Gates
Cited by
Cited by
Accelerating numerical dense linear algebra calculations with GPUs
J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, S Tomov, ...
Numerical computations with GPUs, 3-28, 2014
Towards high performance digital volume correlation
M Gates, J Lambros, MT Heath
Experimental Mechanics 51 (4), 491-507, 2011
Parallel programming models for dense linear algebra on heterogeneous systems
J Dongarra, M Abalenkovs, A Abdelfattah, M Gates, A Haidar, J Kurzak, ...
Supercomputing frontiers and innovations 2 (4), 67-86, 2015
Accelerating collaborative filtering using concepts from high performance computing
M Gates, H Anzt, J Kurzak, J Dongarra
2015 IEEE International Conference on Big Data (Big Data), 667-676, 2015
Implementation and tuning of batched Cholesky factorization and solve for NVIDIA GPUs
J Kurzak, H Anzt, M Gates, J Dongarra
IEEE Transactions on Parallel and Distributed Systems 27 (7), 2036-2048, 2015
Hpc programming on intel many-integrated-core hardware with magma port to xeon phi
J Dongarra, M Gates, A Haidar, Y Jia, K Kabir, P Luszczek, S Tomov
Scientific Programming 2015, 2015
High-performance hybrid CPU and GPU parallel algorithm for digital volume correlation
M Gates, MT Heath, J Lambros
The International Journal of High Performance Computing Applications 29 (1 …, 2015
clMAGMA: High performance dense linear algebra with OpenCL
C Cao, J Dongarra, P Du, M Gates, P Luszczek, S Tomov
Proceedings of the International Workshop on OpenCL 2013 & 2014, 1-9, 2014
The singular value decomposition: Anatomy of optimizing an algorithm for extreme scale
J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, S Tomov, ...
SIAM review 60 (4), 808-865, 2018
With extreme computing, the rules have changed
J Dongarra, S Tomov, P Luszczek, J Kurzak, M Gates, I Yamazaki, H Anzt, ...
Computing in Science & Engineering 19 (3), 52-62, 2017
Preconditioned Krylov solvers on GPUs
H Anzt, M Gates, J Dongarra, M Kreutzer, G Wellein, M Köhler
Parallel Computing 68, 32-44, 2017
A survey of recent developments in parallel implementations of Gaussian elimination
S Donfack, J Dongarra, M Faverge, M Gates, J Kurzak, P Luszczek, ...
Concurrency and Computation: Practice and Experience 27 (5), 1292-1309, 2015
Slate: Design of a modern distributed and accelerated linear algebra library
M Gates, J Kurzak, A Charara, A YarKhan, J Dongarra
Proceedings of the International Conference for High Performance Computing …, 2019
A proposed API for batched basic linear algebra subprograms
J Dongarra, I Duff, M Gates, A Haidar, S Hammarling, NJ Higham, J Hogg, ...
Manchester Institute for Mathematical Sciences, University of Manchester, 2016
Subset refinement for digital volume correlation: numerical and experimental applications
M Gates, J Gonzalez, J Lambros, MT Heath
Experimental Mechanics 55 (1), 245-259, 2015
Portable HPC programming on Intel many-integrated-core hardware with MAGMA port to Xeon Phi
J Dongarra, M Gates, A Haidar, Y Jia, K Kabir, P Luszczek, S Tomov
International Conference on Parallel Processing and Applied Mathematics, 571-581, 2013
Heterogeneous streaming
CJ Newburn, G Bansal, M Wood, L Crivelli, J Planas, A Duran, P Souza, ...
2016 IEEE International Parallel and Distributed Processing Symposium …, 2016
Block-asynchronous multigrid smoothers for GPU-accelerated systems
H Anzt, S Tomov, M Gates, J Dongarra, V Heuveline
Procedia Computer Science 9, 7-16, 2012
A survey of numerical methods utilizing mixed precision arithmetic
A Abdelfattah, H Anzt, EG Boman, E Carson, T Cojean, J Dongarra, ...
arXiv preprint arXiv:2007.06674, 2020
C++ api for blas and lapack
M Gates, P Luszczek, A Abdelfattah, J Kurzak, J Dongarra, K Arturov, ...
SLATE Working Notes, 2017
The system can't perform the operation now. Try again later.
Articles 1–20