Kokkos 3: Programming model extensions for the exascale era CR Trott, D Lebrun-Grandié, D Arndt, J Ciesko, V Dang, N Ellingwood, ... IEEE Transactions on Parallel and Distributed Systems 33 (4), 805-817, 2021 | 214 | 2021 |
TERAFLUX: Harnessing dataflow in next generation teradevices R Giorgi, RM Badia, F Bodin, A Cohen, P Evripidou, P Faraboschi, ... Microprocessors and Microsystems 38 (8), 976-990, 2014 | 90 | 2014 |
An empirical roofline methodology for quantitatively assessing performance portability C Yang, R Gayatri, T Kurth, P Basu, Z Ronaghi, A Adetokunbo, B Friesen, ... 2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018 | 46 | 2018 |
A case study for performance portability using OpenMP 4.5 R Gayatri, C Yang, T Kurth, J Deslippe Accelerator Programming Using Directives: 5th International Workshop, WACCPD …, 2019 | 40 | 2019 |
Billion atom molecular dynamics simulations of carbon at extreme conditions and experimental time and length scales K Nguyen-Cong, JT Willman, SG Moore, AB Belonoshko, R Gayatri, ... Proceedings of the International Conference for High Performance Computing …, 2021 | 30 | 2021 |
A novel multi-level integrated roofline model approach for performance characterization T Koskela, Z Matveev, C Yang, A Adedoyin, R Belenov, P Thierry, Z Zhao, ... High Performance Computing: 33rd International Conference, ISC High …, 2018 | 22 | 2018 |
Experiences in porting mini‐applications to OpenACC and OpenMP on heterogeneous systems VG Vergara Larrea, RD Budiardja, R Gayatri, C Daley, O Hernandez, ... Concurrency and Computation: Practice and Experience 32 (20), e5780, 2020 | 18 | 2020 |
Case study of using Kokkos and SYCL as performance-portable frameworks for Milc-Dslash benchmark on NVIDIA, AMD and Intel GPUs AS Dufek, R Gayatri, N Mehta, D Doerfler, B Cook, Y Ghadar, C DeTar 2021 International Workshop on Performance, Portability and Productivity in …, 2021 | 11 | 2021 |
Rapid exploration of optimization strategies on advanced architectures using testsnap and lammps R Gayatri, S Moore, E Weinberg, N Lubbers, S Anderson, J Deslippe, ... arXiv preprint arXiv:2011.12875, 2020 | 11 | 2020 |
Loop level speculation in a task based programming model R Gayatri, RM Badia, E Aygaude 20th Annual International Conference on High Performance Computing, 39-48, 2013 | 8 | 2013 |
Comparing managed memory and ats with and without prefetching on nvidia volta gpus R Gayatri, K Gott, J Deslippe 2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2019 | 7 | 2019 |
Evaluating performance portability of OpenMP for SNAP on NVIDIA, Intel, and AMD GPUs using the roofline methodology NA Mehta, R Gayatri, Y Ghadar, C Knight, J Deslippe Accelerator Programming Using Directives: 7th International Workshop, WACCPD …, 2021 | 6 | 2021 |
Kokkos 3: Programming Model Extensions for the Exascale Era, IEEE T. Parall. Distr., 33, 805–817 CR Trott, D Lebrun-Grandié, D Arndt, J Ciesko, V Dang, N Ellingwood, ... | 5 | 2022 |
Transactional access to shared memory in StarSs, a task based programming model R Gayatri, RM Badia, E Ayguade, M Luján, I Watson Euro-Par 2012 Parallel Processing: 18th International Conference, Euro-Par …, 2012 | 5 | 2012 |
Scaling and performance portability of the particle-in-cell scheme for plasma physics applications through mini-apps targeting exascale architectures S Muralikrishnan, M Frey, A Vinciguerra, M Ligotino, AJ Cerfon, ... arXiv preprint arXiv:2205.11052, 2022 | 3 | 2022 |
Non-recurring engineering (NRE) best practices: a case study with the NERSC/NVIDIA OpenMP contract CS Daley, A Southwell, R Gayatri, S Biersdorfff, C Toepfer, G Özen, ... Proceedings of the International Conference for High Performance Computing …, 2021 | 2 | 2021 |
The Kokkos OpenMPTarget Backend: Implementation and Lessons Learned R Gayatri, SL Olivier, CR Trott, J Doerfert, J Ciesko, D Lebrun-Grandie International Workshop on OpenMP, 99-113, 2023 | 1 | 2023 |
A Methodology for Evaluating Tightly-integrated and Disaggregated Accelerated Architectures T Groves, C Daley, R Gayatri, HA Nam, N Ding, L Oliker, NJ Wright, ... 2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking …, 2022 | 1 | 2022 |
Increasing parallelism through speculation in a task-based programming model R Gayatri Universitat Politècnica de Catalunya (UPC), 2015 | 1 | 2015 |
ALPINE: A set of performance portable plasma physics particle-in-cell mini-apps for exascale computing. S Muralikrishnan, M Frey, A Vinciguerra, M Ligotino, AJ Cerfon, ... CoRR, 2022 | | 2022 |