Offloading support for OpenMP in Clang and LLVM
SF Antao, A Bataev, AC Jacob, GT Bercea, AE Eichenberger, G Rokos, ...
2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), 1-11, 2016
Integrating GPU support for OpenMP offloading directives into Clang
C Bertolli, SF Antao, GT Bercea, AC Jacob, AE Eichenberger, T Chen, ...
Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in†…, 2015
Performance analysis of OpenMP on a GPU using a CORAL proxy application
GT Bercea, C Bertolli, SF Antao, AC Jacob, AE Eichenberger, T Chen, ...
Proceedings of the 6th International Workshop on Performance Modeling†…, 2015
A fast and scalable graph coloring algorithm for multi-core and many-core architectures
G Rokos, G Gorman, PHJ Kelly
Euro-Par 2015: Parallel Processing: 21st International Conference on†…, 2015
Hybrid OpenMP/MPI anisotropic mesh smoothing
GJ Gorman, J Southern, PE Farrell, MD Piggott, G Rokos, PHJ Kelly
Procedia Computer Science 9, 1513-1522, 2012
Performance analysis and optimization of Clang's OpenMP 4.5 GPU support
M Martineau, S McIntosh-Smith, C Bertolli, AC Jacob, SF Antao, ...
2016 7th International Workshop on Performance Modeling, Benchmarking and†…, 2016
Efficient fork-join on GPUs through warp specialization
AC Jacob, AE Eichenberger, H Sung, SF Ant„o, GT Bercea, C Bertolli, ...
2017 IEEE 24th International Conference on High Performance Computing (HiPC†…, 2017
Thread-parallel anisotropic mesh adaptation
GJ Gorman, G Rokos, J Southern, PHJ Kelly
New challenges in grid generation and adaptivity for scientific computing†…, 2015
Thread parallelism for highly irregular computation in anisotropic mesh adaptation
G Rokos, GJ Gorman, KE Jensen, PHJ Kelly
arXiv preprint arXiv:1505.04694, 2015
A thread-parallel algorithm for anisotropic mesh adaptation
G Rokos, GJ Gorman, J Southern, PHJ Kelly
arXiv preprint arXiv:1308.2480, 2013
Pragmatic–parallel anisotropic adaptive mesh toolkit
G Rokos, G Gorman
Facing the Multicore-Challenge III: Aspects of New Paradigms and†…, 2013
Implementing implicit OpenMP data sharing on GPUs
GT Bercea, C Bertolli, AC Jacob, A Eichenberger, A Bataev, G Rokos, ...
Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in†…, 2017
Accelerating Optimisation-Based Anisotropic Mesh Adaptation using nVIDIA’s CUDA Architecture
G Rokos
Msc thesis, Imperial College London, 2010
Towards performance portable gpu programming with raja
A Jacob, SF Antao, H Sung, AE Eichenberger, C Bertolli, GT Bercea, ...
Workshop on Portability Among HPC Architectures for Scientific Applications, 2015
Accelerating anisotropic mesh adaptivity on nVIDIA’s CUDA using texture interpolation
G Rokos, G Gorman, PHJ Kelly
Euro-Par 2011 Parallel Processing: 17th International Conference, Euro-Par†…, 2011
Solving the advection PDE on the Cell Broadband Engine
G Rokos, G Peteinatos, G Kouveli, G Goumas, K Kourtis, N Koziris
2010 IEEE International Symposium on Parallel & Distributed Processing†…, 2010
Scalable multithreaded algorithms for mutable irregular data with application to anisotropic mesh adaptivity
G Rokos
Imperial College London, 2014
An Interrupt-Driven Work-Sharing For-Loop Scheduler
G Rokos, GJ Gorman, PHJ Kelly
arXiv preprint arXiv:1505.04134, 2015
Offloading Support for OpenMP in Clang and LLVM
C Bertolli, AEE Bercea, G Rokos, M Martineau, T Jin, G Ozen, Z Sura, ...
