Fault prediction under the microscope: A closer look into HPC systems A Gainaru, F Cappello, M Snir, W Kramer Proceedings of the International Conference on High Performance Computing …, 2012 | 151 | 2012 |
Modeling and tolerating heterogeneous failures in large parallel systems E Heien, D Kondo, A Gainaru, D LaPine, B Kramer, F Cappello High Performance Computing, Networking, Storage and Analysis (SC), 2011 …, 2011 | 109 | 2011 |
Taming of the shrew: Modeling the normal and faulty behaviour of large-scale hpc systems A Gainaru, F Cappello, W Kramer 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 91 | 2012 |
Scheduling the I/O of HPC applications under congestion A Gainaru, G Aupy, A Benoit, F Cappello, Y Robert, M Snir 2015 IEEE International Parallel and Distributed Processing Symposium, 1013-1022, 2015 | 74 | 2015 |
Improving the computing efficiency of HPC systems using a combination of proactive and preventive checkpointing MS Bouguerra, A Gainaru, LB Gomez, F Cappello, S Matsuoka, ... Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International …, 2013 | 66 | 2013 |
A Realistic Mobility Model Based on Social Networks for the Simulation of VANETs A Gainaru, C Dobre, V Cristea Vehicular Technology Conference, 2009. VTC Spring 2009. IEEE 69th, 1-5, 2009 | 52 | 2009 |
Failure prediction for HPC systems and applications: Current situation and open issues A Gainaru, F Cappello, M Snir, W Kramer The International Journal of High Performance Computing Applications 27 (3 …, 2013 | 47 | 2013 |
Event log mining tool for large scale HPC systems A Gainaru, F Cappello, S Trausan-Matu, B Kramer European Conference on Parallel Processing, 52-64, 2011 | 47 | 2011 |
Adaptive event prediction strategy with dynamic time window for large-scale hpc systems A Gainaru, F Cappello, J Fullop, S Trausan-Matu, W Kramer Managing Large-scale Systems via the Analysis of System Logs and the …, 2011 | 43 | 2011 |
Reducing waste in extreme scale systems through introspective analysis L Bautista-Gomez, A Gainaru, S Perarnau, D Tiwari, S Gupta, ... 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016 | 36 | 2016 |
Periodic I/O scheduling for super-computers G Aupy, A Gainaru, V Le Fèvre International Workshop on Performance Modeling, Benchmarking and Simulation …, 2017 | 16 | 2017 |
Mapping data mining algorithms on a GPU architecture: a study A Gainaru, E Slusanschi, S Trausan-Matu International Symposium on Methodologies for Intelligent Systems, 102-112, 2011 | 15 | 2011 |
Errors and Faults A Gainaru, F Cappello Fault-Tolerance Techniques for High-Performance Computing, 89-144, 2015 | 12 | 2015 |
Scheduling the I/O of HPC applications under congestion A Gainaru, G Aupy, A Benoit, F Cappello, Y Robert, M Snir LIP; INRIA, 2014 | 12 | 2014 |
Reservation Strategies for Stochastic Jobs G Aupy, A Gainaru, V Honoré, P Raghavan, Y Robert, H Sun 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019 | 10 | 2019 |
Scheduling Parallel Tasks under Multiple Resources: List Scheduling vs. Pack Scheduling H Sun, R Elghazi, A Gainaru, G Aupy, P Raghavan 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018 | 9 | 2018 |
Speculative scheduling for stochastic HPC applications A Gainaru, GP Aupy, H Sun, P Raghavan Proceedings of the 48th International Conference on Parallel Processing, 1-10, 2019 | 7 | 2019 |
Navigating the Blue Waters: Online Failure Prediction in the Petascale Era A Gainaru, MS Bouguerra, F Cappello, M Snir, W Kramer Argonne National Laboratory Technical Report, ANL/MCS-P5219-1014, 2014 | 7 | 2014 |
Real time analysis and event prediction engine J Fullop, A Gainaru, J Plutchak Proceedings of the Cray User Group meeting, 2012 | 7 | 2012 |
On-the-fly scheduling versus reservation-based scheduling for unpredictable workflows A Gainaru, H Sun, G Aupy, Y Huo, BA Landman, P Raghavan The International Journal of High Performance Computing Applications …, 2019 | 6 | 2019 |