Follow
Yehia Arafa
Yehia Arafa
Qualcomm R&D, PhD in Computer Engineering
Verified email at qualcomm.com
Title
Cited by
Cited by
Year
Verified instruction-level energy consumption measurement for nvidia gpus
Y Arafa, A ElWazir, A ElKanishy, Y Aly, A Elsayed, AH Badawy, ...
Proceedings of the 17th ACM International Conference on Computing Frontiers …, 2020
392020
Low Overhead Instruction Latency Characterization for NVIDIA GPGPUs
Y Arafa, AHA Badawy, G Chennupati, N Santhi, S Eidenbenz
2019 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2019
38*2019
PPT-GPU: Scalable gpu performance modeling
Y Arafa, AHA Badawy, G Chennupati, N Santhi, S Eidenbenz
IEEE Computer Architecture Letters 18 (1), 55-58, 2019
382019
Hybrid, scalable, trace-driven performance modeling of GPGPUs
Y Arafa, AH Badawy, A ElWazir, A Barai, A Eker, G Chennupati, N Santhi, ...
Proceedings of the International Conference for High Performance Computing …, 2021
202021
Fast, accurate, and scalable memory modeling of GPGPUs using reuse profiles
Y Arafa, AH Badawy, G Chennupati, A Barai, N Santhi, S Eidenbenz
Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020
202020
GPUs cache performance estimation using reuse distance analysis
Y Arafa, G Chennupati, A Barai, AHA Badawy, N Santhi, S Eidenbenz
2019 IEEE 38th International Performance Computing and Communications …, 2019
112019
Demystifying the nvidia ampere architecture through microbenchmarking and instruction-level analysis
H Abdelkhalik, Y Arafa, N Santhi, AHA Badawy
2022 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2022
102022
Fault tolerance performance evaluation of large-scale distributed storage systems HDFS and Ceph case study
Y Arafa, A Barai, M Zheng, AHA Badawy
2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018
102018
PPT-SASMM: Scalable analytical shared memory model: Predicting the performance of multicore caches from a single-threaded execution trace
A Barai, G Chennupati, N Santhi, AH Badawy, Y Arafa, S Eidenbenz
Proceedings of the International Symposium on Memory Systems, 341-351, 2020
82020
Efficient intra-rack resource disaggregation for HPC using co-packaged DWDM photonics
G Michelogiannakis, Y Arafa, B Cook, LY Dai, AHH Badawy, M Glick, ...
2023 IEEE International Conference on Cluster Computing (CLUSTER), 158-172, 2023
72023
Load-aware dynamic time synchronization in parallel discrete event simulation
A Eker, Y Arafa, AHA Badawy, N Santhi, S Eidenbenz, D Ponomarev
Proceedings of the 2021 ACM SIGSIM Conference on Principles of Advanced …, 2021
72021
PPT-Multicore: performance prediction of OpenMP applications using reuse profiles and analytical modeling
A Barai, Y Arafa, AH Badawy, G Chennupati, N Santhi, S Eidenbenz
The Journal of Supercomputing, 1-32, 2022
62022
PPT-GPU: Performance prediction toolkit for gpus identifying the impact of caches
Y Arafa, AHA Badawy, G Chennupati, N Santhi, S Eidenbenz
Proceedings of the International Symposium on Memory Systems, 301-302, 2018
52018
Evaluating the fault tolerance performance of hdfs and ceph
Y Arafa, A Barai, M Zheng, AHA Badawy
Proceedings of the Practice and Experience on Advanced Research Computing, 1-3, 2018
32018
NVIDIA GPGPUs Instructions Energy Consumption
Y Arafa, A ElWazir, A Elkanishy, Y Aly, A Elsayed, AH Badawy, ...
2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020
12020
PPT-SASMM: Scalable analytical shared memory model
A Barai, G Chennupati, N Santhi, AHA Badawy, Y Arafa, SJ Eidenbenz
Press of the 6th International Symposium on Memory Systems (MEMSYS). ACM …, 2020
12020
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques
H Abdelkhalik, S Aktar, Y Arafa, A Barai, G Chennupati, N Santhi, ...
2023 IEEE 29th International Conference on Parallel and Distributed Systems …, 2023
2023
Modeling and Characterizing Shared and Local Memories of the Ampere GPUs
H Abdelkhalik, Y Arafa, N Santhi, N Prajapati, AHA Badawy
Proceedings of the International Symposium on Memory Systems, 1-3, 2023
2023
BB-ML: Basic Block Performance Prediction using Machine Learning Techniques
S Aktar, H Abdelkhalik, NH Turja, Y Arafa, A Barai, N Panda, ...
arXiv preprint arXiv:2202.07798, 2022
2022
-Multicore: performance prediction of Open applications using reuse profiles and analytical modeling
A Barai, Y Arafa, AH Badawy, G Chennupati, N Santhi, S Eidenbenz
Journal of Supercomputing 78 (LA-UR-21-22749), 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20