Shuai Che
Cited by
Cited by
Rodinia: A benchmark suite for heterogeneous computing
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, SH Lee, K Skadron
2009 IEEE International Symposium on Workload Characterization (IISWC), 2009
A performance study of general-purpose applications on graphics processors using CUDA
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, K Skadron
Journal of Parallel and Distributed Computing (JPDC) 68 (10), 1370-1380, 2008
Accelerating compute-intensive applications with GPUs and FPGAs
S Che, J Li, JW Sheaffer, K Skadron, J Lach
2008 IEEE Symposium on Application Specific Processors (SASP), 2008
A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads
S Che, JW Sheaffer, M Boyer, LG Szafaryn, L Wang, K Skadron
2010 IEEE International Symposium on Workload Characterization (IISWC), 2010
Pannotia: Understanding Irregular GPGPU Graph Applications
S Che, BM Beckmann, SK Reinhardt, K Skadron
2013 IEEE International Symposium on Workload Characterization (IISWC), 2013
Load balancing in a changing world: dealing with heterogeneity and performance variability.
M Boyer, K Skadron, S Che, N Jayasena
The 10th Conference on Computing Frontiers, 2013
AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing
T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
Dymaxion: Optimizing Memory Access Patterns for Heterogeneous Systems
S Che, JW Sheaffer, K Skadron
2011 International Conference for High Performance Computing, Networking …, 2011
QuickRelease: A throughput-oriented approach to release consistency on GPUs
BA Hechtman, S Che, DR Hower, Y Tian, BM Beckmann, MD Hill, ...
2014 IEEE 20th International Symposium on High Performance Computer …, 2014
SPEC ACCEL: A standard application suite for measuring hardware accelerator performance
G Juckeland, W Brantley, S Chandrasekaran, B Chapman, S Che, ...
International Workshop on Performance Modeling, Benchmarking and Simulation …, 2014
Using Cycle Stacks to Understand Scaling Bottlenecks in Multi-Threaded Workloads
W Heirman, TE Carlson, S Che, K Skadron, L Eeckhout
2011 IEEE International Symposium on Workload Characterization (IISWC), 2011
Synchronization Using Remote-Scope Promotion
MS Orr, S Che, A Yilmazer, BM Beckmann, MD Hill, DA Wood
International Conference on Architectural Support for Programming Languages …, 2015
Auto-tuning strategies for parallelizing sparse matrix-vector (spmv) multiplication on multi-and many-core processors
K Hou, W Feng, S Che
2017 IEEE International Parallel and Distributed Processing Symposium …, 2017
GasCL: A Vertex-Centric Graph Model for GPUs
S Che
IEEE High Performance Extreme Computing Conference (HPEC), 2014
Toward more efficient noc arbitration: A deep reinforcement learning approach
J Yin, Y Eckert, S Che, M Oskin, GH Loh
Proc. IEEE 1st Int. Workshop AI-assisted Des. Architecture 128, 2018
Offloading Execution of an Application by a Network Connected Device
S Che
US Patent App. 15/174,624, 2017
BenchFriend: Correlating the performance of GPU benchmarks
S Che, K Skadron
International Journal of High Performance Computing Applications, 2013
Implementing directed acyclic graphs with the heterogeneous system architecture
S Puthoor, AM Aji, S Che, M Daga, W Wu, BM Beckmann, G Rodgers
Proceedings of the 9th Annual Workshop on General Purpose Processing using …, 2016
System and method for repurposing dead cache blocks
GH Loh, DR Hower, S Che
US Patent 9,990,289, 2018
BelRed: Constructing GPGPU Graph Applications with Software Building Blocks
S Che, BM Beckmann, SK Reinhardt
IEEE High Performance Extreme Computing Conference (HPEC), 2014
The system can't perform the operation now. Try again later.
Articles 1–20