Follow
Aravind Sukumaran Rajam
Title
Cited by
Cited by
Year
Adaptive sparse tiling for sparse matrix multiplication
C Hong, A Sukumaran-Rajam, I Nisa, K Singh, P Sadayappan
Proceedings of the 24th Symposium on Principles and Practice of Parallel …, 2019
752019
Register optimizations for stencils on GPUs
PS Rawat, F Rastello, A Sukumaran-Rajam, LN Pouchet, A Rountev, ...
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of …, 2018
492018
A code generator for high-performance tensor contractions on GPUs
J Kim, A Sukumaran-Rajam, V Thumma, S Krishnamoorthy, A Panyala, ...
Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation …, 2019
412019
Domain-specific optimization and generation of high-performance GPU code for stencil computations
PS Rawat, M Vaidya, A Sukumaran-Rajam, M Ravishankar, V Grover, ...
Proceedings of the IEEE 106 (11), 1902-1920, 2018
362018
Load-Balanced Sparse MTTKRP on GPUs
I Nisa, J Li, A Sukumaran-Rajam, R Vuduc, P Sadayappan
arXiv preprint arXiv:1904.03329, 2019
332019
MultiGraph: Efficient graph processing on GPUs
C Hong, A Sukumaran-Rajam, J Kim, P Sadayappan
2017 26th International Conference on Parallel Architectures and Compilation …, 2017
322017
The polyhedral model of nonlinear loops
A Sukumaran-Rajam, P Clauss
ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-27, 2015
322015
Efficient sparse-matrix multi-vector product on gpus
C Hong, A Sukumaran-Rajam, B Bandyopadhyay, J Kim, SE Kurt, I Nisa, ...
Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018
302018
Effective machine learning based format selection and performance modeling for SpMV on GPUs
I Nisa, C Siegel, AS Rajam, A Vishnu, P Sadayappan
2018 IEEE International Parallel and Distributed Processing Symposium …, 2018
252018
Parallel ccd++ on gpu for matrix factorization
I Nisa, A Sukumaran-Rajam, R Kunchum, P Sadayappan
Proceedings of the General Purpose GPUs, 73-83, 2017
232017
Sampled dense matrix multiplication for high-performance machine learning
I Nisa, A Sukumaran-Rajam, SE Kurt, C Hong, P Sadayappan
2018 IEEE 25th International Conference on High Performance Computing (HiPC …, 2018
222018
Analytical cache modeling and tilesize optimization for tensor contractions
R Li, A Sukumaran-Rajam, R Veras, TM Low, F Rastello, A Rountev, ...
Proceedings of the International Conference for High Performance Computing …, 2019
202019
An efficient mixed-mode representation of sparse tensors
I Nisa, J Li, A Sukumaran-Rajam, PS Rawat, S Krishnamoorthy, ...
Proceedings of the International Conference for High Performance Computing …, 2019
192019
Speculative program parallelization with scalable and decentralized runtime verification
A Sukumaran-Rajam, JM Martinez Caamaño, W Wolff, A Jimborean, ...
International Conference on Runtime Verification, 124-139, 2014
192014
Israt Nisa, Shivani Sabhlok, Ümit V. Çatalyürek, Srinivasan Parthasarathy, and P. Sadayappan. 2018. Efficient Sparse-matrix Multi-vector Product on GPUs
C Hong, A Sukumaran-Rajam, B Bandyopadhyay, J Kim, SE Kurt
Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018
182018
APOLLO: Automatic speculative POLyhedral Loop Optimizer
JMM Caamaño, A Sukumaran-Rajam, A Baloian, M Selva, P Clauss
IMPACT 2017-7th International Workshop on Polyhedral Compilation Techniques, 8, 2017
172017
On Optimizing Complex Stencils on GPUs
PS Rawat, M Vaidya, A Sukumaran-Rajam, A Rountev, LN Pouchet, ...
17*
Analytical characterization and design space exploration for optimization of CNNs
R Li, Y Xu, A Sukumaran-Rajam, A Rountev, P Sadayappan
Proceedings of the 26th ACM International Conference on Architectural …, 2021
152021
Optimizing tensor contractions in ccsd (t) for efficient execution on gpus
J Kim, A Sukumaran-Rajam, C Hong, A Panyala, RK Srivastava, ...
Proceedings of the 2018 International Conference on Supercomputing, 96-106, 2018
142018
On improving performance of sparse matrix-matrix multiplication on GPUs
R Kunchum, A Chaudhry, A Sukumaran-Rajam, Q Niu, I Nisa, ...
Proceedings of the International Conference on Supercomputing, 14, 2017
132017
The system can't perform the operation now. Try again later.
Articles 1–20