|Conservation cores: reducing the energy of mature computations|
G Venkatesh, J Sampson, N Goulding, S Garcia, V Bryksin, ...
ACM SIGARCH Computer Architecture News 38 (1), 205-218, 2010
|Mixed precision training|
P Micikevicius, S Narang, J Alben, G Diamos, E Elsen, D Garcia, ...
arXiv preprint arXiv:1710.03740, 2017
|The GreenDroid mobile application processor: An architecture for silicon's dark future|
N Goulding-Hotta, J Sampson, G Venkatesh, S Garcia, J Auricchio, ...
IEEE Micro 31 (2), 86-95, 2011
|QsCores: Trading dark silicon for scalable energy efficiency with quasi-specific cores|
G Venkatesh, J Sampson, N Goulding-Hotta, SK Venkata, MB Taylor, ...
Proceedings of the 44th Annual IEEE/ACM International Symposium on …, 2011
|Unbounded page-based transactional memory|
W Chuang, S Narayanasamy, G Venkatesh, J Sampson, ...
ACM SIGPLAN Notices 41 (11), 347-358, 2006
|Runnemede: An architecture for ubiquitous high-performance computing|
NP Carter, A Agrawal, S Borkar, R Cledat, H David, D Dunning, J Fryman, ...
High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th …, 2013
|Accelerating Deep Convolutional Network via Low Precision and Sparsity|
G Venkatesh, E Nurvitadhi, D Marr
|Efficient complex operators for irregular codes|
J Sampson, G Venkatesh, N Goulding-Hotta, S Garcia, S Swanson, ...
High Performance Computer Architecture (HPCA), 2011 IEEE 17th International …, 2011
|GreenDroid: A mobile application processor for a future of dark silicon|
N Goulding, J Sampson, G Venkatesh, S Garcia, J Auricchio, J Babb, ...
Hot Chips 22, 2010
|Hardware accelerator for analytics of sparse data|
E Nurvitadhi, A Mishra, Y Wang, G Venkatesh, D Marr
2016 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2016
|An Evaluation of Selective Depipelining for FPGA-based Energy-Reducing Irregular Code Coprocessors|
J Sampson, M Arora, N Goulding-Hotta, G Venkatesh, J Babb, V Bhatt, ...
Field Programmable Logic and Applications (FPL), 2011 International …, 2011
|Reducing the energy cost of irregular code bases in soft processor systems|
M Arora, J Sampson, N Goulding-Hotta, J Babb, G Venkatesh, MB Taylor, ...
Field-Programmable Custom Computing Machines (FCCM), 2011 IEEE 19th Annual …, 2011
|Exploiting a computation reuse cache to reduce energy in network processors|
B Li, G Venkatesh, B Calder, R Gupta
High Performance Embedded Architectures and Compilers, 251-265, 2005
|Efficient sparse array handling in a processor|
G Venkatesh, TC Zhang, DT Marr
US Patent App. 14/747,182, 2016
|Scaling the Utilization Wall: The Case for Massively Heterogeneous Multiprocessors|
I Ahn, N Goudling, J Sampson, G Venkatesh, MB Taylor, SJ Swanson
Department of Computer Science and Engineering, University of California …, 2009
|APE: accelerator processor extensions to optimize data-compute co-location|
Proceedings of the ACM SIGPLAN Workshop on Memory Systems Performance and …, 2013
|Configurable energy-efficient co-processors to scale the utilization wall|
|Quasi-ASICs: Trading Area for Energy by Exploiting Similarity in Synthesized Cores for Irregular Code|
G Venkatesh, J Sampson, N Goulding, SJ Swanson, MB Taylor
Department of Computer Science and Engineering, University of California …, 2011
|Page-Based Transactional Memory to Provide Fast Virtual Transactions|
W Chuang, S Narayanasamy, G Pokam, J Sampson, M Van Biesbrouck, ...