Bharat Kaul

Cited by

	All	Since 2019
Citations	2483	2075
h-index	19	17
i10-index	28	23

480

240

120

360

2013201420152016201720182019202020212022202320247 12 19 67 79 186 231 350 474 480 425 111

Public access

View all

4 articles

1 article

available

not available

Based on funding mandates

Co-authors

Dipankar DasIntel Parallel Computing Labs, Intel LabsVerified email at intel.com
Sasikanth AvanchaVerified email at intel.com
Pradeep DubeyIntel CorporationVerified email at intel.com
Dheevatsa MudigereDistinguished Engineer, NVIDIAVerified email at nvidia.com
Naveen MellempudiFellow, Advanced Micro DevicesVerified email at amd.com
Alexander HeineckeSenior Principal Engineer at Intel LabsVerified email at intel.com
Abhisek KunduResearch Scientist, Intel Parallel Computing Labs, IndiaVerified email at intel.com
Anand RaghunathanProfessor of Electrical and Computer Engineering, Purdue UniversityVerified email at purdue.edu
Sudarshan SrinivasanIntelVerified email at intel.com
Srinivas Sridharan, PhdDistinguished Engineer, NVIDIAVerified email at nvidia.com
Ashish RanjanResearch Staff Member, IBM T.J. Watson Research CenterVerified email at ibm.com
Kunal BanerjeeData Science Foundation, WalmartVerified email at walmartlabs.com
Evangelos GeorganasIntel Labs, Parallel Computing LabVerified email at intel.com
Swagath VenkataramaniResearch Staff Member, IBM T.J. Watson Research Center / Purdue Univ.Verified email at ibm.com
Subarno BanerjeeUniversity of MichiganVerified email at umich.edu
Mikhail SmelyanskiyFacebookVerified email at intel.com
Eric QinGeorgia TechVerified email at gatech.edu
Jongsoo ParkResearch Scientist, FacebookVerified email at fb.com
Nataraj JammalamadakaPhd ScholarVerified email at research.iiit.ac.in
Balaraman RavindranProfessor of Computer Science, Indian Institute of Technology MadrasVerified email at cse.iitm.ac.in

Bharat Kaul

Intel Labs

Verified email at intel.com - Homepage

Parallel Programming Deep Learning HPC Computational Biology CPU and GPU architecture


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sigma: A sparse and irregular gemm accelerator with flexible interconnects for dnn training E Qin, A Samajdar, H Kwon, V Nadella, S Srinivasan, D Das, B Kaul, ... 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020	375	2020
A study of BFLOAT16 for deep learning training D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ... arXiv preprint arXiv:1905.12322, 2019	295	2019
Scaledeep: A scalable compute architecture for learning and evaluating deep networks S Venkataramani, A Ranjan, S Banerjee, D Das, S Avancha, ... Proceedings of the 44th Annual International Symposium on Computer …, 2017	265	2017
Out-of-distribution detection using an ensemble of self supervised leave-out classifiers A Vyas, N Jammalamadaka, X Zhu, D Das, B Kaul, TL Willke Proceedings of the European conference on computer vision (ECCV), 550-564, 2018	259	2018
Distributed deep learning using synchronous stochastic gradient descent D Das, S Avancha, D Mudigere, K Vaidynathan, S Sridharan, D Kalamkar, ... arXiv preprint arXiv:1602.06709, 2016	207	2016
Method and apparatus to manage network addresses B Kaul, N Tulpule, M Zhu, P Krishnaswamy US Patent App. 10/651,929, 2005	203	2005
Mixed precision training of convolutional neural networks using integer operations D Das, N Mellempudi, D Mudigere, D Kalamkar, S Avancha, K Banerjee, ... arXiv preprint arXiv:1802.00930, 2018	187	2018
Ternary neural networks with fine-grained quantization N Mellempudi, A Kundu, D Mudigere, D Das, B Kaul, P Dubey arXiv preprint arXiv:1705.01462, 2017	128	2017
Mixed precision training with 8-bit floating point N Mellempudi, S Srinivasan, D Das, B Kaul arXiv preprint arXiv:1905.12334, 2019	69	2019
Apparatuses, methods, and systems for neural networks S Venkataramani, D Das, A Ranjan, S Banerjee, S Avancha, ... US Patent App. 16/317,497, 2019	53	2019
Data structure and movement for lattice-based simulations AG Shet, SH Sorathiya, S Krithivasan, AM Deshpande, B Kaul, ... Physical Review E 88 (1), 013314, 2013	50	2013
X-mann: A crossbar based architecture for memory augmented neural networks A Ranjan, S Jain, JR Stevens, D Das, B Kaul, A Raghunathan Proceedings of the 56th Annual Design Automation Conference 2019, 1-6, 2019	41	2019
On scale-out deep learning training for cloud and hpc S Sridharan, K Vaidyanathan, D Kalamkar, D Das, ME Smorkalov, ... arXiv preprint arXiv:1801.08030, 2018	34	2018
Mixed low-precision deep learning inference using dynamic fixed point N Mellempudi, A Kundu, D Das, D Mudigere, B Kaul arXiv preprint arXiv:1701.08978, 2017	28	2017
Manna: An accelerator for memory-augmented neural networks JR Stevens, A Ranjan, D Das, B Kaul, A Raghunathan Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019	27	2019
Rail: Risk-averse imitation learning A Santara, A Naik, B Ravindran, D Das, D Mudigere, S Avancha, B Kaul arXiv preprint arXiv:1707.06658, 2017	23	2017
Madras: Multi agent driving simulator A Santara, S Rudra, SA Buridi, M Kaushik, A Naik, B Kaul, B Ravindran Journal of Artificial Intelligence Research 70, 1517-1555, 2021	22	2021
Polydl: Polyhedral optimizations for creation of high-performance dl primitives S Tavarageri, A Heinecke, S Avancha, B Kaul, G Goyal, R Upadrasta ACM Transactions on Architecture and Code Optimization (TACO) 18 (1), 1-27, 2021	21	2021
Exploring shared-memory optimizations for an unstructured mesh CFD application on modern parallel systems D Mudigere, S Sridharan, A Deshpande, J Park, A Heinecke, ... 2015 IEEE International Parallel and Distributed Processing Symposium, 723-732, 2015	21	2015
On vectorization for lattice based simulations AG Shet, K Siddharth, SH Sorathiya, AM Deshpande, SD Sherlekar, ... International Journal of Modern Physics C 24 (12), 1340011, 2013	18	2013

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors