Prashanth L.A.

Cited by

	All	Since 2019
Citations	2267	1505
h-index	18	17
i10-index	31	28

380

190

285

201120122013201420152016201720182019202020212022202320249 15 50 64 81 99 76 127 180 207 272 292 380 173

Public access

View all

19 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceVerified email at iisc.ac.in
Michael C. FuUniversity of MarylandVerified email at umd.edu
Mohammad GhavamzadehAmazonVerified email at amazon.com
Krishna JagannathanProfessor, Department of Electrical Engineering, IIT MadrasVerified email at ee.iitm.ac.in
H L PrasadChairman and CTO at Astrome TechnologiesVerified email at csa.iisc.ernet.in
Rémi MunosGoogle DeepMindVerified email at inria.fr
Ravi Kumar KollaIIT MadrasVerified email at ee.iitm.ac.in
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Sanjay P. BhatTata Consultancy Services LimitedVerified email at tcs.com
Cheng JiePinterest LLC, University of Maryland, College Park, Walmart Global TechVerified email at pinterest.com
Nirmit DesaiIBM ResearchVerified email at us.ibm.com
Nirav BhavsarM.S. Scholar in the Department of Computer Science and Engineering, Indian Institute of TechnologyVerified email at cse.iitm.ac.in
Nithia VijayanResearch Fellow, School of Computing, National University of SingaporeVerified email at comp.nus.edu.sg
Aditya GopalanIndian Institute of Science, BangaloreVerified email at iisc.ac.in
Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
gargi dasguptaIBM Research LabVerified email at in.ibm.com
Gandharv PatilMcGill University, MilaVerified email at mail.mcgill.ca
Dheeraj NagarajResearch Scientist, GoogleVerified email at google.com
Steven I. MarcusProfessor of Electrical and Computer Engineering, University of MarylandVerified email at umd.edu
Andras GyorgyDeepMindVerified email at google.com

Prashanth L.A.

Associate Professor, Department of Computer Science and Engg., IIT Madras

Verified email at cse.iitm.ac.in - Homepage

Reinforcement learning simulation optimization multi-armed bandits


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods S Bhatnagar, HL Prasad, LA Prashanth Springer 434, 302, 2013	425*	2013
Reinforcement Learning With Function Approximation for Traffic Signal Control P LA, S Bhatnagar Intelligent Transportation Systems, IEEE Transactions on, 1-10, 2011	386	2011
Actor-critic algorithms for risk-sensitive MDPs P La, M Ghavamzadeh Advances in neural information processing systems 26, 2013	309	2013
Reinforcement learning with average cost for adaptive control of traffic lights at intersections LA Prashanth, S Bhatnagar 2011 14th International IEEE Conference on Intelligent Transportation …, 2011	90	2011
Cumulative prospect theory meets reinforcement learning: Prediction and control LA Prashanth, C Jie, M Fu, S Marcus, C Szepesvári International Conference on Machine Learning, 1406-1415, 2016	88	2016
Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs LA Prashanth, M Ghavamzadeh arXiv preprint arXiv:1403.6530, 2014	82	2014
Policy gradients for CVaR-constrained MDPs LA Prashanth International Conference on Algorithmic Learning Theory, 155-169, 2014	72	2014
Two-timescale algorithms for learning Nash equilibria in general-sum stochastic games HL Prasad, P LA, S Bhatnagar Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	71	2015
Concentration of risk measures: A Wasserstein distance approach SP Bhat, P LA Advances in neural information processing systems 32, 2019	56	2019
Concentration bounds for empirical conditional value-at-risk: The unbounded case RK Kolla, LA Prashanth, SP Bhat, K Jagannathan Operations Research Letters 47 (1), 16-20, 2019	56	2019
Threshold tuning using stochastic optimization for graded signal control LA Prashanth, S Bhatnagar IEEE Transactions on Vehicular Technology 61 (9), 3865-3880, 2012	54	2012
Stochastic optimization in a cumulative prospect theory framework C Jie, LA Prashanth, M Fu, S Marcus, C Szepesvári IEEE Transactions on Automatic Control 63 (9), 2867-2882, 2018	52	2018
On TD (0) with function approximation: Concentration bounds and a centered variant with exponential convergence N Korda, P La International conference on machine learning, 626-634, 2015	52	2015
Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions LA Prashanth, K Jagannathan, RK Kolla Proceedings of the 37th International Conference on Machine Learning, 5577-5586, 2020	51	2020
Adaptive system optimization using random directions stochastic approximation LA Prashanth, S Bhatnagar, M Fu, S Marcus IEEE Transactions on Automatic Control 62 (5), 2223-2238, 2017	37	2017
Risk-sensitive reinforcement learning: A constrained optimization viewpoint LA Prashanth, M Fu arXiv 2018, 2018	35	2018
Risk-sensitive reinforcement learning via policy gradient search LA Prashanth, MC Fu Foundations and Trends® in Machine Learning 15 (5), 537-693, 2022	29	2022
Analysis of stochastic approximation for efficient least squares regression and LSTD LA Prashanth, N Korda, R Munos arXiv preprint arXiv:1306.2557, 2013	26*	2013
(Bandit) Convex Optimization with Biased Noisy Gradient Oracles X Hu, LA Prashanth, A György, C Szepesvári International Conference on Artificial Intelligence and Statistics (AISTATS …, 2016	18	2016
Simultaneous perturbation Newton algorithms for simulation optimization S Bhatnagar, LA Prashanth Journal of Optimization Theory and Applications 164, 621-643, 2015	18	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors