Derivative-free methods for policy optimization: Guarantees for linear quadratic systems D Malik, A Pananjady, K Bhatia, K Khamaru, PL Bartlett, MJ Wainwright Journal of Machine Learning Research 21 (21), 1-51, 2020 | 215 | 2020 |
Convergence guarantees for a class of non-convex and non-smooth optimization problems K Khamaru, MJ Wainwright Journal of Machine Learning Research 20 (154), 1-52, 2019 | 75 | 2019 |
Singularity, Misspecification, and the Convergence Rate of EM R Dwivedi*, N Ho*, K Khamaru*, MI Jordan, MJ Wainwright, B Yu arXiv preprint arXiv:1810.00828, 2018 | 69 | 2018 |
Is temporal difference learning optimal? an instance-dependent analysis K Khamaru, A Pananjady, F Ruan, MJ Wainwright, MI Jordan SIAM Journal on Mathematics of Data Science 3 (4), 1013-1040, 2021 | 68 | 2021 |
Sharp analysis of expectation-maximization for weakly identifiable models R Dwivedi, N Ho, K Khamaru, M Wainwright, M Jordan, B Yu International Conference on Artificial Intelligence and Statistics, 1866-1876, 2020 | 28 | 2020 |
Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning K Khamaru, E Xia, MJ Wainwright, MI Jordan arXiv preprint arXiv:2106.14352, 2021 | 23 | 2021 |
Near-optimal inference in adaptive linear regression K Khamaru, Y Deshpande, T Lattimore, L Mackey, MJ Wainwright arXiv preprint arXiv:2107.02266, 2021 | 22 | 2021 |
Computation of the maximum likelihood estimator in low-rank factor analysis K Khamaru, R Mazumder Mathematical Programming 176, 279-310, 2019 | 21 | 2019 |
A peak synchronization measure for multiple signals R Biswas, K Khamaru, KK Majumdar IEEE Transactions on Signal Processing 62 (17), 4390-4398, 2014 | 20 | 2014 |
Theoretical guarantees for EM under misspecified Gaussian mixture models R Dwivedi*, N Ho*, K Khamaru*, MJ Wainwright, ... Advances in Neural Information Processing Systems, 9704-9712, 2018 | 18 | 2018 |
Instability, computational efficiency and statistical accuracy N Ho, K Khamaru, R Dwivedi, MJ Wainwright, MI Jordan, B Yu arXiv preprint arXiv:2005.11411, 2020 | 16 | 2020 |
Optimal variance-reduced stochastic approximation in Banach spaces W Mou, K Khamaru, MJ Wainwright, PL Bartlett, MI Jordan arXiv preprint arXiv:2201.08518, 2022 | 13 | 2022 |
Challenges with EM in application to weakly identifiable mixture models R Dwivedi*, N Ho*, K Khamaru*, MJ Wainwright, MI Jordan, B Yu arXiv preprint arXiv:1902.00194, 2019 | 10 | 2019 |
Instance-dependent confidence and early stopping for reinforcement learning E Xia, K Khamaru, MJ Wainwright, MI Jordan Journal of Machine Learning Research 24 (392), 1-43, 2023 | 6 | 2023 |
Adaptive linear estimating equations M Ying, K Khamaru, CH Zhang Advances in Neural Information Processing Systems 36, 2024 | 3 | 2024 |
Semi-parametric inference based on adaptively collected data L Lin, K Khamaru, MJ Wainwright arXiv preprint arXiv:2303.02534, 2023 | 3 | 2023 |
Instance-dependent confidence and early stopping for reinforcement learning K Khamaru, E Xia, MJ Wainwright, MI Jordan arXiv preprint arXiv:2201.08536, 2022 | 3 | 2022 |
Informativeness of Weighted Conformal Prediction M Ying, W Guo, K Khamaru, Y Hung arXiv preprint arXiv:2405.06479, 2024 | 2 | 2024 |
Instance-optimality in optimal value estimation: Adaptivity via variance-reduced Q-learning E Xia, K Khamaru, MJ Wainwright, MI Jordan IEEE Transactions on Information Theory, 2024 | 1 | 2024 |
Statistical limits of adaptive linear models: low-dimensional estimation and inference L Lin, M Ying, S Ghosh, K Khamaru, CH Zhang Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |