Follow
Yi Su
Title
Cited by
Cited by
Year
Doubly robust off-policy evaluation with shrinkage
Y Su, M Dimakopoulou, A Krishnamurthy, M Dudík
International Conference on Machine Learning, 2020, 2019
842019
Cab: Continuous adaptive blending for policy evaluation and learning
Y Su, L Wang, M Santacatterina, T Joachims
International Conference on Machine Learning, 6005-6014, 2019
762019
Off-policy bandits with deficient support
N Sachdeva, Y Su, T Joachims
Proceedings of the 26th ACM SIGKDD International Conference on Knowledge …, 2020
652020
Offline rl for natural language generation with implicit language q learning
C Snell, I Kostrikov, Y Su, M Yang, S Levine
arXiv preprint arXiv:2206.11871, 2022
562022
Online adaptation to label distribution shift
R Wu, C Guo, Y Su, KQ Weinberger
Advances in Neural Information Processing Systems 34, 11340-11351, 2021
462021
Adaptive Estimator Selection for Off-Policy Evaluation
Y Su, P Srinath, A Krishnamurthy
International Conference on Machine Learning, 2020, 2020
352020
Optimizing Rankings for Recommendation in Matching Markets
Y Su, M Bayoumi, T Joachims
Proceedings of the ACM Web Conference 2022, 328-338, 2022
182022
Recommendations as treatments
T Joachims, B London, Y Su, A Swaminathan, L Wang
AI Magazine 42 (3), 19-30, 2021
182021
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
C Snell, S Yang, J Fu, Y Su, S Levine
NAACL, 2022, 2022
172022
Data-driven offline decision-making via invariant representation learning
H Qi, Y Su, A Kumar, S Levine
Advances in Neural Information Processing Systems 35, 13226-13237, 2022
102022
Learning from logged bandit feedback of multiple loggers
Y Su, A Agarwal, T Joachims
ICML Workshop on Machine Learning for Causal Inference, Counterfactual …, 2018
32018
Unified off-policy learning to rank: a reinforcement learning perspective
Z Zhang, Y Su, H Yuan, Y Wu, R Balasubramanian, Q Wu, H Wang, ...
Advances in Neural Information Processing Systems 36, 2024
12024
Value of exploration: Measurements, findings and algorithms
Y Su, X Wang, EY Le, L Liu, Y Li, H Lu, B Lipshitz, S Badam, L Heldt, S Bi, ...
arXiv preprint arXiv:2305.07764, 2023
12023
Long-Term Value of Exploration: Measurements, Findings and Algorithms
Y Su, X Wang, EY Le, L Liu, Y Li, H Lu, B Lipshitz, S Badam, L Heldt, S Bi, ...
Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024
2024
Online Feature Updates Improve Online (Generalized) Label Shift Adaptation
R Wu, S Datta, Y Su, D Baby, YX Wang, KQ Weinberger
arXiv preprint arXiv:2402.03545, 2024
2024
System for effective use of data for personalization
M Dudik, A Krishnamurthy, M Dimakopoulou, Y Su
US Patent App. 18/368,400, 2024
2024
System for effective use of data for personalization
M Dudik, A Krishnamurthy, M Dimakopoulou, Y Su
US Patent 11,798,029, 2023
2023
Nonlinear Bandits Exploration for Recommendations
Y Su, M Chen
Proceedings of the 17th ACM Conference on Recommender Systems, 1054-1057, 2023
2023
2nd Workshop on Online and Adaptive Recommender Systems (OARS)
X Cui, V Dave, Y Su, K Al-Jadda, S Kumar, J McAuley, T Ye, K Aryafar, ...
Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022
2022
Off-Policy Evaluation and Learning for Interactive Systems
Y Su
Cornell University, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20