Importance Resampling for Off-policy Prediction M Schlegel, W Chung, D Graves, J Qian, M White Advances in Neural Information Processing Systems, 1797-1807, 2019 | 11 | 2019 |
General value function networks M Schlegel, A Jacobsen, Z Abbas, A Patterson, A White, M White Journal of Artificial Intelligence Research 70, 497-543, 2021 | 8 | 2021 |
Meta-descent for online, continual prediction A Jacobsen, M Schlegel, C Linke, T Degris, A White, M White Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3943-3950, 2019 | 8 | 2019 |
Context-dependent upper-confidence bounds for directed exploration R Kumaraswamy, M Schlegel, A White, M White arXiv preprint arXiv:1811.06629, 2018 | 6 | 2018 |
Stable predictive representations with general value functions for continual learning M Schlegel, A White, M White Continual Learning and Deep Networks workshop at the Neural Information …, 2017 | 2 | 2017 |
Adapting kernel representations online using submodular maximization M Schlegel, Y Pan, J Chen, M White International Conference on Machine Learning, 3037-3046, 2017 | 1 | 2017 |
Meta-descent for Online, Continual Prediction Download PDF A Jacobsen, M Schlegel, C Linke, T Degris, A White, M White | | |
General Value Function Networks Download PDF M Schlegel, A Jacobsen, M Zaheer, A Patterson, A White, M White | | |