Follow
Andrew Critch
Andrew Critch
UC Berkeley, Department of Electrical Engineering and Computer Sciences
Verified email at eecs.berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Aligning ai with shared human values
D Hendrycks, C Burns, S Basart, A Critch, J Li, D Song, J Steinhardt
arXiv preprint arXiv:2008.02275, 2020
5212020
Emergent complexity and zero-shot transfer via unsupervised environment design
M Dennis, N Jaques, E Vinitsky, A Bayen, S Russell, A Critch, S Levine
Advances in neural information processing systems 33, 13049-13061, 2020
2642020
Alignment for advanced machine learning systems
J Taylor, E Yudkowsky, P LaVictoire, A Critch
Ethics of artificial intelligence, 342-382, 2016
1542016
Optimal policies tend to seek power
AM Turner, L Smith, R Shah, A Critch, P Tadepalli
arXiv preprint arXiv:1912.01683, 2019
842019
AI research considerations for human existential safety (ARCHES)
A Critch, D Krueger
arXiv preprint arXiv:2006.04948, 2020
702020
A note on the proportionality between some consistency indices in the AHP
M Brunelli, A Critch, M Fedrizzi
Applied Mathematics and Computation 219 (14), 7901-7906, 2013
582013
The magical benchmark for robust imitation
S Toyer, R Shah, A Critch, S Russell
Advances in Neural Information Processing Systems 33, 18284-18295, 2020
542020
Logical induction
S Garrabrant, T Benson-Tilsen, A Critch, N Soares, J Taylor
arXiv preprint arXiv:1609.03543, 2016
542016
Algebraic geometry of matrix product states
A Critch, J Morton
SIGMA. Symmetry, Integrability and Geometry: Methods and Applications 10, 095, 2014
462014
TASRA: a taxonomy and analysis of societal-scale risks from AI
A Critch, S Russell
arXiv preprint arXiv:2306.06924, 2023
402023
Clusterability in neural networks
D Filan, S Casper, S Hod, C Wild, A Critch, S Russell
arXiv preprint arXiv:2103.03386, 2021
372021
Human irrationality: both bad and good for reward inference
L Chan, A Critch, A Dragan
arXiv preprint arXiv:2111.06956, 2021
282021
A parametric, resource-bounded generalization of Löb’s theorem, and a robust cooperation criterion for open-source game theory
A Critch
The Journal of Symbolic Logic 84 (4), 1368-1381, 2019
242019
Pruned neural networks are surprisingly modular
D Filan, S Hod, C Wild, A Critch, S Russell
arXiv preprint arXiv:2003.04881, 2020
202020
Graphical clusterability and local specialization in deep neural networks
S Casper, S Hod, D Filan, C Wild, A Critch, S Russell
ICLR 2022 Workshop on PAIR {\textasciicircum} 2Struct: Privacy …, 2022
152022
Negotiable reinforcement learning for pareto optimal sequential decision-making
N Desai, A Critch, SJ Russell
Advances in Neural Information Processing Systems 31, 2018
152018
Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making
A Critch
arXiv preprint arXiv:1701.01302, 2017
152017
For learning in symmetric teams, local optima are global nash equilibria
S Emmons, C Oesterheld, A Critch, V Conitzer, S Russell
International Conference on Machine Learning, 5924-5943, 2022
122022
Detecting modularity in deep neural networks
S Hod, S Casper, D Filan, C Wild, A Critch, S Russell
122021
Algebraic geometry of hidden Markov and related models
AJ Critch
University of California, Berkeley, 2013
122013
The system can't perform the operation now. Try again later.
Articles 1–20