Follow
Kyle O’Brien
Kyle O’Brien
Microsoft, EleutherAI Community
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Pythia: A suite for analyzing large language models across training and scaling
S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ...
ICML 2023, 2397-2430, 2023
9732023
Composable interventions for language models
A Kolbeinsson, K O'Brien, T Huang, S Gao, S Liu, JR Schwarz, A Vaidya, ...
ICLR 2025, 2024
42024
Recite, reconstruct, recollect: Memorization in LMs as a multifaceted phenomenon
US Prashanth*, A Deng*, K O'Brien*, J SV*, MA Khan, J Borkar, ...
ICLR 2025, 2024
32024
Improving Black-box Robustness with In-Context Rewriting
K O'Brien, N Ng, I Puri, J Mendez, H Palangi, Y Kim, M Ghassemi, ...
TMLR, 2024
22024
Steering language model refusal with sparse autoencoders
K O'Brien, D Majercak, X Fernandes, R Edgar, J Chen, H Nori, ...
arXiv preprint arXiv:2411.11296, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–5