Pythia: A suite for analyzing large language models across training and scaling S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ... International Conference on Machine Learning, 2397-2430, 2023 | 1093 | 2023 |
Gpt-neox-20b: An open-source autoregressive language model S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... arXiv preprint arXiv:2204.06745, 2022 | 856 | 2022 |
Emergent and predictable memorization in large language models S Biderman, U Prashanth, L Sutawika, H Schoelkopf, Q Anthony, ... Advances in Neural Information Processing Systems 36, 28072-28090, 2023 | 164 | 2023 |
Roentgen: vision-language foundation model for chest x-ray generation P Chambon, C Bluethgen, JB Delbrouck, R Van der Sluijs, M Połacin, ... arXiv preprint arXiv:2211.12737, 2022 | 131 | 2022 |
Gpt-neox-20b: An open-source autoregressive language model, 2022 S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... URL https://arxiv. org/abs/2204.06745, 2022 | 42 | 2022 |
A vision–language foundation model for the generation of realistic chest x-ray images C Bluethgen, P Chambon, JB Delbrouck, R van der Sluijs, M Połacin, ... Nature Biomedical Engineering, 1-13, 2024 | 22 | 2024 |
Gpt-neox-20b: An open-source autoregressive language model. arXiv 2022 S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... arXiv preprint arXiv:2204.06745, 0 | 19 | |
GPT-NeoX-20B: An open-source autoregressive language model. ArXiv S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... arXiv preprint arXiv:2204.06745, 2022 | 18 | 2022 |