Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 568 | 2023 |
Model evaluation for extreme risks T Shevlane, S Farquhar, B Garfinkel, M Phuong, J Whittlestone, J Leung, ... arXiv preprint arXiv:2305.15324, 2023 | 99* | 2023 |
Structured access: an emerging paradigm for safe AI deployment T Shevlane arXiv preprint arXiv:2201.05159, 2022 | 48 | 2022 |
The Offense-Defense Balance of Scientific Knowledge: Does Publishing AI Research Reduce Misuse? T Shevlane, A Dafoe Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 173-179, 2020 | 41* | 2020 |
AI policy levers: A review of the US Government’s tools to shape AI research, development, and deployment SC Fischer, J Leung, M Anderljung, C O’keefe, S Torges, SM Khan, ... Retrieved June 1, 2022, 2021 | 8 | 2021 |
A guide to writing the NeurIPS impact statement C Ashurst, M Anderljung, C Prunkl, J Leike, Y Gal, T Shevlane, A Dafoe Centre for the Governance of AI. URL: https://perma. cc/B5R8-2B9V, 2020 | 8 | 2020 |
Contact tracing apps can help stop coronavirus. But they can hurt privacy T Shevlane, B Garfinkel, A Dafoe The Washington Post, 2020 | 6 | 2020 |
Evaluating Frontier Models for Dangerous Capabilities M Phuong, M Aitchison, E Catt, S Cogan, A Kaskasoli, V Krakovna, ... arXiv preprint arXiv:2403.13793, 2024 | 3* | 2024 |
A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI S El-Sayed, C Akbulut, A McCroskery, G Keeling, Z Kenton, Z Jalan, ... arXiv preprint arXiv:2404.15058, 2024 | 2 | 2024 |
The Artefacts of Intelligence: Governing Scientists' Contribution to AI Proliferation T Shevlane University of Oxford, 2022 | 2 | 2022 |