Follow
Sebastian Gehrmann
Sebastian Gehrmann
Head of Responsible AI, CTO Office, Bloomberg LP
Verified email at bloomberg.net - Homepage
Title
Cited by
Cited by
Year
PaLM: Scaling language modeling with pathways
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311, 2022
51122022
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
16222023
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
14272023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
11702022
Bottom-up abstractive summarization
S Gehrmann, Y Deng, AM Rush
EMNLP 2018, 2018
8502018
BloombergGPT: A large language model for finance
S Wu, O Irsoy, S Lu, V Dabravolski, M Dredze, S Gehrmann, P Kambadur, ...
arXiv preprint arXiv:2303.17564, 2023
7792023
Challenging big-bench tasks and whether chain-of-thought can solve them
M Suzgun, N Scales, N Schärli, S Gehrmann, Y Tay, HW Chung, ...
ACL Findings 2023, 2022
5552022
LSTMVis: A tool for visual analysis of hidden state dynamics in recurrent neural networks
H Strobelt*, S Gehrmann*, H Pfister, AM Rush
IEEE transactions on visualization and computer graphics 24 (1), 667-676, 2017
5552017
GLTR: Statistical detection and visualization of generated text
S Gehrmann*, H Strobelt*, AM Rush
ACL Demo 2019, 2019
5252019
Investigating gender bias in language models using causal mediation analysis
J Vig*, S Gehrmann*, Y Belinkov*, S Qian, D Nevo, Y Singer, S Shieber
NeurIPS 2021 33, 12388-12401, 2020
494*2020
ToTTo: A controlled table-to-text generation dataset
AP Parikh, X Wang, S Gehrmann, M Faruqui, B Dhingra, D Yang, D Das
EMNLP 2020, 2020
3622020
Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations
P Das, T Sercu, K Wadhawan, I Padhi, S Gehrmann, F Cipcigan, ...
Nature Biomedical Engineering 5 (6), 613-623, 2021
3012021
Comparing deep learning and concept extraction based methods for patient phenotyping from clinical narratives
S Gehrmann, F Dernoncourt, Y Li, ET Carlson, JT Wu, J Welt, J Foote Jr, ...
PloS one 13 (2), e0192360, 2018
295*2018
Seq2Seq-Vis: A visual debugging tool for sequence-to-sequence models
H Strobelt*, S Gehrmann*, M Behrisch, A Perer, H Pfister, AM Rush
IEEE transactions on visualization and computer graphics 25 (1), 353-363, 2018
2792018
The language interpretability tool: Extensible, interactive visualizations and analysis for NLP models
I Tenney, J Wexler, J Bastings, T Bolukbasi, A Coenen, S Gehrmann, ...
ACL Demo 2020, 2020
2122020
exBERT: A visual analysis tool to explore learned representations in transformers models
B Hoover, H Strobelt, S Gehrmann
EMNLP Demo 2019, 2019
1932019
The GEM benchmark: Natural language generation, its evaluation and metrics
S Gehrmann, T Adewumi, K Aggarwal, PS Ammanamanchi, ...
GEM Workshop at ACL 2021, 2021
1502021
Repairing the cracked foundation: A survey of obstacles in evaluation practices for generated text
S Gehrmann, E Clark, T Sellam
JAIR, 2022
1492022
Palm: Scaling language modeling with pathways. arXiv 2022
A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ...
arXiv preprint arXiv:2204.02311 10, 2022
1152022
Causal analysis of syntactic agreement mechanisms in neural language models
M Finlayson, A Mueller, S Gehrmann, S Shieber, T Linzen, Y Belinkov
ACL 2021, 2021
912021
The system can't perform the operation now. Try again later.
Articles 1–20