QuickUMLS: a fast, unsupervised approach for medical concept extraction L Soldaini, N Goharian Medical Information Retrieval (MedIR) Workshop at SIGIR 2016, 2016 | 267 | 2016 |
SMHD: a large-scale resource for exploring online language usage for multiple mental health conditions A Cohan, B Desmet, A Yates, L Soldaini, S MacAvaney, N Goharian arXiv preprint arXiv:1806.05258, 2018 | 185 | 2018 |
Olmo: Accelerating the science of language models D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ... 🏆 Best Paper Award 🏆 ACL 2024, 2024 | 174* | 2024 |
Dolma: An open corpus of three trillion tokens for language model pretraining research L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... 🏆 Best Paper Award 🏆 ACL 2024., 2024 | 133* | 2024 |
Don’t parse, generate! a sequence to sequence architecture for task-oriented semantic parsing S Rongali, L Soldaini, E Monti, W Hamza Proceedings of the web conference 2020, 2962-2968, 2020 | 119 | 2020 |
The semantic scholar open data platform R Kinney, C Anastasiades, R Authur, I Beltagy, J Bragg, A Buraczynski, ... arXiv preprint arXiv:2301.10140, 2023 | 102 | 2023 |
What's In My Big Data? Y Elazar, A Bhagia, I Magnusson, A Ravichander, D Schwenk, A Suhr, ... arXiv preprint arXiv:2310.20707, 2023 | 67 | 2023 |
Enhancing web search in the medical domain via query clarification L Soldaini, A Yates, E Yom-Tov, O Frieder, N Goharian Information Retrieval Journal 19, 149-173, 2016 | 59 | 2016 |
The cascade transformer: an application for efficient answer sentence selection L Soldaini, A Moschitti arXiv preprint arXiv:2005.02534, 2020 | 57 | 2020 |
Rsdd-time: Temporal annotation of self-reported mental health diagnoses S MacAvaney, B Desmet, A Cohan, L Soldaini, A Yates, A Zirikly, ... arXiv preprint arXiv:1806.07916, 2018 | 51 | 2018 |
Retrieving medical literature for clinical decision support L Soldaini, A Cohan, A Yates, N Goharian, O Frieder Advances in Information Retrieval: 37th European Conference on IR Research …, 2015 | 47 | 2015 |
One-shot labeling for automatic relevance estimation S MacAvaney, L Soldaini Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023 | 45 | 2023 |
Datacomp-lm: In search of the next generation of training sets for language models J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... arXiv preprint arXiv:2406.11794, 2024 | 40* | 2024 |
Scim: Intelligent skimming support for scientific papers R Fok, H Kambhamettu, L Soldaini, J Bragg, K Lo, M Hearst, A Head, ... Proceedings of the 28th International Conference on Intelligent User …, 2023 | 34 | 2023 |
Teaching a new dog old tricks: Resurrecting multilingual retrieval using zero-shot learning S MacAvaney, L Soldaini, N Goharian Advances in Information Retrieval: 42nd European Conference on IR Research …, 2020 | 34 | 2020 |
Learning to rank for consumer health search: a semantic approach L Soldaini, N Goharian Advances in Information Retrieval: 39th European Conference on IR Research …, 2017 | 33 | 2017 |
Answer generation for retrieval-based question answering systems CC Hsu, E Lind, L Soldaini, A Moschitti arXiv preprint arXiv:2106.00955, 2021 | 28 | 2021 |
Matching Citation Text and Cited Spans in Biomedical Literature: a Search-Oriented Approach A Cohan, L Soldaini, N Goharian North American Chapter of the Association for Computational Linguistics …, 2015 | 27 | 2015 |
Queer In AI: A Case Study in Community-Led Participatory AI OQ In AI, A Ovalle, A Subramonian, A Singh, C Voelcker, DJ Sutherland, ... 🏆 Best Paper Award 🏆 FAccT 2023, 2023 | 26* | 2023 |
peS2o (Pretraining Efficiently on S2ORC) Dataset L Soldaini, K Lo Allen Institute for AI, Tech. Rep, 2023 | 26 | 2023 |