Chain of thought prompting elicits reasoning in large language models J Wei, X Wang, D Schuurmans, M Bosma, E Chi, Q Le, D Zhou NeurIPS 2022, 2022 | 10375* | 2022 |
PaLM: Scaling Language Modeling with Pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... Journal of Machine Learning Research, 2023 | 5225 | 2023 |
Finetuned language models are zero-shot learners J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le ICLR 2022, 2021 | 3202 | 2021 |
Emergent abilities of large language models J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ... Transactions on Machine Learning Research, 2022b, 2022 | 3044* | 2022 |
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022 | 1653 | 2022 |
Program synthesis with large language models J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ... arXiv preprint arXiv:2108.07732, 2021 | 1290 | 2021 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 1186 | 2022 |
GLaM: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022 | 709* | 2022 |
Show your work: Scratchpads for intermediate computation with language models M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ... ICLR 2022 Workshop DL4C, 2021 | 550 | 2021 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ... Journal of Machine Learning Research 24 (377), 1-8, 2023 | 155 | 2023 |
Emergent abilities of large language models. arXiv 2022 J Wei, Y Tay, R Bommasani, C Raffel, B Zoph, S Borgeaud, D Yogatama, ... arXiv preprint arXiv:2206.07682, 2023 | 60 | 2023 |
Program synthesis with large language models. CoRR abs/2108.07732 (2021) J Austin, A Odena, MI Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ... arXiv preprint arXiv:2108.07732, 2021 | 55 | 2021 |
A framework for unsupervised spam detection in social networking sites M Bosma, E Meij, W Weerkamp European Conference on Information Retrieval, 364-375, 2012 | 53 | 2012 |
Ichter, b J Wei, X Wang, D Schuurmans, M Bosma Xia, F., et al.(2022b).“Chain-of-thought prompting elicits reasoning in …, 0 | 16 | |
Performing machine learning tasks using instruction-tuned neural networks JW Wei, MP Bosma, Y Zhao, K Gu, QV Le US Patent App. 17/561,581, 2023 | 6 | 2023 |
Inflection-1 Inflection-AI https://inflection.ai/assets/Inflection-1.pdf, 2023 | 6* | 2023 |
System and method for automatically selecting images to accompany text M Heyward, M Bosma, S Brotherton, C DePue III, MEG Contreras, ... US Patent 9,075,812, 2015 | 6 | 2015 |
Deterministic training of machine learning models G Mishra, AJ Roberts, MP Bosma, NM Shazeer US Patent 12,014,276, 2024 | | 2024 |
Prompting Machine-Learned Models Using Chains of Thought JW Wei, D Zhou, DE Schuurmans, QV Le, MP Bosma, EHH Chi, ... US Patent App. 17/881,746, 2023 | | 2023 |
Inflection-2 Inflection-AI https://inflection.ai/inflection-2, 2023 | | 2023 |