Vedanuj Goswami

Cited by

	All	Since 2019
Citations	13109	13087
h-index	18	18
i10-index	22	22

10000

5000

2500

7500

20192020202120222023202444 136 336 626 2621 9262

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Amanpreet SinghContextual AIVerified email at contextual.ai
Devi ParikhPreviously: FAIR and GenAI @ Meta. Georgia TechVerified email at gatech.edu
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Douwe KielaContextual AI, Stanford UniversityVerified email at stanford.edu
Xinlei ChenFAIR, MetaVerified email at meta.com
Ronghang HuResearch Scientist, AI at MetaVerified email at meta.com
Jiasen LuResearch Scientist, AppleVerified email at apple.com
Stefan LeeAssistant Professor, Oregon State UniversityVerified email at oregonstate.edu
Angela FanMeta AI Research, FAIRVerified email at fb.com
Shruti BhosaleFacebook AI ResearchVerified email at fb.com
C. Lawrence ZitnickFAIR (Meta)Verified email at fb.com
Songwei GeUniversity of Maryland, College ParkVerified email at cs.umd.edu
Maha ElbayadResearch scientist, Meta AIVerified email at fb.com
Fernando De la TorreResearch Associate Professor, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Albert PumarolaMeta GenAIVerified email at fb.com

Vedanuj Goswami

Llama Team, Research Engineer, Meta AI

Verified email at meta.com

Natural Language Processing Computer Vision Machine Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Llama 2: Open foundation and fine-tuned chat models H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288, 2023	9256	2023
No language left behind: Scaling human-centered machine translation MR Costa-jussà, J Cross, O Çelebi, M Elbayad, K Heafield, K Heffernan, ... arXiv preprint arXiv:2207.04672, 2022	661	2022
Flava: A foundational language and vision alignment model A Singh, R Hu, V Goswami*, G Couairon, W Galuba, M Rohrbach, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	628	2022
The hateful memes challenge: Detecting hate speech in multimodal memes D Kiela, H Firooz, A Mohan, V Goswami, A Singh, P Ringshia, ... Advances in neural information processing systems 33, 2611-2624, 2020	571	2020
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024	543	2024
12-in-1: Multi-task vision and language representation learning J Lu, V Goswami, M Rohrbach, D Parikh, S Lee Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020	538	2020
MMF: A multimodal framework for vision and language research A Singh, V Goswami, V Natarajan, Y Jiang, X Chen, M Shah, M Rohrbach, ... URL: https://github. com/facebookresearch/mmf, 0	368*
Only time can tell: Discovering temporal data for temporal modeling L Sevilla-Lara, S Zha, Z Yan, V Goswami, M Feiszli, L Torresani Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021	82	2021
Creative sketch generation S Ge, V Goswami, CL Zitnick, D Parikh arXiv preprint arXiv:2011.10039, 2020	79	2020
The hateful memes challenge: Competition report D Kiela, H Firooz, A Mohan, V Goswami, A Singh, CA Fitzpatrick, P Bull, ... NeurIPS 2020 Competition and Demonstration Track, 344-360, 2021	65	2021
Human-adversarial visual question answering S Sheng, A Singh, V Goswami, J Magana, T Thrush, W Galuba, D Parikh, ... Advances in Neural Information Processing Systems 34, 20346-20359, 2021	62	2021
Are we pretraining it right? digging deeper into visio-linguistic pretraining A Singh, V Goswami, D Parikh arXiv preprint arXiv:2004.08744, 2020	48	2020
Movie: Revisiting modulated convolutions for visual counting and beyond DK Nguyen, V Goswami, X Chen arXiv preprint arXiv:2004.11883, 2020	33	2020
Speechmatrix: A large-scale mined corpus of multilingual speech-to-speech translations PA Duquenne, H Gong, N Dong, J Du, A Lee, V Goswani, C Wang, J Pino, ... arXiv preprint arXiv:2211.04508, 2022	29	2022
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang arXiv preprint arXiv:2303.00628, 2023	26	2023
Tricks for training sparse translation models D Dua, S Bhosale, V Goswami, J Cross, M Lewis, A Fan arXiv preprint arXiv:2110.08246, 2021	22	2021
Revisiting machine translation for cross-lingual classification M Artetxe, V Goswami, S Bhosale, A Fan, L Zettlemoyer arXiv preprint arXiv:2305.14240, 2023	19	2023
Small data, big impact: Leveraging minimal data for effective machine translation J Maillard, C Gao, E Kalbassi, KR Sadagopan, V Goswami, P Koehn, ... Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	18	2023
Knowledge extraction and annotation for cross-domain textual case-based reasoning in biologically inspired design S Rugaber, S Bhati, V Goswami, E Spiliopoulou, S Azad, S Koushik, ... Case-Based Reasoning Research and Development: 24th International Conference …, 2016	16	2016
Causes and cures for interference in multilingual translation U Shaham, M Elbayad, V Goswami, O Levy, S Bhosale arXiv preprint arXiv:2212.07530, 2022	13	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors