Vedanuj Goswami
Vedanuj Goswami
Research Engineer, Meta AI
Verified email at
Cited by
Cited by
Llama 2: Open foundation and fine-tuned chat models
H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ...
arXiv preprint arXiv:2307.09288, 2023
Flava: A foundational language and vision alignment model
A Singh*, R Hu*, V Goswami*, G Couairon, W Galuba, M Rohrbach, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
No language left behind: Scaling human-centered machine translation
MR Costa-jussą, J Cross, O Ēelebi, M Elbayad, K Heafield, K Heffernan, ...
arXiv preprint arXiv:2207.04672, 2022
12-in-1: Multi-task vision and language representation learning
J Lu*, V Goswami*, M Rohrbach, D Parikh, S Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
The hateful memes challenge: Detecting hate speech in multimodal memes
D Kiela, H Firooz, A Mohan, V Goswami, A Singh, P Ringshia, ...
Advances in neural information processing systems 33, 2611-2624, 2020
MMF: A multimodal framework for vision and language research
A Singh, V Goswami, V Natarajan, Y Jiang, X Chen, M Shah, M Rohrbach, ...
URL: https://github. com/facebookresearch/mmf, 0
Only time can tell: Discovering temporal data for temporal modeling
L Sevilla-Lara, S Zha, Z Yan, V Goswami, M Feiszli, L Torresani
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021
Creative sketch generation
S Ge, V Goswami, CL Zitnick, D Parikh
arXiv preprint arXiv:2011.10039, 2020
The hateful memes challenge: Competition report
D Kiela, H Firooz, A Mohan, V Goswami, A Singh, CA Fitzpatrick, P Bull, ...
NeurIPS 2020 Competition and Demonstration Track, 344-360, 2021
Human-adversarial visual question answering
S Sheng, A Singh, V Goswami, J Magana, T Thrush, W Galuba, D Parikh, ...
Advances in Neural Information Processing Systems 34, 20346-20359, 2021
Are we pretraining it right? digging deeper into visio-linguistic pretraining
A Singh, V Goswami, D Parikh
arXiv preprint arXiv:2004.08744, 2020
Movie: Revisiting modulated convolutions for visual counting and beyond
DK Nguyen, V Goswami, X Chen
arXiv preprint arXiv:2004.11883, 2020
Speechmatrix: A large-scale mined corpus of multilingual speech-to-speech translations
PA Duquenne, H Gong, N Dong, J Du, A Lee, V Goswani, C Wang, J Pino, ...
arXiv preprint arXiv:2211.04508, 2022
Tricks for training sparse translation models
D Dua, S Bhosale, V Goswami, J Cross, M Lewis, A Fan
arXiv preprint arXiv:2110.08246, 2021
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation
M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang
arXiv preprint arXiv:2303.00628, 2023
Knowledge extraction and annotation for cross-domain textual case-based reasoning in biologically inspired design
S Rugaber, S Bhati, V Goswami, E Spiliopoulou, S Azad, S Koushik, ...
Case-Based Reasoning Research and Development: 24th International Conference …, 2016
Revisiting machine translation for cross-lingual classification
M Artetxe, V Goswami, S Bhosale, A Fan, L Zettlemoyer
arXiv preprint arXiv:2305.14240, 2023
Causes and cures for interference in multilingual translation
U Shaham, M Elbayad, V Goswami, O Levy, S Bhosale
arXiv preprint arXiv:2212.07530, 2022
Unsupervised image-to-video clothing transfer
A Pumarola, V Goswami, F Vicente, F De la Torre, F Moreno-Noguer
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
Small data, big impact: Leveraging minimal data for effective machine translation
J Maillard, C Gao, E Kalbassi, KR Sadagopan, V Goswami, P Koehn, ...
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
The system can't perform the operation now. Try again later.
Articles 1–20