Barun Patra
Barun Patra
Senior Applied Scientist, Microsoft
Verified email at
Cited by
Cited by
Language is not all you need: Aligning perception with language models
S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...
Advances in Neural Information Processing Systems 36, 2024
Bilingual lexicon induction with semi-supervision in non-isometric embedding spaces
B Patra, JRA Moniz, S Garg, MR Gormley, G Neubig
arXiv preprint arXiv:1908.06625, 2019
A length-extrapolatable transformer
Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ...
arXiv preprint arXiv:2212.10554, 2022
On the representation collapse of sparse mixture of experts
Z Chi, L Dong, S Huang, D Dai, S Ma, B Patra, S Singhal, P Bajaj, X Song, ...
Advances in Neural Information Processing Systems 35, 34600-34613, 2022
A survey of community question answering
B Patra
arXiv preprint arXiv:1705.04009, 2017
Foundation transformers
H Wang, S Ma, S Huang, L Dong, W Wang, Z Peng, Y Wu, P Bajaj, ...
arXiv preprint arXiv:2210.06423, 2022
Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions
D Contractor, B Patra, P Singla
Natural Language Engineering 27 (1), 65-87, 2021
Invariant language modeling
M Peyrard, SS Ghotra, M Josifoski, V Agarwal, B Patra, D Carignan, ...
arXiv preprint arXiv:2110.08413, 2021
TorchScale: Transformers at scale
S Ma, H Wang, S Huang, W Wang, Z Chi, L Dong, A Benhaim, B Patra, ...
arXiv preprint arXiv:2211.13184, 2022
Beyond english-centric bitexts for better multilingual language representation learning
B Patra, S Singhal, S Huang, Z Chi, L Dong, F Wei, V Chaudhary, X Song
arXiv preprint arXiv:2210.14867, 2022
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 2024
On efficiently acquiring annotations for multilingual models
JRA Moniz, B Patra, MR Gormley
arXiv preprint arXiv:2204.01016, 2022
ScopeIt: Scoping task relevant sentences in documents
B Patra, V Suryanarayanan, C Fufa, P Bhattacharya, CC Lee
Proceedings of the 28th International Conference on Computational …, 2020
MAGNETO: a foundation transformer
H Wang, S Ma, S Huang, L Dong, W Wang, Z Peng, Y Wu, P Bajaj, ...
International Conference on Machine Learning, 36077-36092, 2023
Artificial intelligence for identifying relevant content related to specific tasks
P Bhattacharya, B Patra, CY Lee, V Suryanarayanan, CF Fufa
US Patent 11,354,500, 2022
Weakly supervised attention networks for entity recognition
B Patra, JRA Moniz
Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019
A glitch in the Matrix? Locating and detecting language model grounding with Fakepedia
G Monea, M Peyrard, M Josifoski, V Chaudhary, J Eisner, E Kıcıman, ...
arXiv preprint arXiv:2312.02073, 2023
Everything you need to know about multilingual LLMs: Towards fair, performant and reliable models for languages of the world
S Sitaram, M Choudhury, B Patra, V Chaudhary, K Ahuja, K Bali
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
The SUMEval 2022 shared task on performance prediction of multilingual pre-trained language models
K Ahuja, A Anastasopoulos, B Patra, G Neubig, M Choudhury, ...
Proceedings of the First Workshop on Scaling Up Multilingual Evaluation, 1-7, 2022
To schedule or not to schedule: extracting task specific temporal entities and associated negation constraints
B Patra, C Fufa, P Bhattacharya, C Lee
arXiv preprint arXiv:2012.02594, 2020
The system can't perform the operation now. Try again later.
Articles 1–20