Arsha Nagrani

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	10230	10059
h-индекс	32	32
i10-индекс	46	46

3300

1650

825

2475

2018201920202021202220232024135 480 1234 1629 2413 3207 1072

Публичен достъп

Преглед на всички

20 статии

0 статии

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Andrew ZissermanUniversity of OxfordПотвърден имейл адрес: robots.ox.ac.uk
Joon Son ChungKAISTПотвърден имейл адрес: kaist.ac.kr
Cordelia SchmidResearch director INRIA Потвърден имейл адрес: inria.fr
Chen SunAssistant Professor, Brown UniversityПотвърден имейл адрес: brown.edu
Andrea VedaldiUniversity of OxfordПотвърден имейл адрес: robots.ox.ac.uk
Dima DamenProfessor, University of Bristol and Google DeepMindПотвърден имейл адрес: bristol.ac.uk
Evangelos KazakosCzech Technical University in PragueПотвърден имейл адрес: cvut.cz
Rahul SukthankarGoogle ResearchПотвърден имейл адрес: google.com
Samuel AlbanieAssistant Professor, University of CambridgeПотвърден имейл адрес: cam.ac.uk

Следене

Arsha Nagrani

Research Scientist, Google

Потвърден имейл адрес: google.com - Начална страница

Machine learning Computer Vision Speech Technology Deep Learning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Voxceleb: a large-scale speaker identification dataset A Nagrani, JS Chung, A Zisserman arXiv preprint arXiv:1706.08612, 2017	2380	2017
Voxceleb2: Deep speaker recognition JS Chung, A Nagrani, A Zisserman arXiv preprint arXiv:1806.05622, 2018	2218	2018
Frozen in time: A joint video and image encoder for end-to-end retrieval M Bain, A Nagrani, G Varol, A Zisserman Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	705	2021
Voxceleb: Large-scale speaker verification in the wild A Nagrani, JS Chung, W Xie, A Zisserman Computer Speech & Language 60, 101027, 2020	620	2020
Attention bottlenecks for multimodal fusion A Nagrani, S Yang, A Arnab, A Jansen, C Schmid, C Sun Advances in neural information processing systems 34, 14200-14213, 2021	442	2021
Use what you have: Video retrieval using representations from collaborative experts Y Liu, S Albanie, A Nagrani, A Zisserman arXiv preprint arXiv:1907.13487, 2019	400	2019
Utterance-level aggregation for speaker recognition in the wild W Xie, A Nagrani, JS Chung, A Zisserman ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	386	2019
Epic-fusion: Audio-visual temporal binding for egocentric action recognition E Kazakos, A Nagrani, A Zisserman, D Damen Proceedings of the IEEE/CVF international conference on computer vision …, 2019	351	2019
Emotion recognition in speech using cross-modal transfer in the wild S Albanie, A Nagrani, A Vedaldi, A Zisserman Proceedings of the 26th ACM international conference on Multimedia, 292-301, 2018	297	2018
Seeing voices and hearing faces: Cross-modal biometric matching A Nagrani, S Albanie, A Zisserman Proceedings of the IEEE conference on computer vision and pattern …, 2018	227	2018
Chimpanzee face recognition from videos in the wild using deep learning D Schofield, A Nagrani, A Zisserman, M Hayashi, T Matsuzawa, D Biro, ... Science advances 5 (9), eaaw0736, 2019	184	2019
Localizing visual sounds the hard way H Chen, W Xie, T Afouras, A Nagrani, A Vedaldi, A Zisserman Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	150	2021
Learnable pins: Cross-modal embeddings for person identity A Nagrani, S Albanie, A Zisserman Proceedings of the European Conference on Computer Vision (ECCV), 71-88, 2018	136	2018
Spot the conversation: speaker diarisation in the wild JS Chung, J Huh, A Nagrani, T Afouras, A Zisserman arXiv preprint arXiv:2007.01216, 2020	134	2020
End-to-end generative pretraining for multimodal video captioning PH Seo, A Nagrani, A Arnab, C Schmid Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	132	2022
Cough against covid: Evidence of covid-19 signature in cough sounds P Bagad, A Dalmia, J Doshi, A Nagrani, P Bhamare, A Mahale, S Rane, ... arXiv preprint arXiv:2009.08790, 2020	128	2020
Disentangled speech embeddings using cross-modal self-supervision A Nagrani, JS Chung, S Albanie, A Zisserman ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	99	2020
Vid2seq: Large-scale pretraining of a visual language model for dense video captioning A Yang, A Nagrani, PH Seo, A Miech, J Pont-Tuset, I Laptev, J Sivic, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	91	2023
Pali-x: On scaling up a multilingual vision and language model X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ... arXiv preprint arXiv:2305.18565, 2023	80	2023
Voxsrc 2020: The second voxceleb speaker recognition challenge A Nagrani, JS Chung, J Huh, A Brown, E Coto, W Xie, M McLaren, ... arXiv preprint arXiv:2012.06867, 2020	79	2020

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори