David Harwath

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	2230	1947
h-индекс	24	23
i10-индекс	36	32

600

300

150

450

20132014201520162017201820192020202120222023202415 19 30 39 72 100 180 245 322 382 595 218

Публичен достъп

Преглед на всички

2 статии

1 статия

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

James GlassMIT Computer Science and Artificial Intelligence LaboratoryПотвърден имейл адрес: mit.edu
Rogerio FerisResearch Manager, MIT-IBM Watson AI LabПотвърден имейл адрес: us.ibm.com
Hilde KuehneUniversity of Bonn , MIT-IBM Watson LabПотвърден имейл адрес: uni-bonn.de
Puyuan PengPhD student, The University of Texas at AustinПотвърден имейл адрес: utexas.edu
Samuel ThomasIBM Research AIПотвърден имейл адрес: us.ibm.com
Andrew RouditchenkoPhD Student at MIT CSAILПотвърден имейл адрес: mit.edu
Antonio TorralbaProfessor of Computer Science, MITПотвърден имейл адрес: csail.mit.edu
Brian KingsburyDistinguished Research Staff Member and Manager, IBM T. J. Watson Research Center, Yorktown HeightsПотвърден имейл адрес: us.ibm.com
Angie BoggustMassachusetts Institute of TechnologyПотвърден имейл адрес: mit.edu
Michael Alan PichenyNYU - Courant CS and CDSПотвърден имейл адрес: nyu.edu
Wei-Ning HsuFacebook AI Research (FAIR)Потвърден имейл адрес: csail.mit.edu
Rameswar PandaResearch Scientist, MIT-IBM Watson AI LabПотвърден имейл адрес: ibm.com
Brian ChenColumbia UniversityПотвърден имейл адрес: columbia.edu
Nina ShvetsovaUniversity of BonnПотвърден имейл адрес: uni-frankfurt.de
Alexander H. LiuMassachusetts Institute of TechnologyПотвърден имейл адрес: mit.edu
Layne BerryPhD Student, University of Texas at AustinПотвърден имейл адрес: utexas.edu
Galen ChuangUC BerkeleyПотвърден имейл адрес: berkeley.edu
Adrià RecasensResearch Scientist, DeepMindПотвърден имейл адрес: google.com
Dídac SurísPhD student, Columbia UniversityПотвърден имейл адрес: columbia.edu
Kunio KashinoFellow, NTT CorporationПотвърден имейл адрес: ieee.org

Следене

David Harwath

The University of Texas at Austin

Потвърден имейл адрес: utexas.edu

Speech and Language Processing Computer Vision Natural Language Processing Artificial Intelligence Machine Learning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Unsupervised learning of spoken language with visual context D Harwath, A Torralba, J Glass Advances in Neural Information Processing Systems 29, 2016	282	2016
Jointly discovering visual objects and spoken words from raw sensory input D Harwath, A Recasens, D Surís, G Chuang, A Torralba, J Glass Proceedings of the European conference on computer vision (ECCV), 649-665, 2018	222	2018
Deep multimodal semantic embeddings for speech and images D Harwath, J Glass 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015	174	2015
Avlnet: Learning audio-visual language representations from instructional videos A Rouditchenko, A Boggust, D Harwath, B Chen, D Joshi, S Thomas, ... arXiv preprint arXiv:2006.09199, 2020	132	2020
Everything at once-multi-modal fusion transformer for video retrieval N Shvetsova, B Chen, A Rouditchenko, S Thomas, B Kingsbury, RS Feris, ... Proceedings of the ieee/cvf conference on computer vision and pattern …, 2022	122	2022
Learning word-like units from joint audio-visual analysis D Harwath, JR Glass arXiv preprint arXiv:1701.07481, 2017	122	2017
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition A Jansen, E Dupoux, S Goldwater, M Johnson, S Khudanpur, K Church, ... 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	119	2013
Learning hierarchical discrete linguistic units from visually-grounded speech D Harwath, WN Hsu, J Glass arXiv preprint arXiv:1911.09602, 2019	93	2019
Mae-ast: Masked autoencoding audio spectrogram transformer A Baade, P Peng, D Harwath arXiv preprint arXiv:2203.16691, 2022	72	2022
Contrastive audio-visual masked autoencoder Y Gong, A Rouditchenko, AH Liu, D Harwath, L Karlinsky, H Kuehne, ... arXiv preprint arXiv:2210.07839, 2022	69	2022
Multimodal clustering networks for self-supervised learning from unlabeled videos B Chen, A Rouditchenko, K Duarte, H Kuehne, S Thomas, A Boggust, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	69	2021
Vision as an interlingua: Learning multilingual semantic embeddings of untranscribed speech D Harwath, G Chuang, J Glass 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	65	2018
Text-free image-to-speech synthesis using learned segmental units WN Hsu, D Harwath, C Song, J Glass arXiv preprint arXiv:2012.15454, 2020	63	2020
Spoken moments: Learning joint audio-visual representations from video descriptions M Monfort, SY Jin, A Liu, D Harwath, R Feris, J Glass, A Oliva Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	52	2021
Towards visually grounded sub-word speech unit discovery D Harwath, J Glass ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	42	2019
Look, Listen, and Decode: Multimodal Speech Recognition with Images F Sun, D Harwath, J Glass IEEE Workshop on Spoken Language Technology, 2016	33	2016
Learning modality-invariant representations for speech and images K Leidal, D Harwath, J Glass 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	32	2017
Zero resource spoken audio corpus analysis DF Harwath, TJ Hazen, JR Glass 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013	32	2013
Why is winoground hard? investigating failures in visuolinguistic compositionality A Diwan, L Berry, E Choi, D Harwath, K Mahowald arXiv preprint arXiv:2211.00768, 2022	31	2022
Word discovery in visually grounded, self-supervised speech models P Peng, D Harwath arXiv preprint arXiv:2203.15081, 2022	31	2022

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори