Follow
Soroosh Mariooryad
Soroosh Mariooryad
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
7202024
Correcting time-continuous emotional labels by modeling the reaction lag of evaluators
S Mariooryad, C Busso
IEEE Transactions on Affective Computing 6 (2), 97-108, 2014
1372014
Location-relative attention mechanisms for robust long-form speech synthesis
E Battenberg, RJ Skerry-Ryan, S Mariooryad, D Stanton, D Kao, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1322020
Wave-tacotron: Spectrogram-free end-to-end text-to-speech synthesis
RJ Weiss, RJ Skerry-Ryan, E Battenberg, S Mariooryad, DP Kingma
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1242021
Exploring cross-modality affective reactions for audiovisual emotion recognition
S Mariooryad, C Busso
IEEE Transactions on affective computing 4 (2), 183-196, 2013
822013
Analysis and compensation of the reaction lag of evaluators in continuous emotional annotations
S Mariooryad, C Busso
2013 humaine association conference on affective computing and intelligent …, 2013
812013
Compensating for speaker or lexical variabilities in speech for emotion recognition
S Mariooryad, C Busso
Speech Communication 57, 1-12, 2014
772014
Iterative feature normalization scheme for automatic emotion detection from speech
C Busso, S Mariooryad, A Metallinou, S Narayanan
IEEE transactions on Affective computing 4 (4), 386-397, 2013
772013
Building a naturalistic emotional speech corpus by retrieving expressive behaviors from existing speech corpora
S Mariooryad, R Lotfian, C Busso
Fifteenth Annual Conference of the International Speech Communication …, 2014
662014
Generating human-like behaviors using joint, speech-driven models for conversational agents
S Mariooryad, C Busso
IEEE Transactions on Audio, Speech, and Language Processing 20 (8), 2329-2340, 2012
662012
Semi-supervised generative modeling for controllable speech synthesis
R Habib, S Mariooryad, M Shannon, E Battenberg, RJ Skerry-Ryan, ...
arXiv preprint arXiv:1910.01709, 2019
612019
Effective use of variational embedding capacity in expressive end-to-end speech synthesis
E Battenberg, S Mariooryad, D Stanton, RJ Skerry-Ryan, M Shannon, ...
arXiv preprint arXiv:1906.03402, 2019
582019
Audiovisual corpus to analyze whisper speech
T Tran, S Mariooryad, C Busso
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
362013
Speaker generation
D Stanton, M Shannon, S Mariooryad, RJ Skerry-Ryan, E Battenberg, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
342022
Facial expression recognition in the presence of speech using blind lexical compensation
S Mariooryad, C Busso
IEEE Transactions on Affective Computing 7 (4), 346-359, 2015
332015
Touchless user interface navigation using gestures
RL Carceroni, PR Sanketi, S Shah, D Ozkan, S Mariooryad, SMS Tarzjani, ...
US Patent 9,804,679, 2017
312017
The cost of dichotomizing continuous labels for binary classification problems: Deriving a Bayesian-optimal classifier
S Mariooryad, C Busso
IEEE Transactions on Affective Computing 8 (1), 119-130, 2015
292015
Feature and model level compensation of lexical content for facial emotion recognition
S Mariooryad, C Busso
2013 10th IEEE International Conference and Workshops on Automatic Face and …, 2013
232013
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
E Nachmani, A Levkovitch, R Hirsch, J Salazar, C Asawaroengchai, ...
arXiv preprint arXiv:2305.15255, 2023
212023
Variational embedding capacity in expressive end-to-end speech synthesis
ED Battenberg, D Stanton, RJW Skerry-Ryan, S Mariooryad, DT Kao, ...
US Patent 11,222,621, 2022
212022
The system can't perform the operation now. Try again later.
Articles 1–20