The Kaldi speech recognition toolkit D Povey, A Ghoshal, G Boulianne, L Burget, O Glembek, N Goel, ... IEEE 2011 workshop on automatic speech recognition and understanding, 2011 | 7701 | 2011 |
Machine learning approach to RF transmitter identification K Youssef, L Bouchard, K Haigh, J Silovsky, B Thapa, C Vander Valk IEEE Journal of Radio Frequency Identification 2 (4), 197-205, 2018 | 166 | 2018 |
Challenges in speech processing of Slavic languages (case studies in speech recognition of Czech and Slovak) J Nouza, J Zdansky, P Cerva, J Silovsky Development of Multimodal Interfaces: Active Listening and Synchrony: Second …, 2010 | 44 | 2010 |
Speaker diarization of broadcast streams using two-stage clustering based on i-vectors and cosine distance scoring J Silovsky, J Prazak 2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012 | 35 | 2012 |
Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives P Cerva, J Silovsky, J Zdansky, J Nouza, L Seps Speech Communication 55 (10), 1033-1046, 2013 | 27 | 2013 |
Enhancement of emotion detection in spoken dialogue systems by combining several information sources R López-Cózar, J Silovsky, M Kroul Speech Communication 53 (9-10), 1210-1228, 2011 | 25 | 2011 |
Making czech historical radio archive accessible and searchable for wide public J Nouza, K Blavka, P Cerva, J Zdansky, J Silovsky, M Bohac, J Prazak Journal of Multimedia 7 (2), 159, 2012 | 24 | 2012 |
Speaker diarization using PLDA-based speaker clustering J Prazak, J Silovsky Proceedings of the 6th IEEE international conference on intelligent data …, 2011 | 22 | 2011 |
Sage: The new BBN speech processing platform R Hsiao, R Meermeier, T Ng, Z Huang, M Jordan, E Kan, T Alumäe, ... submission to Interspeech, 2016 | 21 | 2016 |
Incorporation of the ASR output in speaker segmentation and clustering within the task of speaker diarization of broadcast streams J Silovsky, J Zdansky, J Nouza, P Cerva, J Prazak Multimedia Signal Processing (MMSP), 2012 IEEE 14th International Workshop …, 2012 | 20 | 2012 |
Fast keyword spotting in telephone speech J Nouza, J Silovsky Radioengineering 18 (4), 665-670, 2009 | 20 | 2009 |
Voice technology to enable sophisticated access to historical audio archive of the Czech radio J Nouza, K Blavka, M Bohac, P Cerva, J Zdansky, J Silovsky, J Prazak Multimedia for Cultural Heritage: First International Workshop, MM4CH 2011 …, 2012 | 19 | 2012 |
Speech-To-Text Technology to Transcribe and Disclose 100,000+ Hours of Bilingual Documents from Historical Czech and Czechoslovak Radio Archive J Nouza, P Cerva, J Zdansky, K Blavka, M Bohac, J Silovsky, J Chaloupka, ... Fifteenth Annual Conference of the International Speech Communication …, 2014 | 18 | 2014 |
Czech-to-Slovak adapted broadcast news transcription system J Nouza, J Silovsky, J Zdansky, P Cerva, M Kroul, J Chaloupka Ninth Annual Conference of the International Speech Communication Association, 2008 | 17 | 2008 |
Speech, speaker and speaker's gender identification in automatically processed broadcast stream J Silovský, J Nouza Radioengineering, 2006 | 16 | 2006 |
Real-time lecture transcription using asr for czech hearing impaired or deaf students P Cerva, J Silovsky, J Zdansky, J Nouza, J Malek Thirteenth Annual Conference of the International Speech Communication …, 2012 | 15 | 2012 |
Improving language identification for multilingual speakers A Titus, J Silovsky, N Chen, R Hsiao, M Young, A Ghoshal ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 14 | 2020 |
PLDA-based clustering for speaker diarization of broadcast streams J Silovsky, J Prazak, P Cerva, J Zdansky, J Nouza Twelfth Annual Conference of the International Speech Communication Association, 2011 | 14 | 2011 |
Variable attention masking for configurable transformer transducer speech recognition P Swietojanski, S Braun, D Can, TF Da Silva, A Ghoshal, T Hori, R Hsiao, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 13 | 2023 |
Adapting lexical and language models for transcription of highly spontaneous spoken Czech J Nouza, J Silovský Text, Speech and Dialogue, 377-384, 2010 | 13 | 2010 |