SALMONN: Towards Generic Hearing Abilities for Large Language Models C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang The Twelfth International Conference on Learning Representations, 2024 | 22 | 2024 |
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets Z Ma, Z Zhen, C Tang, Y Wang, X Chen Proc. Interspeech 2023, 2022 | 18 | 2022 |
Connecting Speech Encoder and Large Language Model for ASR W Yu, C Tang, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition X Chen, Z Ma, C Tang, Y Wang, Z Zheng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models G Sun, W Yu, C Tang, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang arXiv preprint arXiv:2310.05863, 2023 | 1 | 2023 |
Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models C Tang, Y Wang, X Chen, WQ Zhang National Conference on Man-Machine Speech Communication, 2022, 2022 | 1 | 2022 |
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition Y Wang, C Tang, Z Ma, Z Zheng, X Chen, WQ Zhang 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2022 | 1 | 2022 |
Extending Large Language Models for Speech and Audio Captioning C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |