Следене
Kai Zhen
Kai Zhen
Alexa Speech
Потвърден имейл адрес: amazon.com - Начална страница
Заглавие
Позовавания
Позовавания
Година
Cascaded cross-module residual learning towards lightweight end-to-end speech coding
K Zhen, J Sung, MS Lee, S Beack, M Kim
arXiv preprint arXiv:1906.07769, 2019
412019
Psychoacoustic calibration of loss functions for efficient end-to-end neural audio coding
K Zhen, MS Lee, J Sung, S Beack, M Kim
IEEE Signal Processing Letters 27, 2159-2163, 2020
252020
Efficient and scalable neural residual waveform coding with collaborative quantization
K Zhen, MS Lee, J Sung, S Beack, M Kim
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
172020
Scalable and efficient neural speech coding: A hybrid design
K Zhen, J Sung, MS Lee, S Beack, M Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 12-25, 2021
15*2021
Sparsification via compressed sensing for automatic speech recognition
K Zhen, HD Nguyen, FJ Chang, A Mouchtaris, A Rastrow
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
132021
Audio signal encoding method and apparatus and audio signal decoding method and apparatus using psychoacoustic-based weighted error function
J Sung, M Kim, A Sivaraman, K Zhen
US Patent 11,416,742, 2022
122022
Sub-8-bit quantization aware training for 8-bit neural network accelerator with on-device speech recognition
K Zhen, HD Nguyen, R Chinta, N Susanj, A Mouchtaris, T Afzal, ...
arXiv preprint arXiv:2206.15408, 2022
12*2022
Source-aware neural speech coding for noisy speech compression
H Yang, K Zhen, S Beack, M Kim
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
On psychoacoustically weighted cost functions towards resource-efficient deep neural networks for speech denoising
K Zhen, A Sivaraman, J Sung, M Kim
arXiv preprint arXiv:1801.09774, 2018
102018
A dual-staged context aggregation method towards efficient end-to-end speech enhancement
K Zhen, MS Lee, M Kim
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
9*2020
Conmer: Streaming Conformer without self-attention for interactive voice assistants
M Radfar, P Lyskawa, B Trujillo, Y Xie, K Zhen, J Heymann, D Filimonov, ...
42023
A functional flavor of service composition
L Bao, Q Li, K Zhen, W Xiang, P Chen
2011 Eighth International Conference on Fuzzy Systems and Knowledge …, 2011
22011
Residual coding method of linear prediction coding coefficient based on collaborative quantization, and computing device for performing the method
M Kim, K Zhen, MS Lee, SK Beack, J Sung, TJ Lee, JS Choi
US Patent 11,488,613, 2022
12022
Audio signal encoding method and audio signal decoding method, and encoder and decoder performing the same
MS Lee, J Sung, M Kim, K Zhen
US Patent 11,276,413, 2022
12022
Hybrid supervised-unsupervised image topic visualization with convolutional neural network and LDA. arXiv
K Zhen, M Birla, D Crandall, B Zhang, J Qiu
12017
Max-margin transducer loss: Improving sequence-discriminative training using a large-margin learning strategy
RV Swaminathan, G Strimel, A Rastrow, H Mallidi, K Zhen, HD Nguyen, ...
2024
Apparatus and method for speech processing using a densely connected hybrid neural network
M Kim, MS Lee, SK Beack, J Sung, TJ Lee, JS Choi, K Zhen
US Patent 11,837,220, 2023
2023
Method and apparatus for processing audio signal
MS Lee, SK Beack, J Sung, TJ Lee, JS Choi, M Kim, K Zhen
US Patent 11,790,926, 2023
2023
Neural Waveform Coding: Scalability, Efficiency and Psychoacoustic Calibration
K Zhen
Indiana University, 2021
2021
A Hybrid Supervised-unsupervised Method on Image Topic Visualization with Convolutional Neural Network and LDA
K Zhen, M Birla, D Crandall, B Zhang, J Qiu
arXiv preprint arXiv:1703.05243, 2017
2017
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20