SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun, X Shen, G Yuan, B Ren, ... European Conference on Computer Vision (ECCV), 620-640, 2022 | 105 | 2022 |
Efficient transformer-based large scale language representations using hardware-friendly block structured pruning B Li*, Z Kong*, T Zhang, J Li, Z Li, H Liu, C Ding Findings of the Association for Computational Linguistics: EMNLP 2020, 2020 | 52* | 2020 |
Mest: Accurate and fast memory-economic sparse training framework on the edge G Yuan, X Ma, W Niu, Z Li, Z Kong, N Liu, Y Gong, Z Zhan, C He, Q Jin, ... Advances in Neural Information Processing Systems 34, 20838-20850, 2021 | 40 | 2021 |
Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, et al. Mest: Accurate and fast memory-economic sparse training framework on the edge G Yuan, X Ma, W Niu, Z Li Advances in Neural Information Processing Systems 34 (20838-20850), 2, 2021 | 36 | 2021 |
Accelerating framework of transformer by hardware design and model compression co-optimization P Qi, EHM Sha, Q Zhuge, H Peng, S Huang, Z Kong, Y Song, B Li 2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021 | 33 | 2021 |
Improving dnn fault tolerance using weight pruning and differential crossbar mapping for reram-based edge ai G Yuan, Z Liao, X Ma, Y Cai, Z Kong, X Shen, J Fu, Z Li, C Zhang, H Peng, ... 2021 22nd International Symposium on Quality Electronic Design (ISQED), 135-141, 2021 | 31 | 2021 |
Automatic tissue image segmentation based on image processing and deep learning Z Kong, T Li, J Luo, S Xu Journal of healthcare engineering 2019, 2019 | 29 | 2019 |
NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration Z Li, G Yuan, W Niu, P Zhao, Y Li, Y Cai, X Shen, Z Zhan, Z Kong, Q Jin, ... Conference on Computer Vision and Pattern Recognition (CVPR) Oral, 14255-14266, 2021 | 27 | 2021 |
HeatViT: Hardware-Efficient Adaptive Token Pruning for Vision Transformers P Dong, M Sun, A Lu, Y Xie, K Liu, Z Kong, X Meng, Z Li, X Lin, Z Fang, ... International Symposium on High-Performance Computer Architecture (HPCA), 2022 | 21* | 2022 |
A Compression-Compilation Framework for On-mobile Real-time BERT Applications W Niu*, Z Kong*, G Yuan, W Jiang, J Guan, C Ding, P Zhao, S Liu, B Ren, ... International Joint Conference on Artificial Intelligence (IJCAI), 2021 | 21* | 2021 |
SS-Auto: A single-shot, automatic structured weight pruning framework of DNNs with ultra-high efficiency Z Li, Y Gong, X Ma, S Liu, M Sun, Z Zhan, Z Kong, G Yuan, Y Wang arXiv preprint arXiv:2001.08839, 2020 | 18 | 2020 |
HMC-TRAN A Tensor-core Inspired Hierarchical Model Compression for Transformer-based DNNs on GPU S Huang, S Chen, H Peng, D Manu, Z Kong, G Yuan, L Yang, S Wang, ... Proceedings of the 2021 on Great Lakes Symposium on VLSI, 169-174, 2021 | 17* | 2021 |
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model S Tang, Y Wang, Z Kong, T Zhang, Y Li, C Ding, Y Wang, Y Liang, D Xu Conference on Computer Vision and Pattern Recognition (CVPR), 2022 | 15* | 2022 |
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training Z Kong, H Ma, G Yuan, M Sun, Y Xie, P Dong, X Meng, X Shen, H Tang, ... AAAI Conference on Artificial Intelligence (AAAI) Oral, 2023 | 10 | 2023 |
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training G Yuan, Y Li, S Li, Z Kong, S Tulyakov, X Tang, Y Wang, J Ren Advances in Neural Information Processing Systems (NeurIPS), 2022 | 9* | 2022 |
Automatic tissue image segmentation based on image processing and deep learning Z Kong, J Luo, S Xu, T Li Neural Imaging and Sensing 2018 10481, 79-85, 2018 | 9 | 2018 |
Data level lottery ticket hypothesis for vision transformers X Shen, Z Kong, M Qin, P Dong, G Yuan, X Meng, H Tang, X Ma, Y Wang International Joint Conference on Artificial Intelligence (IJCAI) Oral, 2023 | 7* | 2023 |
Zhenglun Kong, Geng Yuan, and Yanzhi Wang. 2020. SS-Auto: A single-shot, automatic structured weight pruning framework of DNNs with ultra-high efficiency Z Li, Y Gong, X Ma, S Liu, M Sun, Z Zhan arXiv preprint arXiv:2001.08839, 2020 | 5 | 2020 |
Automatical and accurate segmentation of cerebral tissues in fMRI dataset with combination of image processing and deep learning Z Kong, J Luo, S Xu, T Li Optics and Biophotonics in Low-Resource Settings IV 10485, 24-30, 2018 | 5 | 2018 |
Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, and Xue Lin. Npas: A compiler-aware framework of unified network pruning and architecture … Z Li, G Yuan, W Niu, P Zhao, Y Li, Y Cai, X Shen, Z Zhan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 4 | 2021 |