Следене
Chen Zhang
Заглавие
Позовавания
Позовавания
Година
Optimizing FPGA-based accelerator design for deep convolutional neural networks
C Zhang, P Li, G Sun, Y Guan, B Xiao, J Cong
Proceedings of the 2015 ACM/SIGDA international symposium on field …, 2015
23472015
Caffeine: towards uniformed representation and acceleration for deep convolutional neural networks
J Zhang, Chen and Fang, Zhenman and Zhou, Peipei and Pan, Peichen and Cong
Proceedings of the 35th International Conference on Computer-Aided Design, 1--8, 2016
663*2016
Energy-efficient CNN implementation on a deeply pipelined FPGA cluster
C Zhang, D Wu, J Sun, G Sun, G Luo, J Cong
Proceedings of the 2016 International Symposium on Low Power Electronics and …, 2016
2662016
An efficient design and implementation of LSM-tree based key-value store on open-channel SSD
P Wang, G Sun, S Jiang, J Ouyang, S Lin, C Zhang, J Cong
Proceedings of the Ninth European Conference on Computer Systems, 1-14, 2014
2482014
Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity
S Cao, C Zhang, Z Yao, W Xiao, L Nie, D Zhan, Y Liu, M Wu, L Zhang
Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019
1862019
Memory partitioning for multidimensional arrays in high-level synthesis
Y Wang, P Li, P Zhang, C Zhang, J Cong
Proceedings of the 50th Annual Design Automation Conference, 1-8, 2013
1202013
Balanced sparsity for efficient dnn inference on gpu
Z Yao, S Cao, W Xiao, C Zhang, L Nie
Proceedings of the AAAI conference on artificial intelligence 33 (01), 5676-5683, 2019
1152019
Seernet: Predicting convolutional neural network feature-map sparsity through low-bit quantization
S Cao, L Ma, W Xiao, C Zhang, Y Liu, L Zhang, L Nie, Z Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
792019
Ladabert: Lightweight adaptation of bert through hybrid model compression
Y Mao, Y Wang, C Wu, C Zhang, Y Wang, Y Yang, Q Zhang, Y Tong, J Bai
arXiv preprint arXiv:2004.04124, 2020
572020
Dual-side sparse tensor core
Y Wang, C Zhang, Z Xie, C Guo, Y Liu, J Leng
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021
532021
Squant: On-the-fly data-free quantization via diagonal hessian approximation
C Guo, Y Qiu, J Leng, X Gao, C Zhang, Y Liu, F Yang, Y Zhu, M Guo
arXiv preprint arXiv:2202.07471, 2022
472022
Best-effort FPGA programming: A few steps can go a long way
J Cong, Z Fang, Y Hao, P Wei, CH Yu, C Zhang, P Zhou
arXiv preprint arXiv:1807.01340, 2018
332018
Scylla: Qoe-aware continuous mobile vision with fpga-based dynamic deep neural network reconfiguration
S Jiang, Z Ma, X Zeng, C Xu, M Zhang, C Zhang, Y Liu
IEEE INFOCOM 2020-IEEE Conference on Computer Communications, 1369-1378, 2020
262020
Live video analytics with FPGA-based smart cameras
S Wang, C Zhang, Y Shu, Y Liu
Proceedings of the 2019 Workshop on Hot Topics in Video Analytics and …, 2019
222019
Olive: Accelerating large language models via hardware-friendly outlier-victim pair quantization
C Guo, J Tang, W Hu, J Leng, C Zhang, F Yang, Y Liu, M Guo, Y Zhu
Proceedings of the 50th Annual International Symposium on Computer …, 2023
202023
Ant: Exploiting adaptive numerical data type for low-bit deep neural network quantization
C Guo, C Zhang, J Leng, Z Liu, F Yang, Y Liu, M Guo, Y Zhu
2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO …, 2022
202022
Using data compression for optimizing FPGA-based convolutional neural network accelerators
Y Guan, N Xu, C Zhang, Z Yuan, J Cong
International workshop on advanced parallel processing technologies, 14-26, 2017
102017
Boosting mobile CNN inference through semantic memory
Y Li, C Zhang, S Han, LL Zhang, B Yin, Y Liu, M Xu
Proceedings of the 29th ACM International Conference on Multimedia, 2362-2371, 2021
92021
Nesting forward automatic differentiation for memory-efficient deep neural network training
C Guo, Y Qiu, J Leng, C Zhang, Y Cao, Q Zhang, Y Liu, F Yang, M Guo
2022 IEEE 40th International Conference on Computer Design (ICCD), 738-745, 2022
62022
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache
B Lin, T Peng, C Zhang, M Sun, L Li, H Zhao, W Xiao, Q Xu, X Qiu, S Li, ...
arXiv preprint arXiv:2401.02669, 2024
22024
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20