Follow
Yang You
Title
Cited by
Cited by
Year
Large batch optimization for deep learning: Training bert in 76 minutes
Y You, J Li, S Reddi, J Hseu, S Kumar, S Bhojanapalli, X Song, J Demmel, ...
arXiv preprint arXiv:1904.00962, 2019
4932019
Large batch training of convolutional networks
Y You, I Gitman, B Ginsburg
arXiv preprint arXiv:1708.03888, 2017
4792017
Imagenet training in minutes
Y You, Z Zhang, CJ Hsieh, J Demmel, K Keutzer
Proceedings of the 47th International Conference on Parallel Processing, 1-10, 2018
3842018
Scaling sgd batch size to 32k for imagenet training
Y You, I Gitman, B Ginsburg
arXiv preprint arXiv:1708.03888 6 (12), 6, 2017
3112017
Reducing BERT pre-training time from 3 days to 76 minutes
Y You, J Li, J Hseu, X Song, J Demmel, CJ Hsieh
arXiv preprint arXiv:1904.00962, 2019
812019
Scaling deep learning on gpu and knights landing clusters
Y You, A Buluç, J Demmel
Proceedings of the International Conference for High Performance Computing …, 2017
792017
Large-batch training for LSTM and beyond
Y You, J Hseu, C Ying, J Demmel, K Keutzer, CJ Hsieh
Proceedings of the International Conference for High Performance Computing …, 2019
782019
100-epoch imagenet training with alexnet in 24 minutes
Y You, Z Zhang, CJ Hsieh, J Demmel, K Keutzer
arXiv preprint arXiv:1709.05011, 2017
652017
Mic-svm: Designing a highly efficient support vector machine for advanced modern multi-core and many-core architectures
Y You, SL Song, H Fu, A Marquez, MM Dehnavi, K Barker, KW Cameron, ...
2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014
492014
Asynchronous parallel greedy coordinate descent
Y You, X Lian, J Liu, HF Yu, IS Dhillon, J Demmel, CJ Hsieh
Advances in Neural Information Processing Systems, 4682-4690, 2016
482016
CA-SVM: Communication-avoiding support vector machines on distributed systems
Y You, J Demmel, K Czechowski, L Song, R Vuduc
2015 IEEE International Parallel and Distributed Processing Symposium, 847-859, 2015
412015
Scaling support vector machines on modern HPC platforms
Y You, H Fu, SL Song, A Randles, D Kerbyson, A Marquez, G Yang, ...
Journal of Parallel and Distributed Computing 76, 16-31, 2015
362015
Fast deep neural network training on distributed systems and cloud TPUs
Y You, Z Zhang, CJ Hsieh, J Demmel, K Keutzer
IEEE Transactions on Parallel and Distributed Systems 30 (11), 2449-2462, 2019
352019
Imagenet training in 24 minutes
Y You, Z Zhang, J Demmel, K Keutzer, CJ Hsieh
arXiv preprint arXiv:1709.05011, 2017
332017
PGAP-X: extension on pan-genome analysis pipeline
Y Zhao, C Sun, D Zhao, Y Zhang, Y You, X Jia, J Yang, L Wang, J Wang, ...
BMC genomics 19 (1), 115-124, 2018
262018
Designing a heuristic cross-architecture combination for breadth-first search
Y You, D Bader, MM Dehnavi
2014 43rd International Conference on Parallel Processing, 70-79, 2014
232014
Crafting better contrastive views for siamese representation learning
X Peng, K Wang, Z Zhu, M Wang, Y You
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
222022
Accurate, fast and scalable kernel ridge regression on parallel and distributed systems
Y You, J Demmel, CJ Hsieh, R Vuduc
Proceedings of the 2018 International Conference on Supercomputing, 307-317, 2018
202018
Design and implementation of a communication-optimal classifier for distributed kernel support vector machines
Y You, J Demmel, K Czechowski, L Song, R Vuduc
IEEE Transactions on Parallel and Distributed Systems 28 (4), 974-988, 2016
172016
Cafe: Learning to condense dataset by aligning features
K Wang, B Zhao, X Peng, Z Zhu, S Yang, S Wang, G Huang, H Bilen, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
162022
The system can't perform the operation now. Try again later.
Articles 1–20