Scissorhands: Exploiting the persistence of importance hypothesis for llm kv cache compression at test time Z Liu, A Desai, F Liao, W Wang, V Xie, Z Xu, A Kyrillidis, A Shrivastava
Advances in Neural Information Processing Systems 36, 2024
120 2024 On the convergence of shallow neural network training with randomly masked neurons F Liao, A Kyrillidis
arXiv preprint arXiv:2112.02668, 2021
20 2021 GIST: Distributed training for large-scale graph convolutional networks CR Wolfe, J Yang, F Liao, A Chowdhury, C Dun, A Bayer, S Segarra, ...
Journal of Applied and Computational Topology, 1-53, 2023
14 2023 LOFT: Finding lottery tickets through filter-wise training Q Wang, C Dun, F Liao, C Jermaine, A Kyrillidis
International Conference on Artificial Intelligence and Statistics, 6498-6526, 2023
3 2023 Provable Accelerated Convergence of Nesterov’s Momentum for Deep ReLU Neural Networks F Liao, A Kyrillidis
International Conference on Algorithmic Learning Theory, 732-784, 2024
2 2024 How much pre-training is enough to discover a good subnetwork? CR Wolfe, F Liao, Q Wang, JL Kim, A Kyrillidis
arXiv preprint arXiv:2108.00259, 2021
2 2021 Strong Lottery Ticket Hypothesis with –perturbation Z Xiong, F Liao, A Kyrillidis
International Conference on Artificial Intelligence and Statistics, 6879-6902, 2023
1 2023 On the Error-Propagation of Inexact Deflation for Principal Component Analysis F Liao, JL Kim, C Barnum, A Kyrillidis
arXiv preprint arXiv:2310.04283, 2023
2023 Accelerated Convergence of Nesterov's Momentum for Deep Neural Networks under Partial Strong Convexity F Liao, A Kyrillidis
arXiv preprint arXiv:2306.08109, 2023
2023 On the Error-Propagation of Inexact Hotelling's Deflation for Principal Component Analysis F Liao, JL Kim, C Barnum, A Kyrillidis
Forty-first International Conference on Machine Learning, 0
Strong Lottery Ticket Hypothesis with –perturbation F Liao, Z Xiong, A Kyrillidis
OPT 2022: Optimization for Machine Learning (NeurIPS 2022 Workshop), 0