Следене
Jiancong Xiao
Заглавие
Позовавания
Позовавания
Година
Stability analysis and generalization bounds of adversarial training
J Xiao, Y Fan, R Sun, J Wang, ZQ Luo
NeurIPS 2022 Spotlight, 2022
382022
Adversarial rademacher complexity of deep neural networks
J Xiao, Y Fan, R Sun, ZQ Luo
arXiv preprint arXiv:2211.14966, 2022
172022
Understanding Adversarial Robustness Against On-manifold Adversarial Examples
J Xiao, L Yang, Y Fan, J Wang, ZQ Luo
Pattern Recognition 159, 111071, 2022
142022
PAC-bayesian spectrally-normalized bounds for adversarially robust generalization
J Xiao, R Sun, ZQ Luo
NeurIPS 2023, 2023
13*2023
Improving Adversarial Training for Multiple Perturbations through the Lens of Uniform Stability
J Xiao, Z Qin, Y Fan, B Wu, J Wang, ZQ Luo
ICML 2023 AdvML-Frontiers Workshop, 2023
10*2023
Uniformly Stable Algorithms for Adversarial Training and Beyond
J Xiao, J Zhang, ZQ Luo, A Ozdaglar
ICML 2024, 2024
8*2024
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
J Xiao, Z Li, X Xie, E Getzen, C Fang, Q Long, WJ Su
arXiv preprint arXiv:2405.16455, 2024
72024
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity
Z Li, C Chen, T Xu, Z Qin, J Xiao, R Sun, ZQ Luo
ICLR 2025, 2024
4*2024
Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic
R Jin, B Hou, J Xiao, W Su, L Shen
ICLR 2025, 2024
3*2024
Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization
J Xiao, R Sun, Q Long, WJ Su
COLT 2024, 2024
3*2024
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
M Wang, C Ma, Q Chen, L Meng, Y Han, J Xiao, Z Zhang, J Huo, WJ Su, ...
ICLR 2025, 2024
2024
Understanding Adversarially Robust Generalization: A Learning Theory Perspective
J Xiao
The Chinese University of Hong Kong, Shenzhen, 2023
2023
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–12