A two-branch neural network for non-homogeneous dehazing via ensemble learning Y Yu, H Liu, M Fu, J Chen, X Wang, K Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
98 2021 NTIRE 2021 nonhomogeneous dehazing challenge report CO Ancuti, C Ancuti, FA Vasluianu, R Timofte
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
86 2021 Mementos: A comprehensive benchmark for multimodal large language model reasoning over image sequences X Wang, Y Zhou, X Liu, H Lu, Y Xu, F He, J Yoon, T Lu, F Liu, G Bertasius, ...
ACL 2024, 2024
65 2024 : Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement LearningR Zheng, X Wang, Y Sun, S Ma, J Zhao, H Xu, H Daumé III, F Huang
Advances in Neural Information Processing Systems 36, 2024
42 2024 Drm: Mastering visual reinforcement learning through dormant ratio minimization G Xu, R Zheng, Y Liang, X Wang, Z Yuan, T Ji, Y Luo, X Liu, J Yuan, ...
ICLR 2024, 2023
26 2023 Calibrated self-rewarding vision language models Y Zhou, Z Fan, D Cheng, S Yang, Z Chen, C Cui, X Wang, Y Li, L Zhang, ...
NeurIPS 2024, 2024
24 2024 Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement X Wang, J Chen, Z Wang, Y Zhou, Y Zhou, H Yao, T Zhou, T Goldstein, ...
Findings of NAACL 2025, 2024
23 2024 Transfer RL across Observation Feature Spaces via Model-Based Regularization Y Sun, R Zheng, X Wang, A Cohen, F Huang
ICLR 2022, 2022
21 2022 Live in the moment: Learning dynamics model adapted to evolving policy X Wang, W Wongkamjan, R Jia, F Huang
International Conference on Machine Learning, 36470-36493, 2023
17 2023 LLaVA-Critic: Learning to Evaluate Multimodal Models T Xiong, X Wang, D Guo, Q Ye, H Fan, Q Gu, H Huang, C Li
arXiv preprint arXiv:2410.02712, 2024
15 2024 Is model ensemble necessary? model-based rl via a single model with lipschitz regularized value function R Zheng, X Wang, H Xu, F Huang
ICLR 2023, 2023
15 2023 Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications Y Zhou, P Xu, X Wang, X Lu, G Gao, W Ai
arXiv preprint arXiv:2402.01681, 2024
7 2024 COPlanner: Plan to roll out conservatively but to explore optimistically for model-based rl X Wang, R Zheng, Y Sun, R Jia, W Wongkamjan, H Xu, F Huang
ICLR 2024, 2023
6 2023 Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making Y Xu, C Deng, Y Sun, R Zheng, X Wang, J Zhao, F Huang
ICML 2024, 2023
6 * 2023 Premier-TACO: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss R Zheng, Y Liang, X Wang, S Ma, H Daumé III, H Xu, J Langford, ...
ICML 2024, 2024
5 * 2024 Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation Y Zhou, J Zhu, P Xu, X Liu, X Wang, D Koutra, W Ai, F Huang
Findings of EMNLP 2024, 2024
3 2024 Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension W Xiyao, Y Zhengyuan, L Linjie, L Hongjin, X Yuancheng, LCC Lin, ...
arXiv preprint arXiv:2412.03704, 2024
2 2024 Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning X Wang, L Song, Y Tian, D Yu, B Peng, H Mi, F Huang, D Yu
arXiv preprint arXiv:2410.06508, 2024
1 2024 World Models with Hints of Large Language Models for Goal Achieving Z Liu, Z Huan, X Wang, J Lyu, J Tao, X Li, F Huang, H Xu
NAACL 2025, 2024
1 2024