Следене
Zun Wang
Zun Wang
Потвърден имейл адрес: anu.edu.au - Начална страница
Заглавие
Позовавания
Позовавания
Година
Internvideo: General video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
2032022
Mvbench: A comprehensive multi-modal video understanding benchmark
K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
522024
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Y Hong*, Z Wang*, Q Wu, S Gould
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
462022
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
382022
Scaling Data Generation in Vision-and-Language Navigation
Z Wang, J Li, Y Hong, Y Wang, Q Wu, M Bansal, S Gould, H Tan, Y Qiao
ICCV2023, 2023
262023
Etpnav: Evolving topological planning for vision-language navigation in continuous environments
D An, H Wang, W Wang, Z Wang, Y Huang, K He, L Wang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
212024
Internvideo2: Scaling video foundation models for multimodal video understanding
Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ...
arXiv preprint arXiv:2403.15377, 2024
192024
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022)
D An*, Z Wang*, Y Li, Y Wang, Y Hong, Y Huang, L Wang, J Shao
arXiv preprint arXiv:2206.11610, 2022
92022
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–8