Следене
Xizhou Zhu
Xizhou Zhu
Потвърден имейл адрес: tsinghua.edu.cn
Заглавие
Позовавания
Позовавания
Година
Deformable detr: Deformable transformers for end-to-end object detection
X Zhu, W Su, L Lu, B Li, X Wang, J Dai
arXiv preprint arXiv:2010.04159, 2020
39562020
Deformable convnets v2: More deformable, better results
X Zhu, H Hu, S Lin, J Dai
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
19102019
Vl-bert: Pre-training of generic visual-linguistic representations
W Su, X Zhu, Y Cao, B Li, L Lu, F Wei, J Dai
arXiv preprint arXiv:1908.08530, 2019
16172019
Deep feature flow for video recognition
X Zhu, Y Xiong, J Dai, L Yuan, Y Wei
Proceedings of the IEEE conference on computer vision and pattern …, 2017
7612017
Flow-guided feature aggregation for video object detection
X Zhu, Y Wang, J Dai, L Yuan, Y Wei
Proceedings of the IEEE international conference on computer vision, 408-417, 2017
7372017
An empirical study of spatial attention mechanisms in deep networks
X Zhu, D Cheng, Z Zhang, S Lin, J Dai
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
4122019
Internimage: Exploring large-scale vision foundation models with deformable convolutions
W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
3172023
Towards high performance video object detection
X Zhu, J Dai, L Yuan, Y Wei
Proceedings of the IEEE conference on computer vision and pattern …, 2018
3002018
Planning-oriented autonomous driving
Y Hu, J Yang, L Chen, K Li, C Sima, X Zhu, S Chai, S Du, T Lin, W Wang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1752023
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks
W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ...
Advances in Neural Information Processing Systems 36, 2024
1322024
BEVFormer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision
C Yang, Y Chen, H Tian, C Tao, X Zhu, Z Zhang, G Huang, H Li, Y Qiao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1022023
Uni-perceiver: Pre-training unified architecture for generic perception for zero-shot and few-shot tasks
X Zhu, J Zhu, H Li, X Wu, H Li, X Wang, J Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
982022
Spatially adaptive inference with stochastic feature sampling and interpolation
Z Xie, Z Zhang, X Zhu, G Huang, S Lin
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
912020
An uncertainty-aware approach for exploratory microblog retrieval
M Liu, S Liu, X Zhu, Q Liao, F Wei, S Pan
IEEE transactions on visualization and computer graphics 22 (1), 250-259, 2015
762015
Deformable kernels: Adapting effective receptive fields for object deformation
H Gao, X Zhu, S Lin, J Dai
arXiv preprint arXiv:1910.02940, 2019
672019
Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe
H Li, C Sima, J Dai, W Wang, L Lu, H Wang, J Zeng, Z Li, J Yang, H Deng, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
602023
Integrated object detection and tracking with tracklet-conditioned detection
Z Zhang, D Cheng, X Zhu, S Lin, J Dai
arXiv preprint arXiv:1811.11167, 2018
552018
Weijie Su, Chenyu Yang, Gao Huang, Bin Li, Lewei Lu, Xiaogang Wang, et al. Ghost in the minecraft: Generally capable agents for open-world enviroments via large language models …
X Zhu, Y Chen, H Tian, C Tao
arXiv preprint arXiv:2305.17144 2 (3), 5, 2023
512023
Siamese image modeling for self-supervised vision representation learning
C Tao, X Zhu, W Su, G Huang, B Li, J Zhou, Y Qiao, X Wang, J Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
502023
Exploring the equivalence of siamese self-supervised learning via a unified gradient framework
C Tao, H Wang, X Zhu, J Dong, S Song, G Huang, J Dai
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
492022
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20