MMDetection: Open mmlab detection toolbox and benchmark K Chen, J Wang, J Pang, Y Cao, Y Xiong, X Li, S Sun, W Feng, Z Liu, J Xu, ... arXiv preprint arXiv:1906.07155, 2019 | 3368 | 2019 |
Hybrid task cascade for instance segmentation K Chen, J Pang, J Wang, Y Xiong, X Li, S Sun, W Feng, Z Liu, J Shi, ... Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 1595 | 2019 |
Region proposal by guided anchoring J Wang, K Chen, S Yang, C Change Loy, D Lin Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 764 | 2019 |
CARAFE: Content-Aware ReAssembly of FEatures J Wang, K Chen, R Xu, Z Liu, CC Loy, D Lin Proceedings of the IEEE International Conference on Computer Vision, 2019 | 707 | 2019 |
Mmbench: Is your multi-modal model an all-around player? Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ... European Conference on Computer Vision, 216-233, 2025 | 610 | 2025 |
Sharegpt4v: Improving large multi-modal models with better captions L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin European Conference on Computer Vision, 370-387, 2025 | 377 | 2025 |
Lavt: Language-aware vision transformer for referring image segmentation Z Yang, J Wang, Y Tang, K Chen, H Zhao, PHS Torr Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 308 | 2022 |
Seesaw Loss for Long-Tailed Instance Segmentation J Wang, W Zhang, Y Zang, Y Cao, J Pang, T Gong, K Chen, Z Liu, CC Loy, ... Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021 | 293 | 2021 |
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024 | 250 | 2024 |
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024 | 172 | 2024 |
Side-aware boundary localization for more precise object detection J Wang, W Zhang, Y Cao, K Chen, J Pang, T Gong, J Shi, CC Loy, D Lin Proceedings of the European Conference on Computer Vision (ECCV), 2020 | 171 | 2020 |
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ... arXiv preprint arXiv:2309.15112, 2023 | 164 | 2023 |
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024 | 163 | 2024 |
Omniobject3d: Large-vocabulary 3d object dataset for realistic perception, reconstruction and generation T Wu, J Zhang, X Fu, Y Wang, J Ren, L Pan, W Wu, L Yang, J Wang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 160 | 2023 |
Pyskl: Towards good practices for skeleton action recognition H Duan, J Wang, K Chen, D Lin Proceedings of the 30th ACM International Conference on Multimedia, 7351-7354, 2022 | 140 | 2022 |
Optimizing video object detection via a scale-time lattice K Chen, J Wang, S Yang, X Zhang, Y Xiong, CC Loy, D Lin Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 138 | 2018 |
Dense distinct query for end-to-end object detection S Zhang, X Wang, J Wang, J Pang, C Lyu, W Zhang, P Luo, K Chen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 131 | 2023 |
Are We on the Right Way for Evaluating Large Vision-Language Models? L Chen, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, J Wang, Y Qiao, ... arXiv preprint arXiv:2403.20330, 2024 | 110 | 2024 |
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 105 | 2024 |
Few-shot object detection via association and discrimination Y Cao, J Wang, Y Jin, T Wu, K Chen, Z Liu, D Lin Advances in neural information processing systems 34, 16570-16581, 2021 | 102 | 2021 |