Bei Liu

Получаване на мой собствен потребителски профил

Позовавания

	Всички	От 2019
Позовавания	2063	2060
h-индекс	19	19
i10-индекс	22	22

700

350

175

525

20192020202120222023202420 67 203 436 682 641

Публичен достъп

Преглед на всички

10 статии

0 статии

налични

неналични

Въз основа на изисквания при финансирането

Съавтори

Jianlong FuMicrosoft ResearchПотвърден имейл адрес: microsoft.com
Zhaoyang ZengInternational Digital Economy AcademyПотвърден имейл адрес: idea.edu.cn
Hongwei XueUniversity of Science and Technology of ChinaПотвърден имейл адрес: mail.ustc.edu.cn
Ruihua SongRenmin University of ChinaПотвърден имейл адрес: ruc.edu.cn
Huan YangRhymes.AIПотвърден имейл адрес: fastmail.com
Jiebo Luo, Fellow of NAI, ACM, AAAI,...Albert Arendt Hopeman Professor of Engineering, University of RochesterПотвърден имейл адрес: cs.rochester.edu
Makoto P. KatoUniversity of TsukubaПотвърден имейл адрес: acm.org
Masatoshi YoshikawaOsaka Seikei UniversityПотвърден имейл адрес: osaka-seikei.ac.jp
Katsumi TanakaKyoto UniversityПотвърден имейл адрес: fukuchiyama.ac.jp

Следене

Bei Liu

Microsoft Research

Потвърден имейл адрес: microsoft.com

multimodal learning


Заглавие Сортиране по цитати Сортиране по година Сортиране по заглавие	Позовавания Позовавания	Година
Pixel-bert: Aligning image pixels with text by deep multi-modal transformers Z Huang, Z Zeng, B Liu, D Fu, J Fu arXiv preprint arXiv:2004.00849, 2020	444	2020
Seeing out of the box: End-to-end pre-training for vision-language representation learning Z Huang, Z Zeng, Y Huang, B Liu, D Fu, J Fu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	285	2021
Wsod2: Learning bottom-up and top-down objectness distillation for weakly-supervised object detection Z Zeng, B Liu, J Fu, H Chao, L Zhang Proceedings of the IEEE/CVF international conference on computer vision …, 2019	173	2019
Advancing high-resolution video-language representation with large-scale video transcriptions H Xue, T Hang, Y Zeng, Y Sun, B Liu, H Yang, J Fu, B Guo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	150	2022
Clip-vip: Adapting pre-trained image-text model to video-language alignment H Xue, Y Sun, B Liu, J Fu, R Song, H Li, J Luo The Eleventh International Conference on Learning Representations, 2023	140*	2023
Mm-diffusion: Learning multi-modal diffusion models for joint audio and video generation L Ruan, Y Ma, H Yang, H He, B Liu, J Fu, NJ Yuan, Q Jin, B Guo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	129	2023
M3p: Learning universal representations via multitask multilingual multimodal pre-training M Ni, H Huang, L Su, E Cui, T Bharti, L Wang, D Zhang, N Duan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	110	2021
Probing inter-modality: Visual parsing with self-attention for vision-and-language pre-training H Xue, Y Huang, B Liu, H Peng, J Fu, H Li, J Luo Advances in Neural Information Processing Systems 34, 4514-4528, 2021	88	2021
Beyond narrative description: Generating poetry from images by multi-adversarial training B Liu, J Fu, MP Kato, M Yoshikawa Proceedings of the 26th ACM international conference on Multimedia, 783-791, 2018	87	2018
Long-form video-language pre-training with multimodal temporal contrastive learning Y Sun, H Xue, R Song, B Liu, H Yang, J Fu Advances in neural information processing systems 35, 38032-38045, 2022	62	2022
Unifying multimodal transformer for bi-directional image and text generation Y Huang, H Xue, B Liu, Y Lu Proceedings of the 29th ACM International Conference on Multimedia, 1138-1147, 2021	57	2021
Searching the search space of vision transformer M Chen, K Wu, B Ni, H Peng, B Liu, J Fu, H Chao, H Ling Advances in Neural Information Processing Systems 34, 8714-8726, 2021	51	2021
Aesthetic-aware image style transfer Z Hu, J Jia, B Liu, Y Bu, J Fu Proceedings of the 28th ACM International Conference on Multimedia, 3320-3329, 2020	37	2020
Smp challenge: An overview of social media prediction challenge 2019 B Wu, WH Cheng, P Liu, B Liu, Z Zeng, J Luo Proceedings of the 27th ACM International Conference on Multimedia, 2667-2671, 2019	36	2019
Reference-based defect detection network Z Zeng, B Liu, J Fu, H Chao IEEE Transactions on Image Processing 30, 6637-6647, 2021	35	2021
Neural storyboard artist: Visualizing stories with coherent image sequences S Chen, B Liu, J Fu, R Song, Q Jin, P Lin, X Qi, C Wang, J Zhou Proceedings of the 27th ACM International Conference on Multimedia, 2236-2244, 2019	33	2019
Pave the way to grasp anything: Transferring foundation models for universal pick-place robots J Yang, W Tan, C Jin, B Liu, J Fu, R Song, L Wang arXiv preprint arXiv:2306.05716, 2023	24	2023
Emotion reinforced visual storytelling N Li, B Liu, Z Han, YS Liu, J Fu Proceedings of the 2019 on International Conference on Multimedia Retrieval …, 2019	24	2019
Alphablock: Embodied finetuning for vision-language reasoning in robot manipulation C Jin, W Tan, J Yang, B Liu, R Song, L Wang, J Fu arXiv preprint arXiv:2305.18898, 2023	21	2023
Activitynet 2019 task 3: Exploring contexts for dense captioning events in videos S Chen, Y Song, Y Zhao, Q Jin, Z Zeng, B Liu, J Fu, A Hauptmann arXiv preprint arXiv:1907.05092, 2019	12	2019

Системата не може да изпълни операцията сега. Опитайте отново по-късно.

Статии 1–20

Позовавания годишно

Дублирани описания

Обединени библиографски описания

Добавяне на съавториСъавтори

Следене

Позовавания

Съавтори