Следене
Lijuan Wang
Lijuan Wang
Microsoft GenAI
Потвърден имейл адрес: microsoft.com - Начална страница
Заглавие
Позовавания
Позовавания
Година
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
JG Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang ...
European Conference on Computer Vision (ECCV), 2020
1677*2020
Large Scale Incremental Learning
YF Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
1101*2019
VinVL: Making Visual Representations Matter in Vision-Language Models
P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao
CVPR2021, 2021
923*2021
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
6072021
Grounded language-image pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
5712022
Rethinking Classification and Localization for Object Detection
YF Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
5602020
End-to-End Human Pose and Mesh Reconstruction with Transformers
K Lin, L Wang, Z Liu
CVPR2021, 2020
5312020
End-to-end semi-supervised object detection with soft teacher
M Xu, Z Zhang, H Hu, J Wang, L Wang, F Wei, X Bai, Z Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
3782021
Real-time Animation for an Expressive Avatar
N Xu, L Wang, FKP Soong, X Liang, Q Luo, YQ Xu, X Zou
US Patent App. 12/950,801, 2012
3452012
Refining of segmental boundaries in speech waveforms using contextual-dependent models
Y Zhao, M Chu, JL Zhou, L Wang
US Patent 7,496,512, 2009
3392009
Git: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
arXiv preprint arXiv:2205.14100, 2022
3032022
Handwriting-based user interface for correction of speech recognition errors
L Wang, FKP Soong
US Patent App. 12/042,344, 2009
2802009
Unnatural prosody detection in speech synthesis
Y Zhao, FKP Soong, M Chu, L Wang
US Patent 8,583,438, 2013
2632013
An empirical study of training end-to-end vision-and-language transformers
ZY Dou, Y Xu, Z Gan, J Wang, S Wang, L Wang, C Zhu, P Zhang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2622022
An empirical study of gpt-3 for few-shot knowledge-based vqa
Z Yang, Z Gan, J Wang, X Hu, Y Lu, Z Liu, L Wang
Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 3081-3089, 2022
2532022
Mesh graphormer
K Lin, L Wang, Z Liu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
2352021
Scaling up vision-language pre-training for image captioning
X Hu, Z Gan, J Wang, Z Yang, Z Liu, Y Lu, L Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
1972022
Speech and text driven HMM-based body animation synthesis
L Wang, L Ma, FKP Soong
US Patent 8,224,652, 2012
1832012
Mm-react: Prompting chatgpt for multimodal reasoning and action
Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ...
arXiv preprint arXiv:2303.11381, 2023
1692023
Violet: End-to-end video-language transformers with masked visual-token modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv preprint arXiv:2111.12681, 2021
1632021
Системата не може да изпълни операцията сега. Опитайте отново по-късно.
Статии 1–20