I am now an Associate Professor at PALM lab, Department of Computer Science, Southeast University (SEU), China. I got my B.S. degree from Nanjing University of Posts and Telecommunications (NUPT), M.S. degree from Southeast University (SEU) supervised by Prof Xin Geng, and Ph.D. from Nanyang Technological University (NTU) supervised by Prof Jianfei Cai and Prof Hanwang Zhang.
I have wide interest on AI, especially machine learning and deep learning, recently, I especially focus on multi-model in-context learning and learngene framework. In the past and future few years, I will focus on the following topics:
How to Configure Good In-Context Sequence for Visual Question Answering
Li Li, Jiawei Peng, Huiyi Chen, Chongyang Gao, Xu Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.CVPR 2024.
[Web]
ICD-LM: Configuring Vision-Language In-Context Demonstrations by Language Modeling
Yingzhe Peng, Xu Yang, Haoxuan Ma, Shuo Xu, Chi Zhang, Yucheng Han, Hanwang Zhang
Computer Vision and Pattern Recognition.arXiv.
[Web]
Manipulating the Label Space for In-Context Classification
Haokun Chen, Xu Yang, Yuhang Huang, Zihan Wu, Jing Wang, Xin Geng
Computer Vision and Pattern Recognition.arXiv.
[Web]
Exploring Diverse In-Context Configurations for Image Captioning
Xu Yang, Yongliang Wu, Mingzhuo Yang, Haokun Chen, Xin Geng
Annual Conference on Neural Information Processing Systems.NeurIPS2023.
[Web]
Transforming Visual Scene Graphs to Image Captions
Xu Yang, Jiawei Peng, Zihua Wang, Haiyang Xu, Qinghao Ye, Chenliang Li, Ming Yan, Fei Huang, Zhangzikang Li, Yu Zhang
Association for Computational Linguistics.ACL 2023.
[Web]
Learning Trajectory-Word Alignments for Video-Language Tasks
Xu Yang, Zhangzikang Li, Haiyang Xu, Hanwang Zhang, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang, Songfang Huang
International Conference on Computer Vision.ICCV 2023.
[Web]
Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Xu Yang, Hanwang Zhang, Chongyang Gao, Jianfei Cai
International Journal of Computer Vision, 1-19.IJCV 2023.
[Web]
Show, Deconfound and Tell: Image Captioning With Causal Inference
Bing Liu, Dong Wang, Xu Yang, Yong Zhou, Rui Yao, Zhiwen Shao, Jiaqi Zhao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.CVPR 2022.
[Web]
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
Yaya Shi, Xu Yang, Haiyang Xu, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.CVPR 2022.
[Web]
Image captioning with transformer and knowledge graph
Yu Zhang, Xinyu Shi, Siya Mi, Xu Yang
Pattern Recognition Letters 143, 43-49.PRL.
[Web]
Deconfounded image captioning: A causal retrospect
Xu Yang, Hanwang Zhang, Jianfei Cai
IEEE Transactions on Pattern Analysis and Machine Intelligence.TPAMI.
[Web]
Auto-encoding and Distilling Scene Graphs for Image Captioning
Xu Yang, Hanwang Zhang, Jianfei Cai
IEEE Transactions on Pattern Analysis and Machine Intelligence.TPAMI.
[Web]
Auto-Parsing Network for Image Captioning and Visual Question Answering
Xu Yang, Chongyang Gao, Hanwang Zhang, Jianfei Cai
IEEE International Conference on Computer Vision. ICCV 2021.
[PDF]
Causal attention for vision-language tasks
Xu Yang, Hanwang Zhang, Guojun Qi, Jianfei Cai
Conference on Computer Vision and Pattern Recognition. CVPR 2021.
[PDF]
Hierarchical Scene Graph Encoder-Decoder for Image Paragraph Captioning
Xu Yang, Chongyang Gao, Hanwang Zhang, Jianfei Cai
ACM International Conference on Multimedia. ACMMM 2020.
[Web]
Learning to collocate neural modules for image captioning
Xu Yang, Hanwang Zhang, Jianfei Cai
IEEE International Conference on Computer Vision. ICCV 2019.
[PDF]
Auto-encoding scene graphs for image captioning
Xu Yang, Kaihua Tang, Hanwang Zhang, Jianfei Cai
Conference on Computer Vision and Pattern Recognition. CVPR 2019.
[PDF]Oral Presentation
Shuffle-then-assemble: learning object-agnostic visual relationship features
Xu Yang, Hanwang Zhang, Jianfei Cai
European Conference on Computer Vision. ECCV 2018.
[PDF]
Sparsity Conditional Energy Label Distribution Learning for Age Estimation
Xu Yang, Xin Geng, Deyu Zhou
International Joint Conference on Artificial Intelligence. IJCAI 2016.
[PDF]
Deep label distribution learning for apparent age estimation
Xu Yang, Bin-Bin Gao, Chao Xing, Zeng-Wei Huo, Xiu-Shen Wei, Ying Zhou, Jianxin Wu, Xin Geng
IEEE International Conference on Computer Vision Workshops. ICCVW 2015.
[PDF]
When I have some available time, I usually read, swim, and run. I have ubiquitous interest on different topics of the books, including Computer Science, Philosophy, History, Politics, Literature, and Detective Fiction.
Some recommended books:
Powered by Jekyll and Minimal Light theme.