跳转至

Xiao Xu @ HIT-SCIR

ICON

I am Xiao Xu, a third-year Ph.D. student from Harbin Institute of Technology. Fortunately, I am advised by Prof. Wanxiang Che.

I am a member of Language Analysis Group at Research Center for Social Computing and Information Retrieval (HIT-SCIR).

Love music, singing, animation, and all good things in my life.

The Sun Also Rises.


More   Resume   Github   Google Scholar   Semantic Scholar
Contact   Email   WeChat   Twitter   Zhihu

Research Interests

  • 2020 - 2021: Task-oriented Dialogue Systems, Natural Language Processing.
  • 2022 - Now:  Vision-Language Learning, Multimodal Large Language Models.

Publications

ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning

Xiao Xu, Bei Li, Chenfei Wu, Shao-Yen Tseng, Anahita Bhiwandiwalla, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan.

ACL 2023 (Oral) | Association for Computational Linguistics

Paper | Arxiv | Code | Model | Slides | Video(EN) | Video(CN) | Blog(CN) | Tweet(EN)

BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning

Xiao Xu, Chenfei Wu, Shachar Rosenman, Vasudev Lal, Wanxiang Che, Nan Duan.

AAAI 2023 (Oral) | Association for the Advancement of Artificial Intelligence

Paper | Arxiv | Code | Model | Slides | Video(EN) | Video(CN) | Blog(CN) | Tweet(EN)

Integration into 🤗-Transformers ( Model | Code | Doc | Blog(EN) | Blog(CN))

Demos ( Image-Text Matching | Video Frame Retrieval)

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Xiao Xu*, Libo Qin*, Kaiji Chen, Guoxing Wu, Linlin Li, Wanxiang Che.

AAAI 2022 (Oral) | Association for the Advancement of Artificial Intelligence

Paper | Arxiv | Code | Blog(CN)

Semantic-Guided Image Augmentation with Pre-trained Models

Bohan Li, Xiao Xu, Xinghao Wang, Yutai Hou, Yunlong Feng, Feng Wang, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che.

AAAI 2024 | Association for the Advancement of Artificial Intelligence

Arxiv

Modularized Pre-training for End-to-end Task-oriented Dialogue

Libo Qin, Xiao Xu, Lehan Wang, Yue Zhang, Wanxiang Che.

TASLP 2023 | IEEE/ACM Transactions on Audio, Speech, and Language Processing

Paper | Code

AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slot Filling

Libo Qin, Xiao Xu, Wanxiang Che, Ting Liu.

EMNLP 2020 (Findings) | Association for Computational Linguistics

Paper | Arxiv | Code | Blog(CN)

Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog

Libo Qin, Xiao Xu, Wanxiang Che, Yue Zhang, Ting Liu.

ACL 2020 | Association for Computational Linguistics

Paper | Arxiv | Code | Blog(CN)

V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization

Yuxi Xie, Guanzhen Li, Xiao Xu, Min-Yen Kan

EMNLP 2024 (Findings) | Conference on Empirical Methods in Natural Language Processing

Pro-HAN: A Heterogeneous Graph Attention Network for Profile-Based Spoken Language Understanding

Dechuan Teng, Chunlin Lu, Xiao Xu, Wanxiang Che, Libo Qin.

ICASSP 2024 | IEEE International Conference on Acoustics, Speech and Signal Processing

Paper | Arxiv | Code

OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language Understanding

Libo Qin, Qiguang Chen, Xiao Xu, Yunlong Feng, Wanxiang Che.

ACL 2023 (Demo) | Association for Computational Linguistics

Paper | Arxiv | Code

A Preliminary Evaluation of ChatGPT for Zero-shot Dialogue Understanding

Wenbo Pan, Qiguang Chen, Xiao Xu, Wanxiang Che, Libo Qin.

Arxiv | Preprint

Arxiv

A Two-Stage Framework with Self-Supervised Distillation For Cross-Domain Text Classification

Yunlong Feng, Bohan Li, Libo Qin, Xiao Xu, Wanxiang Che.

LREC-COLING | The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation

Paper | Arxiv

GL-GIN: Fast and Accurate Non-Autoregressive Model for Joint Multiple Intent Detection and Slot Filling

Libo Qin, Fuxuan Wei, Tianbao Xie, Xiao Xu, Wanxiang Che, Ting Liu.

ACL 2021 (Oral) | Association for Computational Linguistics

Paper | Arxiv | Code

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement

Yuxi Xie, Anirudh Goyal, Xiaobao Wu, Xunjian Yin, Xiao Xu, Min-Yen Kan, Liangming Pan, William Yang Wang

Arxiv | Preprint

Arxiv

In-Context Transfer Learning: Demonstration Synthesis by Transferring Similar Tasks

Dingzirui Wang, Xuanliang Zhang, Qiguang Chen, Longxu Dou, Xiao Xu, Rongyu Cao, Yingwei Ma, Qingfu Zhu, Wanxiang Che, Binhua Li, Fei Huang, Yongbin Li

Arxiv | Preprint

Arxiv

Self-Constructed Context Decompilation with Fined-grained Alignment Enhancement

Yunlong Feng, Yang Xu, Dechuan Teng, Honglin Mu, Xiao Xu, Libo Qin, Wanxiang Che, Qingfu Zhu

EMNLP 2024 (Findings) | Conference on Empirical Methods in Natural Language Processing

Arxiv

M\(^3\)CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought

Qiguang Chen, Libo Qin, Jin Zhang, Zhi Chen, Xiao Xu, Wanxiang Che

ACL 2024 | Association for Computational Linguistics

Arxiv | Code | Dataset | Website

Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

Libo Qin, Tianbao Xie, Shijue Huang, Qiguang Chen, Xiao Xu, Wanxiang Che.

EMNLP 2021 (Poster) | Association for Computational Linguistics

Paper | Arxiv | Code

IPGAN: Generating Informative Item Pairs by Adversarial Sampling

Guibing Guo, Huan Zhou, Bowei Chen, Zhirong Liu, Xiao Xu, Xu Chen, Zhenhua Dong, and Xiuqiang He.

TNNLS 2022 | IEEE Transactions on Neural Networks and Learning Systems

Paper

Services

  • Conference Reviewer: ACL 2023, AAAI 2023, EMNLP 2022 & 2023, NLPCC 2022 & 2023.
  • Community: MLNLP-2022 Outstanding Organizer.

Awards

  • National Scholarship for Ph.D., 2023.
  • China Scholarship Council (CSC) Scholarship, 2023.
  • Stars of Tomorrow Internship Award of Microsoft Research Asia, 2023.
  • National Scholarship for Encouragement, 2019.
  • Mathematical Contest In Modeling - Meritorious Winner, 2019.
  • National Scholarship for B.E., 2018.
  • National Scholarship for B.E., 2017.
  • Top Ten Campus Singers of Northeastern University (Hunnan), 2017.
  • Top Ten Campus Singers of Northeastern University (Hunnan), 2016.

Experiences

MSRA

Microsoft Research Asia, Beijing, China

Education

HIT

Harbin Institute of Technology (HIT), Harbin, China


NUS

National University of Singapore (NUS), Singapore


NEU

Northeastern University (NEU), Shenyang, China

  • B.S. in Software Engineering
  • 2016.09 - 2020.06