Publications
Preprints
- Pseudo-triplet Guided Few-shot Composed Image Retrieval
Bohan Hou, Haoqiang Lin, Haokun Wen, Meng Liu, and Xuemeng Song.
ArXiv preprint. [Paper]
2024
Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
Haokun Wen, Xuemeng Song, Xiaolin Chen, Yinwei Wei, Liqiang Nie, and Tat-Seng Chua.
In ACM SIGIR 2024 (full paper). [Paper] [Code] [Slides] [BibTex]Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval
Haokun Wen, Xuemeng Song, Jianhua Yin, Jianlong Wu, Weili Guan, and Liqiang Nie.
IEEE TPAMI, 2024. [Paper] [Code] [BibTex]Fine-Grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
Haoqiang Lin, Haokun Wen, Xuemeng Song, Meng Liu, Yupeng Hu, and Liqiang Nie.
In ACM SIGIR 2024 (full paper). [Paper] [Code] [BibTex]Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning
Xian Zhang, Haokun Wen, Jianlong Wu, Pengda Qin, Hui Xue, and Liqiang Nie.
In ACM MM 2024 (full paper). [Paper] [Code]Interactive Garment Recommendation with User in the Loop
Federico Becattini, Xiaolin Chen, Andrea Puccia, Haokun Wen, Xuemeng Song, Liqiang Nie, and Alberto Del Bimbo.
ACM ToMM 2024. [Paper]
2023
Target-Guided Composed Image Retrieval
Haokun Wen, Xian Zhang, Xuemeng Song, Yinwei Wei, and Liqiang Nie.
In ACM MM 2023 (full paper). [Paper] [Code] [Slides] [BibTex]Finetuning Language Models for Multimodal Question Answering
Xin Zhang$^1$, Wen Xie$^1$, Ziqi Dai$^1$, Jun Rao, Haokun Wen, Xuan Luo, Meishan Zhang, and Min Zhang.
In ACM MM 2023 (grand challenge). [Paper] [BibTex]
Ranked 1st in both Chinese and English tracks of the VTQA 2023.Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction
Weili Guan, Xuemeng Song, Kejie Wang, Haokun Wen, Hongda Ni, Yaowei Wang, and Xiaojun Chang.
IEEE TCSVT, 2023. [Paper] [Code] [BibTex]
2022
Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning
Weili Guan, Fangkai Jiao, Xuemeng Song, Haokun Wen, Chung-Hsing Yeh, and Xiaojun Chang.
In ACM SIGIR 2022 (full paper). [Paper] [Code] [BibTex]Partially Supervised Compatibility Modeling
Weili Guan, Haokun Wen, Xuemeng Song, Chun Wang, Chung-Hsing Yeh, Xiaojun Chang, and Liqiang Nie.
IEEE TIP, 2022. [Paper] [Code] [BibTex]
2021
Comprehensive Linguistic-Visual Composition Network for Image Retrieval
Haokun Wen, Xuemeng Song, Xin Yang, Yibing Zhan, and Liqiang Nie.
In ACM SIGIR 2021 (full paper). [Paper] [Code] [BibTex]Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations
Weili Guan, Haokun Wen, Xuemeng Song, Chung-Hsing Yeh, Xiaojun Chang, and Liqiang Nie.
In ACM MM 2021 (full paper). [Paper] [Code] [BibTex]Attribute-wise Explainable Fashion Compatibility Modeling
Xin Yang, Xuemeng Song, Fuli Feng, Haokun Wen, Ling-Yu Duan, and Liqiang Nie.
ACM ToMM, 2021. [Paper] [Code] [BibTex]
2020