Publications
Preprints
- FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval
Zixu Li, Zhiheng Fu, Yupeng Hu, Zhiwei Chen, Haokun Wen, and Liqiang Nie.
ArXiv preprint. [Paper] - A Comprehensive Survey on Composed Image Retrieval
Xuemeng Song, Haoqiang Lin, Haokun Wen, Bohan Hou, Mingzhu Xu, and Liqiang Nie.
ArXiv preprint. [Paper] - Pseudo-triplet Guided Few-shot Composed Image Retrieval
Bohan Hou, Haoqiang Lin, Haokun Wen, Meng Liu, and Xuemeng Song.
ArXiv preprint. [Paper]
2025
- ENCODER: Entity Mining and Modification Relation Binding for Composed Image Retrieval
Zixu Li, Zhiwei Chen, Haokun Wen, Zhiheng Fu, Yupeng Hu, and Weili Guan.
In AAAI 2025.
2024
Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
Haokun Wen, Xuemeng Song, Xiaolin Chen, Yinwei Wei, Liqiang Nie, and Tat-Seng Chua.
In ACM SIGIR 2024 (full paper). [Paper] [Code] [Slides] [BibTex]Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval
Haokun Wen, Xuemeng Song, Jianhua Yin, Jianlong Wu, Weili Guan, and Liqiang Nie.
IEEE TPAMI, 2024. [Paper] [Code] [BibTex]Fine-Grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
Haoqiang Lin, Haokun Wen, Xuemeng Song, Meng Liu, Yupeng Hu, and Liqiang Nie.
In ACM SIGIR 2024 (full paper). [Paper] [Code] [BibTex]Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning
Xian Zhang, Haokun Wen, Jianlong Wu, Pengda Qin, Hui Xue, and Liqiang Nie.
In ACM MM 2024 (full paper). [Paper] [Code]Interactive Garment Recommendation with User in the Loop
Federico Becattini, Xiaolin Chen, Andrea Puccia, Haokun Wen, Xuemeng Song, Liqiang Nie, and Alberto Del Bimbo.
ACM ToMM 2024. [Paper]
2023
Target-Guided Composed Image Retrieval
Haokun Wen, Xian Zhang, Xuemeng Song, Yinwei Wei, and Liqiang Nie.
In ACM MM 2023 (full paper). [Paper] [Code] [Slides] [BibTex]Finetuning Language Models for Multimodal Question Answering
Xin Zhang$^1$, Wen Xie$^1$, Ziqi Dai$^1$, Jun Rao, Haokun Wen, Xuan Luo, Meishan Zhang, and Min Zhang.
In ACM MM 2023 (grand challenge). [Paper] [BibTex]
Ranked 1st in both Chinese and English tracks of the VTQA 2023.Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction
Weili Guan, Xuemeng Song, Kejie Wang, Haokun Wen, Hongda Ni, Yaowei Wang, and Xiaojun Chang.
IEEE TCSVT, 2023. [Paper] [Code] [BibTex]
2022
Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning
Weili Guan, Fangkai Jiao, Xuemeng Song, Haokun Wen, Chung-Hsing Yeh, and Xiaojun Chang.
In ACM SIGIR 2022 (full paper). [Paper] [Code] [BibTex]Partially Supervised Compatibility Modeling
Weili Guan, Haokun Wen, Xuemeng Song, Chun Wang, Chung-Hsing Yeh, Xiaojun Chang, and Liqiang Nie.
IEEE TIP, 2022. [Paper] [Code] [BibTex]
2021
Comprehensive Linguistic-Visual Composition Network for Image Retrieval
Haokun Wen, Xuemeng Song, Xin Yang, Yibing Zhan, and Liqiang Nie.
In ACM SIGIR 2021 (full paper). [Paper] [Code] [BibTex]Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations
Weili Guan, Haokun Wen, Xuemeng Song, Chung-Hsing Yeh, Xiaojun Chang, and Liqiang Nie.
In ACM MM 2021 (full paper). [Paper] [Code] [BibTex]Attribute-wise Explainable Fashion Compatibility Modeling
Xin Yang, Xuemeng Song, Fuli Feng, Haokun Wen, Ling-Yu Duan, and Liqiang Nie.
ACM ToMM, 2021. [Paper] [Code] [BibTex]
2020