Wenrui Li received the B.S. degree from the School of Information and Software Engineering, University of Electronic Science and Technology of China (UESTC), Chengdu, China, in 2021. He is currently pursuing the Ph.D. degree supervised by Prof. Xiaopeng Fan with the School of Computer Science, Harbin Institute of Technology (HIT), Harbin, China. He also studied as visiting student in Pengcheng Laboratory in 2022, supervised by Prof. Yonghong Tian. He has authored or co-authored more than 20 technical papers in refereed international journals and conferences. His research interests include multimedia search, joint source-channel coding, spiking neural networks and Embodied AI. He also serves as a reviewer for IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE TRANSACTIONS ON MULTIMEDIA, NeurIPS, ECCV, AAAI, and ACM MM.

I am now actively looking for academic opportunities in 2025, If you are interested, please do not hesitate to drop me an email. Thank you!

🔥 News

2025.08: 🎉🎉 One collaborative paper have been accepted by IEEE Transactions on Audio, Speech and Language Processing as corresponding author, congratulation to Zhe Yang!
2025.07: 🎉🎉 One collaborative paper have been accepted by ACM MM 2025 (CCF-A) as corresponding author, congratulation to Runlin and Yipu!
2025.05: 🎉🎉 One paper (Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning ) has been accepted by IEEE TCSVT as first author! My first paper of 2025 sure took its sweet time to show up!
2025.02: 🎉🎉 One collaborative paper have been accepted by Neurocomputing, congratulation to Zhe Yang!
2024.12: 🎉🎉 One collaborative paper have been accepted by ICASSP (CCF-B), congratulation to Yuchuan!
2024.12: 🎉🎉 One collaborative paper have been accepted by AAAI (CCF-A), congratulation to Jisheng!
2024.12: 🎉🎉 Two papers have been accepted by AAAI (CCF-A) as first author!
2024.12: 🎉🎉 I have received the Bydauto Scholarship! Rank (1/69)!
2024.12: 🎉🎉 I have supported by the National Natural Science Foundation of China, 国家自然科学基金青年学生基础研究项目(博士研究生): 2D/3D Image and Graphics Alignment, Fusion, and Perception-Based Collaborative Interaction System from 2025.01-2027.12!
2024.08: 🎉🎉 I have supported by the Fundamental Research Funds for the Central Universities (哈尔滨工业大学点子基金): Trusted multimodal alignment fusion and transmission from 2024.08-2025.08!
2024.07: 🎉🎉 Two collaborative papers have been accepted by ACM MM (CCF-A), congratulation to Haonan!
2024.06: 🎉🎉 One paper (Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning) has been accepted by IEEE TIP as first author!
2024.05: 🎉🎉 One collaborative paper (DV-Hop Localization Based On Distance Estimation Using Multi-Node and Hop Loss in IoT) has been accepted by IEEE IoTJ, congratulation to Penghong!
2024.04: 🎉🎉 One paper (Multi-layer Probabilistic Association Reasoning Network for Image-Text Retrieval) has been accepted by IEEE TCSVT as first author!
2024.03: 🎉🎉 One paper (SMILE: spiking multi-modal interactive label-guided enhancement network for emotion recognition) has been accepted by IEEE ICME (CCF-B), congratulation to Ming Guo!
2024.02: 🎉🎉 One paper (Multi-Scale Spiking Pyramid Wireless Communication Framework for Food Recognition) has been accepted by IEEE TMM as first author!
2023.10: 🎉🎉 I have received the 2023 China National Scholarship for doctoral students (top 1.5%)! (23/10/2023)
2023.08: 🎉🎉 Two papers have been accepted by ACM MM (CCF-A) as first author!

📝 Publications (#Corresponding Author,*equal contribution)

First Autor or Corresponding Author
1. Wenrui Li, Penghong Wang, Ruiqin xiong and Xiaopeng Fan^#. “Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning” IEEE Transactions on Image Processing. (IEEE TIP)
2. Wenrui Li, Ruiqin Xiong and Xiaopeng Fan^#. “ Multi-layer Probabilistic Association Reasoning Network for Image-Text Retrieval” in IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT)
3. Wenrui Li, Jiahui Li, Mengyao Ma, Xiaopeng Hong and Xiaopeng Fan^#.”Multi-Scale Spiking Pyramid Wireless Communication Framework for Food Recognition,” in IEEE Transactions on Multimedia (IEEE TMM), 2024.
4. Wenrui Li, Zhengyu Ma, LiangJian Deng, Xiaopeng Fan^# and Yonghong Tian.”Neuron-Based Spiking Transmission and Reasoning NetworkFor Robust Image-Text Retrieval,” in IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT), 2022, doi: 10.1109/TCSVT.2022.3233042.
5. Wenrui Li, Penghong Wang, Xingtao Wang, Wangmeng Zuo, Xiaopeng Fan^# and Yonghong Tian. “Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning ” IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT), 2025.
6. Wenrui Li, Zhe Yang, Wei Han, Hengyu Man, Xingtao Wang^# and Xiaopeng Fan. “Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images”, in AAAI, 2025.
7. Wenrui Li, Wei Han, Yandu Chen, Yeyu Chai, Yidan Lu, Xingtao Wang^# and Xiaopeng Fan. “Riemann-based Multi-scale Attention Reasoning Network for Text-3D Retrieval”, in AAAI, 2025.
8. Wenrui Li, Zhengyu Ma^#, LiangJian Deng, Penghong Wang, Jinqiao Shi and Xiaopeng Fan^#. “Reservoir Computing Transformer for Image Text Retrieval,” in ACM International Conference on Multimedia (ACM MM), Ottawa, Canada, 2023.
9. Wenrui Li, XiLe Zhao, Zhengyu Ma^#, Xingtao Wang, Xiaopeng Fan^# and Yonghong Tian. “Motion Decoupled Spiking Transformer for Audio Visual Zero-Shot Learning,” in ACM International Conference on Multimedia (ACM MM), Ottawa, Canada, 2023.
10. RunLin Yu, Yipu Gong, Wenrui Li^#, Aiwen Sun, Mengren Zheng, “Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning”, in ACM International Conference on Multimedia (ACM MM), 2025.
11. Zhe Yang, Wenrui Li^# and Guanghui Cheng^#. “SHMamba: Structured Hyperbolic State Space Model for Audio-Visual Question Answering”, IEEE Transactions on Audio, Speech and Language Processing (T-ASL), 2025.
12. Wenrui Li, Zhengyu Ma, Jinqiao Shi^# and Xiaopeng Fan. “The style transformer with common knowledge optimization for image text retrieval,” in IEEE Signal Processing Letter (SPL), 2023.
13. Wenrui Li, Zhengyu Ma, LiangJian Deng and Xiaopeng Fan^#. “Modality Fusion Spiking Transformer Network for AudioVisual Zero Shot Learning,” in IEEE International Conference on Multimedia and Expo (ICME), Oral, Brisbane, Australia, 2023.
14. Wenrui Li and Xiaopeng Fan^#. “Image-Text Alignment and Retrieval Using Light-Weight Transformer”, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
15. Zhe Yang, Wenrui Li^#, Jingxiu Hou and Guanghui Cheng^#. Multi-Modal Spiking Tensor Regression Network for Audio-Visual Zero-Shot Learning, Neurocomputing, 2025.
16. Jinyu Guo, Yuejia Li, Guanghui Cheng^# and Wenrui Li^#. Based-CLIP early fusion transformer for image caption. Signal, Image and Video Processing, 19, 112 (2025). https://doi.org/10.1007/s11760-024-03721-0.
17. Wenrui Li, Jifei Miao and Guanghui Cheng^#. “A Jacobi-Like Algorithm for the General Joint Diagonalization Problem with Its Application to Blind Source Separation,” 12th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, 2019.
Collaborative Paper
1. Jisheng Chu, Wenrui Li, Xingtao Wang^#, Ning Kanglin, Yidan Lu and Xiaopeng Fan. “Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion”, in AAAI, 2025.
2. Haonan Zheng, Xinyang Deng^#, Wen Jiang, and Wenrui Li, “A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models” in ACM International Conference on Multimedia, 2024.
3. Haonan Zheng, Wen Jiang^#, Xinyang Deng and Wenrui Li, “Sample-agnostic Adversarial Perturbation for Vision-Language Pre-training Models”, in ACM International Conference on Multimedia, 2024.
4. P. Wang, X. Wang, W. Li, X. Fan^# and D. Zhao. 2024. “DV-Hop Localization Based On Distance Estimation Using Multi-Node and Hop Loss in IoT” in IEEE Internet of Things Journal (IEEE IoTJ),doi: 10.1109/ JIOT.2024.3404492.
5. M. Guo, W. Li, C. Wang, Y. Ge and C. Wang^#. 2024. “SMILE: Spiking Multi-modal Interactive Label-Guided Enhancement Network for Emotion Recognition,” 2024, in IEEE International Conference on Multimedia and Expo.
6. Yuchuan Feng, Jihang Jiang, Jie Ren, Ruotong Li^#, Wenrui Li and Xiaopeng Fan. “Text-Guided Editable 3D City Scene Generation,” 2025, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
7. Jifei Miao, Guanghui Cheng^#, Wenrui Li and Gong Zhang. “Non-orthogonal approximate joint diagonalization of non-Hermitian matrices in the least-squares sense,” Neurocomputing, 2019.
8. Jifei Miao, Guanghui Cheng^#, Wenrui Li and Eric Moreau. “A unitary joint diagonalization algorithm for nonsymmetric higher‐order tensors based on Givens‐like rotations,” Numerical Linear Algebra with Applications, 2020.
9. Guanghui Cheng^#, Jifei Miao, Wenrui Li, “Two Jacobi-like algorithms for the general joint diagonalization problem with applications to blind source separation,” Chinese Journal of Electronics, doi: 10.23919/cje.2019.00.102, 2022.

🎖 Selected Honors and Awards

2024 Bydauto Scholarship for doctoral students (1/69)
2023 China National Scholarship for doctoral students (top 1.5%), Ministry of Education of the People’s Republic of China.
Outstanding Communist Party of China in Harbin Institute of Technology, Faculty of Computing, Harbin Institute of Technology.
Outstanding Student of Faculty of Computing, Harbin Institute of Technology.
2021 Outstanding Graduates of University of Electronic Science and Technology of China.
2018-2021 Outstanding Student Scholarship (top 20%) of University of Electronic Science and Technology of China.
2021 Meritorious Winner (winning ratio 7.09%) in Mathematical Contest In Modeling, the COMAP of American.

📖 Educations

2021.09 - 2025.12 (expected), Harbin institute of technology, Doctor of Philosophy.
2017.09 - 2021.06, University of Electronic Science and Technology of China, Bachelor.

📕 Projects

Project Leader: 2D/3D Image and Graphics Alignment, Fusion, and Perception-Based Collaborative Interaction System supported by National Natural Science Foundation of China, 2025.01-2027.12, 国家自然科学基金青年学生基础研究项目(博士研究生).
Project Leader: Trusted multimodal alignment fusion and transmission supported by Fundamental Research Funds for the Central Universities, 2024.08-2025.08, 哈尔滨工业大学点子基金.

💻 Internships