Wenrui Li received the Ph.D. degree in Computer Science and Technology from HIT in Dec. 2025 under the supervision of Prof. Xiaopeng Fan, and the B.S. degree in Digital Signal Processing from the University of Electronic Science and Technology of China (UESTC) in 2021. Wenrui Li is also a postdoctoral researcher at the School of Astronautics, Harbin Institute of Technology (HIT), working with Prof. Ligang Wu. He was a visiting student at Peng Cheng Laboratory (2022–2023) and the Peking University Shenzhen Graduate School (2023–2024), supervised by Prof. Yonghong Tian. His research interests include multimodal joint learning and retrieval, embodied AI, semantic communications and joint source–channel coding, spiking neural networks, and AI for Science. He has published 30+ refereed papers, including first-author/corresponding works in IEEE TPAMI, TIP, TCSVT, TMM, and TASL, as well as top venues such as ACM MM and AAAI. He leads multiple research projects, including an NSFC Youth Student Basic Research Program (Doctoral Student) project on 2D/3D alignment and perception–interaction systems, and has also participated in national key R&D and industry–university collaborations. He has received several honors and scholarships, such as the HIT “Top Ten Outstanding Graduate Students” award, the Baosteel Scholarship, National Scholarships for Ph.D. students, and the BYD Scholarship. He serves as a reviewer for IEEE TIP, IEEE TCSVT, IEEE TMM, The Journal of Supercomputing, and conferences including CVPR, NeurIPS, AAAI, and ACM MM.

Self-motivated students are welcome to join [Prof. Fan's](http://homepage.hit.edu.cn/xiaopengfan) research group. Please feel free to contact me!

🔥 News

  • 2025.12:  🎉🎉 I have successfully passed my PhD thesis defense, and my dissertation was awarded the Harbin Institute of Technology Outstanding Doctoral Dissertation Award. (哈尔滨工业大学优秀博士学位论文)
  • 2025.12:  🎉🎉 As a project leader among the first cohort of “Chuang Class” students, I participated in and co-launched Harbin Institute of Technology’s inaugural “Innovation-Driven Talent Support Program” special class
  • 2025.12:  🎉🎉 Our project “Fairness & Openness: Leading a New Era of Full-Scenario Smart Examination Environments,” (公平·开放:引领全场景智慧考场新时代) won the National Grand Prize (Special Prize) at the 10th National Youth AI Innovation & Entrepreneurship Competition! (第十届全国青年人工智能创新创业大会全国特等奖)
  • 2025.11:  🎉🎉 One paper has been accepted by IEEE TPAMI (CCF-A) as first author! Congratulations to Wei!
  • 2025.11:  🎉🎉 One paper has been accepted by AAAI (CCF-A) as co-first author, and was selected as Oral! Congratulations to Yidan!
  • 2025.11:  🎉🎉 Two collaborative papers have been accepted by AAAI (CCF-A), congratulations to Zhitao and Han!
  • 2025.10:  🎉🎉 I was selected as one of the “Top 10 Outstanding Graduate Talents of Harbin Institute of Technology”(哈工大十佳英才)
  • 2025.10:  🎉🎉 I have received the 2025 China National Scholarship for doctoral students!
  • 2025.10:  🎉🎉I have received the Baosteel Outstanding Student Scholarship (宝钢优秀学生奖学金,全校仅六人)
  • 2025.09:  🎉🎉 My project “Lingjing Construction” — a Digital Twin 3D Virtual Content Industrialization Project (灵境构筑”数字孪生三维虚拟内容产业化项目) has been successfully approved and funded under the Entrepreneurship-Driven Innovative Talent Support Program of the Harbin Institute of Technology, Suzhou Research Institute. (“创业驱动的创新人才托举工程”专项计划)
  • 2025.08:  🎉🎉 One paper have been accepted by IEEE Transactions on Image Processing as first author!
  • 2025.08:  🎉🎉 One collaborative paper have been accepted by IEEE Transactions on Audio, Speech and Language Processing as corresponding author, congratulation to Zhe Yang!
  • 2025.07:  🎉🎉 One collaborative paper have been accepted by ACM MM 2025 (CCF-A) as corresponding author, congratulation to Runlin and Yipu!
  • 2025.05:  🎉🎉 One paper (Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning ) has been accepted by IEEE TCSVT as first author! My first paper of 2025 sure took its sweet time to show up!
  • 2025.02:  🎉🎉 One collaborative paper have been accepted by Neurocomputing, congratulation to Zhe Yang!
  • 2024.12:  🎉🎉 One collaborative paper have been accepted by ICASSP (CCF-B), congratulation to Yuchuan!
  • 2024.12:  🎉🎉 One collaborative paper have been accepted by AAAI (CCF-A), congratulation to Jisheng!
  • 2024.12:  🎉🎉 Two papers have been accepted by AAAI (CCF-A) as first author!
  • 2024.12:  🎉🎉 I have received the Bydauto Scholarship! Rank (1/69)!
  • 2024.12:  🎉🎉 I have supported by the National Natural Science Foundation of China, 国家自然科学基金青年学生基础研究项目(博士研究生): 2D/3D Image and Graphics Alignment, Fusion, and Perception-Based Collaborative Interaction System from 2025.01-2027.12!
  • 2024.08:  🎉🎉 I have supported by the Fundamental Research Funds for the Central Universities (哈尔滨工业大学点子基金): Trusted multimodal alignment fusion and transmission from 2024.08-2025.08!
  • 2024.07:  🎉🎉 Two collaborative papers have been accepted by ACM MM (CCF-A), congratulation to Haonan!
  • 2024.06:  🎉🎉 One paper (Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning) has been accepted by IEEE TIP as first author!
  • 2024.05:  🎉🎉 One collaborative paper (DV-Hop Localization Based On Distance Estimation Using Multi-Node and Hop Loss in IoT) has been accepted by IEEE IoTJ, congratulation to Penghong!
  • 2024.04:  🎉🎉 One paper (Multi-layer Probabilistic Association Reasoning Network for Image-Text Retrieval) has been accepted by IEEE TCSVT as first author!
  • 2024.03:  🎉🎉 One paper (SMILE: spiking multi-modal interactive label-guided enhancement network for emotion recognition) has been accepted by IEEE ICME (CCF-B), congratulation to Ming Guo!
  • 2024.02:  🎉🎉 One paper (Multi-Scale Spiking Pyramid Wireless Communication Framework for Food Recognition) has been accepted by IEEE TMM as first author!
  • 2023.10:  🎉🎉 I have received the 2023 China National Scholarship for doctoral students (top 1.5%)! (23/10/2023)
  • 2023.08:  🎉🎉 Two papers have been accepted by ACM MM (CCF-A) as first author!

📝 Publications (#Corresponding Author,*equal contribution)

  • First Autor or Corresponding Author
    1. Wenrui Li, Wei Han, Hengyu Man, Wangmeng Zuo, Xiaopeng Fan# and Yonghong Tian. “Language-Guided Graph Representation Learning for Video Summarization”, IEEE Transactions on Pattern Analysis and Machine Intelligence. (IEEE Tpami)
    2. Wenrui Li, Penghong Wang, Ruiqin xiong and Xiaopeng Fan#. “Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning” IEEE Transactions on Image Processing. (IEEE TIP)
    3. Wenrui Li, Wei Han, Liang-Jian Deng, Ruiqin Xiong and Xiaopeng Fan#. “Spiking Variational Graph Representation Inference for Video Summarization” in IEEE Transactions on Image Processing. (IEEE TIP)
    4. Wenrui Li, Ruiqin Xiong and Xiaopeng Fan#. “ Multi-layer Probabilistic Association Reasoning Network for Image-Text Retrieval” in IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT)
    5. Wenrui Li, Jiahui Li, Mengyao Ma, Xiaopeng Hong and Xiaopeng Fan#.”Multi-Scale Spiking Pyramid Wireless Communication Framework for Food Recognition,” in IEEE Transactions on Multimedia (IEEE TMM), 2024.
    6. Wenrui Li, Zhengyu Ma, LiangJian Deng, Xiaopeng Fan# and Yonghong Tian.”Neuron-Based Spiking Transmission and Reasoning NetworkFor Robust Image-Text Retrieval,” in IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT), 2022, doi: 10.1109/TCSVT.2022.3233042.
    7. Wenrui Li, Penghong Wang, Xingtao Wang, Wangmeng Zuo, Xiaopeng Fan# and Yonghong Tian. “Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning ” IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT), 2025.
    8. Wenrui Li*, Yidan Lu*, Yeyu Chai, Rui Zhao, Hengyu Man, Xiaopeng Fan#. “Hyperbolic Hierarchical Alignment Reasoning Network for Text-3D Retrieval”, in AAAI, Oral 2026.
    9. Wenrui Li, Zhe Yang, Wei Han, Hengyu Man, Xingtao Wang# and Xiaopeng Fan. “Hyperbolic-constraint Point Cloud Reconstruction from Single RGB-D Images”, in AAAI, 2025.
    10. Wenrui Li, Wei Han, Yandu Chen, Yeyu Chai, Yidan Lu, Xingtao Wang# and Xiaopeng Fan. “Riemann-based Multi-scale Attention Reasoning Network for Text-3D Retrieval”, in AAAI, 2025.
    11. Wenrui Li, Zhengyu Ma#, LiangJian Deng, Penghong Wang, Jinqiao Shi and Xiaopeng Fan#. “Reservoir Computing Transformer for Image Text Retrieval,” in ACM International Conference on Multimedia (ACM MM), Ottawa, Canada, 2023.
    12. Wenrui Li, XiLe Zhao, Zhengyu Ma#, Xingtao Wang, Xiaopeng Fan# and Yonghong Tian. “Motion Decoupled Spiking Transformer for Audio Visual Zero-Shot Learning,” in ACM International Conference on Multimedia (ACM MM), Ottawa, Canada, 2023.
    13. RunLin Yu, Yipu Gong, Wenrui Li#, Aiwen Sun, Mengren Zheng, “Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning”, in ACM International Conference on Multimedia (ACM MM), 2025.
    14. Zhe Yang, Wenrui Li# and Guanghui Cheng#. “SHMamba: Structured Hyperbolic State Space Model for Audio-Visual Question Answering”, IEEE Transactions on Audio, Speech and Language Processing (T-ASL), 2025.
    15. Wenrui Li, Zhengyu Ma, Jinqiao Shi# and Xiaopeng Fan. “The style transformer with common knowledge optimization for image text retrieval,” in IEEE Signal Processing Letter (SPL), 2023.
    16. Wenrui Li, Zhengyu Ma, LiangJian Deng and Xiaopeng Fan#. “Modality Fusion Spiking Transformer Network for AudioVisual Zero Shot Learning,” in IEEE International Conference on Multimedia and Expo (ICME), Oral, Brisbane, Australia, 2023.
    17. Wenrui Li and Xiaopeng Fan#. “Image-Text Alignment and Retrieval Using Light-Weight Transformer”, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
    18. Zhe Yang, Wenrui Li#, Jingxiu Hou and Guanghui Cheng#. Multi-Modal Spiking Tensor Regression Network for Audio-Visual Zero-Shot Learning, Neurocomputing, 2025.
    19. Jinyu Guo, Yuejia Li, Guanghui Cheng# and Wenrui Li#. Based-CLIP early fusion transformer for image caption. Signal, Image and Video Processing, 19, 112 (2025). https://doi.org/10.1007/s11760-024-03721-0.
    20. Xingtao Wang, Kaixin Wu, Jinyu Zhang, Yuxuan Wang and Wenrui Li*. “PanoExtend: An omnidirectional image super-resolution method based on spherical expansion”, in ACM MM Asia Workshop, 2026.
    21. Wenrui Li, Jifei Miao and Guanghui Cheng#. “A Jacobi-Like Algorithm for the General Joint Diagonalization Problem with Its Application to Blind Source Separation,” 12th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, 2019.
  • Collaborative Paper
    1. Jisheng Chu, Wenrui Li, Xingtao Wang#, Ning Kanglin, Yidan Lu and Xiaopeng Fan. “Digging into Intrinsic Contextual Information for High-fidelity 3D Point Cloud Completion”, in AAAI, 2025.
    2. Zhitao Wang, Hengyu Man#, Wenrui Li, Xingtao Wang, Xiaopeng Fan, Debin Zhao. “T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates”, in AAAI, 2026.
    3. Han Liu, Hengyu Man#, Xingtao Wang, Wenrui Li, Debin Zhao. “MRT: Learning Compact Representations with Mixed RWKV-Transformer for Extreme Image Compression”, in AAAI, 2026.
    4. Haonan Zheng, Xinyang Deng#, Wen Jiang, and Wenrui Li, “A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models” in ACM International Conference on Multimedia, 2024.
    5. Haonan Zheng, Wen Jiang#, Xinyang Deng and Wenrui Li, “Sample-agnostic Adversarial Perturbation for Vision-Language Pre-training Models”, in ACM International Conference on Multimedia, 2024.
    6. P. Wang, X. Wang, W. Li, X. Fan# and D. Zhao. 2024. “DV-Hop Localization Based On Distance Estimation Using Multi-Node and Hop Loss in IoT” in IEEE Internet of Things Journal (IEEE IoTJ),doi: 10.1109/ JIOT.2024.3404492.
    7. Rui Zhao, Jiyuan Zhang, Yanchen Dong, Wenrui Li and Yajing Zheng#, “Spike Camera Image Reconstruction Based on an Efficient Spiking Transformer”, in ACM MM Asia Workshop, 2026.
    8. M. Guo, W. Li, C. Wang, Y. Ge and C. Wang#. 2024. “SMILE: Spiking Multi-modal Interactive Label-Guided Enhancement Network for Emotion Recognition,” 2024, in IEEE International Conference on Multimedia and Expo.
    9. Yuchuan Feng, Jihang Jiang, Jie Ren, Ruotong Li#, Wenrui Li and Xiaopeng Fan. “Text-Guided Editable 3D City Scene Generation,” 2025, in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
    10. Jifei Miao, Guanghui Cheng#, Wenrui Li and Gong Zhang. “Non-orthogonal approximate joint diagonalization of non-Hermitian matrices in the least-squares sense,” Neurocomputing, 2019.
    11. Jifei Miao, Guanghui Cheng#, Wenrui Li and Eric Moreau. “A unitary joint diagonalization algorithm for nonsymmetric higher‐order tensors based on Givens‐like rotations,” Numerical Linear Algebra with Applications, 2020.

🎖 Selected Honors and Awards

  • “Top 10 Outstanding Graduate Talents of Harbin Institute of Technology”(哈工大十佳英才)
  • National Grand Prize (Special Prize) of 10th National Youth AI Innovation & Entrepreneurship Competition (第十届全国青年人工智能创新创业大会全国特等奖)
  • 2025 China National Scholarship for doctoral students (top 1.5%), Ministry of Education of the People’s Republic of China.
  • 2024 Baosteel Outstanding Student Scholarship (宝钢优秀学生奖学金,全校仅六人)
  • 2024 Bydauto Scholarship for doctoral students (1/69)
  • 2023 China National Scholarship for doctoral students (top 1.5%), Ministry of Education of the People’s Republic of China.
  • Outstanding Communist Party of China in Harbin Institute of Technology, Faculty of Computing, Harbin Institute of Technology.
  • Outstanding Student of Faculty of Computing, Harbin Institute of Technology.
  • 2021 Outstanding Graduates of University of Electronic Science and Technology of China.
  • 2018-2021 Outstanding Student Scholarship (top 20%) of University of Electronic Science and Technology of China.
  • 2021 Meritorious Winner (winning ratio 7.09%) in Mathematical Contest In Modeling, the COMAP of American.

📖 Educations

  • 2021.09 - 2025.12, Harbin institute of technology, Doctor of Philosophy.
  • 2017.09 - 2021.06, University of Electronic Science and Technology of China, Bachelor.

📕 Projects

  • Project Leader: 2D/3D Image and Graphics Alignment, Fusion, and Perception-Based Collaborative Interaction System supported by National Natural Science Foundation of China, 2025.01-2027.12, 国家自然科学基金青年学生基础研究项目(博士研究生).
  • Project Leader: “Lingjing Construction” — a Digital Twin 3D Virtual Content Industrialization Project (灵境构筑”数字孪生三维虚拟内容产业化项目) supported by Entrepreneurship-Driven Innovative Talent Support Program of the Harbin Institute of Technology, Suzhou Research Institute. (“创业驱动的创新人才托举工程”专项计划).
  • Project Leader: Trusted multimodal alignment fusion and transmission supported by Fundamental Research Funds for the Central Universities, 2024.08-2025.08, 哈尔滨工业大学点子基金.

💻 Internships

  • 2023.08 - 2024.06, Peking University ShenZhen Graduate School, Visiting Student, supervised by Prof. Yonghong Tian.
  • 2022.03 - 2023.03, Pengcheng Laboratory, Visiting Student, supervised by Prof. Yonghong Tian.

    Flag Counter