Baolong Liu

Dr. Baolong Liu | 刘宝龙

Associate Professor

School of Computer Science and Technology, Zhejiang Gongshang University

📍No. 149, Jiaogong Road, Xihu District, Hangzhou City, Zhejiang Province, China, 310012

📧liubaolongx [at] gmail [dot] com

I received my PhD in Computer Science and Technology from Zhejiang University in 2018. Afterward, I joined Alibaba Group as an algorithm expert specializing in artificial intelligence. Since 2021, I have been working at the School of Computer Science and Technology at Zhejiang Gongshang University. My research focuses on human-computer interaction, intelligent transportation, embodied AI, and large vision models (LVMs). Currently, I lead projects funded by the National Natural Science Foundation of China and the Zhejiang Provincial Natural Science Foundation. I publish numerous papers in major international conferences such as ICCV, AAAI, SIGIR, and ACM MM. Professionally, I am a committee member of the Visualization and Cognitive Computing Committee under the China Graphics Society and serve as a reviewer for prestigious journals and conferences such as IEEE TNNLS and ACM MM. With extensive experience in technology application and product development, I also maintain strong cooperative relationships with various companies.

What's New

January 2025

One paper accepted to SIGIR 2025.

December 2024

One paper accepted to AAAI 2025.

Research Interests

Human-Computer Interaction Intelligent Waterway Transportation Embodied AI Large Vision Models (LVMs)

Selected Publications

# corresponding author

B. Liu, R. Huang, X. Pan, C. LI, J. Sun, J. Dong, X. Wang, "Advancing Ship Re-Identification in the Wild: The ShipReID-2400 Benchmark Dataset and D2InterNet Baseline Method," in Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025. Full Text • Code & Dataset
B. Liu, R. Yang, R. Huang, W. Xu, X. Pan, C. Li, B. Wang, X. Wang, J. Dong, "Towards Ship License Plate Recognition in the Wild: A Large Benchmark and Strong Baseline," in Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence, 2025. Full Text • Code & Dataset
B. Liu, T. Zheng, P. Zheng, D. Liu, X. Qu, J. Gao, J. Dong, and X. Wang, "Lite-MKD: A multi-modal knowledge distillation framework for lightweight few-shot action recognition," in Proceedings of the 31st ACM International Conference on Multimedia, 2023: 7283-7294. Full Text • Code
J. Dong, M. Zhang, Z. Zhang, X. Chen, D. Liu, X. Qu, X. Wang, and B. Liu^#, "Dual learning with dynamic knowledge distillation for partially relevant video retrieval,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023: 11302-11312. Full Text • Code
J. Dong, X. Peng, Z. Ma, D. Liu, X. Qu, X. Yang, J. Zhu, and B. Liu^#, "From Region to Patch: Attribute-Aware Foreground-Background Contrastive Learning for Fine-Grained Fashion Retrieval," in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023: 1273-1282. Full Text • Code
B. Liu, Q. Zheng, Y. Wang, M. Zhang, J. Dong, and X. Wang, "FeatInter: exploring fine-grained object features for video-text retrieval," Neurocomputing, 2022, 496: 178-191. Full Text
Q. Zheng, J. Dong, X. Qu, X. Yang, Y. Wang, P. Zhou, B. Liu^#, and X. Wang, "Progressive localization networks for language-based moment localization," ACM Transactions on Multimedia Computing, Communications and Applications, 2023, 19(2): 1-21. Full Text
B. Liu, J. Sheng, J. Dun, S. Zhang, Z. Hong, and X. Ye, "Locating Various Ship License Numbers in the Wild: An Effective Approach," IEEE Intelligent Transportation Systems Magazine, 2017, 9(4): 102-117. Full Text
B. Liu, S. Zhou, J. Dong, M. Xie, S. Zhou, T. Zheng, S. Zhang, X. Ye, and X. Wang, "Research Progress in Skeleton-Based Human Action Recognition," Journal of Computer-Aided Design & Computer Graphics, 2023, 35(9): 1299-1322. Full Text

Research Funding

Research on Few-shot Video Action Recognition Based on Vision-Language Multimodal Learning, No. 62402438, 2025.01-2027.12, National Natural Science Foundation Youth Project, RMB 300,000, Principal Investigator
Research on Open Environment Text Recognition for Intelligent Waterway Transportation, No. LQ24F020005, 2024.01-2026.12, Zhejiang Provincial Natural Science Foundation Youth Project, RMB 100,000, Principal Investigator
Key Technologies Research on Sensitive Multimedia Data Detection in Public Safety Events, No. 2021DSJSYS001, 2022.01-2023.12, Open Research Topic at the Ministry of Public Security's Key Laboratory of Big Data Architecture for Police Information Application, RMB 50,000, Principal Investigator
Research on Large-scale Video Clip Retrieval for Natural Language Queries, No. QRK23014, 2024.01-2026.12, Provincial University Basic Scientific Research Fund Project, RMB 50,000, Principal Investigator
Research on Several Key Issues of Complex Ship License Plate Detection and Recognition in Natural Scenes, No. 1300XJ2321063, 2021.10-2024.10, Introduction Talent Research Start-up Fund, RMB 100,000, Principal Investigator
Research and Development of Supporting Technologies for Viewing and Performing Spaces in Mixed Reality Environments, No. 2018YFB1404100, 2019.07-2022.06, National Key R&D Program Project, RMB 14 million, Academic Backbone
Cross-modal Intelligent Retrieval and Generation Platform and Applications Driven by Data and Knowledge, No. LQ19F020002, 2024.01-2026.12, Zhejiang Province "Sharpshooter" Key R&D Program Project, RMB 4 million, Academic Backbone
Key Technologies Research and Application of Ultra HD Digital Content Intelligent Generation and Copyright Protection, No. 2023C01212, 2023.01-2025.12, Zhejiang Province "Leading Goose" Tackling R&D Program Project, RMB 800,000, Academic Backbone
Research on Cross-modal Video Retrieval Under Resource Constraints, No. 62472385, 2025.01-2028.12, National Natural Science Foundation General Project, RMB 500,000, Technical Director
Smart Digital Recognition Algorithm System Development, Enterprise Commissioned Project, RMB 800,000, 2021.11-2023.12, Principal Investigator
Smart Document Processing Algorithm System Development, Enterprise Commissioned Project, RMB 300,000, 2023.10-2024.12, Principal Investigator

Patents

Gesture Control Method, Device, Electronic Device, and Computer-readable Medium, No. CN114153308B. (Granted, First Author)
Road Scene Image Processing Method, Device, Electronic Device, and Storage Medium, No. CN112991241A. (Granted, First Author)
Image Processing Method, Device, and Electronic Device, No. CN112784084B. (Granted, First Author)
Road Scene Image Processing Method, Device, and Electronic Device, No. CN112991510B. (Granted, First Author)
Control Method, Device, and Electronic Device for Electronic Devices, No. CN114138104A. (Under Examination, First Author)
Small Target Ship License Plate Detection Method, System, and Device for Low-light Environments, No. CN119251819A. (Under Examination, First Author)
Complex Ship License Plate Recognition Method, System, and Device Based on Mask Reconstruction and Semantic Enhancement Learning, No. CN119251820A. (Under Examination, First Author)

Teaching

Linux System and Programming (Spring 2024)

Undergraduate-level course covering Linux fundamentals