Profile
Dr. Baolong Liu | 刘宝龙
Associate Professor
School of Computer Science and Technology, Zhejiang Gongshang University
📍No. 149, Jiaogong Road, Xihu District, Hangzhou City, Zhejiang Province, China, 310012
📧liubaolongx [at] gmail [dot] com

I received my PhD in Computer Science and Technology from Zhejiang University in 2018. Afterward, I joined Alibaba Group as an algorithm expert specializing in artificial intelligence. Since 2021, I have been working at the School of Computer Science and Technology at Zhejiang Gongshang University. My research focuses on human-computer interaction, intelligent transportation, embodied intelligence, and large language models (LLMs). Currently, I lead projects funded by the National Natural Science Foundation of China and the Zhejiang Provincial Natural Science Foundation. I publish numerous papers in major international conferences such as ICCV, AAAI, SIGIR, and ACM MM. Professionally, I am a committee member of the Visualization and Cognitive Computing Committee under the China Graphics Society and serve as a reviewer for prestigious journals and conferences such as IEEE TNNLS and ACM MM. With extensive experience in technology application and product development, I also maintain strong cooperative relationships with various companies.

What's New

January 2025

New paper submitted to SIGIR 2025.

December 2024

New paper accepted to AAAI 2025.

Research Interests

Human-Computer Interaction Intelligent Waterway Transportation

Selected Publications

# corresponding author

  • B. Liu, R. Yang, R. Huang, W. Xu, X. Pan, C. LI, B. Wang, X. Wang, J. Dong, "Towards Ship License Plate Recognition in the Wild: A Large Benchmark and Strong Baseline," in Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence, 2025.
  • B. Liu, T. Zheng, P. Zheng, D. Liu, X. Qu, J. Gao, J. Dong, and X. Wang, "Lite-MKD: A multi-modal knowledge distillation framework for lightweight few-shot action recognition," in Proceedings of the 31st ACM International Conference on Multimedia, 2023: 7283-7294. Full TextCode
  • J. Dong, M. Zhang, Z. Zhang, X. Chen, D. Liu, X. Qu, X. Wang, and B. Liu#, "Dual learning with dynamic knowledge distillation for partially relevant video retrieval,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023: 11302-11312. Full TextCode
  • J. Dong, X. Peng, Z. Ma, D. Liu, X. Qu, X. Yang, J. Zhu, and B. Liu#, "From Region to Patch: Attribute-Aware Foreground-Background Contrastive Learning for Fine-Grained Fashion Retrieval," in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023: 1273-1282. Full TextCode
  • B. Liu, Q. Zheng, Y. Wang, M. Zhang, J. Dong, and X. Wang, "FeatInter: exploring fine-grained object features for video-text retrieval," Neurocomputing, 2022, 496: 178-191. Full Text
  • Q. Zheng, J. Dong, X. Qu, X. Yang, Y. Wang, P. Zhou, B. Liu#, and X. Wang, "Progressive localization networks for language-based moment localization," ACM Transactions on Multimedia Computing, Communications and Applications, 2023, 19(2): 1-21. Full Text
  • B. Liu, J. Sheng, J. Dun, S. Zhang, Z. Hong, and X. Ye, "Locating Various Ship License Numbers in the Wild: An Effective Approach," IEEE Intelligent Transportation Systems Magazine, 2017, 9(4): 102-117. Full Text
  • B. Liu, S. Zhou, J. Dong, M. Xie, S. Zhou, T. Zheng, S. Zhang, X. Ye, and X. Wang, "Research Progress in Skeleton-Based Human Action Recognition," Journal of Computer-Aided Design & Computer Graphics, 2023, 35(9): 1299-1322. Full Text

Research Funding

  • Research on Few-shot Video Action Recognition Based on Vision-Language Multimodal Learning, No. 62402438, 2025.01-2027.12, National Natural Science Foundation Youth Project, RMB 300,000, Principal Investigator
  • Research on Open Environment Text Recognition for Intelligent Waterway Transportation, No. LQ24F020005, 2024.01-2026.12, Zhejiang Provincial Natural Science Foundation Youth Project, RMB 100,000, Principal Investigator
  • Key Technologies Research on Sensitive Multimedia Data Detection in Public Safety Events, No. 2021DSJSYS001, 2022.01-2023.12, Open Research Topic at the Ministry of Public Security's Key Laboratory of Big Data Architecture for Police Information Application, RMB 50,000, Principal Investigator
  • Research on Large-scale Video Clip Retrieval for Natural Language Queries, No. QRK23014, 2024.01-2026.12, Provincial University Basic Scientific Research Fund Project, RMB 50,000, Principal Investigator
  • Research on Several Key Issues of Complex Ship License Plate Detection and Recognition in Natural Scenes, No. 1300XJ2321063, 2021.10-2024.10, Introduction Talent Research Start-up Fund, RMB 100,000, Principal Investigator
  • Research and Development of Supporting Technologies for Viewing and Performing Spaces in Mixed Reality Environments, No. 2018YFB1404100, 2019.07-2022.06, National Key R&D Program Project, RMB 14 million, Academic Backbone
  • Cross-modal Intelligent Retrieval and Generation Platform and Applications Driven by Data and Knowledge, No. LQ19F020002, 2024.01-2026.12, Zhejiang Province "Sharpshooter" Key R&D Program Project, RMB 4 million, Academic Backbone
  • Key Technologies Research and Application of Ultra HD Digital Content Intelligent Generation and Copyright Protection, No. 2023C01212, 2023.01-2025.12, Zhejiang Province "Leading Goose" Tackling R&D Program Project, RMB 800,000, Academic Backbone
  • Research on Cross-modal Video Retrieval Under Resource Constraints, No. 62472385, 2025.01-2028.12, National Natural Science Foundation General Project, RMB 500,000, Technical Director
  • Smart Digital Recognition Algorithm System Development, Enterprise Commissioned Project, RMB 800,000, 2021.11-2023.12, Principal Investigator
  • Smart Document Processing Algorithm System Development, Enterprise Commissioned Project, RMB 300,000, 2023.10-2024.12, Principal Investigator

Patents

  • Gesture Control Method, Device, Electronic Device, and Computer-readable Medium, No. CN114153308B. (Granted, First Author)
  • Road Scene Image Processing Method, Device, Electronic Device, and Storage Medium, No. CN112991241A. (Granted, First Author)
  • Image Processing Method, Device, and Electronic Device, No. CN112784084B. (Granted, First Author)
  • Road Scene Image Processing Method, Device, and Electronic Device, No. CN112991510B. (Granted, First Author)
  • Control Method, Device, and Electronic Device for Electronic Devices, No. CN114138104A. (Under Examination, First Author)
  • Small Target Ship License Plate Detection Method, System, and Device for Low-light Environments, No. CN119251819A. (Under Examination, First Author)
  • Complex Ship License Plate Recognition Method, System, and Device Based on Mask Reconstruction and Semantic Enhancement Learning, No. CN119251820A. (Under Examination, First Author)

Teaching

Linux System and Programming (Spring 2024)

Undergraduate-level course covering Linux fundamentals