๐ Publications
๐ Publications
Acceptance

Global Instance Tracking: Locating Target More Like Humans
Shiyu Hu, X. Zhao, L. Huang, K. Huang
IEEE Transactions on Pattern Analysis and Machine Intelligence (CCF-A Journal)
๐ Visual Object Tracking ๐ Large-scale Benchmark Construction ๐ Intelligent Evaluation Technology
๐ Paper ๐ bibTex ๐ PDF ๐ชง Poster ๐ Platform ๐ง Toolkit ๐พ Dataset

SOTVerse: A User-defined Task Space of Single Object Tracking
Shiyu Hu, X. Zhao, K. Huang
International Journal of Computer Vision (CCF-A Journal)
๐ Visual Object Tracking ๐ Dynamic Open Environment Construction ๐ 3E Paradigm
๐ Paper ๐ bibTex ๐ PDF ๐ Platform

BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision
X. Zhao, Shiyu Huโ๏ธ, Y. Wang, J. Zhang, Y. Hu, R. Liu, H. Lin, Y. Li, R. Li, K. Liu, J. Li
International Journal of Computer Vision (CCF-A Journal)
๐ Visual Object Tracking ๐ Drone-based Tracking ๐ Visual Robustness
๐ Paper ๐ Platform ๐ bibTex ๐ PDF ๐ง Toolkit ๐พ Dataset

A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship
Shiyu Hu, D. Zhang, M. Wu, X. Feng, X. Li, X. Zhao, K. Huang
the 37th Conference on Neural Information Processing Systems (CCF-A Conference, Poster)
๐ Visual Language Tracking ๐ Long Video Understanding and Reasoning ๐ Hierarchical Semantic Information Annotation
๐ Paper ๐ bibTex ๐ PDF ๐ชง Poster ๐น Slides ๐ Platform ๐ง Toolkit ๐พ Dataset

Visual Intelligence Evaluation Techniques for Single Object Tracking: A Survey (ๅ็ฎๆ ่ท่ธชไธญ็่ง่งๆบ่ฝ่ฏไผฐๆๆฏ็ปผ่ฟฐ)
Shiyu Hu, X. Zhao, K. Huang
Journal of Images and Graphics (ใไธญๅฝๅพ่ฑกๅพๅฝขๅญฆๆฅใ, CCF-B Chinese Journal)
๐ Visual Object Tracking ๐ Intelligent Evaluation Technique ๐ AI4Science
๐ Paper ๐ PDF


MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts
X. Feng, X. Li, Shiyu Hu, D. Zhang, M. Wu, J. Zhang, X. Chen, K. Huang
the 38th Conference on Neural Information Processing Systems (CCF-A Conference, Poster)
๐ Visual Language Tracking ๐ Human-like Memory Modeling ๐ Adaptive Prompts

Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
X. Feng, D. Zhang, Shiyu Hu, X. Li, M. Wu, J. Zhang, X. Chen, K. Huang
the 50th IEEE International Conference on Acoustics, Speech, and Signal Processing (CCF-B Conference, Poster)
๐ Visual Language Tracking ๐ Multi-modal Learning ๐ Grounding Model

Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
M. Wu, K. Huang, Y. Cai, Shiyu Hu, Y. Zhao, W. Wang
IEEE Transactions on Circuits and Systems for Video Technology (CCF-B Journal)
๐ Air-writing Technique ๐ Benchmark Construction ๐ Human-machine Interaction
๐ Paper ๐ bibTex ๐ PDF ๐ง Toolkit

Diverse Text Generation for Visual Language Tracking Based on LLM
X. Li, X. Feng, Shiyu Hu, M. Wu, D. Zhang, J. Zhang, K. Huang
the 3rd Workshop on Vision Datasets Understanding and DataCV Challenge in CVPR 2024 (Workshop in CCF-A Conference, Oral, Best Paper Honorable Mention)
๐ Visual Language Tracking ๐ Large Language Model ๐ Evaluation Technique
๐ Paper ๐ bibTex ๐ PDF ๐ชง Poster ๐น Slides ๐ Platform ๐ง Toolkit ๐พ Dataset ๐ Award

Robust Single-particle Cryo-EM Image Denoising and Restoration
J. Zhang, T. Zhao, Shiyu Hu, X. Zhao
the 49th IEEE International Conference on Acoustics, Speech, and Signal Processing (CCF-B Conference, Poster)
๐ Medical Image Processing ๐ AI4Science ๐ Diffusion Model
๐ Paper ๐ bibTex ๐ PDF

VS-LLM: Visual-Semantic Depression Assessment based on LLM for Drawing Projection Test
M. Wu, Y. Kang, X. Li, Shiyu Hu, X. Chen, Y. kang, W. Wang, K. Huang
the 7th Chinese Conference on Pattern Recognition and Computer Vision (CCF-C Conference)
๐ Psychological Assessment System ๐ Gamified Assessment ๐ AI4Science
๐ Paper ๐ bibTex ๐ PDF

A Review of Intelligent Psychological Assessment Based on Interactive Environment (ๅบไบไบคไบ็ฏๅข็ๆบ่ฝๅๅฟ็ๆต่ฏ)
K. Huang, Y. Kang, C. Yan, Shiyu Hu, L. Wang, T. Tao, W. Gao
Chinese Mental Health Journal (ใไธญๅฝๅฟ็ๅซ็ๆๅฟใ, CSSCI Journal, Top Psychological Journal in China)
๐ Psychological Assessment System ๐ Gamified Assessment ๐ AI4Science

A Hierarchical Theme Recognition Model for Sandplay Therapy
X. Feng, Shiyu Hu, X. Chen, K. Huang
the 6th Chinese Conference on Pattern Recognition and Computer Vision (CCF-C Conference, Poster)
๐ Psychological Assessment System ๐ Gamified Assessment ๐ AI4Science
๐ Paper ๐ bibTex ๐ PDF ๐ Supplementary ๐ชง Poster

Rethinking Similar Object Interference in Single Object Tracking
Y. Wang, Shiyu Hu, X. Zhao
the 7th International Conference on Computer Science and Artificial Intelligence (EI Conference, Oral)
๐ Visual Object Tracking ๐ Similar Object Interference ๐ Data Mining
๐ Paper ๐ bibTex ๐ PDF

Revisiting Instance Search: A New Benchmark Using Cycle Self-training
Y. Zhang, C. Liu, W. Chen, X. Xu, F. Wang, H. Li, Shiyu Hu, X. Zhao
Neurocomputing (CCF-C Journal)
๐ Video Instance Search ๐ Benchmark Construction ๐ Data Mining
๐ Paper ๐ bibTex ๐ PDF ๐ Project

Visual Turing: The Next Development of Computer Vision in The View of Human-computer Gaming (่ง่งๅพ็ต๏ผไปไบบๆบๅฏนๆ็่ฎก็ฎๆบ่ง่งไธไธๆญฅๅๅฑ)
K. Huang, X. Zhao, Q. Li, Shiyu Hu
Journal of Graphics (ใๅพๅญฆๅญฆๆฅใ, CCF-C Chinese Journal)
๐ Visual Object Tracking ๐ Intelligent Evaluation Technique ๐ AI4Science
๐ Paper ๐ bibTex ๐ PDF
Preprint

Can LVLMs Describe Videos like Humans? A Five-in-One Video Annotations Benchmark for Better Human-Machine Comparison
Shiyu Hu*, X. Li*, X. Li, J. Zhang, Y. Wang, X. Zhao, K. Cheong (*Equal Contributions)
Submitted to a CAAI-A conference, under review
๐ Large Vision-Language Models ๐ Evaluation Technique ๐ Visual Turing
๐ Paper ๐ bibTex ๐ PDF ๐ Project

Students Rather Than Experts: A New AI for Education Pipeline to Model More Human-like and Personalised Early Adolescences
Y. Ma*, Shiyu Hu*, X. Li, Y. Wang, S. Liu, K. Cheong (*Equal Contributions)
Submitted to a CAAI-A conference, under review
๐ AI4Education ๐ LLMs ๐ LLM-based Agent
๐ Paper ๐ bibTex ๐ PDF ๐ Project

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
X. Li, Shiyu Hu, X. Feng, D. Zhang, M. Wu, J. Zhang, K. Huang
Submitted to a CAAI-A conference, under review
๐ Visual Language Tracking ๐ Large Language Model ๐ Evaluation Technique
๐ Paper ๐ bibTex ๐ PDF ๐ Project

Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
X. Li, Shiyu Hu, X. Feng, D. Zhang, M. Wu, J. Zhang, K. Huang
Submitted to a workshop in CCF-A conference, under review
๐ Visual Language Tracking ๐ Multi-modal Interaction ๐ Evaluation Technology
๐ Paper ๐ bibTex ๐ PDF ๐ Project