Harold Haodong Chen

haroldchen328 [at] gmail.com | WeChat: haroldchen04

Biography

I am a PhD student at The Hong Kong University of Science and Technology, supervised by Prof. Ying-Cong Chen and Prof. Qifeng Chen.

We are always looking for self-motivated undergrad/master/mphil students to for research projects in Video/Image Generation, LLM/MLLM Reasoning. Feel free to ping me via Email or WeChat if you are interested in working with me.

Selected Preprints

DVD: Deterministic Video Depth Estimation with Generative Priors [arXiv]
Show, Don't Tell: Morphing Latent Reasoning into Image Generation [arXiv]
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation [arXiv]

Selected Publications

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models CVPR 2026 [Paper]
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation NeurIPS 2025 [Paper]
FineQuest: Adaptive Knowledge-Assisted Sports Video Understanding via Agent-of-Thoughts Reasoning MM 2025 (Oral) [Paper]
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models ICML 2025 [Paper]
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization AAAI 2025 [Paper]
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs MM 2024 [Paper]

Services