
Biography
I'm a PhD student at The Hong Kong University of Science and Technology (HKUST). I obtained my B.E. degree at Northwestern Polytechnical University (NPU) with Outstanding Graduate Award in 2025, where I worked closely with Prof. Dian Shao. I'm fortunate to work closely with Prof. Ying-Cong Chen, Prof. Harry Yang, Prof. Qifeng Chen and Prof. Ser-Nam Lim.
I am always open to all forms of research collaboration. Feel free to contact me if you are interested in working with me! My research interests include Video Generation & Understanding, Agentic System, and Embodied AI.
News
More
Experience
Visiting Student | HKUST |
Research Intern | Everlyn AI |
Research Intern | HKGAI |
Selected Preprints
Temporal Regularization Makes Your Video Generator Stronger |
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization |
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting |
Selected Publications
Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation |
FineQuest: Adaptive Knowledge-Assisted Sports Video Understanding via Agent-of-Thoughts Reasoning |
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models |
FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance |
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization |
FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs |
UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web |
Awards and Honors
2025 | |
2025 | |
2025 | |
2023-2024 | |
2023-2024 | |
2023 |
Services
European Conference on Computer Vision (ECCV), 2024
ACM International Conference on Multimedia (MM), 2024
Neural Information Processing Systems (NeurIPS), 2024-2025
International Conference on Machine Learning (ICML), 2025
International Conference on Computer Vision (ICCV), 2025
International Conference on Learning Representations (ICLR), 2025-2026
Computer Vision and Pattern Recognition (CVPR), 2025-2026
Artificial Intelligence and Statistics (AISTATS), 2025-2026
AAAI Conference on Artificial Intelligence (AAAI), 2026
Pattern Recognition, ACM TOMM