I am a second-year Ph.D. student with Visual Intelligence Lab at Nanyang Technological University (NTU), supervised by Prof. Shijian Lu. Prior to joining NTU, I obtained my B.S. degree in Computing Science from University of Alberta. I also work closely with Dr. Lidong Bing at MiroMind.ai and Dr. Song Bai when he was at ByteDance. My research broadly explores video-centric multimodal intelligence, spanning controllable generation, temporal reasoning, agentic tool use, and long-term memory.
Feel free to reach out to me for collaborations, questions, or just to chat!
🔥 Exciting News
- 2025.08 - One paper was accepted by EMNLP 2025.
- 2025.06 - Two papers were accepted by ICCV 2025.
- 2025.05 - Two papers were accepted by ACL 2025.
- 2025.04 - I joined MiroMind.ai as an AI Scientist Intern.
- 2023.11 - I joined ByteDance as an AI Research Intern.
- 2023.09 - One paper was accepted by NeurIPS 2023.
- 2023.01 - I joined Visual Intelligence Lab at NTU.
📝 Selected Publications (Full List)
Preprint 

ToDRE: Visual Token Pruning via Diversity and Task Awareness for Efficient Large Vision-Language Models
Duo Li*, Zuhao Yang*, Shijian Lu
Preprint 2025
paper / bibtex
Duo Li*, Zuhao Yang*, Shijian Lu
Preprint 2025
paper / bibtex
ICCV 

TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding
Zuhao Yang, Yingchen Yu, Yunqing Zhao, Shijian Lu, Song Bai
ICCV 2025
paper / bibtex / webpage
Zuhao Yang, Yingchen Yu, Yunqing Zhao, Shijian Lu, Song Bai
ICCV 2025
paper / bibtex / webpage
ICCV 

Versatile Transition Generation with Image-to-Video Diffusion
Zuhao Yang, Jiahui Zhang, Yingchen Yu, Shijian Lu, Song Bai
ICCV 2025
paper / bibtex / webpage
Zuhao Yang, Jiahui Zhang, Yingchen Yu, Shijian Lu, Song Bai
ICCV 2025
paper / bibtex / webpage
ACL 

QAEval: Mixture of Evaluators for Question‑Answering Task Evaluation
Tan Yue, Rui Mao, Xuzhao Shi, Shuo Zhan, Zuhao Yang, Dongyan Zhao
ACL 2025
paper / bibtex / code
Tan Yue, Rui Mao, Xuzhao Shi, Shuo Zhan, Zuhao Yang, Dongyan Zhao
ACL 2025
paper / bibtex / code
NeurIPS 

FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross‑Entropy
Zuhao Yang*, Yingfang Yuan*, Yang Xu*, Shuo Zhan, Huajun Bai, Kefan Chen
NeurIPS 2023
paper / bibtex / code
Zuhao Yang*, Yingfang Yuan*, Yang Xu*, Shuo Zhan, Huajun Bai, Kefan Chen
NeurIPS 2023
paper / bibtex / code
📖 Educational Background
- 2024.01 - Present: Doctor of Philosophy, College of Computing and Data Science, Nanyang Technological University
- 2022.08 - 2024.01: Master in Artificial Intelligence, College of Computing and Data Science, Nanyang Technological University
- 2017.09 - 2021.06: Bachelor in Computing Science, Department of Computing Science, University of Alberta
🧑⚖️ Working Experiences
- 2025.04 - Present: AI Scientist Intern, Shanda AI Research Institute & MiroMind.ai, Singapore
- 2023.11 - 2025.03: AI Research Intern, ByteDance Inc. & TikTok, Singapore
- 2021.05 - 2022.06: NLP Algorithm Engineer, TMI Robotics Technology, Shanghai
💻 Academic Services
Conference Reviewer
- CVPR 24/25, ECCV 24, ACMMM 24, NeurIPS 24/25, ICLR 25, AISTATS 25/26, ICML 25, ICCV 25
Journal Reviewer
- IEEE TPAMI, Pattern Recognition, Journal of Electronic Imaging
Workshop PC Member
- SyntaGen: Harnessing Generative Models for Synthetic Visual Datasets (CVPR 24/25)
- Neural Rendering Intelligence (CVPR 24)
- Agents That Help or Hinder? Rethinking Agentic AI in Real-World Workflows (AAAI 26)
Teaching Assistant
- AI6121 - Computer Vision, NTU, 2025 Fall
🏆 Patent & Awards
- Method, Device, and Medium for Video Temporal Grounding with Mixture-of-Experts, US Patent, 2025
- Method, Device, and Medium for Generating Transition Videos with Diffusion Model, SG Patent, 2024
- Method, Device, and Medium for Automatic Question-Answering, CN Patent, 2022
- Outstanding Graduate, University of Alberta, 2021
- Dean’s Honor Roll Award, University of Alberta, 2018 - 2020
- International Student Scholarship, University of Alberta, 2017 - 2019