• Homepage
  • Publications
  • Blogs
  • Talks
  • Publications
  • Blogs
  • Talks
Zuhao Yang (杨祖豪)

Zuhao Yang (杨祖豪)

Ph.D. Student at Nanyang Technological University

  • Singapore
  • Email
  • Twitter
  • LinkedIn
  • Github
  • Google Scholar

  • LongVT: Incentivizing “Thinking with Long Videos” via Native Tool Calling
    Zuhao Yang*, Sudong Wang*, Kaichen Zhang*, Keming Wu, Sicong Leng, Yifan Zhang, Bo Li, Chengwei Qin, Shijian Lu, Xingxuan Li, Lidong Bing
    Preprint 2025
    [paper] [bibtex] [code]

  • OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
    Kaichen Zhang*, Keming Wu*, Zuhao Yang, Bo Li, Kairui Hu, Bin Wang, Ziwei Liu, Xingxuan Li, Lidong Bing
    Preprint 2025
    [paper] [bibtex] [code]

  • A Comprehensive Study on Visual Token Redundancy for Discrete Diffusion-based Multimodal Large Language Models
    Duo Li*, Zuhao Yang*, Xiaoqin Zhang, Ling Shao, Shijian Lu
    Preprint 2025
    [paper] [bibtex]

  • ToDRE: Effective Visual Token Pruning via Token Diversity and Task Relevance
    Duo Li*, Zuhao Yang*, Xiaoqin Zhang, Ling Shao, Shijian Lu
    Preprint 2025
    [paper] [bibtex]

  • SVAgent: Storyline-guided Long Video Understanding via Cross-modal Multi-agent Collaboration
    Zhongyu Yang, Zuhao Yang, Shuo Zhan, Tan Yue, Wei Pang, Yingfang Yuan
    Under Review

  • ReChar: Revitalising Characters with Structure-Preserved and User-Specified Aesthetic Enhancements
    Zhongyu Yang, Junhao Song, Zhang Luo, Zuhao Yang, Yang Xu, Jingfen Lan, Yonghan Zhang, Wei Pang, Siyang Song, Yingfang Yuan
    SIGGRAPH Asia 2025
    [paper] [bibtex] [code]

  • Evaluating Text Generation Quality Using Spectral Distances of Surprisal
    Zhichen Liu, Yongyuan Li, Yang Xu, Yu Wang, Yingfang Yuan, Zuhao Yang
    EMNLP 2025
    [paper] [bibtex] [code]

  • TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding
    Zuhao Yang, Yingchen Yu, Yunqing Zhao, Shijian Lu, Song Bai
    ICCV 2025
    [paper] [bibtex] [webpage]

  • Versatile Transition Generation with Image-to-Video Diffusion
    Zuhao Yang, Jiahui Zhang, Yingchen Yu, Shijian Lu, Song Bai
    ICCV 2025
    [paper] [bibtex] [webpage]

  • QAEval: Mixture of Evaluators for Question-Answering Task Evaluation
    Tan Yue, Rui Mao, Xuzhao Shi, Shuo Zhan, Zuhao Yang, Dongyan Zhao
    ACL 2025
    [paper] [bibtex] [code]

  • AI-Generated Images as Data Sources: The Dawn of Synthetic Era
    Zuhao Yang, Fangneng Zhan, Kunhao Liu, Muyu Xu, Xiaoqin Zhang, Ling Shao, Shijian Lu
    Preprint 2023
    [paper] [bibtex] [webpage]

  • FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy
    Zuhao Yang*, Yingfang Yuan*, Yang Xu*, Shuo Zhan, Huajun Bai, Kefan Chen
    NeurIPS 2023
    [paper] [bibtex] [code]

  • PaCaNet: A Study on CycleGAN with Transfer Learning for Diversifying Fused Chinese Painting and Calligraphy
    Zuhao Yang*, Huajun Bai*, Zhang Luo, Yang Xu, Wei Pang, Yue Wang, Yisheng Yuan, Yeqi Hu, Yingfang Yuan
    Preprint 2023
    [paper] [bibtex]

Sitemap
  • Follow:
  • GitHub
  • Feed
© 2026 Zuhao Yang. Powered by Jekyll & AcademicPages, a fork of Minimal Mistakes.