Qihang Rao (饶启杭)

About me

I am a senior undergraduate student in the i-Vision Group at the Department of Automation, Tsinghua University, under the guidance of Professor Jiwen Lu.

Research

My research interests span across computer vision and deep learning theory. Currently, I am focusing on the intersection of these two fields, particularly in the area of VQ-VAEs and diffusion models.

Interest

Outside of research, I have a passion for sports and am particularly skilled in track and field as well as basketball. I also enjoy traveling, embracing the unexpected encounters along the journey, and capturing these moments with my camera.

Email  /  CV  /  Github

profile photo

Preprints

* indicates equal contribution
Quantize-then-Rectify: Efficient VQ-VAE Training
Borui Zhang, Qihang Rao*, Wenzhao Zheng, Jie Zhou, Jiwen Lu
arXiv, 2025
[Paper] / [Code] / [Project Page] / [HF Demo] / [中文解读]

This study presents ReVQ, a framework that enables efficient VQ-VAE training by leveraging pre-trained VAEs, compressing ImageNet images into 512 tokens while maintaining competitive reconstruction quality (rFID=1.06). ReVQ reduces training costs by over two orders of magnitude, completing full training on a single NVIDIA 4090 within 22 hours.

Honors and Awards

  • 2024 National Scholarship (highest scholarship given by the government of China)
  • 2023 Tsinghua University Comprehensive Excellence Scholarship (scholarship for juniors in Tsinghua)

© Qihang Rao | Last updated: Jul. 14, 2025

Website template from Jon Barron