Chenyang Qi

I am a fourth-year Ph.D. student at the Hong Kong University of Science and Technology (HKUST), supervised by Prof. Qifeng Chen.

In 2020, I received my Bachelor's Degree in Automation, Zhejiang University with a National Scholarship from Chu Kochen Honors College .

From June 2022 to December 2022, I worked as a research intern at MSRA, supervised by Bo Zhang , Dong Chen, and Fang Wen.

After that, I was fortunate to work at Tencent AI Lab with Xiaodong Cun , Yong Zhang, Xintao Wang and Ying Shan on project FateZero about video diffusion model

I was a student researcher at Google Research mentored by Zhengzhong Tu, Mauricio Delbracio, Hossein Talebi.

Now I am working at Adobe with Taesung Park and Jimei Yang on video editing and generation.

我正在寻找2024年秋季入职的全职岗位, 欢迎邮件联系

I will graduate at 2024 Fall. Drop me an Email if your are recruting!

Email  /  CV  /  Google Scholar  /  Github

profile photo

  • News (July 2023): FateZero is accepted by ICCV 2023 as an Oral presentation.
  • News (July 2023): I gave a talk about FateZero at CVPR 2023 workshop.
  • News (Mar 2023): Excited to release the first zero-shot video editing framework FateZero using text-to-image diffusion models!
  • News (Feb 2023): Two papers are accepted by CVPR 2023! More details are coming.
  • News (June 2022): One paper is accepted to ACM Multimedia 2022.
  • News (Mar 2022): One paper is accepted to CVPR 2022.

Research

Currently, I am working on image / video generation and editing using AIGC techniques!
Hover your mouse over the image box below to view more results.
Left: input; Right: output

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
Chenyang Qi, Xiaodong Cun , Yong Zhang,
Chenyang Lei, Xintao Wang , Ying Shan, Qifeng Chen
ICCV Oral, 2023
arxiv / code / project page

Editing your video via pretrained Stable Diffusion model without training.
(e.g., Replace the jeep with a posche car; Add Van Gogh style to the sunflower)

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts
Yue Ma*, Yingqing He*, Hongfa Wang, Andong Wang, Chenyang Qi, Chengfei Cai, Xiu Li, Zhifeng Li, Heung-Yeung Shum, Wei Liu, Qifeng Chen,
arXiv, 2024
Project page / arXiv / Github

TIP: Text-Driven Image Processing with Semantic and Restoration Instructions

Preprint, 2023
project page

AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Jiwen Yu, Xiaodong Cun , Chenyang Qi , Yong Zhang, Xintao Wang, Ying Shan, Jian Zhang

ArXiv, 2023
MagicStick🪄: Controllable Video Editing via Control Handle Transformations
Yue Ma, Xiaodong Cun, Yingqing He, Chenyang Qi, Xintao Wang, Ying Shan, Xiu Li , Qifeng Chen

ArXiv, 2023
Inserting Anybody in Diffusion Models via Celeb Basis
Ge Yuan, Xiaodong Cun, Yong Zhang, Maomao Li, Chenyang Qi, Xintao Wang, Ying Shan, Huicheng Zheng
NeurIPS, 2023
project page / code

Face identity customization in diffusion model

Real-time 6K Image Rescaling with Rate-distortion Optimization
Chenyang Qi,* Xin Yang*, Ka Leong Cheng, Ying-Cong Chen, Qifeng Chen
CVPR, 2023
arxiv / code

Image upscaling with learnable frequency-domain quantization to achieve 6K real-time speed and best rate-distortion.

MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation
Bowen Zhang*, Chenyang Qi*, Pan Zhang, Bo Zhang,
HsiangTao Wu, Dong Chen, Qifeng Chen, Yong Wang, Fang Wen
CVPR, 2023
arxiv / code / project page

Identity-preserving talking head generation utilizing dense landmarks and
spatial-temporal enhancement with GAN priors.
(e.g., Make Marilyn Monroe speak as the motions of another person in the driving video)

Real-time Streaming Video Denoising with Bidirectional Buffers
Chenyang Qi*, Junming Chen*, Xin Yang, Qifeng Chen
ACM Multimedia, 2022
arxiv / code / project page

An extremely efficient (700X speedup) buffer-based framework for online video denoising.

Shape from Polarization for Complex Scenes in the Wild
Chenyang Lei*, Chenyang Qi*, Jiaxin Xie*, Na Fan, Vladlen Koltun , Qifeng Chen
CVPR, 2022
arxiv / code / project page

Scene-level normal estimation from a single polarization image using physics-based priors.

Experience
Student Researcher, Google Research, Mountain View
July, 2023 - December, 2023
Research Intern, Tencent AI Lab, Shenzhen
Jan, 2023 - June, 2023
Research Intern, Microsoft Research Asia, Beijing
June, 2022 - December, 2022
Services
  • Reviewer: CVPR, ICCV, NeurIPS, ICLR, IJCV, ACM MM
Honors and Awards
  • National Scholarship, 2018 (Chu Kochen Honors College, Zhejiang University)

Thanks Dr. Jon Barron for sharing the source code of his personal page.