Publications

Papers by MIPG members (Starting 2021)

2026

[Conference]

Generative Neural Video Compression via Video Diffusion Prior
Qi Mao, Hao Cheng, Tinghan Yang, Libiao Jin, Siwei Ma
The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2026 (CVPR 2026)
[Paper] [Code]

2025

[Journal]

StarVid: Enhancing Semantic Alignment in Video Diffusion Models via Spatial and SynTactic Guided Attention Refocusing
Yuanhang Li, Qi Mao, Lan Chen, Zhen Fang, Lei Tian, Xinyan Xiao, Libiao Jin, Hua Wu
IEEE Transactions on Multimedia (TMM)
[Paper] [Code]

[Conference]

UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models
Lan Chen, Yuchao Gu, Qi Mao
The IEEE/CVF Winter Conference on Applications of Computer Vision 2026 (WACV 2026)
[Paper] [Code]

2024

[Conference]

Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer
Naifu Xue, Qi Mao, Zijian Wang, Yuan Zhang, Siwei Ma
IEEE International Conference on Multimedia and Expo (ICME)
[Paper] [Code]

Beyond Aligned Target Face: StyleGAN-Based Face-Swapping via Inverted Identity Learning
Yuanhang Li, Qi Mao, Libiao Jin
IEEE Internrnational Confeference on Multimedia andExpo Workrkshops (ICMEW)
[Paper] [Code]

MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance
Qi Mao, Lan Chen, Yuchao Gu, Zhen Fang, Mike Zheng Shou
The 32nd ACM International Conference on Multimedia (ACM MM)
[Paper] [Code]

Extreme Image Compression using Fine-tuned VQGANs
Qi Mao, Tinghan Yang, Yinuo Zhang, Zijian Wang, Meng Wang, Shiqi Wang, Siwei Ma
2024 Data Compression Conference (DCC)
[Paper] [Code]

2023

[Journal]

Enhancing Style-Guided Image-to-Image Translation via Self-Supervised Metric Learning
Qi Mao, Siwei Ma
IEEE Transactions on Multimedia (TMM)
[Paper] [Code]

2022

[Journal]

Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors
Qi Mao, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang, Siwei Ma, Ming-Hsuan Yang
Internrnational Journrnal of ComputerVision (IJCV)
[Paper] [Code]

2021

[Journal]

Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision
Qi Mao, Chongyu Wang, Meng Wang, Shiqi Wang, Ruijie Chen, Libiao Jin, Siwei Ma
IEEE Transcations on Image Processing (TIP)
[Paper] [Code]