Zhengyi Wang (王征翊)
I'm Zhengyi Wang, a PhD student at Tsinghua University on Machine Learning since 2021. I'm advised by Prof. Jun Zhu and Prof. Hang Su . I also work closely with Prof. Chongxuan Li .
Previously, I interned at Qwen , where I was a core contributor to Qwen-Image , a large-scale multimodal generative model for image generation. I also interned at NVIDIA, working with Sanja Fidler . I graduated from Tsinghua University in 2021 with a Bachelor degree.
My research interests focus on multi-modal generative models for images, 3D and robotics.
Email /
Google Scholar /
Twitter /
Github
Qwen-Image Technical Report
Qwen Team (core contributor)
arXiv
project page
/
arXiv
/
Code
/
Your browser does not support the video tag.
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding
Junliang Ye* ,
Zhengyi Wang* ,
Ruowen Zhao* ,
Shenghao Xie,
Jun Zhu
NeurIPS , 2025   (Spotlight)
project page
/
arXiv
/
Code
/
Your browser does not support the video tag.
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Ruowen Zhao* ,
Junliang Ye* ,
Zhengyi Wang* ,
Guangce Liu,
Yiwen Chen ,
Yikai Wang ,
Jun Zhu
ICCV , 2025
project page
/
arXiv
/
Code
/
Your browser does not support the video tag.
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Zhengyi Wang ,
Jonathan Lorraine ,
Yikai Wang ,
Hang Su ,
Jun Zhu ,
Sanja Fidler ,
Xiaohui Zeng
arXiv
project page
/
arXiv
/
Code
/
Model
/
Online Demo
/
Blender Addon
/
Your browser does not support the video tag.
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Songming Liu* ,
Lingxuan Wu* ,
Bangguo Li,
Hengkai Tan ,
Huayu Chen ,
Zhengyi Wang ,
Ke Xu,
Hang Su ,
Jun Zhu
ICLR , 2025
project page
/
arXiv
/
Code
/
Model
/
Data
/
CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Zhengyi Wang ,
Yikai Wang ,
Yifei Chen,
Chendong Xiang ,
Shuo Chen ,
Dajiang Yu,
Chongxuan Li ,
Hang Su ,
Jun Zhu
ECCV , 2024
project page
/
arXiv
/
Code
/
Pretrained Models
/
Your browser does not support the video tag.
V3D: Video Diffusion Models are Effective 3D Generators
Zilong Chen ,
Yikai Wang ,
Feng Wang ,
Zhengyi Wang ,
Huaping Liu
TPAMI
project page
/
arXiv
/
Code
/
Your browser does not support the video tag.
DreamReward: Aligning Human Preference in Text-to-3D Generation
Junliang Ye ,
Fangfu Liu ,
Qixiu Li,
Zhengyi Wang ,
Yikai Wang ,
Xinzhou Wang ,
Yueqi Duan ,
Jun Zhu
ECCV , 2024
project page
/
arXiv
/
Code
/
Your browser does not support the video tag.
MicroDreamer: Zero-shot 3D Generation in ∼20 Seconds by Score-based Iterative Reconstruction
Luxi Chen* ,
Zhengyi Wang* ,
Zihan Zhou,
Tingting Gao ,
Hang Su ,
Jun Zhu ,
Chongxuan Li
TPAMI
arXiv
/
Code
/
Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen ,
Cheng Lu ,
Zhengyi Wang ,
Hang Su ,
Jun Zhu
ICLR , 2024
arXiv
/
Code
/
Poster
Your browser does not support the video tag.
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
Zhengyi Wang* ,
Cheng Lu* ,
Yikai Wang ,
Fan Bao ,
Chongxuan Li ,
Hang Su ,
Jun Zhu
NeurIPS , 2023   (Spotlight)
project page
/
Slides
/
arXiv
/
Code
/
GNOT: A General Neural Operator Transformer for Operator Learning
Zhongkai Hao , Zhengyi Wang , Hang Su , Chengyang Ying ,
Yinpeng Dong ,
Songming Liu, Ze Cheng, Jian Song,
Jun Zhu
ICML , 2023
arXiv /
Github
Cluster Attack: Query-based Adversarial Attacks on Graphs with Graph-Dependent Priors
Zhengyi Wang ,
Zhongkai Hao ,
Ziqiao Wang,
Hang Su ,
Jun Zhu
IJCAI , 2022   (Long Oral, Accept rate~3.8%)
arXiv
/
Github
Services
Reviewer: NeurIPS 2023/2024/2025, ICLR 2024/2025, CVPR 2024/2025, ICML 2024, ECCV 2024, ICCV 2024, Siggraph Asia 2024, AAAI 2025