Zhiwen Fan's Homepage

Zhiwen ("Aaron") Fan

Texas A&M University

Email Google Scholar CV Twitter

About Me

I am an Assistant Professor in the Department of Electrical and Computer Engineering at Texas A&M University. I'm recruiting highly motivated students for fully funded Research Assistantships and internship opportunities. Please email your resume to apply.

Researches Interests

My research goal is to enhance AI agents' ability to interact with the physical world through 3D AI. I focus on developing multi-modal 3D models that enable perception, generation, and action in 3D space. By integrating natural language, images, videos, and 3D data, my work aims to bridge AI agents, human instructions, and the physical world. I am particularly interested in building real-time 3D models that can understand, recreate, and interact with their environment using spatial awareness and common sense.

My research has been demonstrated on platforms such as Quest 3, implemented within IARPA projects, and integrated into multiple commercial products.

Selected Publications
Full publication list at Google Scholar

* denotes equal contribution, † denotes project lead.

3D Modeling

LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS

Zhiwen Fan*†, Kevin Wang*, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang

NeurIPS 2024 (Spotlight, top 2.1% of 15 671)

Paper Project Code

VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment

Wenyan Cong, Hanqing Zhu, Kevin Wang, Jiahui Lei, Colton Stearns, Yuanhao Cai, Dilin Wang, Rakesh Ranjan, Matt Feiszli, Leonidas Guibas, Zhangyang Wang, Weiyao Wang, Zhiwen Fan

CVPR 2025 AI4CC Workshop (Best Paper Award)

Paper Code

MM3DGS SLAM: Multi‑modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements

Lisong C. Sun, Neel P. Bhatt, Jonathan C. Liu, Zhiwen Fan, Zhangyang Wang, Todd E. Humphreys, Ufuk Topcu

IROS 2024 (Oral Pitch Highlight)

Paper Project Code

M^3ViT: Mixture‑of‑Experts Vision Transformer for Efficient Multi‑task Learning with Model‑Accelerator Co‑design

Zhiwen Fan*†, Hanxue Liang*, Rishov Sarkar, Ziyu Jiang, Tianlong Chen, Kai Zou, Yu Cheng, Cong Hao, Zhangyang Wang

NeurIPS 2022 (QIF 2022 Award & DAC 3rd best demo)

Paper Code

Cascade Cost Volume for High‑Resolution Multi‑View Stereo and Stereo Matching

Zhiwen Fan*, Xiaodong Gu*, Siyu Zhu, Zuozhuo Dai, Feitong Tan, Ping Tan

CVPR 2020 (Oral Presentation, top 5% of submissions)

Paper Video Code

CADTransformer: Panoptic Symbol Spotting Transformer for CAD Drawings

Zhiwen Fan, Tianlong Chen, Peihao Wang, Zhangyang Wang

CVPR 2022 (Oral Presentation, top 5% of submissions)

Paper Code

3D VLMs

VLM‑3R: Vision‑Language Models Augmented with Instruction‑Tuned 3D Reconstruction

Zhiwen Fan et al.

Preprint

Paper Project Code

Large Spatial Model: Real‑time Unposed Images to Semantic 3D

Zhiwen Fan*†, Jian Zhang*, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang

NeurIPS 2024

Paper Project Code

Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

Shijie Zhou, Haoran Chang, Sicheng Jiang, Zhiwen Fan, Zehao Zhu, Dejia Xu, Pradyumna Chari, Suya You, Zhangyang Wang, Achuta Kadambi

CVPR 2024 (Highlight, 2.8% of 11 532)

Paper Project Code

NeRF‑SOS: Any‑View Self‑supervised Object Segmentation from Complex Real‑World Scenes

Zhiwen Fan, Peihao Wang, Yifan Jiang, Xinyu Gong, Dejia Xu, Zhangyang Wang

ICLR 2023

Paper Project Code

3D AIGC

Can Test‑Time Scaling Improve World Foundation Model?

Wenyan Cong, Hanqing Zhu, Peihao Wang, Bangya Liu, Dejia Xu, Kevin Wang, David Z. Pan, Yan Wang, Zhiwen Fan, Zhangyang Wang

Preprint

Paper Project Code

4K4DGen: Panoramic 4D Generation at 4K Resolution

Renjie Li, Panwang Pan, Dejia Xu, Shijie Zhou, Xuanyang Zhang, Zeming Li, Achuta Kadambi, Zhangyang Wang, Zhiwen Fan

ICLR 2025 (Spotlight, 3.2% of 11 672)

Paper Project Code

MoonSim: A Photorealistic Lunar Environment Simulator

Ziyu Chen*†, Henghui Bao*, Ting‑Hsuan Chen*, Haozhe Lou, Ge Yang, Zhiwen Fan, Marco Pavone, Yue Wang

Under submission

Paper Project Code

DreamScene360: Unconstrained Text‑to‑3D Scene Generation with Panoramic Gaussian Splatting

Shijie Zhou*†, Zhiwen Fan*†, Dejia Xu*, Haoran Chang, Pradyumna Chari, Tejas Bharadwaj, Suya You, Zhangyang Wang, Achuta Kadambi

ECCV 2024

Paper Project Code

Unified Implicit Neural Stylization

Zhiwen Fan*†, Yifan Jiang*, Peihao Wang*, Xinyu Gong, Dejia Xu, Zhangyang Wang

ECCV 2022

Paper Project Code

3D Human

Expressive Gaussian Human Avatars from Monocular RGB Video

Hezhen Hu, Zhiwen Fan, Tianhao Wu, Yihan Xi, Seoyoung Lee, Georgios Pavlakos, Zhangyang Wang

NeurIPS 2024

Paper Project Code

MMHU: A Massive‑Scale Multimodal Benchmark for Human Behavior Understanding in Autonomous Driving

…, Zhiwen Fan, …

Submitted

Inverse Problems

CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy

Jiakai Zhang, Shouchen Zhou, Haizhao Dai, Xinhang Liu, Peihao Wang, Zhiwen Fan, Yuan Pei, Jingyi Yu

ICCV 2025

Paper Code

X2‑Gaussian: 4D Radiative Gaussian Splatting for Continuous‑time Tomographic Reconstruction

Weihao Yu, Yuanhao Cai, Ruyi Zha, Zhiwen Fan, Chenxin Li, Yixuan Yuan

ICCV 2025

Paper Project Code

Joint CS‑MRI Reconstruction and Segmentation with a Unified Deep Network

Liyan Sun*, Zhiwen Fan*, Xinghao Ding, Yue Huang, John Paisley

IPMI 2019

Paper

Others

Generative AI for Autonomous Driving: Frontiers and Opportunities

GenAI4AD Survey

Paper & Code

Invited Talks

Real-Time Spatial Intelligence @ Meta. June 2025.
Scalable 3D/4D Assets Creation @ Duke. November 2024.
E cient 3D Learning for Autonomous System @ UNC, Guest Lecture. November 2024.
Empowering Machines to Understand 3D @ Stanford, ASU, JHU, Yale. October 2024.
3D Computer Vision @ TAMU, Guest Lecture. October 2024.
From Efficient 3D Learning to 3D Foundation Models @ UCLA and CalTech. October 2024.
Towards Universal, Real-Time 3D Construction and Interaction @ TAMU AI Lunch. October 2024.
Spatial Intelligence via Reconstruction, Distillation, and Generation @ Shanghai AI Lab. July 2024.
Streamlined 3D/4D: From Hours to Seconds to Millisecond @ Google Research, VALSE Webinar . May 2024.
Streamlined 3D/4D: From Hours to Seconds to Millisecond @ Google Research. May 2024.
Real-Time Few-shot View Synthesis w/ Gaussian Splatting @ IARPA WRIVA Workshop. April 2024.
Data-efficient and Rendering-efficient Neural Rendering @ IFML Workshop on Gen AI. November 2023.
Unified Implicit Neural Stylization @ Xiamen University; Kungfu.ai. July 2022.

Experience

Meta, Reality Lab, Burlingame:
Research Intern (year of 2024).
NVIDIA Research (remote):
Research Intern (year of 2024).
Google AR, San Francisco:
Research Intern (year of 2022).
Alibaba Group, Hangzhou:
Senior Algorithm Engineer(2019 - 2021).

Services

Area Chairs:
NeurIPS.
Journal Reviewers:
TPAMI, TIP, IJCV, Neurocomputing.
Conference Reviewers:
NeurIPS, ICLR, ICML, CVPR, ICCV, ECCV.

Student Mentoring

Kevin Wang (Undergraduate Student @ UT Austin → PhD Student @ UT Austin)
Hanxue Liang (Graduate Student @ ETH → PhD Student @ Cambridge)
Renjie Li (Graduate Student @ Tsinghua → PhD Student @ TAMU)
Chenxin Li (Graduate Student @ XMU → PhD Student @ CUHK)

Zhiwen ("Aaron") Fan

Texas A&M University

About Me

Recent News

Researches Interests

Selected PublicationsFull publication list at Google Scholar

3D Modeling

3D VLMs

3D AIGC

3D Human

Inverse Problems

Others

Invited Talks

Experience

Services

Student Mentoring

Selected Publications
Full publication list at Google Scholar