Fig. 1: Overview of GVGEN. Our framework comprises two stages. In the data pre-processing phase, we fit GaussianVolumes and extract coarse geometry Gaussian Distance Field (GDF) as training data. For the generation stage, we first generate GDF via a diffusion model, and then send it into a 3D U-Net to predict attributes of GaussianVolumes.
Visual comparisons among different GaussianVolume resolution settings and original 3D Gaussian Splatting (3DGS).
@article{he2024gvgen,
title={GVGEN: Text-to-3D Generation with Volumetric Representation},
author={He, Xianglong and Chen, Junyi and Peng, Sida and Huang, Di and Li, Yangguang and Huang, Xiaoshui and Yuan, Chun and Ouyang, Wanli and He, Tong},
journal={arXiv preprint arXiv:2403.12957},
year={2024}
}