🐦Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation 🐦
- Kam Woh Ng1
- Jing Yang2
- Jia Wei Sii3
- Jiankang Deng4
- Chee Seng Chan3
- Yi-Zhe Song1
- Tao Xiang1
- Xiatian Zhu1
- University of Surrey1
- University of Cambridge2
- Universiti Malaya3
- Imperial College London4

Abstract
In this paper, we push the boundaries of fine-grained 3D generation into truly creative territory. Current methods either lack intricate details or simply mimic existing objects -- we enable both. By lifting 2D fine-grained understanding into 3D through multi-view diffusion and modeling part latents as continuous distributions, we unlock the ability to generate entirely new, yet plausible parts through interpolation and sampling. A self-supervised feature consistency loss further ensures stable generation of these unseen parts. The result is the first system capable of creating novel 3D objects with species-specific details that transcend existing examples. While we demonstrate our approach on birds, the underlying framework extends beyond things that can chirp!
Methodology

Overall architecture of our
Multiview Generation
Not only we can generate multiview images of existing classes, we can also generate hybrid version of them (randomly interpolated for each part).














The following multiview images are generated by randomly sampled part latents.















3D Birds
3D generation usingthreestudio
with random sampled part latents.
some examples from CUB200.
Citation
@misc{ng2024chirpy3d,
title={Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation},
author={Kam Woh Ng and Jing Yang and Jia Wei Sii and Jian Kang Deng and Chee Seng Chan and Yi-Zhe Song and Tao Xiang and Xiatian Zhu},
year={2025},
eprint={2501.04144},
archivePrefix={arXiv},
primaryClass={cs.CV}
}