D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video

Kappel, Moritz; Hahlbohm, Florian; Scholz, Timon; Castillo, Susana; Theobalt, Christian; Eisemann, Martin; Golyanik, Vladislav; Magnor, Marcus

D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video

dc.contributor.author	Kappel, Moritz	en_US
dc.contributor.author	Hahlbohm, Florian	en_US
dc.contributor.author	Scholz, Timon	en_US
dc.contributor.author	Castillo, Susana	en_US
dc.contributor.author	Theobalt, Christian	en_US
dc.contributor.author	Eisemann, Martin	en_US
dc.contributor.author	Golyanik, Vladislav	en_US
dc.contributor.author	Magnor, Marcus	en_US
dc.contributor.editor	Bousseau, Adrien	en_US
dc.contributor.editor	Day, Angela	en_US
dc.date.accessioned	2025-05-09T09:12:40Z
dc.date.available	2025-05-09T09:12:40Z
dc.date.issued	2025
dc.description.abstract	Dynamic reconstruction and spatiotemporal novel-view synthesis of non-rigidly deforming scenes recently gained increased attention. While existing work achieves impressive quality and performance on multi-view or teleporting camera setups, most methods fail to efficiently and faithfully recover motion and appearance from casual monocular captures. This paper contributes to the field by introducing a new method for dynamic novel view synthesis from monocular video, such as casual smartphone captures. Our approach represents the scene as a dynamic neural point cloud, an implicit time-conditioned point distribution that encodes local geometry and appearance in separate hash-encoded neural feature grids for static and dynamic regions. By sampling a discrete point cloud from our model, we can efficiently render high-quality novel views using a fast differentiable rasterizer and neural rendering network. Similar to recent work, we leverage advances in neural scene analysis by incorporating data-driven priors like monocular depth estimation and object segmentation to resolve motion and depth ambiguities originating from the monocular captures. In addition to guiding the optimization process, we show that these priors can be exploited to explicitly initialize our scene representation to drastically improve optimization speed and final image quality. As evidenced by our experimental evaluation, our dynamic point cloud model not only enables fast optimization and real-time frame rates for interactive applications, but also achieves competitive image quality on monocular benchmark sequences. Our code and data are available online https://moritzkappel.github.io/projects/dnpc/.	en_US
dc.description.number	2
dc.description.sectionheaders	Fix it in Post: Image and Video Synthesis and Analysis
dc.description.seriesinformation	Computer Graphics Forum
dc.description.volume	44
dc.identifier.doi	10.1111/cgf.70038
dc.identifier.issn	1467-8659
dc.identifier.pages	13 pages
dc.identifier.uri	https://doi.org/10.1111/cgf.70038
dc.identifier.uri	https://diglib.eg.org/handle/10.1111/cgf70038
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Computing methodologies → Image-based rendering; Point-based models; Reconstruction; Rasterization
dc.subject	Computing methodologies → Image
dc.subject	based rendering
dc.subject	Point
dc.subject	based models
dc.subject	Reconstruction
dc.subject	Rasterization
dc.title	D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video	en_US