Text-Guided Interactive Scene Synthesis with Scene Prior Guidance

dc.contributor.authorFang, Shaohengen_US
dc.contributor.authorYang, Haitaoen_US
dc.contributor.authorMooney, Raymonden_US
dc.contributor.authorHuang, Qixingen_US
dc.contributor.editorBousseau, Adrienen_US
dc.contributor.editorDay, Angelaen_US
dc.date.accessioned2025-05-09T09:12:43Z
dc.date.available2025-05-09T09:12:43Z
dc.date.issued2025
dc.description.abstract3D scene synthesis using natural language instructions has become a popular direction in computer graphics, with significant progress made by data-driven generative models recently. However, previous methods have mainly focused on one-time scene generation, lacking the interactive capability to generate, update, or correct scenes according to user instructions. To overcome this limitation, this paper focuses on text-guided interactive scene synthesis. First, we introduce the SceneMod dataset, which comprises 168k paired scenes with textual descriptions of the modifications. To support the interactive scene synthesis task, we propose a two-stage diffusion generative model that integrates scene-prior guidance into the denoising process to explicitly enforce physical constraints and foster more realistic scenes. Experimental results demonstrate that our approach outperforms baseline methods in text-guided scene synthesis tasks. Our system expands the scope of data-driven scene synthesis tasks and provides a novel, more flexible tool for users and designers in 3D scene generation. Code and dataset are available at https://github.com/bshfang/SceneMod.en_US
dc.description.number2
dc.description.sectionheadersShape It Til You Make It: Programs for 3D Synthesis
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume44
dc.identifier.doi10.1111/cgf.70039
dc.identifier.issn1467-8659
dc.identifier.pages12 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.70039
dc.identifier.urihttps://diglib.eg.org/handle/10.1111/cgf70039
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Computing methodologies → Computer graphics; Natural language processing; Computer systems organization → Neural networks
dc.subjectComputing methodologies → Computer graphics
dc.subjectNatural language processing
dc.subjectComputer systems organization → Neural networks
dc.titleText-Guided Interactive Scene Synthesis with Scene Prior Guidanceen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
cgf70039.pdf
Size:
10.06 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
paper1086_1.pdf
Size:
4.93 MB
Format:
Adobe Portable Document Format