4-LEGS: 4D Language Embedded Gaussian Splatting

dc.contributor.authorFiebelman, Galen_US
dc.contributor.authorCohen, Tamiren_US
dc.contributor.authorMorgenstern, Ayelleten_US
dc.contributor.authorHedman, Peteren_US
dc.contributor.authorAverbuch-Elor, Hadaren_US
dc.contributor.editorBousseau, Adrienen_US
dc.contributor.editorDay, Angelaen_US
dc.date.accessioned2025-05-09T09:16:41Z
dc.date.available2025-05-09T09:16:41Z
dc.date.issued2025
dc.description.abstractThe emergence of neural representations has revolutionized our means for digitally viewing a wide range of 3D scenes, enabling the synthesis of photorealistic images rendered from novel views. Recently, several techniques have been proposed for connecting these low-level representations with the high-level semantics understanding embodied within the scene. These methods elevate the rich semantic understanding from 2D imagery to 3D representations, distilling high-dimensional spatial features onto 3D space. In our work, we are interested in connecting language with a dynamic modeling of the world. We show how to lift spatio-temporal features to a 4D representation based on 3D Gaussian Splatting. This enables an interactive interface where the user can spatiotemporally localize events in the video from text prompts. We demonstrate our system on public 3D video datasets of people and animals performing various actions.en_US
dc.description.number2
dc.description.sectionheadersSplat-tacular Radiance Fields
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume44
dc.identifier.doi10.1111/cgf.70085
dc.identifier.issn1467-8659
dc.identifier.pages13 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.70085
dc.identifier.urihttps://diglib.eg.org/handle/10.1111/cgf70085
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectCCS Concepts: Computing methodologies → 3D imaging; Rendering; Activity recognition and understanding
dc.subjectComputing methodologies → 3D imaging
dc.subjectRendering
dc.subjectActivity recognition and understanding
dc.title4-LEGS: 4D Language Embedded Gaussian Splattingen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
cgf70085.pdf
Size:
22.19 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
paper1017_1.pdf
Size:
23.63 MB
Format:
Adobe Portable Document Format