Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets
dc.contributor.author | Liang, Yixun | en_US |
dc.contributor.author | He, Hao | en_US |
dc.contributor.author | Xiao, Shishi | en_US |
dc.contributor.author | Lu, Hao | en_US |
dc.contributor.author | Chen, Yingcong | en_US |
dc.contributor.editor | Chen, Renjie | en_US |
dc.contributor.editor | Ritschel, Tobias | en_US |
dc.contributor.editor | Whiting, Emily | en_US |
dc.date.accessioned | 2024-10-13T18:03:15Z | |
dc.date.available | 2024-10-13T18:03:15Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Point cloud segmentation is a fundamental task in 3D vision that serves a wide range of applications. Despite recent advancements, its practical usability is still limited by the availability of training data. The prevalent methodologies cannot optimally exploit multiple datasets due to the inconsistency of labels across datasets. In this work, we introduce a robust method that accommodates learning from diverse datasets with variant label sets. We leverage a pre-trained language model to map discrete labels into a continuous latent space using their semantic names. This harmonizes labels across datasets, facilitating concurrent training. Contrarily, when classifying points within the continuous 3D space via their linguistic tokens, our model exhibits superior generalizability compared to extant methods with fixed decoder structures. Further, our approach assimilates prompt learning to alleviate data shifts across sources. Comprehensive evaluations attest that our model markedly surpasses current benchmarks. | en_US |
dc.description.sectionheaders | Point Cloud Processing and Analysis I | |
dc.description.seriesinformation | Pacific Graphics Conference Papers and Posters | |
dc.identifier.doi | 10.2312/pg.20241277 | |
dc.identifier.isbn | 978-3-03868-250-9 | |
dc.identifier.pages | 9 pages | |
dc.identifier.uri | https://doi.org/10.2312/pg.20241277 | |
dc.identifier.uri | https://diglib.eg.org/handle/10.2312/pg20241277 | |
dc.publisher | The Eurographics Association | en_US |
dc.rights | Attribution 4.0 International License | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | CCS Concepts: Computing methodologies → Scene understanding; 3D imaging | |
dc.subject | Computing methodologies → Scene understanding | |
dc.subject | 3D imaging | |
dc.title | Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets | en_US |