PVP-SSD: Point-Voxel Fusion with Partitioned Point Cloud Sampling for Anchor-Free Single-Stage Small 3D Object Detection
dc.contributor.author | Wu, Xinlin | en_US |
dc.contributor.author | Tian, Yibin | en_US |
dc.contributor.author | Pan, Yin | en_US |
dc.contributor.author | Zhang, Zhiyuan | en_US |
dc.contributor.author | Wu, Xuesong | en_US |
dc.contributor.author | Wang, Ruisheng | en_US |
dc.contributor.author | Zeng, Zhi | en_US |
dc.contributor.editor | Chen, Renjie | en_US |
dc.contributor.editor | Ritschel, Tobias | en_US |
dc.contributor.editor | Whiting, Emily | en_US |
dc.date.accessioned | 2024-10-13T18:03:25Z | |
dc.date.available | 2024-10-13T18:03:25Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Single-stage object detection from 3D point clouds in autonomous driving faces significant challenges, particularly in accurately detecting small objects. To address this issue, we propose a novel method called Point-Voxel dual-branch feature extraction with Partitioned point cloud sampling for anchor-free Single-Stage Detection of 3D objects (PVP-SSD). The network comprises two branches: a point branch and a voxel branch. In the point branch, a partitioned point cloud sampling strategy leverages axial features to divide the point cloud. Then, it assigns different sampling weights to various segments to enhance the sampling accuracy. Additionally, a local feature enhancement module explicitly calculates the correlation between key points and query points, improving the extraction of local features. In the voxel branch, we use 3D sparse convolution to extract instance structural features efficiently. The point-voxel dual-branch fusion dynamically integrates instance features extracted from both branches using a self-attention mechanism, which contains not only the category information of the detected object but also the spatial dimensions and heading angle. Consequently, PVP-SSD achieves a certain balance between preserving detailed information and maintaining structural integrity. Experimental results on the KITTI and ONCE datasets demonstrate that PVP-SSD excels in multi-category small 3D object detection. | en_US |
dc.description.sectionheaders | Point Cloud Processing and Analysis I | |
dc.description.seriesinformation | Pacific Graphics Conference Papers and Posters | |
dc.identifier.doi | 10.2312/pg.20241279 | |
dc.identifier.isbn | 978-3-03868-250-9 | |
dc.identifier.pages | 12 pages | |
dc.identifier.uri | https://doi.org/10.2312/pg.20241279 | |
dc.identifier.uri | https://diglib.eg.org/handle/10.2312/pg20241279 | |
dc.publisher | The Eurographics Association | en_US |
dc.rights | Attribution 4.0 International License | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Keywords: 3D object detection, autonomous driving, partitioned point cloud, self-attention mechanism CCS Concepts: Computing methodologies −→Computer vision; Model development and analysis | |
dc.subject | 3D object detection | |
dc.subject | autonomous driving | |
dc.subject | partitioned point cloud | |
dc.subject | self | |
dc.subject | attention mechanism CCS Concepts | |
dc.subject | Computing methodologies −→Computer vision | |
dc.subject | Model development and analysis | |
dc.title | PVP-SSD: Point-Voxel Fusion with Partitioned Point Cloud Sampling for Anchor-Free Single-Stage Small 3D Object Detection | en_US |
Files
Original bundle
1 - 1 of 1