PVP-SSD: Point-Voxel Fusion with Partitioned Point Cloud Sampling for Anchor-Free Single-Stage Small 3D Object Detection

Wu, Xinlin; Tian, Yibin; Pan, Yin; Zhang, Zhiyuan; Wu, Xuesong; Wang, Ruisheng; Zeng, Zhi

PVP-SSD: Point-Voxel Fusion with Partitioned Point Cloud Sampling for Anchor-Free Single-Stage Small 3D Object Detection

dc.contributor.author	Wu, Xinlin	en_US
dc.contributor.author	Tian, Yibin	en_US
dc.contributor.author	Pan, Yin	en_US
dc.contributor.author	Zhang, Zhiyuan	en_US
dc.contributor.author	Wu, Xuesong	en_US
dc.contributor.author	Wang, Ruisheng	en_US
dc.contributor.author	Zeng, Zhi	en_US
dc.contributor.editor	Chen, Renjie	en_US
dc.contributor.editor	Ritschel, Tobias	en_US
dc.contributor.editor	Whiting, Emily	en_US
dc.date.accessioned	2024-10-13T18:03:25Z
dc.date.available	2024-10-13T18:03:25Z
dc.date.issued	2024
dc.description.abstract	Single-stage object detection from 3D point clouds in autonomous driving faces significant challenges, particularly in accurately detecting small objects. To address this issue, we propose a novel method called Point-Voxel dual-branch feature extraction with Partitioned point cloud sampling for anchor-free Single-Stage Detection of 3D objects (PVP-SSD). The network comprises two branches: a point branch and a voxel branch. In the point branch, a partitioned point cloud sampling strategy leverages axial features to divide the point cloud. Then, it assigns different sampling weights to various segments to enhance the sampling accuracy. Additionally, a local feature enhancement module explicitly calculates the correlation between key points and query points, improving the extraction of local features. In the voxel branch, we use 3D sparse convolution to extract instance structural features efficiently. The point-voxel dual-branch fusion dynamically integrates instance features extracted from both branches using a self-attention mechanism, which contains not only the category information of the detected object but also the spatial dimensions and heading angle. Consequently, PVP-SSD achieves a certain balance between preserving detailed information and maintaining structural integrity. Experimental results on the KITTI and ONCE datasets demonstrate that PVP-SSD excels in multi-category small 3D object detection.	en_US
dc.description.sectionheaders	Point Cloud Processing and Analysis I
dc.description.seriesinformation	Pacific Graphics Conference Papers and Posters
dc.identifier.doi	10.2312/pg.20241279
dc.identifier.isbn	978-3-03868-250-9
dc.identifier.pages	12 pages
dc.identifier.uri	https://doi.org/10.2312/pg.20241279
dc.identifier.uri	https://diglib.eg.org/handle/10.2312/pg20241279
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	Keywords: 3D object detection, autonomous driving, partitioned point cloud, self-attention mechanism CCS Concepts: Computing methodologies −→Computer vision; Model development and analysis
dc.subject	3D object detection
dc.subject	autonomous driving
dc.subject	partitioned point cloud
dc.subject	self
dc.subject	attention mechanism CCS Concepts
dc.subject	Computing methodologies −→Computer vision
dc.subject	Model development and analysis
dc.title	PVP-SSD: Point-Voxel Fusion with Partitioned Point Cloud Sampling for Anchor-Free Single-Stage Small 3D Object Detection	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: pg20241279.pdf
Size:: 20.73 MB
Format:: Adobe Portable Document Format

Download

Collections

PG2024 Conference Papers and Posters