Optoelectronics Letters, Volume. 20, Issue 8, 483(2024)
PointNetV3: feature extraction with position encoding
Feature extraction of point clouds is a fundamental component of three-dimensional (3D) vision tasks. While existing feature extraction networks primarily focus on enhancing the geometric perception abilities of networks and overlook the crucial role played by coordinates. For instance, though two airplane wings share the same shape, it demands distinctfeature representations due to their differing positions. In this paper, we introduce a novel module called position aware module (PAM) to leverage the coordinate features of points for positional encoding, and integrating this encoding into the feature extraction network to provide essential positional context. Furthermore, we embed PAM into the PointNet++ framework, and design a novel feature extraction network, named PointNetV3. To validate the effectivenessof PointNetV3, we conducted comprehensive experiments including classification, object tracking and object detection on point cloud. The results of remarkable improvement in three tasks demonstrate the exceptional performance achieved by PointNetV3 in point cloud processing.
Get Citation
Copy Citation Text
WANG Jun, WANG Xuefei, ZHOU Boxiong, and GUO Dongyan. PointNetV3: feature extraction with position encoding[J]. Optoelectronics Letters, 2024, 20(8): 483
Received: Aug. 26, 2023
Accepted: Apr. 4, 2024
Published Online: Aug. 23, 2024
The Author Email: Dongyan and GUO (guodongyan@zjut.edu.cn)