Figure 1.
The overall architecture of our network. Our network mainly consists of feature extraction, PGT module, feature interaction with AIS module, key point selection module, overlapping mask prediction module and corresponding search module. The inputs are source point clouds X and target point clouds Y of dimension , and the network loops over the obtained features N times to refine the registration results after extracting the keypoints. The source point clouds X and target point clouds Y undergo feature extraction to encode features, followed by PGT module to enhance feature perception and encode relative position information to obtain feature , . Then similar feature information is captured by feature interaction with the AIS module and significant features and points that are in the overlap region are selected by scores. Finally, the transformation matrix T is obtained by the corresponding search module and the overlap mask , are optimized by Contrastive Learning. N indicates the number of iterations.
Figure 1.
The overall architecture of our network. Our network mainly consists of feature extraction, PGT module, feature interaction with AIS module, key point selection module, overlapping mask prediction module and corresponding search module. The inputs are source point clouds X and target point clouds Y of dimension , and the network loops over the obtained features N times to refine the registration results after extracting the keypoints. The source point clouds X and target point clouds Y undergo feature extraction to encode features, followed by PGT module to enhance feature perception and encode relative position information to obtain feature , . Then similar feature information is captured by feature interaction with the AIS module and significant features and points that are in the overlap region are selected by scores. Finally, the transformation matrix T is obtained by the corresponding search module and the overlap mask , are optimized by Contrastive Learning. N indicates the number of iterations.
Figure 2.
Our feature extraction module. We use adaptive graph convolution to extract the point cloud features, fusing the obtained multilevel features and obtaining the global features, followed by feature fusion of the extracted PPF geometrical features with the global features and the multilevel graph features to obtain 512-dimensional features.
Figure 2.
Our feature extraction module. We use adaptive graph convolution to extract the point cloud features, fusing the obtained multilevel features and obtaining the global features, followed by feature fusion of the extracted PPF geometrical features with the global features and the multilevel graph features to obtain 512-dimensional features.
Figure 3.
Our PGT module. We use fully connected layer FC, Sigmoid and Relu activation functions to build the position encoding module, we use concat operation to stitch features with position information and position enhancement by Geotransformer. Finally, we superimpose the original features to highlight saliency.
Figure 3.
Our PGT module. We use fully connected layer FC, Sigmoid and Relu activation functions to build the position encoding module, we use concat operation to stitch features with position information and position enhancement by Geotransformer. Finally, we superimpose the original features to highlight saliency.
Figure 4.
Our keypoints selection module. Uses MLP to extract significance scores and selects corresponding features and keypoints based on the top TOP-K scores.
Figure 4.
Our keypoints selection module. Uses MLP to extract significance scores and selects corresponding features and keypoints based on the top TOP-K scores.
Figure 5.
Corredpondences search module.
Figure 5.
Corredpondences search module.
Figure 6.
Visualization of the registration of the same category in ModelNet40. Red represents the source point cloud, blue represents the target point cloud, and green represents the source point cloud after registration. The settings for subsequent visualization results remain consistent. (a) plant, (b) vase, (c) night-stand, (d) plant.
Figure 6.
Visualization of the registration of the same category in ModelNet40. Red represents the source point cloud, blue represents the target point cloud, and green represents the source point cloud after registration. The settings for subsequent visualization results remain consistent. (a) plant, (b) vase, (c) night-stand, (d) plant.
Figure 7.
Visualization of the registration of the unseen category in ModelNet40. (a) monitor, (b) range-hood, (c) glass-box, (d) night-stand.
Figure 7.
Visualization of the registration of the unseen category in ModelNet40. (a) monitor, (b) range-hood, (c) glass-box, (d) night-stand.
Figure 8.
Visualization results for Gaussian noise in ModelNet40. (a) door, (b) table, (c) mantel, (d) bookshelf.
Figure 8.
Visualization results for Gaussian noise in ModelNet40. (a) door, (b) table, (c) mantel, (d) bookshelf.
Figure 9.
Visualization results for Gaussian noise with low overlap in ModelNet40. (a) night-stand, (b) laptop, (c) vase, (d) table.
Figure 9.
Visualization results for Gaussian noise with low overlap in ModelNet40. (a) night-stand, (b) laptop, (c) vase, (d) table.
Figure 10.
Our method achieves the best registration accuracy in terms of registration accuracy, and although the method DCP slightly outperforms us in terms of speed, our registration accuracy far exceeds it.
Figure 10.
Our method achieves the best registration accuracy in terms of registration accuracy, and although the method DCP slightly outperforms us in terms of speed, our registration accuracy far exceeds it.
Figure 11.
Registration results of our algorithm at different degrees of overlap. It can be seen that our method still maintains a good registration accuracy when the overlap degree decreases sharply.
Figure 11.
Registration results of our algorithm at different degrees of overlap. It can be seen that our method still maintains a good registration accuracy when the overlap degree decreases sharply.
Figure 12.
Regarding the comparison of our method with the recent CMIGNet method for different overlap ratios, the solid line corresponds to the rotation error and the dashed line corresponds to the translation error, which is much lower than the CMIGNet error.
Figure 12.
Regarding the comparison of our method with the recent CMIGNet method for different overlap ratios, the solid line corresponds to the rotation error and the dashed line corresponds to the translation error, which is much lower than the CMIGNet error.
Figure 13.
The results of our registration under different noise levels, we can see that our registration still maintains good accuracy when the noise is stacked sequentially.
Figure 13.
The results of our registration under different noise levels, we can see that our registration still maintains good accuracy when the noise is stacked sequentially.
Figure 14.
Our results on the real scene registration result. The color representation is consistent with previous experiments. Subfigures (a–d) represent the alignment results of the point clouds acquired at different viewing angles.
Figure 14.
Our results on the real scene registration result. The color representation is consistent with previous experiments. Subfigures (a–d) represent the alignment results of the point clouds acquired at different viewing angles.
Figure 15.
Further ablation of our approach regarding the combined AIS module and the feature interaction module FI, histograms mention their effects.
Figure 15.
Further ablation of our approach regarding the combined AIS module and the feature interaction module FI, histograms mention their effects.
Table 1.
The registration results of the same category in ModelNet40.
Table 1.
The registration results of the same category in ModelNet40.
Method | | | | |
---|
DCP [16] | 7.158 | 4.259 | 0.0328 | 0.0276 |
PointNetLK [26] | 16.214 | 7.319 | 0.0418 | 0.0312 |
IDAM [18] | 2.876 | 0.758 | 0.0284 | 0.0167 |
RPMNet [19] | 0.971 | 0.526 | 0.0108 | 0.0072 |
ROPNet [20] | 1.323 | 0.688 | 0.0153 | 0.0108 |
OMNet [27] | 1.231 | 0.672 | 0.0145 | 0.0094 |
FINet [40] | 1.132 | 0.598 | 0.0132 | 0.0089 |
CMIGNet [23] | 0.875 | 0.452 | 0.0051 | 0.0042 |
Ours | 0.467 | 0.309 | 0.0039 | 0.0024 |
Table 2.
Registration results of unseen categories in ModelNet40.
Table 2.
Registration results of unseen categories in ModelNet40.
Method | | | | |
---|
DCP [16] | 9.452 | 7.025 | 0.0358 | 0.0211 |
PointNetLK [26] | 23.228 | 9.842 | 0.0615 | 0.0289 |
IDAM [18] | 3.279 | 0.518 | 0.0213 | 0.0072 |
RPMNet [19] | 1.823 | 0.724 | 0.0142 | 0.0044 |
ROPNet [20] | 1.802 | 0.679 | 0.0141 | 0.0042 |
OMNet [27] | 2.985 | 1.085 | 0.0164 | 0.0062 |
FINet [40] | 2.458 | 0.854 | 0.0112 | 0.0055 |
CMIGNet [23] | 0.892 | 0.486 | 0.0049 | 0.0035 |
Ours | 0.576 | 0.358 | 0.0044 | 0.0028 |
Table 3.
ModelNet40 registration results under Gaussian noise.
Table 3.
ModelNet40 registration results under Gaussian noise.
Method | | | | |
---|
DCP [16] | 7.215 | 4.258 | 0.0268 | 0.0195 |
PointNetLK [26] | 19.253 | 9.352 | 0.0585 | 0.0475 |
IDAM [18] | 4.125 | 1.803 | 0.0255 | 0.0152 |
RPMNet [19] | 2.533 | 1.177 | 0.0234 | 0.0105 |
ROPNet [20] | 2.445 | 1.109 | 0.0221 | 0.0098 |
OMNet [27] | 2.412 | 0.988 | 0.0175 | 0.0092 |
FINet [40] | 1.813 | 0.962 | 0.0136 | 0.0087 |
CMIGNet [23] | 1.512 | 0.679 | 0.0063 | 0.0042 |
Ours | 1.082 | 0.601 | 0.0072 | 0.0038 |
Table 4.
Registration results for Gaussian noise with low overlap in ModelNet40.
Table 4.
Registration results for Gaussian noise with low overlap in ModelNet40.
Method | | | | |
---|
DCP [16] | 9.825 | 6.803 | 0.0986 | 0.0759 |
PointNetLK [26] | 36.223 | 23.375 | 0.2711 | 0.2019 |
IDAM [18] | 9.712 | 5.876 | 0.1223 | 0.0523 |
RPMNet [19] | 2.697 | 1.003 | 0.0309 | 0.0119 |
ROPNet [20] | 2.166 | 1.072 | 0.0200 | 0.0098 |
OMNet [27] | 4.876 | 3.654 | 0.0632 | 0.0403 |
FINet [40] | 4.956 | 2.942 | 0.0507 | 0.0301 |
CMIGNet [23] | 4.423 | 2.312 | 0.0287 | 0.0156 |
Ours | 1.406 | 0.696 | 0.0068 | 0.0036 |
Table 5.
Testing of point clouds with different overlap rates with Gaussian noise.
Table 5.
Testing of point clouds with different overlap rates with Gaussian noise.
Overlap Ratio | | | | |
---|
70% | 1.406 | 0.696 | 0.0068 | 0.0036 |
65% | 1.678 | 0.797 | 0.0119 | 0.0051 |
60% | 1.811 | 0.851 | 0.0118 | 0.0055 |
55% | 2.083 | 0.937 | 0.0154 | 0.0064 |
50% | 2.178 | 1.013 | 0.0168 | 0.0071 |
45% | 2.391 | 1.144 | 0.0175 | 0.0078 |
40% | 2.667 | 1.191 | 0.0197 | 0.0083 |
35% | 2.908 | 1.264 | 0.0206 | 0.0088 |
Table 6.
Testing of point clouds with different levels of Gaussian noise.
Table 6.
Testing of point clouds with different levels of Gaussian noise.
Gaussian Noise | | | | |
---|
0.01 | 1.406 | 0.696 | 0.0068 | 0.0036 |
0.02 | 1.631 | 0.939 | 0.0078 | 0.0049 |
0.03 | 2.056 | 1.118 | 0.0089 | 0.0052 |
0.04 | 2.213 | 1.246 | 0.0099 | 0.0063 |
0.05 | 2.375 | 1.406 | 0.0104 | 0.0071 |
Table 7.
Registration results in 7scenes Dataset.
Table 7.
Registration results in 7scenes Dataset.
Method | | | | |
---|
DCP [16] | 6.712 | 4.175 | 0.1976 | 0.0171 |
PointNetLK [26] | 4.051 | 2.902 | 0.0322 | 0.0091 |
IDAM [18] | 8.591 | 5.874 | 0.0328 | 0.0232 |
RPMNet [19] | 1.084 | 0.603 | 0.0071 | 0.0037 |
ROPNet [20] | 1.391 | 0.735 | 0.0074 | 0.0044 |
OMNet [27] | 1.448 | 0.835 | 0.0077 | 0.0046 |
FINet [40] | 1.781 | 0.902 | 0.0093 | 0.0052 |
CMIGNet [23] | 0.803 | 0.481 | 0.0031 | 0.0017 |
Ours | 0.189 | 0.101 | 0.0017 | 0.0011 |
Table 8.
Ablation Studies on Different Module.
Table 8.
Ablation Studies on Different Module.
PA | PGT | FAS | OM | | | | |
---|
- | - | - | - | 4.243 | 1.941 | 0.0276 | 0.0159 |
✓ | - | - | - | 3.894 | 1.577 | 0.0199 | 0.0076 |
✓ | ✓ | - | - | 1.996 | 0.801 | 0.0107 | 0.0052 |
✓ | ✓ | ✓ | - | 1.635 | 0.729 | 0.0096 | 0.0042 |
✓ | ✓ | ✓ | ✓ | 1.406 | 0.696 | 0.0068 | 0.0036 |