Three-Filters-to-Normal+: Revisiting Discontinuity Discrimination in Depth-to-Normal Translation
Jingwei Yang
Bohuan Xue
Yi Feng
Deming Wang
Qijun Chen
Rui Fan
[Paper]
[GitHub]
The code can be found in this repository.

Abstract

This article introduces three-filters-to-normal+ (3F2N+), an extension of our previous work three-filtersto- normal (3F2N), with a specific focus on incorporating discontinuity discrimination capability into surface normal estimators (SNEs). 3F2N+ achieves this capability by utilizing a novel discontinuity discrimination module (DDM), which combines depth curvature minimization and correlation coefficient maximization through conditional random fields (CRFs). To evaluate the robustness of SNEs on noisy data, we create a large-scale synthetic surface normal (SSN) dataset containing 20 scenarios (ten indoor scenarios and ten outdoor scenarios with and without random Gaussian noise added to depth images). Extensive experiments demonstrate that 3F2N+ achieves greater performance than all other geometry-based surface normal estimators, with average angular errors of 7.85◦, 8.95◦, 9.25◦, and 11.98◦ on the clean-indoor, clean-outdoor, noisy-indoor, and noisy-outdoor datasets, respectively. We conduct three additional experiments to demonstrate the effectiveness of incorporating our proposed 3F2N+ into downstream robot perception tasks, including freespace detection, 6D object pose estimation, and point cloud completion. Our source code and datasets are publicly available at https://mias.group/3F2Nplus/.

Paper and Supplementary Material

J. Yang, B. Xue, Y. Feng, D. Wang, Q. Chen, R. Fan.
Three-Filters-to-Normal+: Revisiting Discontinuity Discrimination in Depth-to-Normal Translation
In TASE, 2024.
(hosted on IEEEXplore)


[Bibtex]


Acknowledgements

This work was supported in part by the National Key Research and Development Program of China under Grant 2020AAA0108100, in part by the National Natural Science Foundation of China under Grant 62233013, in part by the Science and Technology Commission of Shanghai Municipal under Grant 22511104500, in part by the Fundamental Research Funds for Central Universities, and in part by the Xiaomi Young Talents Program (Corresponding author: Rui Fan).