Color and depth image registration algorithm based on multi-vector-fields constraints

Li, Xiaolin; Li, Daoqing; Peng, Li; Zhou, Huabing; Chen, Deng; Zhang, Yanduo; Xie, Liang

doi:10.1007/s11042-018-7048-4

Color and depth image registration algorithm based on multi-vector-fields constraints

Published: 03 January 2019

Volume 78, pages 24301–24319, (2019)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Xiaolin Li¹,
Daoqing Li¹,
Li Peng¹,
Huabing Zhou ORCID: orcid.org/0000-0001-5007-7303¹,
Deng Chen¹,
Yanduo Zhang¹ &
…
Liang Xie²

477 Accesses
Explore all metrics

Abstract

Image registration, which aim to establish a reliable feature relationship between images, is a critical problem in the field of image processing. In order to enhance the accuracy of color and depth image registration, this paper proposes an novel image registration algorithm based on multi-vector-fields constraints. We first initialize the edge information features of color and depth images, and establish putative correspondences based on edge information. Consider the correlation between the images, establish the functional relationships of the multi-vector-fields constraints based on the relationships. In the reproducing nuclear Hilbert space (RKHS), this constraint is added to the probability model, and the model parameters are optimized using the EM algorithm. Finally, the probability of corresponding edge points of the image is obtained. In order to further improve registration accuracy, this paper will change the input from one pair to two pairs and let the feature transformation relationship between images be iteratively evaluated using the parameter model. Taking publicly available RGB-D images as experimental subjects, results show that for single object image registration, the algorithm image registration accuracy in this paper is improved by about 5% compared with SC, ICP, and CPD algorithms. In addition, artificial noise was used to test the proposed algorithm’s anti-noise ability, results show that the proposed algorithm has superior anti-noise ability relative to SC, ICP and CPD algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust feature matching via Gaussian field criterion for remote sensing image registration

Article 02 April 2018

A non-parametric depth modification model for registration between color and depth images

Article 11 June 2018

Registration of Visible and Infrared Images Based on Gradient Information

Article 25 May 2017

References

Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. PAMI 24(24):509–522
Article Google Scholar
Chum O, Matas J (2005) Matching with PROSAC – progressive sample consensus. Proc IEEE Conf Comput Vis Pattern Recog 1:220–226
Google Scholar
Essmaeel K, Gallo L, Damiani E, Pietro G, Dipanda A (2015) Comparison of methods for filtering for kinect depth data. Multimed Tools Appl 74(17):7331–7354
Article Google Scholar
Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with application to image analysis and automated cartography. Commun ACM 24(6):381–395
Article MathSciNet Google Scholar
Hill DL, Batchelor PG, Holden M, Hawkes DJ (2001) Medical image registration. Phys Med Biol 46(3):R1–45
Article Google Scholar
Jackson BP, Goshtasby AA (2010) Registering aerial video images using the projective constraint [J]. IEEE Trans Image Process 19(3):795–804
Article MathSciNet MATH Google Scholar
Jean-Louis F, Van DDA (2000) The PX-EM algorithm for fast stable fitting of Henderson’s mixed model[J]. Gen Select Evol 32(2):143–163
Article Google Scholar
Jing P, Su Y, Nie L, et al (2018) A framework of joint low-rank and sparse regression for image memorability prediction[J]. IEEE Trans Circ Syst Vid Technol PP (99):1–1
Google Scholar
Jing P, Su Y, Nie L, et al (2018) Low-rank multi-view embedding learning for micro-video popularity prediction[J]. IEEE Trans Knowl Data Eng PP(99):1519–1532
Article Google Scholar
Jorge-Peas A, Bov H, Sanen K et al (2017) 3D full-field quantification of cell-induced large deformations in fibrillar biomaterials by combining non-rigid image registration with label-free second harmonic generation[J]. Biomaterials 136:86–97
Article Google Scholar
Lowe DG (2004) Distinctive image features from scale invariant keypoints[J]. Int J Comput Vis 60(2):91–110
Article Google Scholar
Ma J, Zhao J, Tian J, et al (2013) Regularized vector field learning with sparse approximation for mismatch removal. Pattern Recognit 46(12):3519–3532
Article MATH Google Scholar
Ma J, Zhao J, Tian J, Yuille AL, Tu Z (2014) Robust point matching via vector field consensus. IEEE Trans Image Process 23(4):1706–1721
Article MathSciNet MATH Google Scholar
Ma J, Zhao J, Ma Y, Tian J (2015) Non-rigid visible and infrared face registration via regularized Gaussian fields criterion. Pattern Recogn 48(3):772–784
Article Google Scholar
Ma J, Qiu W, Zhao J, et al (2015) Robust L2E estimation of transformation for non-rigid registration. IEEE Trans Signal Process 63(5):1115–1129
Article MathSciNet MATH Google Scholar
Ma J, Zhou H, Zhao J, et al (2015) Robust feature matching for remote sensing image registration via locally linear transforming[J]. IEEE Trans Geosci Remote Sens 53(12):6469–6481
Article Google Scholar
Ma J, Zhao J, Yuille AL (2016) Non-rigid point set registration by preserving global and local structures. IEEE Trans Image Process 25(1):53–64
Article MathSciNet MATH Google Scholar
Ma J, Chen C, Li C, et al (2016) Infrared and visible image fusion via gradient transfer and total variation minimization. Inf Fusion 31:100–109
Article Google Scholar
Ma J, Jiang J, Liu C, Li Y (2017) Feature guided Gaussian mixture model with semi-supervised EM and local geometric constraint for retinal image registration. Inf Sci 417:128–142
Article MathSciNet Google Scholar
Ma J, Ma Y, Li C (2019) Infrared and visible image fusion methods and applications: a survey. Inf Fusion 45:153–178
Article Google Scholar
Ma J, Yu W, Liang P, et al (2019) FusionGAN: a generative adversarial network for infrared and visible image fusion. Inf Fusion 48:11–26
Article Google Scholar
Torr PHS, Zisserman A (2000) MLESAC: a new robust estimator with application to estimating image geometry. Comput Vis Image Under-stand 78(1):138–156
Article Google Scholar
Xie L, Shen J, Han J et al (2017) Dynamic multi-view hashing for online image retrieval[C]. In: Twenty-sixth international joint conference on artificial intelligence, pp 3133–3139
Zhou H, Ma J, Yang C, et al (2016) Nonrigid feature matching for remote sensing images via probabilistic inference with global and local regularizations[J]. IEEE Geosc Rem Sens Lett 13(3):374–378
Google Scholar
Zhou H, Kuang Y, Yu Z, Ren S, Zhang Y, Lu T, Ma J (2018) Image deformation with vector-field interpolation based on MRLS-TPS. IEEE Access 7(1)
Zhu L, Shen J, Xie L, et al (2017) Unsupervised visual hashing with semantic assistant for content-based image retrieval[J]. IEEE Trans Knowl Data Eng 29(2):472–486
Article Google Scholar
Zitová B, Flusser J (2003) Image registration methods: a survey. Image Vis Comput 21(11):977–1000
Article Google Scholar

Download references

Author information

Authors and Affiliations

Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, China
Xiaolin Li, Daoqing Li, Li Peng, Huabing Zhou, Deng Chen & Yanduo Zhang
Department of Mathematics, Wuhan University of Techonology, Wuhan, China
Liang Xie

Authors

Xiaolin Li
View author publications
You can also search for this author inPubMed Google Scholar
Daoqing Li
View author publications
You can also search for this author inPubMed Google Scholar
Li Peng
View author publications
You can also search for this author inPubMed Google Scholar
Huabing Zhou
View author publications
You can also search for this author inPubMed Google Scholar
Deng Chen
View author publications
You can also search for this author inPubMed Google Scholar
Yanduo Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Liang Xie
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Huabing Zhou.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported in part by the National Natural Science Foundation of China under Grants 41501505 and 61771353, Natural Science Foundation of China Open University (G16F2505Q, G16F3702Z), 13th Five Year National Key R&D Project in China “The Hybrid Cloud-based Ubiquitous Location Awareness and Information Overlay Protoco” 2017YFB0503701, the Open Fund of the Key Laboratory of Intelligent Robotics of Hubei Province (HBIR201601) and Research Center of Artificial Intelligence, Yunnan Open University(No.2018JS373) Scientific Research Project of Education Department of Hubei Province(No. 20181508).

Appendix

1.1 A.1 Theoretical explanation of convergence

I have consulted a lot of relevant data, trying to prove that the algorithm is convergent theoretically. After many attempts, I realized that the core of the problem is to prove whether the objective function is a convex function.The main functions of the support computation model are:

$$ Q\left( f \right) = - \frac{1}{{2{\sigma^{2}}}} \sum \limits_{n = 1}^{N} {p_{n}}{\left\| {{y_{n}} - M({x_{n}})} \right\|^{2}} + \langle E^{1}C^{1},E^{3}C^{3} \rangle + \langle E^{2}C^{2},E^{4}C^{4} \rangle $$

(13)

Where f denotes the mapping relationship between four images, Q (f) denotes the parametric model about f, p_n denotes the posterior probability of the mixed model of the n th pair of registration points, and ∥y_n − M(x_n)∥² denotes the difference between the n th pair of initial input and the n th pair of target points after transformation,σ represents the standard deviation of Gauss probability model and belongs to the parameter. 〈f_k, f_k+ 2〉 is the inner product of functions in RHKS space. (that is the formula 10). If we can prove that the$- \frac {1}{{2{\sigma ^{2}}}} \sum \limits _{n = 1}^{N} {p_{n}}{\left \| {{y_{n}} - M({x_{n}})} \right \|^{2}} $ is convex, then we can solve the problem of convergence proof along the way of EM convergence proof.At the same time, because 〈f_k, f_k+ 2〉, under the conditions of this paper, can be converted to Gaussian radial basis function values, and negative exponential radial basis function in the independent variable greater than zero, the value of the function tends to zero.

Therefore, the key point is to prove that p_n∥y_n − M(x_n)∥² is a concave function (When multiplied by $- \frac {1}{{2{\sigma ^{2}}}}$, it becomes a convex function).Here is our proof:

First, for the polynomial ${\sum }_{n = 1}^{\infty }p_{n}\left \Vert Y_{n}-M(x_{n})\right \Vert ^{2}$, it can be regarded as an infinite series, where 0 <= p_n <= 1, 0 <= ∥Y_n − M(x_n)∥² <= distmax, (distmax represents the maximum distance between two points in the picture). Further, the inequality can be obtained:0 <= p_n ∥Y_n − M(x_n)∥²N₀ <= distmaxN₀ < L < 1,N₀ is a positive integer greater than zero, L is a constant satisfying the inequality. Obviously, N₀ exists.According to the root discriminant method, we know that series ${\sum }_{n = 1}^{\infty }p_{n}\left \Vert Y_{n}-M(x_{n})\right \Vert ^{2}$ must converge. Then, according to the Lazystatistician rule, we get the equation:

$$ {\sum}_{n = 1}^{N} p_{n}\left\Vert Y_{n}-M(x_{n})\right\Vert^{2}=E(\left\Vert Y_{n}-M(x_{n})\right\Vert^{2}); $$

(14)

where E represents expectation.

In functional analysis, for discrete variables, the two norm of a matrix is equal to the sum of the transformation of functions, that is: $\left \Vert Y_{n}-M(x_{n})\right \Vert ^{2}\Rightarrow \sum \limits _{n = 1}^{N}D(x_{n})^{2}$,where D(x₍n)) is a function equivalent expression of ∥Y_n − M(x_n)∥².

After that ,according to Hilbert kernel space property, any function in space can be represented by a base, $D(x_{n})={\sum }_{i = 1}^{N}D_{i}\sqrt {\lambda _{i}}\psi _{i},D_{i}$,where D(x_n) is the transformation of the i th component, λ_i is the corresponding constant of the i th transformation, and ψ_i is the kernel function. And the kernel function in this paper is ψ_i = exp(−β ∥x − x_i∥²), (β > 0),it’s a concave function,and because concave function adds or concave function, so the D(x_n) is a concave function, plus the minus sign of $- \frac {1}{{2{\sigma ^{2}}}}$, $- \frac {1}{{2{\sigma ^{2}}}} \sum \limits _{n = 1}^{N} {p_{n}}{\left \| {{y_{n}} - M({x_{n}})} \right \|^{2}} $ has the function property of convex function.

According to Jensen inequality, E(D(x_n)) >= D(E(x_n)),if and only if x₁ = x₂ = x₃ = ... = x_n obtains an equal sign. Therefore,

$-\sum \limits _{n = 1}^{N} {p_{n}}{\left \| {{y_{n}} - M({x_{n}})} \right \|^{2}}>=\sum \limits _{n = 1}^{N}p_{n}Y_{n}+\sum \limits _{n = 1}^{N}p_{n}M(x_{n})$.In the M-step of EM algorithm, the parameter maximization ensures that the lower bound $\sum \limits _{n = 1}^{N}p_{n}Y_{n}+\sum \limits _{n = 1}^{N}p_{n}M(x_{n})$ is greater than or equal to that before the iteration, so we can gradually approximate the convergence value of the objective function by raising the lower bound $\sum \limits _{n = 1}^{N}p_{n}Y_{n}+\sum \limits _{n = 1}^{N}p_{n}M(x_{n})$ iteratively.

Therefore, the objective function is convergent in theory.

1.2 A.2 Notation table

Table 3 is a supplement to the symbols of the formula, including the collection of main symbols.

Table 3 A notation table of main symbols

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, X., Li, D., Peng, L. et al. Color and depth image registration algorithm based on multi-vector-fields constraints. Multimed Tools Appl 78, 24301–24319 (2019). https://doi.org/10.1007/s11042-018-7048-4

Download citation

Received: 26 June 2018
Revised: 14 November 2018
Accepted: 07 December 2018
Published: 03 January 2019
Issue Date: 15 September 2019
DOI: https://doi.org/10.1007/s11042-018-7048-4

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Color and depth image registration algorithm based on multi-vector-fields constraints

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Robust feature matching via Gaussian field criterion for remote sensing image registration

A non-parametric depth modification model for registration between color and depth images

Registration of Visible and Infrared Images Based on Gradient Information

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Appendix

Appendix

1.1 A.1 Theoretical explanation of convergence

1.2 A.2 Notation table

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now