Skip to main content
Log in

Color and depth image registration algorithm based on multi-vector-fields constraints

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Image registration, which aim to establish a reliable feature relationship between images, is a critical problem in the field of image processing. In order to enhance the accuracy of color and depth image registration, this paper proposes an novel image registration algorithm based on multi-vector-fields constraints. We first initialize the edge information features of color and depth images, and establish putative correspondences based on edge information. Consider the correlation between the images, establish the functional relationships of the multi-vector-fields constraints based on the relationships. In the reproducing nuclear Hilbert space (RKHS), this constraint is added to the probability model, and the model parameters are optimized using the EM algorithm. Finally, the probability of corresponding edge points of the image is obtained. In order to further improve registration accuracy, this paper will change the input from one pair to two pairs and let the feature transformation relationship between images be iteratively evaluated using the parameter model. Taking publicly available RGB-D images as experimental subjects, results show that for single object image registration, the algorithm image registration accuracy in this paper is improved by about 5% compared with SC, ICP, and CPD algorithms. In addition, artificial noise was used to test the proposed algorithm’s anti-noise ability, results show that the proposed algorithm has superior anti-noise ability relative to SC, ICP and CPD algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. PAMI 24(24):509–522

    Article  Google Scholar 

  2. Chum O, Matas J (2005) Matching with PROSAC – progressive sample consensus. Proc IEEE Conf Comput Vis Pattern Recog 1:220–226

    Google Scholar 

  3. Essmaeel K, Gallo L, Damiani E, Pietro G, Dipanda A (2015) Comparison of methods for filtering for kinect depth data. Multimed Tools Appl 74(17):7331–7354

    Article  Google Scholar 

  4. Fischler MA, Bolles RC (1981) Random sample consensus: a paradigm for model fitting with application to image analysis and automated cartography. Commun ACM 24(6):381–395

    Article  MathSciNet  Google Scholar 

  5. Hill DL, Batchelor PG, Holden M, Hawkes DJ (2001) Medical image registration. Phys Med Biol 46(3):R1–45

    Article  Google Scholar 

  6. Jackson BP, Goshtasby AA (2010) Registering aerial video images using the projective constraint [J]. IEEE Trans Image Process 19(3):795–804

    Article  MathSciNet  MATH  Google Scholar 

  7. Jean-Louis F, Van DDA (2000) The PX-EM algorithm for fast stable fitting of Henderson’s mixed model[J]. Gen Select Evol 32(2):143–163

    Article  Google Scholar 

  8. Jing P, Su Y, Nie L, et al (2018) A framework of joint low-rank and sparse regression for image memorability prediction[J]. IEEE Trans Circ Syst Vid Technol PP (99):1–1

    Google Scholar 

  9. Jing P, Su Y, Nie L, et al (2018) Low-rank multi-view embedding learning for micro-video popularity prediction[J]. IEEE Trans Knowl Data Eng PP(99):1519–1532

    Article  Google Scholar 

  10. Jorge-Peas A, Bov H, Sanen K et al (2017) 3D full-field quantification of cell-induced large deformations in fibrillar biomaterials by combining non-rigid image registration with label-free second harmonic generation[J]. Biomaterials 136:86–97

    Article  Google Scholar 

  11. Lowe DG (2004) Distinctive image features from scale invariant keypoints[J]. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  12. Ma J, Zhao J, Tian J, et al (2013) Regularized vector field learning with sparse approximation for mismatch removal. Pattern Recognit 46(12):3519–3532

    Article  MATH  Google Scholar 

  13. Ma J, Zhao J, Tian J, Yuille AL, Tu Z (2014) Robust point matching via vector field consensus. IEEE Trans Image Process 23(4):1706–1721

    Article  MathSciNet  MATH  Google Scholar 

  14. Ma J, Zhao J, Ma Y, Tian J (2015) Non-rigid visible and infrared face registration via regularized Gaussian fields criterion. Pattern Recogn 48(3):772–784

    Article  Google Scholar 

  15. Ma J, Qiu W, Zhao J, et al (2015) Robust L2E estimation of transformation for non-rigid registration. IEEE Trans Signal Process 63(5):1115–1129

    Article  MathSciNet  MATH  Google Scholar 

  16. Ma J, Zhou H, Zhao J, et al (2015) Robust feature matching for remote sensing image registration via locally linear transforming[J]. IEEE Trans Geosci Remote Sens 53(12):6469–6481

    Article  Google Scholar 

  17. Ma J, Zhao J, Yuille AL (2016) Non-rigid point set registration by preserving global and local structures. IEEE Trans Image Process 25(1):53–64

    Article  MathSciNet  MATH  Google Scholar 

  18. Ma J, Chen C, Li C, et al (2016) Infrared and visible image fusion via gradient transfer and total variation minimization. Inf Fusion 31:100–109

    Article  Google Scholar 

  19. Ma J, Jiang J, Liu C, Li Y (2017) Feature guided Gaussian mixture model with semi-supervised EM and local geometric constraint for retinal image registration. Inf Sci 417:128–142

    Article  MathSciNet  Google Scholar 

  20. Ma J, Ma Y, Li C (2019) Infrared and visible image fusion methods and applications: a survey. Inf Fusion 45:153–178

    Article  Google Scholar 

  21. Ma J, Yu W, Liang P, et al (2019) FusionGAN: a generative adversarial network for infrared and visible image fusion. Inf Fusion 48:11–26

    Article  Google Scholar 

  22. Torr PHS, Zisserman A (2000) MLESAC: a new robust estimator with application to estimating image geometry. Comput Vis Image Under-stand 78(1):138–156

    Article  Google Scholar 

  23. Xie L, Shen J, Han J et al (2017) Dynamic multi-view hashing for online image retrieval[C]. In: Twenty-sixth international joint conference on artificial intelligence, pp 3133–3139

  24. Zhou H, Ma J, Yang C, et al (2016) Nonrigid feature matching for remote sensing images via probabilistic inference with global and local regularizations[J]. IEEE Geosc Rem Sens Lett 13(3):374–378

    Google Scholar 

  25. Zhou H, Kuang Y, Yu Z, Ren S, Zhang Y, Lu T, Ma J (2018) Image deformation with vector-field interpolation based on MRLS-TPS. IEEE Access 7(1)

  26. Zhu L, Shen J, Xie L, et al (2017) Unsupervised visual hashing with semantic assistant for content-based image retrieval[J]. IEEE Trans Knowl Data Eng 29(2):472–486

    Article  Google Scholar 

  27. Zitová B, Flusser J (2003) Image registration methods: a survey. Image Vis Comput 21(11):977–1000

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huabing Zhou.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was supported in part by the National Natural Science Foundation of China under Grants 41501505 and 61771353, Natural Science Foundation of China Open University (G16F2505Q, G16F3702Z), 13th Five Year National Key R&D Project in China “The Hybrid Cloud-based Ubiquitous Location Awareness and Information Overlay Protoco” 2017YFB0503701, the Open Fund of the Key Laboratory of Intelligent Robotics of Hubei Province (HBIR201601) and Research Center of Artificial Intelligence, Yunnan Open University(No.2018JS373) Scientific Research Project of Education Department of Hubei Province(No. 20181508).

Appendix

Appendix

1.1 A.1 Theoretical explanation of convergence

I have consulted a lot of relevant data, trying to prove that the algorithm is convergent theoretically. After many attempts, I realized that the core of the problem is to prove whether the objective function is a convex function.The main functions of the support computation model are:

$$ Q\left( f \right) = - \frac{1}{{2{\sigma^{2}}}} \sum \limits_{n = 1}^{N} {p_{n}}{\left\| {{y_{n}} - M({x_{n}})} \right\|^{2}} + \langle E^{1}C^{1},E^{3}C^{3} \rangle + \langle E^{2}C^{2},E^{4}C^{4} \rangle $$
(13)

Where f denotes the mapping relationship between four images, Q (f) denotes the parametric model about f, pn denotes the posterior probability of the mixed model of the n th pair of registration points, and ∥ynM(xn)∥2 denotes the difference between the n th pair of initial input and the n th pair of target points after transformation,σ represents the standard deviation of Gauss probability model and belongs to the parameter. 〈fk, fk+ 2〉 is the inner product of functions in RHKS space. (that is the formula 10). If we can prove that the\(- \frac {1}{{2{\sigma ^{2}}}} \sum \limits _{n = 1}^{N} {p_{n}}{\left \| {{y_{n}} - M({x_{n}})} \right \|^{2}} \) is convex, then we can solve the problem of convergence proof along the way of EM convergence proof.At the same time, because 〈fk, fk+ 2〉, under the conditions of this paper, can be converted to Gaussian radial basis function values, and negative exponential radial basis function in the independent variable greater than zero, the value of the function tends to zero.

Therefore, the key point is to prove that pnynM(xn)∥2 is a concave function (When multiplied by \(- \frac {1}{{2{\sigma ^{2}}}}\), it becomes a convex function).Here is our proof:

First, for the polynomial \({\sum }_{n = 1}^{\infty }p_{n}\left \Vert Y_{n}-M(x_{n})\right \Vert ^{2}\), it can be regarded as an infinite series, where 0 <= pn <= 1, 0 <= ∥YnM(xn)∥2 <= distmax, (distmax represents the maximum distance between two points in the picture). Further, the inequality can be obtained:0 <= pnYnM(xn)∥2N0 <= distmaxN0 < L < 1,N0 is a positive integer greater than zero, L is a constant satisfying the inequality. Obviously, N0 exists.According to the root discriminant method, we know that series \({\sum }_{n = 1}^{\infty }p_{n}\left \Vert Y_{n}-M(x_{n})\right \Vert ^{2}\) must converge. Then, according to the Lazystatistician rule, we get the equation:

$$ {\sum}_{n = 1}^{N} p_{n}\left\Vert Y_{n}-M(x_{n})\right\Vert^{2}=E(\left\Vert Y_{n}-M(x_{n})\right\Vert^{2}); $$
(14)

where E represents expectation.

In functional analysis, for discrete variables, the two norm of a matrix is equal to the sum of the transformation of functions, that is: \(\left \Vert Y_{n}-M(x_{n})\right \Vert ^{2}\Rightarrow \sum \limits _{n = 1}^{N}D(x_{n})^{2}\),where D(x(n)) is a function equivalent expression of ∥YnM(xn)∥2.

After that ,according to Hilbert kernel space property, any function in space can be represented by a base, \(D(x_{n})={\sum }_{i = 1}^{N}D_{i}\sqrt {\lambda _{i}}\psi _{i},D_{i}\),where D(xn) is the transformation of the i th component, λi is the corresponding constant of the i th transformation, and ψi is the kernel function. And the kernel function in this paper is ψi = exp(−βxxi2), (β > 0),it’s a concave function,and because concave function adds or concave function, so the D(xn) is a concave function, plus the minus sign of \(- \frac {1}{{2{\sigma ^{2}}}}\), \(- \frac {1}{{2{\sigma ^{2}}}} \sum \limits _{n = 1}^{N} {p_{n}}{\left \| {{y_{n}} - M({x_{n}})} \right \|^{2}} \) has the function property of convex function.

According to Jensen inequality, E(D(xn)) >= D(E(xn)),if and only if x1 = x2 = x3 = ... = xn obtains an equal sign. Therefore,

\(-\sum \limits _{n = 1}^{N} {p_{n}}{\left \| {{y_{n}} - M({x_{n}})} \right \|^{2}}>=\sum \limits _{n = 1}^{N}p_{n}Y_{n}+\sum \limits _{n = 1}^{N}p_{n}M(x_{n})\).In the M-step of EM algorithm, the parameter maximization ensures that the lower bound \(\sum \limits _{n = 1}^{N}p_{n}Y_{n}+\sum \limits _{n = 1}^{N}p_{n}M(x_{n})\) is greater than or equal to that before the iteration, so we can gradually approximate the convergence value of the objective function by raising the lower bound \(\sum \limits _{n = 1}^{N}p_{n}Y_{n}+\sum \limits _{n = 1}^{N}p_{n}M(x_{n})\) iteratively.

Therefore, the objective function is convergent in theory.

1.2 A.2 Notation table

Table 3 is a supplement to the symbols of the formula, including the collection of main symbols.

Table 3 A notation table of main symbols

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, X., Li, D., Peng, L. et al. Color and depth image registration algorithm based on multi-vector-fields constraints. Multimed Tools Appl 78, 24301–24319 (2019). https://doi.org/10.1007/s11042-018-7048-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-7048-4

Keywords

Navigation