Skip to main content
Log in

An effective few-shot learning approach via location-dependent partial differential equation

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

Recently, learning-based partial differential equation (L-PDE) has achieved success in few-shot learning area, while its feature weighting mechanism and recognition stability require further improvement. To address these issues, we propose a novel model called “location-dependent PDE” (LD-PDE) based on Navier–Stokes equation and rotational invariants in this paper. To our best knowledge, LD-PDE is the first application of the Navier–Stokes equation to achieve image recognition as a high-level vision task. Specifically, we formulate the feature variation with respect to each time step as a linear combination of rotational invariants in LD-PDE. Meanwhile, we design location-dependent mechanism to adaptively weight each invariant in an attention-based approach, which provides hierarchical discrimination in the spatial domain. Once the ultimate feature is learned, we measure the model error with the cross-entropy loss and update the parameters by the coordinate descent algorithm. As a verification, experimental results on face recognition datasets show that LD-PDE method outperforms the state-of-the-art approaches with few training samples. Moreover, compared to L-PDE, LD-PDE achieves a much more stable recognition with low sensitivity to its hyper-parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. The model was downloaded from the website http://www.robots.ox.ac.uk/~vgg/software/vgg_face/.

References

  1. Batchelor G (1967) An introduction to fluid dynamics. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  2. Behmardi D, Nayeri E (2008) Introduction of fréchet and gâteaux derivative. Appl Math Sci 2(20):975–980

    MathSciNet  MATH  Google Scholar 

  3. Chan TF, Esedoglu S (2005) Aspects of total variation regularized \(L^1\) function approximation. SIAM J Appl Math 65(5):1817–1837

    Article  MathSciNet  MATH  Google Scholar 

  4. Chen K, Liu L (2009) “Best k”: critical clustering structures in categorical datasets. Knowl Inf Syst 20(1):1–33

    Article  Google Scholar 

  5. Chorowski JK, Bahdanau D, Serdyuk D, Cho K, Bengio Y (2015) Attention-based models for speech recognition. In: Proceedings of the 28th international conference on neural information processing systems, vol 1, pp 577–585

  6. Fan F, Zhao WX, Wen JR, Xu G, Chang EY (2017) Mining collective knowledge: inferring functional labels from online review for business. Knowl Inf Syst 53(3):723–747

    Article  Google Scholar 

  7. Fang C, Zhao Z, Zhou P, Lin Z (2017) Feature learning via partial differential equation with applications to face recognition. Pattern Recognit 69:14–25

    Article  Google Scholar 

  8. Fletcher R (1976) Conjugate gradient methods for indefinite systems. Numerical analysis. Springer, Berlin, pp 73–89

    Chapter  Google Scholar 

  9. Ganan S, McClure D (1985) Bayesian image analysis: an application to single photon emission tomography. Am Stat Assoc 20:12–18

    Google Scholar 

  10. Garcia V, Bruna J (2017) Few-shot learning with graph neural networks. arXiv preprint arXiv:1711.04043

  11. Geng M, Shang S, Ding B, Wang H, Zhang P (2019) Unsupervised learning-based depth estimation-aided visual slam approach. Circuits Syst Signal Process 1–28

  12. Georghiades AS, Belhumeur PN, Kriegman DJ (2001) From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Trans Pattern Anal Mach Intell 23(6):643–660

    Article  Google Scholar 

  13. Hinton GE, Salakhutdinov RR (2009) Replicated softmax: an undirected topic model. In: Proceedings of the 22nd international conference on neural information processing systems, pp 1607–1614

  14. Horng MH, Liou RJ (2011) Multilevel minimum cross entropy threshold selection based on the firefly algorithm. Expert Syst Appl 38(12):14805–14811

    Article  Google Scholar 

  15. Ishii H (1989) A boundary value problem of the dirichlet type for hamilton–jacobi equations. Annali della Scuola Normale Superiore di Pisa-Classe di Scienze 16(1):105–135

    MathSciNet  MATH  Google Scholar 

  16. Jang E, Gu S, Poole B (2016) Categorical reparameterization with gumbel-softmax. arXiv preprint arXiv:1611.01144

  17. Jiang L, Li C (2019) Two improved attribute weighting schemes for value difference metric. Knowl Inf Syst 60(2):949–970

    Article  Google Scholar 

  18. Jiang Z, Lin Z, Davis LS (2013) Label consistent K-SVD: learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell 35(11):2651–2664

    Article  Google Scholar 

  19. Le QV, Ngiam J, Coates A, Lahiri A, Prochnow B, Ng AY (2011) On optimization methods for deep learning. In: Proceedings of the 28th international conference on international conference on machine learning. Omnipress, pp 265–272

  20. Leroux S, Bohez S, De Coninck E, Verbelen T, Vankeirsbilck B, Simoens P, Dhoedt B (2017) The cascading neural network: building the internet of smart things. Knowl Inf Syst 52(3):791–814

    Article  Google Scholar 

  21. Li D, Wen G, Hou Z, Huan E, Hu Y, Li H (2019) Rtcrelief-f: an effective clustering and ordering-based ensemble pruning algorithm for facial expression recognition. Knowl Inf Syst 59(1–32):219–250

    Article  Google Scholar 

  22. Liu R, Lin Z, Zhang W, Su Z (2010) Learning PDES for image restoration via optimal control. In: European conference on computer vision. Springer, Berlin, pp 115–128

  23. Liu R, Lin Z, Zhang W, Tang K, Su Z (2013) Toward designing intelligent pdes for computer vision: an optimal control approach. Image Vis Comput 31(1):43–56

    Article  Google Scholar 

  24. Liu R, Cao J, Lin Z, Shan S (2014) Adaptive partial differential equation learning for visual saliency detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3866–3873

  25. Mairal J, Bach F, Ponce J (2012) Task-driven dictionary learning. IEEE Trans Pattern Anal Mach Intell 34(4):791–804

    Article  Google Scholar 

  26. Martinez AM (1998) The AR face database. CVC Technical Report 24

  27. Mitchell AR, Griffiths DF (1980) The finite difference method in partial differential equations. Wiley, New York

    MATH  Google Scholar 

  28. Osher S, Rudin LI (1990) Feature-oriented image enhancement using shock filters. SIAM J Numer Anal 27(4):919–940

    Article  MATH  Google Scholar 

  29. Parkhi OM, Vedaldi A, Zisserman A et al (2015) Deep face recognition. BMVC 1(3):6

    Google Scholar 

  30. Peterson LE (2009) \(K\)-nearest neighbor. Scholarpedia 4(2):1883

    Article  Google Scholar 

  31. Rudin LI, Osher S, Fatemi E (1992) Nonlinear total variation based noise removal algorithms. Phys D Nonlinear Phenom 60(1–4):259–268

    Article  MathSciNet  MATH  Google Scholar 

  32. Sapiro G (2006) Geometric partial differential equations and image analysis. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  33. Sayeed F, Hanmandlu M (2017) Properties of information sets and information processing with an application to face recognition. Knowl Inf Syst 52(2):485–507

    Article  Google Scholar 

  34. Sim T, Baker S, Bsat M (2002) The CMU pose, illumination, and expression (PIE) database. In: Proceedings of fifth IEEE international conference on automatic face and gesture recognition. IEEE, pp 53–58

  35. Snell J, Swersky K, Zemel R (2017) Prototypical networks for few-shot learning. In: Proceedings of the 28th international conference on neural information processing systems, pp 4077–4087

  36. Strong D, Chan T (2003) Edge-preserving and scale-dependent properties of total variation regularization. Inverse Probl 19(6):S165

    Article  MathSciNet  MATH  Google Scholar 

  37. Tai XC, Borok S, Hahn J (2009) Image denoising using TV-stokes equation with an orientation-matching minimization. In: International conference on scale space and variational methods in computer vision. Springer, pp 490–501

  38. Tao D, Li X, Hu W, Maybank S, Wu X (2005) Supervised tensor learning. In: Fifth IEEE international conference on data mining. IEEE, p 8

  39. Temam R (2001) Navier–Stokes equations: theory and numerical analysis, vol 343. American Mathematical Society, New York

    MATH  Google Scholar 

  40. Triantafillou E, Zemel R, Urtasun R (2017) Few-shot learning through an information retrieval lens. In: Proceedings of the 31st international conference on neural information processing systems, pp 2255–2265

  41. Vilalta R, Drissi Y (2002) A perspective view and survey of meta-learning. Artif Intell Rev 18(2):77–95

    Article  Google Scholar 

  42. Vincent OR, Folorunso O (2009) A descriptive algorithm for sobel image edge detection. In: Proceedings of informing science and IT education conference (InSITE), vol 40. Informing Science Institute, California, pp 97–107

  43. Vinyals O, Blundell C, Lillicrap T, Wierstra D et al (2016) Matching networks for one shot learning. In: Proceedings of the 30th international conference on neural information processing systems, pp 3630–3638

  44. Wang Y, Huang M, Zhao L et al (2016) Attention-based LSTM for aspect-level sentiment classification. In: Proceedings of the 2016 conference on empirical methods in natural language processing, pp 606–615

  45. Wang YCF, Wei CP, Chen CF (2012) Low-rank matrix recovery with structural incoherence for robust face recognition. In: IEEE Conference on computer vision and pattern recognition. IEEE, pp 2618–2625

  46. Wardetzky M, Mathur S, Kälberer F, Grinspun E (2007) Discrete Laplace operators: no free lunch. In: Symposium on geometry processing. Aire-la-Ville, Switzerland, pp 33–37

  47. Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227

    Article  Google Scholar 

  48. Wright SJ (2015) Coordinate descent algorithms. Math Program 151(1):3–34

    Article  MathSciNet  MATH  Google Scholar 

  49. Yao Y, Liu Y, Yu Y, Xu H, Lv W, Li Z, Chen X (2013) K-SVM: an effective svm algorithm based on \(k\)-means clustering. JCP 8(10):2632–2639

    Google Scholar 

  50. Zhang Y, Jiang Z, Davis LS (2013) Learning structured low-rank representations for image classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 676–683

  51. Zhao W, Chellappa R, Phillips PJ, Rosenfeld A (2003) Face recognition: a literature survey. ACM Comput Surv (CSUR) 35(4):399–458

    Article  Google Scholar 

  52. Zhao W, Liu H, Dai W, Ma J (2016) An entropy-based clustering ensemble method to support resource allocation in business process management. Knowl Inf Syst 48(2):305–330

    Article  Google Scholar 

  53. Zhou P, Lin Z, Zhang C (2016) Integrated low-rank-based discriminative feature learning for recognition. IEEE Trans Neural Netw Learn Syst 27(5):1080–1093

    Article  MathSciNet  Google Scholar 

  54. Zineddin B, Wang Z, Liu X (2011) Cellular neural networks, the Navier–Stokes equation, and microarray image reconstruction. IEEE Trans Image Process 20(11):3296–3301

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Grant Nos. 91648204 and 11701566).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhenyu Zhao.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, H., Zhao, Z. & Tang, Y. An effective few-shot learning approach via location-dependent partial differential equation. Knowl Inf Syst 62, 1881–1901 (2020). https://doi.org/10.1007/s10115-019-01400-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-019-01400-y

Keywords

Navigation