Skip to main content
Log in

Nuclear norm regularized convolutional Max Pos@Top machine

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

In this paper, we propose a novel classification model for the multiple instance data, which aims to maximize the number of positive instances ranked before the top-ranked negative instances. This method belongs to a recently emerged performance, named as Pos@Top. Our proposed classification model has a convolutional structure that is composed by four layers, i.e., the convolutional layer, the activation layer, the max-pooling layer and the full connection layer. In this paper, we propose an algorithm to learn the convolutional filters and the full connection weights to maximize the Pos@Top measure over the training set. Also, we try to minimize the rank of the filter matrix to explore the low-dimensional space of the instances in conjunction with the classification results. The rank minimization is conducted by the nuclear norm minimization of the filter matrix. In addition, we develop an iterative algorithm to solve the corresponding problem. We test our method on several benchmark datasets. The experimental results show the superiority of our method compared with other state-of-the-art Pos@Top maximization methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  1. Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M et al (2015) Tensorflow: large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org 1

  2. Agarwal S (2011) The infinite push: a new support vector ranking algorithm that directly optimizes accuracy at the absolute top of the list. In: Proceedings of the 11th SIAM international conference on data mining, SDM 2011, pp 839–850

  3. Al Madi, NS, Khan JI (2016) Measuring learning performance and cognitive activity during multimodal comprehension. In: 2016 7th international conference on information and communication systems (ICICS). IEEE, pp 50–55

  4. Bewley A, Upcroft B (2016) From imagenet to mining: adapting visual object detection with minimal supervision. Springer Tracts Adv Robot 113:501–514

    Article  Google Scholar 

  5. Boyd S, Cortes C, Mohri M, Radovanovic A (2012) Accuracy at the top. Adv Neural Inf Process Syst 2:953–961

    Google Scholar 

  6. Chen S, Wang H, Xu F, Jin YQ (2016) Target classification using the deep convolutional networks for SAR images. IEEE Trans Geosci Remote Sens 54(8):4806–4817

    Article  Google Scholar 

  7. Ding M, Fan G (2013) Multi-layer joint gait-pose manifold for human motion modeling. In: 2013 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–8

  8. Ding M, Fan G (2015) Generalized sum of Gaussians for real-time human pose tracking from a single depth sensor. In: 2015 IEEE winter conference on applications of computer vision, pp 47–54

  9. Ding M, Fan G (2015) Multilayer joint gait-pose manifolds for human gait motion modeling. IEEE Trans Cybern 45(11):2413–2424

    Article  Google Scholar 

  10. Ding M, Fan G (2016) Articulated and generalized Gaussian kernel correlation for human pose estimation. IEEE Trans Image Process 25(2):776–789

    Article  MathSciNet  Google Scholar 

  11. Fan J, Liang RZ (2016) Stochastic learning of multi-instance dictionary for earth mover’s distance based histogram comparison. Neural Comput Appl 1–11. arXiv:1609.00817v1

  12. Goadrich M, Oliphant L, Shavlik J (2006) Gleaner: creating ensembles of first-order clauses to improve recall–precision curves. Mach Learn 64(1–3):231–261

    Article  MATH  Google Scholar 

  13. Gong C, Tao D, Maybank SJ, Liu W, Kang G, Yang J (2016) Multi-modal curriculum learning for semi-supervised image classification. IEEE Trans Image Process 25(7):3249–3260

    Article  MathSciNet  Google Scholar 

  14. Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. Technical report 7694, California Institute of Technology

  15. Halligan S, Altman DG, Mallett S (2015) Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. Eur Radiol 25(4):932–939

    Article  Google Scholar 

  16. Hammami N, Bedda M, Farah N (2012) Spoken Arabic digits recognition using MFCC based on GMM. In: 2012 IEEE conference on sustainable utilization and development in engineering and technology (STUDENT). IEEE, pp 160–163

  17. Harris G, Panangadan A, Prasanna VK (2015) Learning of performance measures from crowd-sourced data with application to ranking of investments. In: Pacific–Asia conference on knowledge discovery and data mining. Springer, pp 538–549

  18. He K, Zhang X, Ren S, Sun J (2016) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, 11–18 December 2015, pp 1026–1034

  19. Hendrickx I, Kim SN, Kozareva Z, Nakov P, Ó Séaghdha, D, Padó, S, Pennacchiotti, M, Romano, L, Szpakowicz, S (2009) Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominals. In: Proceedings of the workshop on semantic evaluations: recent achievements and future directions. Association for Computational Linguistics, pp 94–99

  20. Hentschel C, Wiradarma T, Sack H (2015) If we did not have imagenet: comparison of fisher encodings and convolutional neural networks on limited training data. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol 9475. Springer, Switzerland, pp 400–409

  21. Jain S, Kashyap R, Kuo TT, Bhargava S, Lin G, Hsu CN (2016) Weakly supervised learning of biomedical information extraction from curated data. BMC Bioinform 17(1):1

    Article  Google Scholar 

  22. Joachims T (2005) A support vector method for multivariate performance measures. In: Proceedings of the 22nd international conference on machine learning. ACM, pp 377–384

  23. Li N, Jin R, Zhou ZH (2014) Top rank optimization in linear time. Adv Neural Inf Process Syst 2:1502–1510

    Google Scholar 

  24. Liang RZ, Shi L, Wang H, Meng J, Wang JJY, Sun Q, Gu, Y (2016) Optimizing top precision performance measure of content-based image retrieval by learning similarity function. In: 2016 23st International conference on pattern recognition (ICPR). IEEE

  25. Liang RZ, Xie W, Li W, Wang H, Wang JJY, Taylor L (2016) A novel transfer learning method based on common space mapping and weighted domain matching. In: 2016 IEEE 28th international conference on tools with artificial intelligence (ICTAI)

  26. Lin F, Wang J, Zhang N, Xiahou J, McDonald N (2016) Multi-kernel learning for multivariate performance measures optimization. Neural Comput Appl 1–13. arXiv:1508.06264v1

  27. Lu S, Lu H, Kolarik WJ (2001) Multivariate performance reliability prediction in real-time. Reliab Eng Syst Saf 72(1):39–45

    Article  Google Scholar 

  28. Madsen ME, Konge L, Nørgaard LN, Tabor A, Ringsted C, Klemmensen Å, Ottesen B, Tolsgaard MG (2014) Assessment of performance measures and learning curves for use of a virtual-reality ultrasound simulator in transvaginal ultrasound examination. Ultrasound Obstet Gynecol 44(6):693–699

    Article  Google Scholar 

  29. Mao Q, Tsang IWH (2013) A feature selection method for multivariate performance measures. IEEE Trans Pattern Anal Mach Intell 35(9):2051–2063

    Article  Google Scholar 

  30. Martinel N, Piciarelli C, Micheloni C (2016) A supervised extreme learning committee for food recognition. Comput Vis Image Underst 148:67–86

    Article  Google Scholar 

  31. Meyen A, Sooriyarachchi M (2016) Simulation study of a novel method for comparing more than two independent receiver operating characteristic (ROC) curves based on the area under the curves (AUCS). J Natl Sci Found Sri Lanka 43(4):357–367

    Article  Google Scholar 

  32. Patel M, Agius S, Wilkinson J, Patel L, Baker P (2016) Value of supervised learning events in predicting doctors in difficulty. Med Educ 50(7):746–756

    Article  Google Scholar 

  33. Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg A, Fei-Fei L (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252

    Article  MathSciNet  Google Scholar 

  34. Shih SM, Wu WH, Hsieh HN (2016) A non-inferiority test for diagnostic accuracy in the absence of the golden standard test based on the paired partial areas under receiver operating characteristic curves. J Appl Stat 43(3):550–562

    Article  MathSciNet  Google Scholar 

  35. Xu S, Xu L, Zhan Z, Ye K, Han K, Born F (2014) Method and system for resilient and adaptive detection of malicious websites. US Patent WO2013184653 A1

  36. Sofotasios PC, Fikadu MK, Ho-Van K, Valkama M, Karagiannidis GK (2014) The area under a receiver operating characteristic curve over enriched multipath fading conditions. In: 2014 IEEE global communications conference. IEEE, pp 3490–3495

  37. Sun F, Guo J, Lan Y, Xu J, Cheng X (2015) Learning word representations by jointly modeling syntagmatic and paradigmatic relations. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, Beijing, China, 26–31 July 2015, pp 136–145

  38. Takeda A, Kanamori T (2014) Using financial risk measures for analyzing generalization performance of machine learning models. Neural Netw 57:29–38

    Article  MATH  Google Scholar 

  39. Tang B, Liu X, Lei J, Song M, Tao D, Sun S, Dong F (2016) Deepchart: combining deep convolutional networks and deep belief networks in chart classification. Signal Process 124:156–161

    Article  Google Scholar 

  40. Wang CY, Peng DY, Xu L, Yi XS (2007) Gradual gray-watermark embedding algorithm in the wavelet domain. J Comput Appl 6:025

    Google Scholar 

  41. Wang J, Wang H, Zhou Y, McDonald N (2015) Multiple kernel multivariate performance learning using cutting plane algorithm. In: 2015 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 1870–1875

  42. Wang JJY, Tsang IWH, Gao X (2016) Optimizing multivariate performance measures from multi-view data. In: Thirtieth AAAI conference on artificial intelligence, pp 2152—2158

  43. Wang L, Scott K, Xu L, Clausi D (2016) Sea ice concentration estimation during melt from dual-pol SAR scenes using deep convolutional neural networks: a case study. IEEE Trans Geosci Remote Sens 54(8):4524–4533

    Article  Google Scholar 

  44. Xu L, Zhan Z, Xu S, Ye K (2013) Cross-layer detection of malicious websites. In: Proceedings of the third ACM conference on data and application security and privacy. ACM, pp 141–152

  45. Xu L, Zhan Z, Xu S, Ye K (2014) An evasion and counter-evasion study in malicious websites detection. In: 2014 IEEE conference on communications and network security (CNS). IEEE, pp 265–273

  46. Yang W, Jin L, Tao D, Xie Z, Feng Z (2016) Dropsample: a new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten chinese character recognition. Pattern Recogn 58:190–203

    Article  Google Scholar 

  47. Zahedi M, Sorkhi A (2013) Improving text classification performance using PCA and recall–precision criteria. Arab J Sci Eng 38(8):2095–2102

    Article  Google Scholar 

  48. Zhang P, Su W (2012) Statistical inference on recall, precision and average precision under random selection. In: Proceedings—2012 9th international conference on fuzzy systems and knowledge discovery, FSKD 2012, pp 1348–1352

  49. Zokaei N, Burnett Heyes S, Gorgoraptis N, Budhdeo S, Husain M (2015) Working memory recall precision is a more sensitive index than span. J Neuropsychol 9(2):319–329

    Article  Google Scholar 

Download references

Acknowledgements

The Foundation of modern educational technology research of Jiangsu Province (No. 2015-R-42631) and The University Natural Science Foundation of Jiangsu Province (14KJD520003).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qinfeng Li.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interests.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Q., Zhou, X., Gu, A. et al. Nuclear norm regularized convolutional Max Pos@Top machine. Neural Comput & Applic 30, 463–472 (2018). https://doi.org/10.1007/s00521-016-2680-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-016-2680-2

Keywords

Navigation