Nuclear norm regularized convolutional Max Pos@Top machine

Li, Qinfeng; Zhou, Xiaofeng; Gu, Aihua; Li, Zonghua; Liang, Ru-Ze

doi:10.1007/s00521-016-2680-2

Nuclear norm regularized convolutional Max Pos@Top machine

Original Article
Published: 18 November 2016

Volume 30, pages 463–472, (2018)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Qinfeng Li^1,2,
Xiaofeng Zhou¹,
Aihua Gu¹,
Zonghua Li¹ &
…
Ru-Ze Liang³

453 Accesses
24 Citations
1 Altmetric
Explore all metrics

Abstract

In this paper, we propose a novel classification model for the multiple instance data, which aims to maximize the number of positive instances ranked before the top-ranked negative instances. This method belongs to a recently emerged performance, named as Pos@Top. Our proposed classification model has a convolutional structure that is composed by four layers, i.e., the convolutional layer, the activation layer, the max-pooling layer and the full connection layer. In this paper, we propose an algorithm to learn the convolutional filters and the full connection weights to maximize the Pos@Top measure over the training set. Also, we try to minimize the rank of the filter matrix to explore the low-dimensional space of the instances in conjunction with the classification results. The rank minimization is conducted by the nuclear norm minimization of the filter matrix. In addition, we develop an iterative algorithm to solve the corresponding problem. We test our method on several benchmark datasets. The experimental results show the superiority of our method compared with other state-of-the-art Pos@Top maximization methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SSD: Single Shot MultiBox Detector

Microsoft COCO: Common Objects in Context

ImageNet Large Scale Visual Recognition Challenge

Article 11 April 2015

References

Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado GS, Davis A, Dean J, Devin M et al (2015) Tensorflow: large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org 1
Agarwal S (2011) The infinite push: a new support vector ranking algorithm that directly optimizes accuracy at the absolute top of the list. In: Proceedings of the 11th SIAM international conference on data mining, SDM 2011, pp 839–850
Al Madi, NS, Khan JI (2016) Measuring learning performance and cognitive activity during multimodal comprehension. In: 2016 7th international conference on information and communication systems (ICICS). IEEE, pp 50–55
Bewley A, Upcroft B (2016) From imagenet to mining: adapting visual object detection with minimal supervision. Springer Tracts Adv Robot 113:501–514
Article Google Scholar
Boyd S, Cortes C, Mohri M, Radovanovic A (2012) Accuracy at the top. Adv Neural Inf Process Syst 2:953–961
Google Scholar
Chen S, Wang H, Xu F, Jin YQ (2016) Target classification using the deep convolutional networks for SAR images. IEEE Trans Geosci Remote Sens 54(8):4806–4817
Article Google Scholar
Ding M, Fan G (2013) Multi-layer joint gait-pose manifold for human motion modeling. In: 2013 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–8
Ding M, Fan G (2015) Generalized sum of Gaussians for real-time human pose tracking from a single depth sensor. In: 2015 IEEE winter conference on applications of computer vision, pp 47–54
Ding M, Fan G (2015) Multilayer joint gait-pose manifolds for human gait motion modeling. IEEE Trans Cybern 45(11):2413–2424
Article Google Scholar
Ding M, Fan G (2016) Articulated and generalized Gaussian kernel correlation for human pose estimation. IEEE Trans Image Process 25(2):776–789
Article MathSciNet Google Scholar
Fan J, Liang RZ (2016) Stochastic learning of multi-instance dictionary for earth mover’s distance based histogram comparison. Neural Comput Appl 1–11. arXiv:1609.00817v1
Goadrich M, Oliphant L, Shavlik J (2006) Gleaner: creating ensembles of first-order clauses to improve recall–precision curves. Mach Learn 64(1–3):231–261
Article MATH Google Scholar
Gong C, Tao D, Maybank SJ, Liu W, Kang G, Yang J (2016) Multi-modal curriculum learning for semi-supervised image classification. IEEE Trans Image Process 25(7):3249–3260
Article MathSciNet Google Scholar
Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. Technical report 7694, California Institute of Technology
Halligan S, Altman DG, Mallett S (2015) Disadvantages of using the area under the receiver operating characteristic curve to assess imaging tests: a discussion and proposal for an alternative approach. Eur Radiol 25(4):932–939
Article Google Scholar
Hammami N, Bedda M, Farah N (2012) Spoken Arabic digits recognition using MFCC based on GMM. In: 2012 IEEE conference on sustainable utilization and development in engineering and technology (STUDENT). IEEE, pp 160–163
Harris G, Panangadan A, Prasanna VK (2015) Learning of performance measures from crowd-sourced data with application to ranking of investments. In: Pacific–Asia conference on knowledge discovery and data mining. Springer, pp 538–549
He K, Zhang X, Ren S, Sun J (2016) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, 11–18 December 2015, pp 1026–1034
Hendrickx I, Kim SN, Kozareva Z, Nakov P, Ó Séaghdha, D, Padó, S, Pennacchiotti, M, Romano, L, Szpakowicz, S (2009) Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominals. In: Proceedings of the workshop on semantic evaluations: recent achievements and future directions. Association for Computational Linguistics, pp 94–99
Hentschel C, Wiradarma T, Sack H (2015) If we did not have imagenet: comparison of fisher encodings and convolutional neural networks on limited training data. Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol 9475. Springer, Switzerland, pp 400–409
Jain S, Kashyap R, Kuo TT, Bhargava S, Lin G, Hsu CN (2016) Weakly supervised learning of biomedical information extraction from curated data. BMC Bioinform 17(1):1
Article Google Scholar
Joachims T (2005) A support vector method for multivariate performance measures. In: Proceedings of the 22nd international conference on machine learning. ACM, pp 377–384
Li N, Jin R, Zhou ZH (2014) Top rank optimization in linear time. Adv Neural Inf Process Syst 2:1502–1510
Google Scholar
Liang RZ, Shi L, Wang H, Meng J, Wang JJY, Sun Q, Gu, Y (2016) Optimizing top precision performance measure of content-based image retrieval by learning similarity function. In: 2016 23st International conference on pattern recognition (ICPR). IEEE
Liang RZ, Xie W, Li W, Wang H, Wang JJY, Taylor L (2016) A novel transfer learning method based on common space mapping and weighted domain matching. In: 2016 IEEE 28th international conference on tools with artificial intelligence (ICTAI)
Lin F, Wang J, Zhang N, Xiahou J, McDonald N (2016) Multi-kernel learning for multivariate performance measures optimization. Neural Comput Appl 1–13. arXiv:1508.06264v1
Lu S, Lu H, Kolarik WJ (2001) Multivariate performance reliability prediction in real-time. Reliab Eng Syst Saf 72(1):39–45
Article Google Scholar
Madsen ME, Konge L, Nørgaard LN, Tabor A, Ringsted C, Klemmensen Å, Ottesen B, Tolsgaard MG (2014) Assessment of performance measures and learning curves for use of a virtual-reality ultrasound simulator in transvaginal ultrasound examination. Ultrasound Obstet Gynecol 44(6):693–699
Article Google Scholar
Mao Q, Tsang IWH (2013) A feature selection method for multivariate performance measures. IEEE Trans Pattern Anal Mach Intell 35(9):2051–2063
Article Google Scholar
Martinel N, Piciarelli C, Micheloni C (2016) A supervised extreme learning committee for food recognition. Comput Vis Image Underst 148:67–86
Article Google Scholar
Meyen A, Sooriyarachchi M (2016) Simulation study of a novel method for comparing more than two independent receiver operating characteristic (ROC) curves based on the area under the curves (AUCS). J Natl Sci Found Sri Lanka 43(4):357–367
Article Google Scholar
Patel M, Agius S, Wilkinson J, Patel L, Baker P (2016) Value of supervised learning events in predicting doctors in difficulty. Med Educ 50(7):746–756
Article Google Scholar
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, Berg A, Fei-Fei L (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252
Article MathSciNet Google Scholar
Shih SM, Wu WH, Hsieh HN (2016) A non-inferiority test for diagnostic accuracy in the absence of the golden standard test based on the paired partial areas under receiver operating characteristic curves. J Appl Stat 43(3):550–562
Article MathSciNet Google Scholar
Xu S, Xu L, Zhan Z, Ye K, Han K, Born F (2014) Method and system for resilient and adaptive detection of malicious websites. US Patent WO2013184653 A1
Sofotasios PC, Fikadu MK, Ho-Van K, Valkama M, Karagiannidis GK (2014) The area under a receiver operating characteristic curve over enriched multipath fading conditions. In: 2014 IEEE global communications conference. IEEE, pp 3490–3495
Sun F, Guo J, Lan Y, Xu J, Cheng X (2015) Learning word representations by jointly modeling syntagmatic and paradigmatic relations. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, Beijing, China, 26–31 July 2015, pp 136–145
Takeda A, Kanamori T (2014) Using financial risk measures for analyzing generalization performance of machine learning models. Neural Netw 57:29–38
Article MATH Google Scholar
Tang B, Liu X, Lei J, Song M, Tao D, Sun S, Dong F (2016) Deepchart: combining deep convolutional networks and deep belief networks in chart classification. Signal Process 124:156–161
Article Google Scholar
Wang CY, Peng DY, Xu L, Yi XS (2007) Gradual gray-watermark embedding algorithm in the wavelet domain. J Comput Appl 6:025
Google Scholar
Wang J, Wang H, Zhou Y, McDonald N (2015) Multiple kernel multivariate performance learning using cutting plane algorithm. In: 2015 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, pp 1870–1875
Wang JJY, Tsang IWH, Gao X (2016) Optimizing multivariate performance measures from multi-view data. In: Thirtieth AAAI conference on artificial intelligence, pp 2152—2158
Wang L, Scott K, Xu L, Clausi D (2016) Sea ice concentration estimation during melt from dual-pol SAR scenes using deep convolutional neural networks: a case study. IEEE Trans Geosci Remote Sens 54(8):4524–4533
Article Google Scholar
Xu L, Zhan Z, Xu S, Ye K (2013) Cross-layer detection of malicious websites. In: Proceedings of the third ACM conference on data and application security and privacy. ACM, pp 141–152
Xu L, Zhan Z, Xu S, Ye K (2014) An evasion and counter-evasion study in malicious websites detection. In: 2014 IEEE conference on communications and network security (CNS). IEEE, pp 265–273
Yang W, Jin L, Tao D, Xie Z, Feng Z (2016) Dropsample: a new training method to enhance deep convolutional neural networks for large-scale unconstrained handwritten chinese character recognition. Pattern Recogn 58:190–203
Article Google Scholar
Zahedi M, Sorkhi A (2013) Improving text classification performance using PCA and recall–precision criteria. Arab J Sci Eng 38(8):2095–2102
Article Google Scholar
Zhang P, Su W (2012) Statistical inference on recall, precision and average precision under random selection. In: Proceedings—2012 9th international conference on fuzzy systems and knowledge discovery, FSKD 2012, pp 1348–1352
Zokaei N, Burnett Heyes S, Gorgoraptis N, Budhdeo S, Husain M (2015) Working memory recall precision is a more sensitive index than span. J Neuropsychol 9(2):319–329
Article Google Scholar

Download references

Acknowledgements

The Foundation of modern educational technology research of Jiangsu Province (No. 2015-R-42631) and The University Natural Science Foundation of Jiangsu Province (14KJD520003).

Author information

Authors and Affiliations

College of Computer and Information, Hohai University, Nanjing, China
Qinfeng Li, Xiaofeng Zhou, Aihua Gu & Zonghua Li
Department of Basic, Jinling Institute of Technology, Nanjing, China
Qinfeng Li
King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Ru-Ze Liang

Authors

Qinfeng Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiaofeng Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Aihua Gu
View author publications
You can also search for this author in PubMed Google Scholar
Zonghua Li
View author publications
You can also search for this author in PubMed Google Scholar
Ru-Ze Liang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qinfeng Li.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, Q., Zhou, X., Gu, A. et al. Nuclear norm regularized convolutional Max Pos@Top machine. Neural Comput & Applic 30, 463–472 (2018). https://doi.org/10.1007/s00521-016-2680-2

Download citation

Received: 03 August 2016
Accepted: 25 October 2016
Published: 18 November 2016
Issue Date: July 2018
DOI: https://doi.org/10.1007/s00521-016-2680-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Nuclear norm regularized convolutional Max Pos@Top machine

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Microsoft COCO: Common Objects in Context

ImageNet Large Scale Visual Recognition Challenge

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Nuclear norm regularized convolutional Max Pos@Top machine

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Microsoft COCO: Common Objects in Context

ImageNet Large Scale Visual Recognition Challenge

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation