Confidence Transformation for Combining Classifiers

Liu, Cheng-Lin; Hao, Hongwei; Sako, Hiroshi

doi:10.1007/s10044-003-0199-5

Confidence Transformation for Combining Classifiers

Original Article
Published: 10 March 2004

Volume 7, pages 2–17, (2004)
Cite this article

Pattern Analysis and Applications Aims and scope Submit manuscript

Cheng-Lin Liu¹,
Hongwei Hao² &
Hiroshi Sako¹

273 Accesses
18 Citations
Explore all metrics

Abstract

This paper investigates a number of confidence transformation methods for measurement-level combination of classifiers. Each confidence transformation method is the combination of a scaling function and an activation function. The activation functions correspond to different types of confidences: likelihood (exponential), log-likelihood, sigmoid, and the evidence combination of sigmoid measures. The sigmoid and evidence measures serve as approximates to class probabilities. The scaling functions are derived by Gaussian density modeling, logistic regression with variable inputs, etc. We test the confidence transformation methods in handwritten digit recognition by combining variable sets of classifiers: neural classifiers only, distance classifiers only, strong classifiers, and mixed strong/weak classifiers. The results show that confidence transformation is efficient to improve the combination performance in all the settings. The normalization of class probabilities to unity of sum is shown to be detrimental to the combination performance. Comparing the scaling functions, the Gaussian method and the logistic regression perform well in most cases. Regarding the confidence types, the sigmoid and evidence measures perform well in most cases, and the evidence measure generally outperforms the sigmoid measure. We also show that the confidence transformation methods are highly robust to the validation sample size in parameter estimation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-step Training of a Generalized Linear Classifier

Article 29 September 2018

Multiple classifiers fusion and CNN feature extraction for handwritten digits recognition

Article Open access 22 February 2019

Combination of Linear Classifiers Using Score Function – Analysis of Possible Combination Strategies

References

Mandler E, Schürman J. Combining the classification results of independent classifiers based on the Dempster-Shafer theory of evidence. In: Gelsema ES, Kanal LN (eds). Pattern Recognition and Artificial Intelligence. Elsevier, 1988, pp.381–393.
Xu L, Krzyzak A, Suen CY. Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. System, Man, and Cybernetics 1992; 22(3): 418–435.
Google Scholar
Ho TK, Hull J, Srihari SN. Decision combination in multiple classifier systems. IEEE Trans. Pattern Analysis and Machine Intelligence 1994; 16(1): 66–75.
Google Scholar
Kittler J, Hatef M, Duin RPW, Matas J. On combining classifiers. IEEE Trans. Pattern Analysis and Machine Intelligence 1998; 20(3): 226–239.
Google Scholar
Suen CY, Lam L. Multiple classifier combination methodologies for different output levels. In: Kittler J, Roli F (eds). Multiple Classifier Systems, LNCS 1857. Springer, 2000, pp.52–66.
Rahman AF, Fairhurst MC. A novel confidence-based framework for multiple expert decision fusion. In: Carter N, Nixon MS (eds). Proc. 9th British Machine Vision Conference, 1998.
Bengio S, Marcel C, Marcel S, Mariethoz J. Confidence measures for multimodal identity identification. Information Fusion 2002; 3(4): 267–276.
Google Scholar
Duin RPW. The combining classifiers: to train or not to train. In: Proc. 16th International Conference on Pattern Recognition, Vol.2. Quebec, Canada, 2002, pp.765–770.
Liu CL, Nakagawa M. Precise candidate selection for large character set recognition by confidence evaluation. IEEE Trans. Pattern Analysis and Machine Intelligence 2000; 22(6): 636–642.
Google Scholar
Ruck DW, Rogers SK, Kabrisky M, Oxley ME, Suter BW. The multilayer perceptron as an approximation to a Bayes optimal discriminant function. IEEE Trans. Neural Networks 1990; 1(4): 296–298.
Google Scholar
Richard MD, Lippmann RP. Neural network classifiers estimate Bayesian a posteriori probabilities. Neural Computation 1991; 4:461–483.
Google Scholar
Duda RO, Hart PE, Stork DG, Pattern Classification, 2nd edition. Wiley-Interscience, 2001.
Cordella LP, Foggia P, Sansone C, Tortorella F, Vento M. Reliability parameters to improve combination strategies in multi-expert systems. Pattern Analysis and Applications 1999; 2(3): 205–214.
Google Scholar
Atukorale AS, Suganthan PN. Combining classifiers based on confidence values. In Proc. 5th International Conference on Document Analysis and Recognition. Bangalore, India, 1999, pp.37–40.
Lin X, Ding X, Chen M, Zhang R, Wu Y. Adaptive confidence transform based classifier combination for Chinese character recognition. Pattern Recognition Letters 1998; 19:975–988.
Google Scholar
Denker JS, LeCun Y. Transforming neural-net output levels to probability distribution. In: Lippmann RP, Moody JE, Touretzky DS (eds). Advances in Neural Information Processing 3. Morgan Kauffman, 1991, pp.853–859.
Hoekstra A, Tholen SA, Duin RPW. Estimating the reliability of neural network classification. In Proc. International Conference on Artificial Neural Networks. Bochum, Germany, 1996, pp.53–58.
Duin RPW, Tax DMJ. Classifier conditional posterior probabilities. In: Amin A, Dori D, Pudil P, Fremman H (eds). Advances in Pattern Recognition, LNCS 1451. Springer, 1998, pp.611–619.
Gillick L, Ito Y, Young J. A probabilistic approach to confidence estimation and evaluation. In Proc. International Conference on Acoustics, Speech, and Signal Processing, vol.2. Munich, Germany, 1997, pp.879–882.
Platt J. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods, In: Smola AJ, Bartlett P, Scholkopf D, Schuurmanns D (eds). Advances in Large Margin Classifiers. MIT Press, 1999.
Gorski N. Practical combination of multiple classifiers, In: Downton AC, Impedovo S (eds), Progress of Handwriting Recognition. World Scientific, 1997.
Wei W, Leen TK, Barnard E. A fast histogram-based postprocessor that improves posterior probability estimates. Neural Computation 1999; 11(5): 1235–1248.
Google Scholar
Schürmann J, Pattern Classification: A United View of Statistical and Neural Approaches. Wiley-Interscience, 1996.
Hao H, Liu CL, Sako H. Confidence evaluation for combining diverse classifiers. In Proc. 7th International Conference on Document Analysis and Recognition. Edinburgh, Scotland, 2003, pp.760–764.
Hashem S. Optimal linear combinations of neural networks. Neural Networks 1997; 10(4): 599–614.
Google Scholar
Ueda N. Optimal linear combination of neural networks for improving classification performance. IEEE Trans. Pattern Analysis and Machine Intelligence 2000; 22(2): 207–215.
Google Scholar
Lee DS, Srihari SN. A theory of classifier combination: the neural network approach. In Proc. 3rd International Conference on Document Analysis and Recognition. Montreal, 1995, pp.42–45.
Google Scholar
Duin RPW, Tax DMJ. Experiments with classifier combining rules. In: Kittler J, Roli F (eds). Multiple Classifier Systems, LNCS 1857. Springer, 2000, pp.16-29.
Kuncheva LI, Bezdek JC, Duin RPW. Decision templates for multiple classifier fusion: an experimental comparison. Pattern Recognition 2001; 34(2): 299–314.
Google Scholar
Shafer G, A Mathematical Theory of Evidence. Princeton Univ. Press, 1976.
Barnett JA. Computational methods for a mathematical theory of evidence. In Proc. 7th International Joint Conference on Artificial Intelligence. Vancouver, Canada, 1981, pp.868–875.
Rogova G. Combining the results of several neural network classifiers. Neural Networks 1994; 7(5): 777–781.
Google Scholar
Tomai CI, Srihari SN. Combination of type III digit recognizers using the Dempster-Shafer theory of edivence. In Prof. 7th International Conference on Document Analysis and Recognition. Edinburgh, 2003, pp.854–858.
Jain AK, Prabhakar S, Chen S. Combining multiple matches for a high security fingerprint verification system. Pattern Recognition Letters 1999; 20(11–13): 1371–1379.
Google Scholar
Wu L, Oviatt SL, Cohen PR. From members to teams to committee—a robust approach to gestural and multimodal recognition. IEEE Trans. Neural Networks 2002; 13(4): 972–982.
Google Scholar
Bridle JS. Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition. In: Fogelman-Soulie, Herault (eds). Neurocomputing: Algorithms, Architectures and Applications. Springer, 1990, pp.227–236.
Robbins H, Monro S. A stochastic approximation method. Annals of Mathematical Statistics 1951; 22:400–407.
Google Scholar
Liu CL, Sako H, Fujisawa H. Performance evaluation of pattern classifiers for handwritten character recognition. Int. J. Document Analysis and Recognition 2002; 4(3): 191–204.
Google Scholar
Liu CL, Nakashima K, Sako H, Fujisawa H. Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognition 2003; 36(10): 2271–2285.
Google Scholar
Liu CL, Nakashima K, Sako H, Fujisawa H. Handwritten digit recognition: investigation of normalization and feature extraction techniques. Pattern Recognition 2003; 37(2):265–279
Google Scholar
Hamanaka M, Yamada K, Tsukumo J. Normalization-cooperated feature extraction method for handprinted Kanji character recognition. In Proc. 3rd International Workshop on Frontiers of Handwriting Recognition. Buffalo, NY, 1993, pp.343-348.
Liu CL, Liu YJ, Dai RW. Preprocessing and statistical/structural feature extraction for handwritten numeral recognition. In: Downton AC, Impedovo S (eds). Progress of Handwriting Recognition. World Scientific, 1997, pp.161-168.
Liu CL, Koga M, Sako H, Fujisawa H. Aspect ratio adaptive normalization for handwritten character recognition. In: Tan T, Shi Y, Gao W (eds). Advances in Multimodal Interfaces—ICMI2000, LNCS 1948. Springer, 2000, pp.418–425.
Bishop CM, Neural Networks for Pattern Recognition. Claderon Press, Oxford, 1995.
Kreßel U, Schürmann J. Pattern classification techniques based on function approximation. In: Bunke H, Wang PSP (eds). Handbook of Character Recognition and Document Image Analysis. World Scientific, 1997, pp.49–78.
Liu CL, Nakagawa M. Evaluation of prototype learning algorithms for nearest neighbor classifier in application to handwritten character recognition. Pattern Recognition 2001; 34(3): 601–615.
Google Scholar
Liu CL, Sako H, Fujisawa H. Learning quadratic discriminant function for handwritten character recognition. In Proc. 16th International Conference on Pattern Recognition, vol.4. Quebec, Canada, 2002, pp.44–47.
Grother PJ, NIST special database 19: handprinted forms and characters database. Technical report and CDROM, 1995.

Download references

Acknowledgements

The work of Hongwei Hao was done when he was working at the Hitachi Central Research Laboratory. The authors would thank Kazuki Nakashima and Ryuji Mine for providing the datasets.

Author information

Authors and Affiliations

Central Research Laboratory, Hitachi, Ltd. 1-280 Higashi-koigakubo, Kokubunji-shi, Tokyo 185-8601, Japan
Cheng-Lin Liu & Hiroshi Sako
Department of Computer Science, University of Science and Technology Beijing, Beijing 100083, P.R. China
Hongwei Hao

Authors

Cheng-Lin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hongwei Hao
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Sako
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cheng-Lin Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, CL., Hao, H. & Sako, H. Confidence Transformation for Combining Classifiers. Pattern Anal Applic 7, 2–17 (2004). https://doi.org/10.1007/s10044-003-0199-5

Download citation

Received: 23 May 2003
Accepted: 24 October 2003
Published: 10 March 2004
Issue Date: April 2004
DOI: https://doi.org/10.1007/s10044-003-0199-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Confidence Transformation for Combining Classifiers

Abstract

Access this article

Similar content being viewed by others

Multi-step Training of a Generalized Linear Classifier

Multiple classifiers fusion and CNN feature extraction for handwritten digits recognition

Combination of Linear Classifiers Using Score Function – Analysis of Possible Combination Strategies

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Confidence Transformation for Combining Classifiers

Abstract

Access this article

Similar content being viewed by others

Multi-step Training of a Generalized Linear Classifier

Multiple classifiers fusion and CNN feature extraction for handwritten digits recognition

Combination of Linear Classifiers Using Score Function – Analysis of Possible Combination Strategies

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation