Evaluation of a Variance-Based Nonconformity Measure for Regression Forests

Boström, Henrik; Linusson, Henrik; Löfström, Tuve; Johansson, Ulf

doi:10.1007/978-3-319-33395-3_6

Henrik Boström¹⁷,
Henrik Linusson¹⁸,
Tuve Löfström¹⁸ &
…
Ulf Johansson¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9653))

Included in the following conference series:

Symposium on Conformal and Probabilistic Prediction with Applications

2029 Accesses
2 Citations

Abstract

In a previous large-scale empirical evaluation of conformal regression approaches, random forests using out-of-bag instances for calibration together with a k-nearest neighbor-based nonconformity measure, was shown to obtain state-of-the-art performance with respect to efficiency, i.e., average size of prediction regions. However, the use of the nearest-neighbor procedure not only requires that all training data have to be retained in conjunction with the underlying model, but also that a significant computational overhead is incurred, during both training and testing. In this study, a more straightforward nonconformity measure is investigated, where the difficulty estimate employed for normalization is based on the variance of the predictions made by the trees in a forest. A large-scale empirical evaluation is presented, showing that both the nearest-neighbor-based and the variance-based measures significantly outperform a standard (non-normalized) nonconformity measure, while no significant difference in efficiency between the two normalized approaches is observed. Moreover, the evaluation shows that state-of-the-art performance is achieved by the variance-based measure at a computational cost that is several orders of magnitude lower than when employing the nearest-neighbor-based nonconformity measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Accelerating difficulty estimation for conformal regression forests

Article Open access 01 March 2017

Efficient Venn predictors using random forests

Article Open access 20 August 2018

Double random forest

Article 02 July 2020

Notes

1.
www.julialang.org.
2.
The Julia implementation can be obtained from the first author upon request.

References

Bache, K., Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)
MathSciNet MATH Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article MathSciNet MATH Google Scholar
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
MathSciNet MATH Google Scholar
Gammerman, A., Vovk, V., Vapnik, V.: Learning by transduction. In: Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 148–155. Morgan Kaufmann (1998)
Google Scholar
Johansson, U., Boström, H., Löfström, T., Linusson, H.: Regression conformal prediction with random forests. Mach. Learn. 97(1–2), 155–176 (2014)
Article MathSciNet MATH Google Scholar
Löfström, T., Johansson, U., Boström, H.: Effective utilization of data in inductive conformal prediction. In: The 2013 International Joint Conference on Neural Networks (IJCNN). IEEE (2013)
Google Scholar
Papadopoulos, H.: Inductive conformal prediction: theory and application to neural networks. Tools Artif. Intell. 18(2), 315–330 (2008)
Google Scholar
Papadopoulos, H., Haralambous, H.: Reliable prediction intervals with regression neural networks. Neural Netw. 24(8), 842–851 (2011)
Article Google Scholar
Papadopoulos, H., Proedrou, K., Vovk, V., Gammerman, A.J.: Inductive confidence machines for regression. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) ECML 2002. LNCS (LNAI), vol. 2430, pp. 345–356. Springer, Heidelberg (2002)
Chapter Google Scholar
Papadopoulos, H., Vovk, V., Gammerman, A.: Regression conformal prediction with nearest neighbours. J. Artif. Intell. Res. 40(1), 815–840 (2011)
MathSciNet MATH Google Scholar
Rasmussen, C.E., Neal, R.M., Hinton, G., van Camp, D., Revow, M., Ghahramani, Z., Kustra, R., Tibshirani, R.: Delve data for evaluating learning in valid experiments (1996). www.cs.toronto.edu/delve
Vovk, V., Gammerman, A., Shafer, G.: Algorithmic Learning in a Random World. Springer, Heidelberg (2006)
MATH Google Scholar

Download references

Acknowledgments

This work was supported by the Swedish Foundation for Strategic Research through the project High-Performance Data Mining for Drug Effect Detection (IIS11-0053), the Vinnova program for Strategic Vehicle Research and Innovation (FFI)-Transport Efficiency, and the Knowledge Foundation through the project Data Analytics for Research and Development (20150185).

Author information

Authors and Affiliations

Department of Computer and Systems Sciences, Stockholm University, Kista, Sweden
Henrik Boström
Department of Information Technology, University of Borås, Borås, Sweden
Henrik Linusson & Tuve Löfström
Department of Computer Science and Informatics, Jönköping University, Jönköping, Sweden
Ulf Johansson

Authors

Henrik Boström
View author publications
You can also search for this author in PubMed Google Scholar
Henrik Linusson
View author publications
You can also search for this author in PubMed Google Scholar
Tuve Löfström
View author publications
You can also search for this author in PubMed Google Scholar
Ulf Johansson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Henrik Boström .

Editor information

Editors and Affiliations

University of London, Egham, United Kingdom
Alexander Gammerman
University of London, Egham, United Kingdom
Zhiyuan Luo
CIEMAT, Madrid, Spain
Jesús Vega
University of London, Egham, United Kingdom
Vladimir Vovk

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boström, H., Linusson, H., Löfström, T., Johansson, U. (2016). Evaluation of a Variance-Based Nonconformity Measure for Regression Forests. In: Gammerman, A., Luo, Z., Vega, J., Vovk, V. (eds) Conformal and Probabilistic Prediction with Applications. COPA 2016. Lecture Notes in Computer Science(), vol 9653. Springer, Cham. https://doi.org/10.1007/978-3-319-33395-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-33395-3_6
Published: 17 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-33394-6
Online ISBN: 978-3-319-33395-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics