Hypothesis Testing with Classifier Systems for Rule-Based Risk Prediction

Baronti, Flavio; Starita, Antonina

doi:10.1007/978-3-540-71783-6_3

Hypothesis Testing with Classifier Systems for Rule-Based Risk Prediction

Flavio Baronti¹ &
Antonina Starita¹

Conference paper

1349 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4447))

Abstract

Analysis of medical datasets has some specific requirements not always fulfilled by standard Machine Learning methods. In particular, heterogeneous and missing data must be tolerated, the results should be easily interpretable. Moreover, with genetic data, often the combination of two or more attributes leads to non-linear effects not detectable for each attribute on its own. We present a new ML algorithm, HCS, taking inspiration from learning classifier systems, decision trees and statistical hypothesis testing. We show the results of applying this algorithm to a well-known benchmark dataset, and to HNSCC, a dataset studying the connection between smoke and genetic patterns to the development of oral cancer.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Holland, J.H.: Adaptation. In: Rosen, R., Snell, F.M. (eds.) Progress in theoretical biology 4, Plenum, New York (1976)
Google Scholar
Passaro, A., Baronti, F., Maggini, V.: Exploring relationships between genotype and oral cancer development through xcs. In: Rothlauf, F. (ed.) GECCO 2005, Workshop Proceedings, Washington DC, USA, June 25-26, pp. 147–151 (2005)
Google Scholar
Itti, L., Baldi, P.: Bayesian surprise attracts human attention. In: Advances in Neural Information Processing Systems, NIPS*2005, vol. 19, pp. 1–8. MIT Press, Cambridge (2006)
Google Scholar
Goldberg, D.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, Reading (1989)
MATH Google Scholar
Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Wilson, S.W.: Classifier fitness based on accuracy. Ev. Comp. 3(2) (1995)
Google Scholar
Holmes, J.H., Sager, J.A., Bilker, W.B.: Methods for covering missing data in XCS. In: Keijzer, M. (ed.) Late Breaking Papers at GECCO 2004, Seattle, WA (2004)
Google Scholar
Butz, M.V., Sastry, K., Goldberg, D.E.: Strong, stable, and reliable fitness pressure in XCS due to tournament selection. Genetic Programming and Evolvable Machines 6(1), 53–77 (2005)
Article Google Scholar
Shaffer, J.P.: Multiple hypothesis testing. Annual Review of Psychology 46, 561–584 (1995)
Article Google Scholar
Perneger, T.V.: What’s wrong with bonferroni adjustments. British Medical Journal 316, 1236–1238 (1998)
Google Scholar
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Brier, G.W.: Verification of forecasts expressed in terms of probability. Monthly weather review 78(1), 1–3 (1950)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Università di Pisa, Largo B. Pontecorvo, 3—56127 Pisa, Italy
Flavio Baronti & Antonina Starita

Authors

Flavio Baronti
View author publications
You can also search for this author in PubMed Google Scholar
Antonina Starita
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Elena Marchiori Jason H. Moore Jagath C. Rajapakse

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baronti, F., Starita, A. (2007). Hypothesis Testing with Classifier Systems for Rule-Based Risk Prediction. In: Marchiori, E., Moore, J.H., Rajapakse, J.C. (eds) Evolutionary Computation,Machine Learning and Data Mining in Bioinformatics. EvoBIO 2007. Lecture Notes in Computer Science, vol 4447. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71783-6_3

Download citation

DOI: https://doi.org/10.1007/978-3-540-71783-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71782-9
Online ISBN: 978-3-540-71783-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics