Abstract
Knowledge Discovery in Databases is concerned with the development of methods and techniques for making sense of data. Its aim is to model the shapes of distributions and to discover patterns. During the knowledge acquisition process choices are made. Uncertainty and ambiguity hinder the process and “poor” choices cannot be avoided. Uncertainty corresponds to situations in which the choices are unclear and/or their consequences difficult to measure. Ambiguity arises from the lack of context, there is not sufficient information to assure the success of the choice, thus causing confusion. And decision making is hampered by perceptions of uncertainty and ambiguity.
Medical diagnosis/prognosis is a complex decision making process. In this paper we present a comparative study of model ambiguity on breast cancer predictions. Automatic classification of breast cancer on mammograms using two models, logistic regression and artificial neural networks are considered. The models were trained and tested to separate malignant and benign tumors for two different scenarios. Ambiguity in the prediction was studied for the different models.
The results show that a measure of uncertainty is practical to explain observable phenomena, such as medical data. Since model ambiguity can rarely be avoided, ordering alternative models by their degree of ambiguity is crucial in medical decision making processes. Furthermore, the levels of uncertainty and ambiguity are relevant in the knowledge representation process and open up new possibilities for richer Data Mining tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hand, D.J.: Principles of data mining. Drug Saf. 30(7), 621–622 (2007)
Bishop, C.M.: Pattern Recognition and Machine Learning (Information Science and Statistics). Springer, Berlin (2006)
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann Publishers Inc., San Francisco (1988)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference and Prediction, 2nd edn. Springer, Heidelberg (2009)
Ward, J.S., Barker, A.: Undefined by data: a survey of big data definitions. CoRR, 1309.5821 (2013)
Anderson, C.: The end of theory: the data deluge makes the scientific method obsolete. Wired (2008). http://www.wired.com/science/discoveries/magazine/16-07/pb_theory. Accessed 12 May 2019
Elter, M., et al.: The prediction of breast cancer biopsy outcomes using two CAD approaches that both emphasize an intelligible decision process. Med. Phys. 34(11), 4164–4172 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Miró-Julià, M., Ruiz-Miró, M.J., García Mosquera, I. (2020). Knowledge Discovery: From Uncertainty to Ambiguity and Back. In: Moreno-Díaz, R., Pichler, F., Quesada-Arencibia, A. (eds) Computer Aided Systems Theory – EUROCAST 2019. EUROCAST 2019. Lecture Notes in Computer Science(), vol 12013. Springer, Cham. https://doi.org/10.1007/978-3-030-45093-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-45093-9_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-45092-2
Online ISBN: 978-3-030-45093-9
eBook Packages: Computer ScienceComputer Science (R0)