Abstract
The extraction of hidden predictive information from large databases is possible with data mining. Anemia is the most common disorder of the blood. Anemia can be classified in a variety of ways, based on the morphology of RBCs, etiology, etc . In this paper we present an analysis of the prediction and classification of anemia in patients using data mining techniques. The dataset constructed from complete blood count test data from various hospitals. We have worked out with classification method C4.5 decision tree algorithm and Support vector machine which are implemented as J48 and SMO(sequential minimal optimization) in Weka. Several experiments are conducted using these algorithms. The decision ree for classification of anemia is generated which gives best possible classification of anemia based on CBC reports along with severity of anemia. We have observed that C4.5 algorithm has best performance with highest accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Razali, A.M., Ali, S.: Generating Treatment Plan in Medicine: A Data Mining Approach. American Journal of Applied Sciences 6(2), 345–351 (2009), ISSN 1546- 9239 © 2009 Science Publications
Schmaier, A.H., Petruzzelli, L.M.: Hematology for the medical student. Lippincott Wiliams and Wilkins 25
Bernadette, F.R., Doig, K.: Hematology: Clinical features & applications, 3rd edn., pp. 227–230
Ed Uthman’s homepage; Anemia Pathophysiologic Consequences, Classification, and Clinical Investigation (2009), http://web2.airmail.net/uthman/anemia/anemia.html
Fischbach, F.T.: A manual of laboratory & diagnostic tests, 6th edn. (2008)
Han, J., Kamber, M.: Data Mining Concepts and Techniques, 2nd edn. (2001)
Hematology complete blood count, http://www.meddean.luc.edu/lumen/MedEd/MEDICINE/medclerk/2004_05/level1/CBCAnemia/cbc_f.htm
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Fransisco (2005)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo (2003)
Cadez, L.V., MacLaren, C.E., Smyth, P., McLachlan, G.J.: Hierarchical model for screening Iron Deficiency Anemia. Technical report no 99-14, Department of Information and Computer Science, University of California, Irvine
Wood, M.E., Philips, G.K.: Hematology/oncology secrets, 3rd edn., pp. 20–21
Medical Technology; RBC indices and anemia classification, http://www.irvingcrowley.com/cls/anemia.htm
Practical Utilization of the Complete Blood Count. Joseph M. Flynn, D.O.,MPH, FACP. Division Hematology-Oncology. THE Ohio State University, Columbus, OH (April 2008), sciocountrymedicalsociety.org/documents/CBC_Flynn.PPT
Ravel, R.: Clinical laboratory medicine: Clinical application of laboratory data, 6th edn., pp. 13–14 (1993)
Weka: Data Mining Software in Java, http://www.cs.waikato.ac.nz/ml/weka/
Beck, W.S.: Hematology, 5th edn., pp. 604–613
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20, 273–297 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sanap, S.A., Nagori, M., Kshirsagar, V. (2011). Classification of Anemia Using Data Mining Techniques. In: Panigrahi, B.K., Suganthan, P.N., Das, S., Satapathy, S.C. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2011. Lecture Notes in Computer Science, vol 7077. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27242-4_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-27242-4_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27241-7
Online ISBN: 978-3-642-27242-4
eBook Packages: Computer ScienceComputer Science (R0)