Abstract
Diabetes is a disorder of the metabolism where the amount of glucose in the blood is too high because the body cannot produce or properly use insulin. In order to achieve more effective diabetes clinic management, data mining techniques have been applied to a patient database. In an attempt to improve the efficiency of data mining algorithms, a feature selection technique ReliefF is used with the data, which can rank the important attributes affecting Type 2 diabetes control. After selecting suitable attributes, classification techniques are applied to the data to predict how well the patients are controlling their condition. Preliminary results have been confirmed by the clinician and this provides optimism that data mining can be used to generate prediction models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Diamond, HICOM Technology (2002), http://www.hicom.co.uk/news.asp?NewsID=8
IDF–International Diabetes Federation, Diabetes Atlas, 2nd edn. (2003)
American Association, Medical Guidelines for the Management of Diabetes Mellitus. Endocrine Practice 8 (suppl. 1), 40–82 (2002)
Diabetes Control and Complications Trial Research Group: The effect of intensive treat ment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N. Engl. J. Med. 329, 977–986 (1993)
UKPDS. Intensive blood-glucose control with sulphonylureas orinsulin compared with conventional treatment and risk ofcomplications in patients with type 2 diabetes. Lancet. 352, 837–853 (1998)
UKPDS. Effect of intensive blood-glucose control with metformin on complications in overweight patients with type 2 diabetes. Lancet. 352, 854–865 (1998)
American Diabetes Association, About us American Diabetes Association (2004), http://www.diabetes.org/aboutus.jsp?WTLPromo=HEADER_aboutus&vms=142585600057
Strattpm, I.M., Adler, A.I., Neil, H.A.W.: Association of Glycaemia with Macrovascular and Microvascular complications of Type 2 Diabetes. Br. Med. J. 321, 405–412 (2000)
Lavrac, N.: Selected Techniques for Data Mining in Medicine. AI Med. 16(1), 3–23 (2002)
Breault, J., Goodall, C., Fos, P.: Data mining a diabetic data warehouse. Artif. Intell. Med. 26(1-2), 37 (2002)
Miyaki, K., Takei, I., Watanabe, K., Nakashima, H., Watanabe, K., Omae, K.: Novel statistical classification model of Type 2 diabetes mellitus patients for trailor-made prevention using data mining algorithm. J. Epidemiol 12(3), 243–248 (2002)
SPSS Inc., Target the right people more effectively— AnswerTree, http://www.aspiresoftwareintl.com/html/spss_answer_tree.html
Rohlfing, C.L., Wiedmeyer, H.M., Little, R.R., Jack, D.E., Tennill, A., Goldstein, D.E.: Defining the Relationship between Plasma Glucose and HbA1c, Analysis of glucose profiles and HbA1c in the Diabetes Control and Complications Trial. Diabetes Care 25, 275–278 (2002)
Richards, G., Rayward-Smith, V.J., Sonksen, P.H., Carey, S., Weng, C.: Mining for indicators of early mortality in a database of clinical records. Artif. Intell. Med. 22(3), 215–231 (2001)
Stepaniuk, J.: Rough set based data mining in diabetes mellitus data table. In: Proceedings of the Sixth European Congress on Intelligent Techniques and Soft Computing (EUFIT 1998), Aachen, Germany, September 7-10, vol. 2, pp. 980–984 (1998)
Diabetes UK, Diabetes in Northern Ireland (March 2004), http://www.diabetes.org.uk/n.ireland/nireland.htm
Kononenko, I.: Estimating attributes: Analysis and extensions of Relief. In: Proceeding of the Seventh European Conference on Machine Learning, pp. 171–182. Springer, Heidelberg (1994)
Hall, M.A., Holmes, G.: Benchmarking Attribute Selection Techniques for Discrete Class Data Mining. IEEE Transactions on Knowledge and Data Engineering, IEEE 15(3), 1437–1447 (2003)
Demsar, J., Zupan, B., Aoki, N., Wall, M.J., Granchi, T.H., Beck, J.R.: Feature Mining and Predictive Model Construction from Severe Trauma Patient’s Data. Int. J. Med. Inf. 63, 41–50 (2001)
Molina, L., Belanche, L., Nebot, A.: Feature Selection Algorithms: A Survey and Experimental Evaluation. In: Proceeding of IEEE International Conference on Data Mining. IEEE, pp. 306–313 (2002)
Perner, P.: Improving the Accuracy of Decision Tree Induction by Feature Pre-Selection. Applied Artificial Intelligence 15(8), 747–760 (2001)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)
Turney, P.: Theoretical Analysis of Cross-Validation Error and Voting in Instance-Based Learning. J. Experimental and Theoretical Artificial Intelligence 6, 361–391 (1994)
Colombet, I., Ruelland, A., Chatellier, G., et al.: Models to predict cardiovascular risk: comparison of CART, Multilayer perceptron and logistic regression. In: Proc AMIA Symp, pp. 156–160 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, Y., McCullagh, P., Black, N., Harper, R. (2004). Feature Selection and Classification Model Construction on Type 2 Diabetic Patient’s Data. In: Perner, P. (eds) Advances in Data Mining. ICDM 2004. Lecture Notes in Computer Science(), vol 3275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30185-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-30185-1_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24054-9
Online ISBN: 978-3-540-30185-1
eBook Packages: Computer ScienceComputer Science (R0)