Feature Selection and Classification Model Construction on Type 2 Diabetic Patient’s Data

Huang, Yue; McCullagh, Paul; Black, Norman; Harper, Roy

doi:10.1007/978-3-540-30185-1_17

Yue Huang¹⁹,
Paul McCullagh¹⁹,
Norman Black¹⁹ &
…
Roy Harper²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3275))

Included in the following conference series:

Industrial Conference on Data Mining

829 Accesses
4 Citations

Abstract

Diabetes is a disorder of the metabolism where the amount of glucose in the blood is too high because the body cannot produce or properly use insulin. In order to achieve more effective diabetes clinic management, data mining techniques have been applied to a patient database. In an attempt to improve the efficiency of data mining algorithms, a feature selection technique ReliefF is used with the data, which can rank the important attributes affecting Type 2 diabetes control. After selecting suitable attributes, classification techniques are applied to the data to predict how well the patients are controlling their condition. Preliminary results have been confirmed by the clinician and this provides optimism that data mining can be used to generate prediction models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Diamond, HICOM Technology (2002), http://www.hicom.co.uk/news.asp?NewsID=8
IDF–International Diabetes Federation, Diabetes Atlas, 2nd edn. (2003)
Google Scholar
American Association, Medical Guidelines for the Management of Diabetes Mellitus. Endocrine Practice 8 (suppl. 1), 40–82 (2002)
Google Scholar
Diabetes Control and Complications Trial Research Group: The effect of intensive treat ment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N. Engl. J. Med. 329, 977–986 (1993)
Google Scholar
UKPDS. Intensive blood-glucose control with sulphonylureas orinsulin compared with conventional treatment and risk ofcomplications in patients with type 2 diabetes. Lancet. 352, 837–853 (1998)
Google Scholar
UKPDS. Effect of intensive blood-glucose control with metformin on complications in overweight patients with type 2 diabetes. Lancet. 352, 854–865 (1998)
Google Scholar
American Diabetes Association, About us American Diabetes Association (2004), http://www.diabetes.org/aboutus.jsp?WTLPromo=HEADER_aboutus&vms=142585600057
Strattpm, I.M., Adler, A.I., Neil, H.A.W.: Association of Glycaemia with Macrovascular and Microvascular complications of Type 2 Diabetes. Br. Med. J. 321, 405–412 (2000)
Article Google Scholar
Lavrac, N.: Selected Techniques for Data Mining in Medicine. AI Med. 16(1), 3–23 (2002)
MathSciNet Google Scholar
Breault, J., Goodall, C., Fos, P.: Data mining a diabetic data warehouse. Artif. Intell. Med. 26(1-2), 37 (2002)
Article Google Scholar
Miyaki, K., Takei, I., Watanabe, K., Nakashima, H., Watanabe, K., Omae, K.: Novel statistical classification model of Type 2 diabetes mellitus patients for trailor-made prevention using data mining algorithm. J. Epidemiol 12(3), 243–248 (2002)
Article Google Scholar
SPSS Inc., Target the right people more effectively— AnswerTree, http://www.aspiresoftwareintl.com/html/spss_answer_tree.html
Rohlfing, C.L., Wiedmeyer, H.M., Little, R.R., Jack, D.E., Tennill, A., Goldstein, D.E.: Defining the Relationship between Plasma Glucose and HbA1c, Analysis of glucose profiles and HbA1c in the Diabetes Control and Complications Trial. Diabetes Care 25, 275–278 (2002)
Article Google Scholar
Richards, G., Rayward-Smith, V.J., Sonksen, P.H., Carey, S., Weng, C.: Mining for indicators of early mortality in a database of clinical records. Artif. Intell. Med. 22(3), 215–231 (2001)
Article Google Scholar
Stepaniuk, J.: Rough set based data mining in diabetes mellitus data table. In: Proceedings of the Sixth European Congress on Intelligent Techniques and Soft Computing (EUFIT 1998), Aachen, Germany, September 7-10, vol. 2, pp. 980–984 (1998)
Google Scholar
Diabetes UK, Diabetes in Northern Ireland (March 2004), http://www.diabetes.org.uk/n.ireland/nireland.htm
Kononenko, I.: Estimating attributes: Analysis and extensions of Relief. In: Proceeding of the Seventh European Conference on Machine Learning, pp. 171–182. Springer, Heidelberg (1994)
Google Scholar
Hall, M.A., Holmes, G.: Benchmarking Attribute Selection Techniques for Discrete Class Data Mining. IEEE Transactions on Knowledge and Data Engineering, IEEE 15(3), 1437–1447 (2003)
Article Google Scholar
Demsar, J., Zupan, B., Aoki, N., Wall, M.J., Granchi, T.H., Beck, J.R.: Feature Mining and Predictive Model Construction from Severe Trauma Patient’s Data. Int. J. Med. Inf. 63, 41–50 (2001)
Article Google Scholar
Molina, L., Belanche, L., Nebot, A.: Feature Selection Algorithms: A Survey and Experimental Evaluation. In: Proceeding of IEEE International Conference on Data Mining. IEEE, pp. 306–313 (2002)
Google Scholar
Perner, P.: Improving the Accuracy of Decision Tree Induction by Feature Pre-Selection. Applied Artificial Intelligence 15(8), 747–760 (2001)
Article Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Turney, P.: Theoretical Analysis of Cross-Validation Error and Voting in Instance-Based Learning. J. Experimental and Theoretical Artificial Intelligence 6, 361–391 (1994)
Article MATH Google Scholar
Colombet, I., Ruelland, A., Chatellier, G., et al.: Models to predict cardiovascular risk: comparison of CART, Multilayer perceptron and logistic regression. In: Proc AMIA Symp, pp. 156–160 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing and Mathematics, Faculty of Engineering, University of Ulster, Jordanstown, BT37 0QB, Northern Ireland, UK
Yue Huang, Paul McCullagh & Norman Black
The Ulster Hospital, Dundonald, Belfast, BT16 0RH, Northern Ireland, UK
Roy Harper

Authors

Yue Huang
View author publications
You can also search for this author in PubMed Google Scholar
Paul McCullagh
View author publications
You can also search for this author in PubMed Google Scholar
Norman Black
View author publications
You can also search for this author in PubMed Google Scholar
Roy Harper
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Vision and applied Computer Sciences, IBaI, Germany
Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, Y., McCullagh, P., Black, N., Harper, R. (2004). Feature Selection and Classification Model Construction on Type 2 Diabetic Patient’s Data. In: Perner, P. (eds) Advances in Data Mining. ICDM 2004. Lecture Notes in Computer Science(), vol 3275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30185-1_17

Download citation

DOI: https://doi.org/10.1007/978-3-540-30185-1_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24054-9
Online ISBN: 978-3-540-30185-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics