Skip to main content

Feature Selection and Classification Model Construction on Type 2 Diabetic Patient’s Data

  • Conference paper
Advances in Data Mining (ICDM 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3275))

Included in the following conference series:

Abstract

Diabetes is a disorder of the metabolism where the amount of glucose in the blood is too high because the body cannot produce or properly use insulin. In order to achieve more effective diabetes clinic management, data mining techniques have been applied to a patient database. In an attempt to improve the efficiency of data mining algorithms, a feature selection technique ReliefF is used with the data, which can rank the important attributes affecting Type 2 diabetes control. After selecting suitable attributes, classification techniques are applied to the data to predict how well the patients are controlling their condition. Preliminary results have been confirmed by the clinician and this provides optimism that data mining can be used to generate prediction models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Diamond, HICOM Technology (2002), http://www.hicom.co.uk/news.asp?NewsID=8

  2. IDF–International Diabetes Federation, Diabetes Atlas, 2nd edn. (2003)

    Google Scholar 

  3. American Association, Medical Guidelines for the Management of Diabetes Mellitus. Endocrine Practice 8 (suppl. 1), 40–82 (2002)

    Google Scholar 

  4. Diabetes Control and Complications Trial Research Group: The effect of intensive treat ment of diabetes on the development and progression of long-term complications in insulin-dependent diabetes mellitus. N. Engl. J. Med. 329, 977–986 (1993)

    Google Scholar 

  5. UKPDS. Intensive blood-glucose control with sulphonylureas orinsulin compared with conventional treatment and risk ofcomplications in patients with type 2 diabetes. Lancet. 352, 837–853 (1998)

    Google Scholar 

  6. UKPDS. Effect of intensive blood-glucose control with metformin on complications in overweight patients with type 2 diabetes. Lancet. 352, 854–865 (1998)

    Google Scholar 

  7. American Diabetes Association, About us American Diabetes Association (2004), http://www.diabetes.org/aboutus.jsp?WTLPromo=HEADER_aboutus&vms=142585600057

  8. Strattpm, I.M., Adler, A.I., Neil, H.A.W.: Association of Glycaemia with Macrovascular and Microvascular complications of Type 2 Diabetes. Br. Med. J. 321, 405–412 (2000)

    Article  Google Scholar 

  9. Lavrac, N.: Selected Techniques for Data Mining in Medicine. AI Med. 16(1), 3–23 (2002)

    MathSciNet  Google Scholar 

  10. Breault, J., Goodall, C., Fos, P.: Data mining a diabetic data warehouse. Artif. Intell. Med. 26(1-2), 37 (2002)

    Article  Google Scholar 

  11. Miyaki, K., Takei, I., Watanabe, K., Nakashima, H., Watanabe, K., Omae, K.: Novel statistical classification model of Type 2 diabetes mellitus patients for trailor-made prevention using data mining algorithm. J. Epidemiol 12(3), 243–248 (2002)

    Article  Google Scholar 

  12. SPSS Inc., Target the right people more effectively— AnswerTree, http://www.aspiresoftwareintl.com/html/spss_answer_tree.html

  13. Rohlfing, C.L., Wiedmeyer, H.M., Little, R.R., Jack, D.E., Tennill, A., Goldstein, D.E.: Defining the Relationship between Plasma Glucose and HbA1c, Analysis of glucose profiles and HbA1c in the Diabetes Control and Complications Trial. Diabetes Care 25, 275–278 (2002)

    Article  Google Scholar 

  14. Richards, G., Rayward-Smith, V.J., Sonksen, P.H., Carey, S., Weng, C.: Mining for indicators of early mortality in a database of clinical records. Artif. Intell. Med. 22(3), 215–231 (2001)

    Article  Google Scholar 

  15. Stepaniuk, J.: Rough set based data mining in diabetes mellitus data table. In: Proceedings of the Sixth European Congress on Intelligent Techniques and Soft Computing (EUFIT 1998), Aachen, Germany, September 7-10, vol. 2, pp. 980–984 (1998)

    Google Scholar 

  16. Diabetes UK, Diabetes in Northern Ireland (March 2004), http://www.diabetes.org.uk/n.ireland/nireland.htm

  17. Kononenko, I.: Estimating attributes: Analysis and extensions of Relief. In: Proceeding of the Seventh European Conference on Machine Learning, pp. 171–182. Springer, Heidelberg (1994)

    Google Scholar 

  18. Hall, M.A., Holmes, G.: Benchmarking Attribute Selection Techniques for Discrete Class Data Mining. IEEE Transactions on Knowledge and Data Engineering, IEEE 15(3), 1437–1447 (2003)

    Article  Google Scholar 

  19. Demsar, J., Zupan, B., Aoki, N., Wall, M.J., Granchi, T.H., Beck, J.R.: Feature Mining and Predictive Model Construction from Severe Trauma Patient’s Data. Int. J. Med. Inf. 63, 41–50 (2001)

    Article  Google Scholar 

  20. Molina, L., Belanche, L., Nebot, A.: Feature Selection Algorithms: A Survey and Experimental Evaluation. In: Proceeding of IEEE International Conference on Data Mining. IEEE, pp. 306–313 (2002)

    Google Scholar 

  21. Perner, P.: Improving the Accuracy of Decision Tree Induction by Feature Pre-Selection. Applied Artificial Intelligence 15(8), 747–760 (2001)

    Article  Google Scholar 

  22. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)

    Google Scholar 

  23. Turney, P.: Theoretical Analysis of Cross-Validation Error and Voting in Instance-Based Learning. J. Experimental and Theoretical Artificial Intelligence 6, 361–391 (1994)

    Article  MATH  Google Scholar 

  24. Colombet, I., Ruelland, A., Chatellier, G., et al.: Models to predict cardiovascular risk: comparison of CART, Multilayer perceptron and logistic regression. In: Proc AMIA Symp, pp. 156–160 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Huang, Y., McCullagh, P., Black, N., Harper, R. (2004). Feature Selection and Classification Model Construction on Type 2 Diabetic Patient’s Data. In: Perner, P. (eds) Advances in Data Mining. ICDM 2004. Lecture Notes in Computer Science(), vol 3275. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30185-1_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30185-1_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24054-9

  • Online ISBN: 978-3-540-30185-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics