Skip to main content

Genetic Mining of DNA Sequence Structures for Effective Classification of the Risk Types of Human Papillomavirus (HPV)

  • Conference paper
Neural Information Processing (ICONIP 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3316))

Included in the following conference series:

Abstract

Human papillomavirus (HPV) is considered to be the most common sexually transmitted disease and the infection of HPV is known as the major factor for cervical cancer. There are more than 100 types in HPV and each HPV has two risk types, low and high. In particular, high risk type HPV is known to the most important factors in medical judgment. Thus, the classifying the risk type of HPV is very important to the treat of cervical cancer. In this paper, we present a machine learning approach to mine the structure of HPV DNA sequence for effective classification of the HPV risk types. We learn the most informative subsequence segment sets and its weights with genetic algorithm to classify the risk types of each HPV. To resolve the problem of computational complexity of genetic algorithm we use distributed intelligent data engineering platform based on active grid concept called “IDEA@Home.” The proposed genetic mining method, with the described platform, shows about 85.6% classification accuracy with relatively fast mining speed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Bosch, F.X., Manos, M.M., Muñoz, N., Sherman, M., Jansen, A.M., Peto, J., Schiffman, M.H., Moreno, V., Kurman, R., Shah, K.V.: Prevalence of human papillomavirus in cervical cancer: A worldwide perspective. J. Natl. Cancer Inst. 87(11), 796–802 (1995)

    Article  Google Scholar 

  2. Furumoto, H., Irahara, M.: Human Papillomavirus (HPV) and Cervical Cancer. Journal of Medical Investigation 49, 124–133 (2002)

    Google Scholar 

  3. Schiffman, M., Bauer, H., Hoover, R., Glass, A., Cadell, D., Rush, B., Scott, D., Sherman, M., Kurman, R., Wacholder, S.: Epidemiologic evidence showing that Human Papillomavirus infection causes most cervical intraepithelial neoplasis. Journal of the National Cancer Institute 85, 958–964 (1993)

    Article  Google Scholar 

  4. Park, S.-B., Hwang, S.-H., Zhang, B.-T.: Classification of the risk types of human papillomavirus by decision trees. In: Proceedings of the 4th International Conference on Intelligent Data Engineering and Automated Learning, pp. 540–544 (2003)

    Google Scholar 

  5. Janicek, M.F., Averette, H.E.: Cervical cancer: Prevention, diagnosis, and therapeutics. Cancer Journals for Clinicians 51, 92–114 (2001)

    Article  Google Scholar 

  6. Burk, R.D., Ho, G.Y., Beardsley, L., Lempa, M., Peters, M., Bierman, R.: Sexual behavior and partner characteristics are the predominant risk factors for genital human papillomavirus infection in young women. J. Infect. Dis. 174(4), 679–689 (1996)

    Article  Google Scholar 

  7. Muñoz, N., Bosch, F.X., Sanjosé, S., Herrero, R., Castellsagué, X., Shah, K.V., Snijders, P.J.F., Meijer, C.J.L.M.: Epidemiologic classification of human papillomavirus types associated with cervical cancer. The New England Journal of Medicine 348(6), 518–527 (2003)

    Article  Google Scholar 

  8. Eom, J.-H., Zhang, B.-T.: IDEA@home: The flexible active grid computing platform based on P2P and network segmentation. Technical Report BI-04-01, School of Computer Sci. & Eng., Seoul National Univ., Seoul, Korea (February 2004)

    Google Scholar 

  9. Richards, W.G.: Virtual screening using grid computing: the screensaver project. Nature Reviews Drug Discovery 1, 551–555 (2002)

    Article  Google Scholar 

  10. Davies, E.K., Glick, M., Harrison, K.N., Richards, W.G.: Pattern recognition and massively distributed computing. Journal of Computational Chemistry 23(16), 1544–1550 (2002)

    Article  Google Scholar 

  11. Bäck, T.: Evolutionary algorithms in theory and practice. Oxford University Press, Oxford (1996)

    MATH  Google Scholar 

  12. Kim, S., Zhang, B.-T.: Genetic mining of HTML structures for effective web-document retrieval. Applied Intelligence 18, 243–256 (2003)

    Article  MathSciNet  Google Scholar 

  13. The HPV sequence database in Los Alamos laboratory, http://hpv-web.lanl.gov/stdgen/virus/hpv/index.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Eom, JH., Park, SB., Zhang, BT. (2004). Genetic Mining of DNA Sequence Structures for Effective Classification of the Risk Types of Human Papillomavirus (HPV). In: Pal, N.R., Kasabov, N., Mudi, R.K., Pal, S., Parui, S.K. (eds) Neural Information Processing. ICONIP 2004. Lecture Notes in Computer Science, vol 3316. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30499-9_208

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30499-9_208

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23931-4

  • Online ISBN: 978-3-540-30499-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics