Abstract
Human Papillomavirus (HPV) is known as the main cause of cervical cancer and classified to low- or high-risk type by its malignant potential. Detection of high-risk HPVs is critical to understand the mechanisms and recognize potential patients in medical judgments. In this paper, we present a simple kernel approach to classify HPV risk types from E6 protein sequences. Our method uses support vector machines combined with gap-spectrum kernels. The gap-spectrum kernel is introduced to compute the similarity between amino acids pairs with a fixed distance, which can be useful for the helical structure of proteins. In the experiments, the proposed method is compared with a mismatch kernel approach in accuracy and F1-score, and the predictions for unknown types are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Janicek, M.F., Averette, H.E.: Cervical Cancer: Prevention, Diagnosis, and Therapeutics. Cancer Journals for Clinicians 51, 92–114 (2001)
Watson, J.D., Laskowski, R.A., Thornton, J.M.: Predicting Protein Function from Sequence and Structural Data. Current Opinion in Structural Biology 15, 275–284 (2005)
Eom, J.H., Park, S.B., Zhang, B.T.: Genetic Mining of DNA Sequence Structures for Effective Classification of the Risk Types of Human Papillomavirus (HPV). In: Proceedings of the 11th International Conference on Neural Information Processing, pp. 1334–1343 (2004)
Joung, J.G., Oh, S.J., Zhang, B.T.: Prediction of the Risk Types of Human Papillomaviruses by Support Vector Machines. In: Proceedings of the 8th Pacific Rim International Conference on Artificial Intelligence, pp. 723–731 (2004)
Park, S.B., Hwang, S., Zhang, B.-T.: Mining the Risk Types of Human Papillomavirus (HPV) by AdaCost. In: Proceedings of the 14th International Conference on Database and Expert Systems Applications, pp. 403–412 (2003)
Leslie, C., Kuang, R.: Fast String Kernels using Inexact Matching for Protein Sequences. Journal of Machine Learning Research 5, 1435–1455 (2004)
Leslie, C., Eskin, E., Noble, W.S.: The Spectrum Kernel: A String Kernel for SVM Protein Classification. In: Proceedings of the Pacific Symposium on Biocomputing, pp. 564–575 (2002)
Leslie, C., Eskin, E., et al.: Mismatch String Kernels for Discriminative Protein Classification. Bioinformatics 20(4), 467–476 (2004)
Pillai, M., Lakshmi, S., et al.: High-Risk Human Papillomavirus Infection and E6 Protein Expression in Lesions of the Uterine Cervix. Pathobiology 66, 240–246 (1998)
Muñoz, N., Bosch, F.X., et al.: Epidemiologic Classification of Human Papillomavirus Types Associated with Cervical Cancer. New England Journal of Medicine 348, 518–527 (2003)
Longuet, M., Beaudenon, S., Orth, G.: Two Novel Genital Human Papillomavirus (HPV) Types, HPV68 and HPV70, Related to the Potentially Oncogenic HPV39. Journal of Clinical Microbiology 34, 738–744 (1996)
Meyer, T., Arndt, R., et al.: Association of Rare Human Papillomavirus Types with Genital Premalignant and Malignant Lesions. Journal of Infectious Diseases 178, 252–255 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, S., Eom, JH. (2006). Prediction of the Human Papillomavirus Risk Types Using Gap-Spectrum Kernels. In: Wang, J., Yi, Z., Zurada, J.M., Lu, BL., Yin, H. (eds) Advances in Neural Networks - ISNN 2006. ISNN 2006. Lecture Notes in Computer Science, vol 3973. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11760191_104
Download citation
DOI: https://doi.org/10.1007/11760191_104
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34482-7
Online ISBN: 978-3-540-34483-4
eBook Packages: Computer ScienceComputer Science (R0)