skip to main content
10.1145/3456529.3456538acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccdaConference Proceedingsconference-collections
research-article

Clustering of Functionally Related Genes Using Machine Learning Techniques

Published:13 July 2021Publication History

ABSTRACT

The clustering of functionally related genes has been an important task for biologists. With the recent progress of machine learning technology, researchers now have more powerful weapons to identify the structures within a large amount of DNA sequencing data. That allows the research on genes to be conducted in an efficient and scalable way. This paper studies the clustering of functionally related genes and their impact on the development and prognosis of lung cancer with machine learning technologies. The patient data derived from 218 patients are analyzed. We focus on two extreme cases, one case includes patients who survived less than 1 year, and the other case includes patients who survived longer than 5 years. We will investigate how different clustering methods can assist in the visualization of the DNA sequence data of such patients, and how such methods can help us identify the underlying patterns of the DNA sequence data.

References

  1. Siegel, Rebecca L., Kimberly D. Miller, and Ahmedin Jemal. "Cancer statistics, 2019." CA: a cancer journal for clinicians 69.1 (2019): 7-34.Google ScholarGoogle Scholar
  2. Sperduto, Paul W., T. Jonathan Yang, Kathryn Beal, Hubert Pan, Paul D. Brown, Ananta Bangdiwala, Ryan Shanley "Estimating survival in patients with lung cancer and brain metastases: an update of the graded prognostic assessment for lung cancer using molecular markers (Lung-molGPA)." JAMA oncology 3, no. 6 (2017): 827-831.Google ScholarGoogle ScholarCross RefCross Ref
  3. Cancer Genome Atlas Research Network. "Comprehensive molecular profiling of lung adenocarcinoma." Nature 511, no. 7511 (2014): 543-550.Google ScholarGoogle ScholarCross RefCross Ref
  4. VanderLaan, Paul A., Deepa Rangachari, Susan M. Mockus, Vanessa Spotlow, Honey V. Reddi, Joan Malcolm, Mark S. Huberman, Loren J. Joseph, Susumu S. Kobayashi, and Daniel B. Costa. "Mutations in TP53, PIK3CA, PTEN and other genes in EGFR mutated lung cancers: Correlation with clinical outcomes." Lung Cancer 106 (2017): 17-21.Google ScholarGoogle ScholarCross RefCross Ref
  5. Mann, Amandeep Kaur, and Navneet Kaur. "Review paper on clustering techniques." Global Journal of Computer Science and Technology (2013).Google ScholarGoogle Scholar
  6. Cancer Genome Atlas Research Network. "Comprehensive molecular profiling of lung adenocarcinoma." Nature 511, no. 7511 (2014): 543.Google ScholarGoogle ScholarCross RefCross Ref
  7. Pozo, Christie L. Pratt, Mary Ann A. Morgan, and Jhanelle E. Gray. "Survivorship issues for patients with lung cancer." Cancer Control 21, no. 1 (2014): 40-50.Google ScholarGoogle ScholarCross RefCross Ref
  8. Wattenberg, Martin, Fernanda Viégas, and Ian Johnson. "How to use t-SNE effectively." Distill 1, no. 10 (2016): e2.Google ScholarGoogle Scholar
  9. McInnes, Leland, John Healy, and James Melville. "Umap: Uniform manifold approximation and projection for dimension reduction." arXiv preprint arXiv:1802.03426 (2018).Google ScholarGoogle Scholar
  10. Ghosh, Soumi, and Sanjay Kumar Dubey. "Comparative analysis of k-means and fuzzy c-means algorithms." International Journal of Advanced Computer Science and Applications 4, no. 4 (2013).Google ScholarGoogle Scholar
  11. Murtagh, Fionn, and Pedro Contreras. "Algorithms for hierarchical clustering: an overview." Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 2, no. 1 (2012): 86-97.Google ScholarGoogle ScholarCross RefCross Ref
  12. Borg, Ingwer. Applied multidimensional scaling and unfolding. Springer, 2018.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICCDA '21: Proceedings of the 2021 5th International Conference on Compute and Data Analysis
    February 2021
    194 pages
    ISBN:9781450389112
    DOI:10.1145/3456529

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 July 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format