skip to main content
10.1145/3571532.3571541acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbbsConference Proceedingsconference-collections
research-article

Application of machine learning to Isoniazid resistance analysis

Published:06 February 2023Publication History

ABSTRACT

Correct and timely detection of Mycobacterium tuberculosis (MTB) resistance against existing tuberculosis (TB) drugs is essential for the limit of TB amplification. The objectives of the projects are (1) to develop classification models that help isoniazid-resistant TB diagnosis, (2) to find the best performed classification algorithm, and (3) to rank the gene mutations according to feature importance. The python sklearn and matplotlib packages were frequently utilized throughout the research for data curation, classification model development, and feature importance ranking. Additionally, area under the curve (AUC), precision, sensitivity, specificity, F1 score, and correct classification rate measured for model performances, and Gini importance calculated feature importance. Gradient boosting found to overperform other classification models with the highest accuracy mean 0f 0.852, and its overfitting error exposed the need for dimensionality reduction prior to model training. Gene 625 and 331 were the most significant features in this project, and this suggested the potential of machine learning (ML) to find new resistance makers. The results confirmed the application of ML in clinical settings for quicker and better prediction of drug resistance based on large genome sequencing data. With future studies focusing on less studied and second-line TB drugs, classification models could decrease mortality and prevent the amplification of existing antibiotic resistance by allowing early diagnosis and treatment.

CCS CONCEPTS • Computing methodologies∼Machine learning∼Learning paradigms∼Supervised learning∼Supervised learning by classification

References

  1. World Health Organization, “Global Tuberculosis Report 2020”, Oct. 2021Google ScholarGoogle Scholar
  2. Centers for Disease Control and Prevention CDC, “Drug-Resistant TB,” Jan. 2017, https://www.cdc.gov/tb/topic/drtb/default.htmGoogle ScholarGoogle Scholar
  3. Y. L. Xie , “Evaluation of a Rapid Molecular Drug-Susceptibility Test for Tuberculosis,” New England Journal of Medicine, vol. 377, no. 11, pp. 1043–1054, Sep. 2017, doi:10.1056/nejmoa1614915.Google ScholarGoogle ScholarCross RefCross Ref
  4. A. Sharma, E. Machado, K. V. B. Lima, P. N. Suffys, and E. C. Conceic, “Tuberculosis drug resistance profiling based on machine learning: A literature review”, The Brazilian Journal of Infectious Diseases, v. 26, n. 1, Jan. 2022, 102332 doi: 10.1016/j.bjid.2022.102332Google ScholarGoogle ScholarCross RefCross Ref
  5. D. V. Volokhov, V. E. Chizhikov, S. Denkin, and Y. Zhang, “Molecular detection of drug-resistant Mycobacterium tuberculosis with a scanning-frame oligonucleotide microarray,” Methods in Molecular Biology (Clifton, N.J.), vol. 465, pp. 395–417, June 2010, doi: 10.1007/978-1-59745-207-6_26.Google ScholarGoogle ScholarCross RefCross Ref
  6. W. Deelder, , “Machine Learning Predicts Accurately Mycobacterium tuberculosis Drug Resistance From Whole Genome Sequencing Data,” frontiers in Genetic, Sep.2019, https://doi.org/10.3389/fgene.2019.00922Google ScholarGoogle ScholarCross RefCross Ref
  7. Y. Yang, , “Machine learning for classifying tuberculosis drug-resistance from DNA sequencing data”, Bioinformatics, v. 34, n. 10, pp. 1666–1671, May 2018, doi: 10.1093/bioinformatics/btx801Google ScholarGoogle ScholarCross RefCross Ref
  8. M. L. Chen , “Deep learning predicts tuberculosis drug resistance status from genome sequencing data,” Mar. 2018, doi: 10.1101/275628. *Google ScholarGoogle ScholarCross RefCross Ref
  9. M. R. Farhat , “Genetic Determinants of Drug Resistance inMycobacterium tuberculosis and Their Diagnostic Value,” American Journal of Respiratory and Critical Care Medicine, vol. 194, no. 5, pp. 621–630, Sep. 2016, doi: 10.1164/rccm.201510-2091oc.Google ScholarGoogle ScholarCross RefCross Ref
  10. S.Kouchaki, , “Application of machine learning techniques to tuberculosis drug resistance analysis, ” Oxford, Nov. 2018, pp. 2276-2282, doi: 10.1093/bioinformatics/bty949Google ScholarGoogle Scholar
  11. S. Kouchaki , “Multi-Label Random Forest Model for Tuberculosis Drug Resistance Classification and Mutation Ranking,” Frontiers in Microbiology, vol. 11, Apr. 2020, doi: 10.3389/fmicb.2020.00667.Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICBBS '22: Proceedings of the 2022 11th International Conference on Bioinformatics and Biomedical Science
    October 2022
    146 pages
    ISBN:9781450396929
    DOI:10.1145/3571532

    Copyright © 2022 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 6 February 2023

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)2

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format