skip to main content
10.1145/3512452.3512453acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbbConference Proceedingsconference-collections
research-article

Predicting Anatomical Therapeutic Chemical Drug Classes from 17 molecules’ Properties of Drugs by Multi-Label Binary Relevance Approach with MLSMOTE

Authors Info & Claims
Published:01 April 2022Publication History

ABSTRACT

Anatomical Therapeutic Chemical (ATC) classes prediction is one of the prominent activities in the costly and tedious pipeline of drug discovery where machine learning plays an important role by minimizing the cost and time of prediction. Most of the existing research have been done to predict ATC classes from the chemical-chemical association, side-effects, target proteins, gene expressions, chemical structures, drug targets, and textual information of drugs. However, the capability of 17 molecules’ properties have not yet been explored to predict drug ATC classes. The current work proposes a methodology for predicting the drug ATC classes using the 17 molecules’ properties. ATC classes prediction is a multi-label classification task and therefore, a binary relevance strategy has been employed to solve this issue with four basic machine learning classifiers, namely K-Nearest Neighbour (KNN), Extra Tree Classifier (ETC), Random Forest (RF), and Decision Tree (DT). The common problem of multi-label datasets is class imbalance which is addressed using the MLSMOTE (Multi-Label Synthetic Minority Over-Sampling Technique). The proposed methodology exhibits promising results, and it achieved the accuracy ranging from 96.90% to 98.06%, which indicates that 17 molecules’ properties are good enough in efficient prediction of ATC classes.

References

  1. A breakthrough milestone in AI-powered drug discovery reached linking biology and chemistry with AI. url: https://insilico.com/blog/pcc (visited on 07/12/2021).Google ScholarGoogle Scholar
  2. Andrew P Bradley. “The use of the area under the ROC curve in the evaluation of machine learning algorithms”. In: Pattern recognition 30.7 (1997), pp. 1145–1159.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Francisco Charte “MLSMOTE: Approaching imbalanced multilabel learning through synthetic instance generation”. In: Knowledge-Based Systems 89 (2015), pp. 385–397.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Fan-Shu Chen and Zhen-Ran Jiang. “Prediction of drug's Anatomical Therapeutic Chemical (ATC) code by integrating drug–domain network”. In: Journal of biomedical informatics 58 (2015), pp. 80–88.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Lei Chen, Tao Liu, and Xian Zhao. “Inferring anatomical therapeutic chemical (ATC) class of drugs using shortest path and random walk with restart algorithms”. In: Biochimica et Biophysica Acta (BBA)-Molecular Basis of Disease 1864.6 (2018), pp. 2228–2240.Google ScholarGoogle ScholarCross RefCross Ref
  6. Lei Chen “Predicting anatomical therapeutic chemical (ATC) classification of drugs by integrating chemical-chemical interactions and similarities”. In: PloS one 7.4 (2012), e35254.Google ScholarGoogle ScholarCross RefCross Ref
  7. etc.ensemble.ExtraTreesClassifier. url: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.ExtraTreesClassif ier.html#sklearn.ensemble.ExtraTreesClassif ier (visited on08/27/2021).Google ScholarGoogle Scholar
  8. Harsha Gurulingappa “Concept-based semi-automatic classification of drugs”. In: Journal of chemical information and modeling 49.8 (2009), pp. 1986–1992.Google ScholarGoogle ScholarCross RefCross Ref
  9. Jeonghee Jo, Hyun-Soo Choi, and Sungroh Yoon. “Prediction of Drug Classes with a Deep Neu- ral Network using Drug Targets and Chemical Structure Data”. In: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE. 2019, pp. 664–667.Google ScholarGoogle ScholarCross RefCross Ref
  10. Michael Kuhn “A side effect resource to capture phenotypic effects of drugs”. In: Molecular systems biology 6.1 (2010), p. 343.Google ScholarGoogle ScholarCross RefCross Ref
  11. Haiyan Liang “Recognizing novel chemicals/drugs for anatomical therapeutic chemical classes with a heat diffusion algorithm”. In: Biochimica et Biophysica Acta (BBA)-Molecular Basis of Disease 1866.11 (2020), p. 165910.Google ScholarGoogle ScholarCross RefCross Ref
  12. Zhongyang Liu “Similarity-based prediction for anatomical therapeutic chemical clas- sification of drugs by integrating multiple data sources”. In: Bioinformatics 31.11 (2015), pp. 1788–1795.Google ScholarGoogle ScholarCross RefCross Ref
  13. Gjorgji Madjarov “An extensive experimental comparison of methods for multi-label learning”. In: Pattern recognition 45.9 (2012), pp. 3084–3104.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Ken McGarry and Ennock Assamoha. “Data integration with self-organising neural network reveals chemical structure and therapeutic effects of drug ATC codes”. In: UK Workshop on Computational Intel ligence. Springer. 2017, pp. 63–74.Google ScholarGoogle Scholar
  15. Loris Nanni, Sheryl Brahnam, and Alessandra Lumini. “Ensemble of Deep Learning Approaches for ATC Classification”. In: Smart Intel ligent Computing and Applications. Springer, 2020, pp. 117–125.Google ScholarGoogle ScholarCross RefCross Ref
  16. Janette Nickel “SuperPred: update on drug classification and target prediction”. In: Nucleic acids research 42.W1 (2014), W26–W31.Google ScholarGoogle ScholarCross RefCross Ref
  17. Thomas Olson and Rahul Singh. “Predicting anatomic therapeutic chemical classification codes using tiered learning”. In: BMC bioinformatics 18.8 (2017), pp. 1–13.Google ScholarGoogle Scholar
  18. Yayuan Peng “Drug repositioning by prediction of drug's anatomical therapeutic chemical code via network-based inference approaches”. In: Briefings in bioinformatics 22.2 (2021), pp. 2058–2072.Google ScholarGoogle ScholarCross RefCross Ref
  19. Purvi Prajapati, Amit Thakkar, and Amit Ganatra. “A survey and current research challenges in multi-label classification methods”. In: International Journal of Soft Computing and Engineering (IJSCE) 2.1 (2012), pp. 248–252.Google ScholarGoogle Scholar
  20. PubChem. url: https://pubchem.ncbi.nlm.nih.gov/ (visited on 09/29/2021).Google ScholarGoogle Scholar
  21. Jesse Read “Classifier chains for multi-label classification”. In: Machine learning 85.3 (2011), pp. 333–359.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. sklearn.ensemble.RandomForestClassifier. url: https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassif ier.html#sklearn.ensemble.RandomForestClass if ier (visited on 08/27/2021).Google ScholarGoogle Scholar
  23. sklearn.neighbors.KNeighborsClassifier. url: https://scikit-learn.org/stable/modules/generated/sklearn.neighbors.KNeighborsClassif ier.html#sklearn.neighbors.KNeighborsClassif ier (visited on 08/27/2021).Google ScholarGoogle Scholar
  24. sklearn.tree.DecisionTreeClassifier. url: https://scikit- learn.org/stable/modules/generated/sklearn.tree.DecisionTreeClassif ier.html#sklearn.tree.DecisionTreeClassif ier (visited on 08/27/2021).Google ScholarGoogle Scholar
  25. Pankaj Vaidya, Ankit Gupta, and Varun Jaiswal. “Machine learning based prediction of anatomical therapeutic chemical (ATC) class of drug like molecule”. In: 2018 International Conference on Recent Innovations in Electrical, Electronics & Communication Engineering (ICRIEECE). IEEE. 2018, pp. 1045–1048.Google ScholarGoogle ScholarCross RefCross Ref
  26. Jessica Vamathevan “Applications of machine learning in drug discovery and develop- ment”. In: Nature Reviews Drug Discovery 18.6 (2019), pp. 463–477.Google ScholarGoogle ScholarCross RefCross Ref
  27. Chi-Shiang Wang “Detecting potential adverse drug reactions using a deep neural network model”. In: Journal of medical Internet research 21.2 (2019), e11016.Google ScholarGoogle ScholarCross RefCross Ref
  28. Xiangeng Wang “ATC-NLSP: prediction of the classes of anatomical therapeutic chemicals using a network-based label space partition method”. In: Frontiers in pharmacology 10 (2019), p. 971.Google ScholarGoogle ScholarCross RefCross Ref
  29. Yong-Cui Wang “Network predicting drug's anatomical therapeutic chemical code”. In: Bioinformatics 29.10 (2013), pp. 1317–1324.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Min-Ling Zhang “Binary relevance for multi-label learning: an overview”. In: Frontiers of Computer Science 12.2 (2018), pp. 191–202.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Haochen Zhao “A novel approach based on deep residual learning to predict drug's anatomical therapeutic chemical code”. In: 2020 IEEE International Conference on Bioinfor- matics and Biomedicine (BIBM). IEEE. 2020, pp. 921–926.Google ScholarGoogle ScholarCross RefCross Ref
  32. Jian-Peng Zhou, Lei Chen, and Zi-Han Guo. “iATC-NRAKEL: an efficient multi-label classifier for recognizing anatomical therapeutic chemical classes of drugs”. In: Bioinformatics 36.5 (2020), pp. 1391–1396.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICCBB '21: Proceedings of the 2021 5th International Conference on Computational Biology and Bioinformatics
    December 2021
    102 pages
    ISBN:9781450387446
    DOI:10.1145/3512452

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 1 April 2022

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format