skip to main content
10.1145/3500931.3501018acmotherconferencesArticle/Chapter ViewAbstractPublication PagesisaimsConference Proceedingsconference-collections
research-article

Prediction of anti-breast cancer compound activity based on Gradient Boosting Decision Tree ensemble learning

Authors Info & Claims
Published:22 December 2021Publication History

ABSTRACT

The number of breast cancer cases worldwide has risen sharply in the past 20 years. In 2000, the total number of cases was 1.05 million. By 2018, the number had increased to 2.09 million, an increase of 99.05 percent, with an average annual growth rate of more than 5 percent. Breast cancer has gradually become the most common cancer for women, so the development of anti-breast cancer drugs has become a hot topic in the current medical field. In this paper, a series of compound descriptors and their biological activity data were collected for ERα, a target associated with breast cancer, and compound activity was predicted by establishing a compound activity prediction model. According to the compound descriptors, more important features were selected. The Quantitative structure-activity Relationship (QSAR) model of compounds was constructed, and then the GBDT algorithm was used to predict the model. Finally, through comparative analysis of results, the method was fast and accurate. It plays a key role in drug research.

References

  1. Kunal, R. Thomas Leonard, J. (2005) Classical QSAR Modeling of Anti-HIV 2, 3-Diaryl-l, 3-thiazolidin-4-ones[J]. QSAR & combinatorial science, 24(5):579--592.Google ScholarGoogle Scholar
  2. Devillers, J. Flatin, J. (2000) A General QSAR Model for Predicting the Acute Toxicity of Pesticides toOncorhynchus mykiss[J]. SAR and QSAR in Environmental Research, 11(1):25--43.Google ScholarGoogle ScholarCross RefCross Ref
  3. Hong Wei, L. Zhi Liang, L. (2003) Structural Characterization of Pyrones through MEDV and Anti-HIV Activity Prediction by QSAR[J]. Chinese Journal of Organic Chemistry, 23(12):1370--1374.Google ScholarGoogle Scholar
  4. Li Ping, Z. Yong Hua, Z. Shu Shen, L. (2002) Structural Characterization of Cyclic Ureas through MEDV and Anti-HIV Activity Prediction by QSAR[J].Acta Chimica Sinica, 60(9):1688--1693.Google ScholarGoogle Scholar
  5. Bing, H. (2009) Prediction of blood-brain barrier penetrating drugs using Supporting Vector Machine[J].Computers and Applied Chemistry, 26(2):188--190.Google ScholarGoogle Scholar
  6. Jia, R., Yang, L., Li, Y., & Xin, Z. (2021). Gestures recognition of sEMG signal based on Random Forest. 2021 IEEE 16th Conference on Industrial Electronics and Applications (ICIEA), Industrial Electronics and Applications (ICIEA), 2021 IEEE 16th Conference On, 1673--1678.Google ScholarGoogle ScholarCross RefCross Ref
  7. MORAES, D. et al. Influence of Sample Size in Land Cover Classification Accuracy Using Random Forest and Sentinel-2 Data in Portugal. 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Geoscience and Remote Sensing Symposium IGARSS, 2021 IEEE International, [s. l.], p. 4232--4235, 2021.Google ScholarGoogle Scholar
  8. WARD, M. et al. Data Balanced Bagging Ensemble of Convolutional- LSTM Neural Networks for Time Series Data Classification with an Imbalanced Dataset. 2021 IEEE International Symposium on Circuits and Systems (ISCAS), [s. l.], p. 1--5, 2021.Google ScholarGoogle Scholar
  9. Yun Fan, Y. Fang, L. Shan, Z. (2020) Ensemble Learning Based on GBDT and CNN for AdoptabilityPrediction[J]. Computers, materials & continua, 65(2):1361--1372.Google ScholarGoogle Scholar
  10. Ghasemi, J. Saaidpour, S. Brown S. (2007) QSPR study for estimation of acidity constants of some aromatic acids derivatives using multiple linear regression (MLR) analysis[J]. Journal of Molecular Structure. Theochem: Applications of Theoretical Chemistry to Organic, Inorganic and Biological Problems, 805(1/3):27--32.Google ScholarGoogle Scholar
  11. Dang Qun, C. (2005) Analysis and Making Good Fitting Degree Test for Logistic Curve Regression Equation[J].Application of Statistics and Management, 24(1):112--115.Google ScholarGoogle Scholar
  12. Timothy, D. Barry, B. Janet, D. (1994) Evaluation of time‐series data sets using the Pearson product‐moment correlation coefficient[J].Medicine and Science in Sports and Exercise, 1994, 26(7):919--928.Google ScholarGoogle Scholar

Index Terms

  1. Prediction of anti-breast cancer compound activity based on Gradient Boosting Decision Tree ensemble learning

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ISAIMS '21: Proceedings of the 2nd International Symposium on Artificial Intelligence for Medicine Sciences
          October 2021
          593 pages
          ISBN:9781450395588
          DOI:10.1145/3500931

          Copyright © 2021 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 22 December 2021

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate53of112submissions,47%
        • Article Metrics

          • Downloads (Last 12 months)9
          • Downloads (Last 6 weeks)1

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader