skip to main content
10.1145/3234664.3234674acmotherconferencesArticle/Chapter ViewAbstractPublication PageshpcctConference Proceedingsconference-collections
research-article

Microarray Data Analysis and Model Construction Based on Oversampling Approach and Decision Tree

Published: 22 June 2018 Publication History

Abstract

Primary skin cancer can be divided into basal cell carcinoma(BCC), squamous cell carcinoma and melanoma three types. Especially, BCC incidence is as high as 80% with 10% global incidence increasing each year. Although BCC has low metastatic and lower mortality than the other two types, this malignant tumor has a very high incidence. Therefore places a great burden on global healthcare expenditures. Imiquimod and Resiquimod are therapies for BCC. Imiquimod has been shown to be highly effective in the treatment of localized BCC. Therefore, the purpose of this study is to analyze the microarray of therapeutic drugs for BCC using machine learning algorithm and constructing prediction model. In this study, microarray raw data with giving Imiquimod and Resiquimod to BCC, which contained 30,968 sets of gene expression. There are two parts of this study: First, the three gene selection algorithms were employed, such as linear regression analysis, information gain and gain ratio, to screen out the biomaker. Second, we use decision tree algorithms: C4.5 to construct the prediction model and discuss classification accuracy.

References

[1]
S. Franceschi, F. Levi, L. Randimbison, and C. La Vecchia, "Site distribution of different types of skin cancer: new aetiological clues," International journal of cancer, vol. 67, no. 1, pp. 24--28, 1996.
[2]
E. L. Smits, P. Ponsaerts, Z. N. Berneman, and V. F. Van Tendeloo, "The use of TLR7 and TLR8 ligands for the enhancement of cancer immunotherapy," The oncologist, vol. 13, no. 8, pp. 859--875, 2008.
[3]
J. F. Gerster, K. J. Lindstrom, R. L. Miller, M. A. Tomai, W. Birmachu, S. N. Bomersine, S. J. Gibson, L. M. Imbertson, J. R. Jacobson, and R. T. Knafla, "Synthesis and structure-activity-relationships of 1 H-imidazo {4, 5-c} quinolines that induce interferon production," Journal of medicinal chemistry, vol. 48, no. 10, pp. 3481--3491, 2005.
[4]
Tomai MA, Gibson SJ, Imbertson LM, et al. Immunomodulating and antiviral activities of the imidazochinolines S-28643. Antivir Res 1995;28:253--64. First paper on resiquimod activities in animal models and cell cultures.
[5]
Tomai MA, Gibson SJ, Imbertson LM, et al. Immunomodulating and antiviral activities of the imidazochinolines S-28643. Antivir Res 1995;28:253--64. First paper on resiquimod activities in animal models and cell cultures.
[6]
T. Meyer, C. Surber, L. E. French, and E. Stockfleth, "Resiquimod, a topical drug for viral skin lesions and skin cancer," Expert opinion on investigational drugs, vol. 22, no. 1, pp. 149--159, 2013.
[7]
L. Morán-Fernández, V. Bolón-Canedo, and A. Alonso-Betanzos, "Data complexity measures for analyzing the effect of SMOTE over microarrays," in European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, 2016.
[8]
R. Blagus and L. Lusa, "Evaluation of smote for high-dimensional class-imbalanced microarray data," in Machine learning and applications (icmla), 2012 11th international conference on, 2012, vol. 2, pp. 89--94: IEEE.
[9]
M.-F. Tsai and S.-S. Yu, "Data Mining for Bioinformatics: Design with Oversampling and Performance Evaluation," Journal of Medical and Biological Engineering, vol. 35, no. 6, pp. 775--782, 2015.
[10]
N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," Journal of artificial intelligence study, vol. 16, pp. 321--357, 2002.
[11]
J. Lee Rodgers and W. A. Nicewander, "Thirteen ways to look at the correlation coefficient," The American Statistician, vol. 42, no. 1, pp. 59--66, 1988.
[12]
R. E. Shiffler, "Maximum Z scores and outliers," The American Statistician, vol. 42, no. 1, pp. 79--80, 1988.
[13]
J. R. Quinlan, "Improved use of continuous features in C4. 5," Journal of artificial intelligence study, vol. 4, pp. 77--90, 1996.

Cited By

View all
  • (2023)Optimal Decision Tree for Early Detection of Bipolar Disorder based on Crowdsourced Symptoms2023 Eighth International Conference on Informatics and Computing (ICIC)10.1109/ICIC60109.2023.10382060(1-6)Online publication date: 8-Dec-2023

Index Terms

  1. Microarray Data Analysis and Model Construction Based on Oversampling Approach and Decision Tree

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    HPCCT '18: Proceedings of the 2018 2nd High Performance Computing and Cluster Technologies Conference
    June 2018
    126 pages
    ISBN:9781450364850
    DOI:10.1145/3234664
    © 2018 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    In-Cooperation

    • Shanghai Jiao Tong University: Shanghai Jiao Tong University
    • Xi'an Jiaotong-Liverpool University: Xi'an Jiaotong-Liverpool University
    • Chinese Academy of Sciences

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. BCC
    2. biomarker
    3. imiquimod
    4. machine learning
    5. microarray
    6. resiquimod

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    HPCCT 2018

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 22 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Optimal Decision Tree for Early Detection of Bipolar Disorder based on Crowdsourced Symptoms2023 Eighth International Conference on Informatics and Computing (ICIC)10.1109/ICIC60109.2023.10382060(1-6)Online publication date: 8-Dec-2023

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media