Abstract
with breast cancer, scientists have been trying to find the most effective solutions and treatments. Moreover, studies on genes through machine learning were conducted. By identifying the factor that influences breast cancer the most, this knowledge can be used to prevent or treat patients with breast cancer appropriately. Furthermore, the result of this experiment extends onto the debate of nature versus nurture. If the result concludes that Only Gene or Only Mutation has a stronger effect on tumors, then it weighs nature more in this debate. Likewise, if Only Others is the dominant factor, then this emphasizes the nurture more in this debate. Gathered data was processed and went through eight different machine learning algorithms to predict the tumor size and stage. The ‘Others’ was concluded as the most influential factor for the tumor. Among the ‘Others’, the type of breast surgery and the number of chemotherapy received were identified with the highest correlation with tumor size and stage. In conclusion, this solidifies the nurture’s stance on the debate. The data on the external effects and the usage of a developed machine learning model can be adopted to improve the experiment because they would increase the accuracy of the result.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
National Cancer Institute: What is cancer? 5 May 2021 https://www.cancer.gov/about-cancer/understanding/what-is-cancer
WHO|World Health Organization: Cancer, 3 March 2021. https://www.who.int/news-room/fact-sheets/detail/cancer
National Breast Cancer Foundation: 19 September 2019. Other Types. https://www.nationalbreastcancer.org/other-types-of-breast-cancer. Accessed 13 Dec. 2022
Centers for Disease Control and Prevention: What are the symptoms of breast cancer? 14 September 2020. https://www.cdc.gov/cancer/breast/basic_info/symptoms.htm
Breastcancer.org: Researchers identify 110 genes associated with breast cancer, 20 December 2018 https://www.breastcancer.org/research-news/110-genes-associated-with-breast-cancer
IBM Cloud Education: What is machine learning? IBM - United States, 15 July 2020. https://www.ibm.com/cloud/learn/machine-learning
Montesinos-López, O.A., et al.: A review of deep learning applications for genomic selection. BMC Genomics 22(1) (2021). https://doi.org/10.1186/s12864-020-07319-x
Jethanandani, M.: Machine learning and genetics. 23andMe Education Program, 8 August 2018. https://education.23andme.com/machine-learning-and-genetics/
Max-Planck-Gesellschaft: 165 new cancer genes identified with the help of machine learning. ScienceDaily, 12 April 2021. www.sciencedaily.com/releases/2021/04/210412142730.htm
Sung, H., Ferlay, J., Siegel, R.L., Laversanne, M., Soerjomataram, I., Jemal, A., Bray, F.: Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer J. Clinicians 71(3), 209–249 (2021). https://doi.org/10.3322/caac.21660
Frangioni, J.V.: New technologies for human cancer imaging. PubMed Central (PMC), 20 August 2008. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2654310/
M. (2018, October 31). Early detection of breast cancer information. MyVMC. https://www.myvmc.com/investigations/early-detection-of-breast-cancer/
Urda, D., Montes-Torres, J., Moreno, F., Franco, L., Jerez, J.M.: Deep learning to analyze RNA-seq gene expression data. In: Rojas, I., Joya, G., Catala, A. (eds.) IWANN 2017. LNCS, vol. 10306, pp. 50–59. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59147-6_5
Castillo, D., Gálvez, J.M., Herrera, L.J., Román, B.S., Rojas, F., Rojas, I.: Integration of RNA-SEQ data with heterogeneous microarray data for breast cancer profiling. BMC Bioinform. 18(1) (2017). https://doi.org/10.1186/s12859-017-1925-0
Liñares Blanco, J., Gestal, M., Dorado, J., Fernandez-Lozano, C.: Differential gene expression analysis of RNA-seq data using machine learning for cancer research. In: Tsihrintzis, G.A., Virvou, M., Sakkopoulos, E., Jain, L.C. (eds.) Machine Learning Paradigms. LAIS, vol. 1, pp. 27–65. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-15628-2_3
Wang, D., Zhang, Y., Zhao, Y.: LightGBM. In: Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics - ICCBB 2017 (2017). https://doi.org/10.1145/3155077.3155079
Johnson, N.T., Dhroso, A., Hughes, K.J., Korkin, D.: Biological classification with RNA-SEQ data: can alternatively spliced transcript expression enhance machine learning classifiers? RNA 24(9), 1119–1132 (2018). https://doi.org/10.1261/rna.062802.117
Alharbi, R.: Breast cancer gene expression profiles (METABRIC). Kaggle: Your Machine Learning and Data Science Community, 27 May 2020. https://www.kaggle.com/raghadalharbi/breast-cancer-gene-expression-profiles-metabric
Breiman, L.: Random forests. Mach. Learn. 45(3), 5–32 (2001). https://doi.org/10.1023/a:1017934522171
Ke, G., et al.: Lightgbm: a highly efficient gradient boosting decision tree. Adv. Neural. Inf. Process. Syst. 30, 3146–3154 (2017)
Medicinenet.com. (2022). https://www.medicinenet.com/nature_vs_nurture_theory_genes_or_environment/article.htm. Accessed 3 Jan 2022
How These Common Carcinogens May Be Increasing Your Risk for Cancers. Verywell Health: (2022). https://www.verywellhealth.com/carcinogens-in-cigarettes-how-they-cause-cancer-514412#:~:text=A%20carcinogen%20is%20any%20substance,cancer%20helps%20in%20prevention%20efforts. Accessed 3 Jan 2022
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Park, J., Kim, M. (2023). Utilizing Machine Learning to Predict Breast Cancer: One Step Closer to Bridging the Gap Between the Nature Versus Nurture Debate. In: Arai, K. (eds) Proceedings of the Future Technologies Conference (FTC) 2022, Volume 1. FTC 2022 2022. Lecture Notes in Networks and Systems, vol 559. Springer, Cham. https://doi.org/10.1007/978-3-031-18461-1_41
Download citation
DOI: https://doi.org/10.1007/978-3-031-18461-1_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-18460-4
Online ISBN: 978-3-031-18461-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)