skip to main content
10.1145/3520304.3528802acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

BOBEA: a bi-objective biclustering evolutionary algorithm for genome-wide association analysis

Published: 19 July 2022 Publication History

Abstract

The behavior of many diseases is still not well understood by researchers. Genome-Wide Association (GWA) analyzes have recently become a popular approach to discovering the genetic causes of many complex diseases. These analyzes could lead to the discovery of genetic factors potentially involved in certain disease susceptibility. These studies typically use the most common genetic variation between individuals, the Single Nucleotide Polymorphism (SNP). Indeed, many complex diseases have been revealed to be associated with combinations of SNP interactions. However, the identification of such interactions is considered difficult. Therefore, various unsupervised data mining methods are often developed in the literature to identify such variation involved in disease. In this work, a biclustering method is adopted to detect possible associations between SNP markers and disease susceptibility. It is an unsupervised classification technique, which plays an increasingly important role in the study of modern biology. We propose an evolutionary algorithm based on a bi-objective approach for the biclustering of the Genome-Wide Association. An experimental study is achieved on synthetic data to evaluate the performance of the proposed algorithm. Promising results are obtained.

References

[1]
Freitas Adelaide, Ayadi Wassim, Elloumi Mourad, Oliveira Joséluis, and Hao JinKao. 2013. Survey on biclustering of gene expression data. Biological Knowledge Discovery Handbook. John Wiley & Sons, Inc (2013), 591--608.
[2]
Syvänen Ann-Christine. 2005. Toward genome-wide SNP genotyping. Nature genetics 37, 6 (2005), S5--S10.
[3]
Wassim Ayadi and Elloumi Mourad. 2011. Biclustering of microarray data. Algorithms in Computational Molecular Biology: Techniques, Approaches and Applications. Wiley Book Series on Bioinformatics: Computational Techniques and Engineering (2011), 651--664.
[4]
Wassim Ayadi, Elloumi Mourad, and Hao Jin-Kao. 2009. A biclustering algorithm based on a bicluster enumeration tree: application to dna microarray data. BioData mining 2, 1 (2009), 1--16.
[5]
Chunteng Bao, Xu Lihong, Goodman Erik D, and Cao Leilei. 2017. A novel non-dominated sorting algorithm for evolutionary multi-objective optimization. Journal of Computational Science 23 (2017), 31--43.
[6]
Amir Ben-Dor, Chor Benny, Karp Richard, and Yakhini Zohar. 2002. Discovering local structure in gene expression data: the order-preserving submatrix problem. In Proceedings of the sixth annual international conference on Computational biology. 49--57.
[7]
Sven Bergmann, Ihmels Jan, and Barkai Naama. 2003. Iterative signature algorithm for the analysis of large-scale gene expression data. Physical review E 67, 3 (2003), 031902.
[8]
William S Bush and Moore Jason H. 2012. Chapter 11: Genome-wide association studies. PLoS computational biology 8, 12 (2012), e1002822.
[9]
Carlos Cano, Adarve L, López J, and Blanco Armando. 2007. Possibilistic approach for biclustering microarray data. Computers in biology and medicine 37, 10 (2007), 1426--1436.
[10]
Yizong Cheng and Church George M. 2000. Biclustering of expression data. In Ismb, Vol. 8. 93--103.
[11]
Gu Dongfeng, Su Shaoyong, Ge Dongliang, Chen Shufeng, Huang Jianfeng, Li Biao, Chen Runsheng, and Qiang Boqin. 2006. Association study with 33 single-nucleotide polymorphisms in 11 candidate genes for hypertension in Chinese. Hypertension 47, 6 (2006), 1147--1154.
[12]
Dongliang Ge, Zhu Haidong, Huang Ying, Treiber Frank A, Harshfield Gregory A, Snieder Harold, and Dong Yanbin. 2007. Multilocus analyses of renin-angiotensin-aldosterone system gene variants on blood pressure at rest and during behavioral stress in young normotensive subjects. Hypertension 49, 1 (2007), 107--112.
[13]
Binder Harald, Müller Tina, Schwender Holger, Golka Klaus, Steffens Michael, Jan G Hengstle, Ickstadt Katja, and Schumacher Martin. 2012. Cluster-localized sparse logistic regression for SNP data. Statistical applications in genetics and molecular biology 11, 4 (2012).
[14]
Ihmels Jan, Bergmann Sven, and Barkai Naama. 2004. Defining transcription modules using large-scale gene expression data. Bioinformatics 20, 13 (2004), 1993--2003.
[15]
Ding Lili, Baye Tesfaye M, He Hua, Zhang Xue, Kurowski Brad G, and Martin Lisa J. 2011. Detection of associations with rare and common SNPs for quantitative traits: a nonparametric Bayes-based approach. In BMC proceedings, Vol. 5. BioMed Central, 1--5.
[16]
Ons Maâtouk, Wassim Ayadi, Hend Bouziri, and Béatrice Duval. 2019. Evolutionary biclustering algorithms: an experimental study on microarray data. Soft Computing 23, 17 (2019), 7671--7697.
[17]
Sara C Madeira, Miguel C Teixeira, Isabel Sa-Correia, and Arlindo L Oliveira. 2008. Identification of regulatory modules in time series gene expression data using a linear time biclustering algorithm. IEEE/ACM Transactions on Computational Biology and Bioinformatics 7, 1 (2008), 153--165.
[18]
Van Emburgh Beth O, Hu Jennifer J, Levine Edward A, Mosley Libyadda J, Case L Douglas, Lin Hui-Yi, Knight Sommer N, Perrier Nancy D, Rubin Peter, Sherrill Gary B, et al. 2008. Polymorphisms in drug metabolism genes, smoking, and p53 mutations in breast cancer. Molecular Carcinogenesis: Published in cooperation with the University of Texas MD Anderson Cancer Center 47, 2 (2008), 88--99.
[19]
Maâtouk Ons, Ayadi Wassim, Bouziri Hend, and Béatrice Duval. 2021. Evolutionary Local Search Algorithm for the biclustering of gene expression data based on biological knowledge. Applied Soft Computing 104 (2021), 107177.
[20]
Amela Prelić, Bleuler Stefan, Zimmermann Philip, Wille Anja, Bühlmann Peter, Gruissem Wilhelm, Hennig Lars, Thiele Lothar, and Zitzler Eckart. 2006. A systematic comparison and evaluation of biclustering methods for gene expression data. Bioinformatics 22, 9 (2006), 1122--1129.
[21]
Cook Nancy R, Zee Robert YL, and Ridker Paul M. 2004. Tree and spline based association analysis of gene-gene interaction models for ischemic stroke. Statistics in medicine 23, 9 (2004), 1439--1453.
[22]
Beatriz Pontesand Giráldez Raúl and Aguilar-Ruiz Jesús S. 2015. Biclustering on expression data: A review. Journal of biomedical informatics 57 (2015), 163--180.
[23]
Pawel Rene, Patryk Orzechowski, Aleksander Byrski, Jaroslaw Was, and Jason H. Moore. 2021. Rapid prototyping of evolution-driven biclustering methods in Julia. In GECCO '21: Proceedings of the Genetic and Evolutionary Computation Conference CompanionJuly. 61--62.
[24]
Henriques Rui and Madeira Sara C. 2018. BSig: evaluating the statistical significance of biclustering solutions. Data Mining and Knowledge Discovery 32, 1 (2018), 124--161.
[25]
Khedidja Seridi, Laetitia Jourdan, and El-Ghazali Talbi. 2013. Preliminary Studies on Biclustering of GWA: A Multiobjective Approach. In International Conference on Artificial Evolution (Evolution Artificielle). Springer, 106--117.
[26]
Andrey A Shabalin, Victor J Weigman, Charles M Perou, and Andrew B Nobel. 2009. Finding large average submatrices in high dimensional data. The Annals of Applied Statistics (2009), 985--1012.
[27]
Nishchal K Verma, Sharma Teena, Dixit Sonal, Agrawal Pooja, Sengupta Sourya, and Singh Vikas. 2021. BIDEAL: A Toolbox for Bicluster Analysis---Generation, Visualization and Validation. SN Computer Science 2, 1 (2021), 1--15.
[28]
Jiong Yang, Haixun Wang, Wei Wang, and Philip S Yu. 2005. An improved biclustering method for analyzing gene expression profiles. International Journal on Artificial Intelligence Tools 14, 05 (2005), 771--789.

Cited By

View all
  • (2024)A parameter free relative density based biclustering method for identifying non-linear feature relationsHeliyon10.1016/j.heliyon.2024.e34736(e34736)Online publication date: Jul-2024
  • (2023)Metaheuristic Biclustering Algorithms: From State-of-the-art to Future OpportunitiesACM Computing Surveys10.1145/361759056:3(1-38)Online publication date: 6-Oct-2023

Index Terms

  1. BOBEA: a bi-objective biclustering evolutionary algorithm for genome-wide association analysis
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Conferences
          GECCO '22: Proceedings of the Genetic and Evolutionary Computation Conference Companion
          July 2022
          2395 pages
          ISBN:9781450392686
          DOI:10.1145/3520304
          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Sponsors

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 19 July 2022

          Check for updates

          Author Tags

          1. bi-objective approach
          2. biclustering
          3. evolutionary algorithm
          4. genome-wide association
          5. single nucleotide polymorphism

          Qualifiers

          • Poster

          Conference

          GECCO '22
          Sponsor:

          Acceptance Rates

          Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)12
          • Downloads (Last 6 weeks)1
          Reflects downloads up to 10 Feb 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)A parameter free relative density based biclustering method for identifying non-linear feature relationsHeliyon10.1016/j.heliyon.2024.e34736(e34736)Online publication date: Jul-2024
          • (2023)Metaheuristic Biclustering Algorithms: From State-of-the-art to Future OpportunitiesACM Computing Surveys10.1145/361759056:3(1-38)Online publication date: 6-Oct-2023

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media