skip to main content
10.1145/2147805.2147843acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
short-paper

A fuzzy entropy based approach for development of gene prediction networks (GPNs): detecting altered dependency in carcinogenic state

Published: 01 August 2011 Publication History

Abstract

In this article, the dependencies among the genes have been identified from microarray gene expression data. Here we propose a methodology for identifying the dependencies among the genes that have deviated quite significantly from normal stage to diseased stage with respect to their expression patterns. This idea leads to predict the disease mediating genes along with their deviated dependencies. The proposed methodology involves measuring information content of individual genes using fuzzy entropy, conditional fuzzy entropy of a gene on another, dependencies of a pair of genes in both normal and diseased states, and finally identifying the dependencies being deviated from normal to carcinogenic state. The deviated dependencies among the genes have been represented using a network, called gene prediction network (GPN), in which each node represents a gene and a directed edge signifies deviated dependency between a pair of nodes (genes).
The methodology has been demonstrated on two gene expression data sets dealing with human lung cancer and breast cancer. The results are appropriately validated by earlier investigations in terms of gene regulation. We have also used some statistical techniques like t-test, accuracy in terms of sensitivity and specificity to validate the results.

References

[1]
G. D. Beer et. al. Gene-expression profilespredict survival of patients with lung adenocarcinoma. Nature Medicine., 8:816--823, 2002.
[2]
A. S. Butte and I. S. Kohane. Relevance networks: a first step toward finding genetic regulatory networks within microarray data. The Analysis of Gene Expression Data, Springer., pages 428--446, 2003.
[3]
A. Fujita, J. R. Sato, H. M. Garay-Malpartida, P. A. Morettin, M. C. Sogayar, and C. E. Ferreira. Time-varying modeling of gene expression regulatory networks using the wavelet dynamic vector autoregressive method. Bioinformatics., 23:1623--1630, 2007.
[4]
N. Grabe. Alibaba2: context specific identification of transcription factor binding sites. In Silico Biol., 2:S1--S15, 2002.
[5]
M. S. Halfon, Y. Grad, G. M. Church, and A. M. Michelson. Computation-based discovery of related transcriptional regulatory modules and motifs using an experimentally validated combinatorial model. Genome Res., 12:1019--1028, 2002.
[6]
O. Hirose, R. Yoshida, S. Imoto, R. Yamaguchi, T. Higuchi, S. D. Charnock-Jones, C. Print, and S. Miyano. Statistical inference of transcriptional module-based gene networks from time course gene expression profiles by using state space models, module finder on gene expression profiles. Bioinformatics., 24:932--942, 2008.
[7]
S. Kim, S. Imoto, and S. Miyano. Dynamic bayesian network and nonparametric regression for nonlinear modeling of gene networks from time series gene expression data. Biosystems., 75:57--65, 2004.
[8]
W. Krivan and W. W. Wasserman. A predictive model for regulatory sequences directing liver-specific transcription. Genome Res., 11:1559--1566, 2001.
[9]
D. H. Ly, D. J. Lockhart, R. A. Lerner, and P. G. Schultz. Mitotic misregulation and human aging. Science, 287:2486--2492, 2000.
[10]
B. H. Mecham, G. T. Klus, J. Strovel, M. Augustus, D. Byrne, P. Bozso, D. Z. Wetmore, T. J. Mariani, I. S. Kohane, and Z. Szallasil. Sequence-matched probes produce increased cross-platform consistency and more reproducible biological results in microarray-based gene expression measurements. Nucleic Acids Res., 32:e74, 2004.
[11]
I. Nachman, A. Regev, and N. Friedman. Inferring quantitative models of regulatory networks from expression data. Bioinformatics., 4:i248--i256, 2004.
[12]
J. Quinlan. Programs for machine learning. Morgan Kaufmann., 1993.
[13]
J. L. Riechmann, J. Heard, G. Martin, L. Reuber, C. Jiang, J. Keddie, L. Adam, O. Pineda, O. J. Ratcliffe, and R. R. Samaha. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes. Science., 290:2105--2110, 2000.
[14]
E. Roulet, I. Fisch, T. Junier, P. Bucher, and N. Mermod. Evaluation of computer tools for the prediction of transcription factor binding sites on genomic dna. In Silico Biol., 1:21--28, 1998.
[15]
J. Schafer and K. Strimmer. An empirical bayes approach to inferring largescale gene association networks. Bioinformatics., 21:754--764, 2005.
[16]
O. Schuldiner, C. Yanover, and N. Benvenisty. Computer analysis of the entire budding yeast genome for putative targets of the GCN4 transcription factor. Curr. Genet., 33:16--20, 1998.
[17]
J. D. Storey and R. Tibshirani. Statistical significance for genomwide studies. Proc. Natl. Acad. Sci., 100:9440--9445, 2003.
[18]
K. Tan, G. Moreno-Hagelsieb, J. Collado-Vides, and G. D. Stormo. A comparative genomics approach to prediction of new members of regulons. Genome Res., 11:566--584, 2001.
[19]
R. Tupler, G. Perini, M. A. Pellegrino, and M. R. Green. Profound misregulation of muscle-specific gene expression in facioscapulohumeral muscular dystrophy. Proc. Natl. Acad. Sci., 96:12650--12654, 1999.
[20]
E. Wingender, X. Chen, E. Fricke, R. Geffers, R. Hehl, I. Liebich, M. Krull, V. Matys, H. Michael, and R. Ohnhauser. The transfac system on gene expression regulation. Nucleic Acids Res., 29:281--283, 2001.
[21]
R. Yamaguchi, R. Yoshida, S. Imoto, T. Higuchi, and S. Miyano. Finding modulebased gene networks in time-course gene expression data with state space models. IEEE Signal Processing Magazine., 24:37--46, 2007.
  1. A fuzzy entropy based approach for development of gene prediction networks (GPNs): detecting altered dependency in carcinogenic state

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      BCB '11: Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine
      August 2011
      688 pages
      ISBN:9781450307963
      DOI:10.1145/2147805
      • General Chairs:
      • Robert Grossman,
      • Andrey Rzhetsky,
      • Program Chairs:
      • Sun Kim,
      • Wei Wang
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 August 2011

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. GPN
      2. entropy
      3. fuzzy set
      4. t-test
      5. true positive

      Qualifiers

      • Short-paper

      Conference

      BCB' 11
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 254 of 885 submissions, 29%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 74
        Total Downloads
      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 03 Mar 2025

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media