Abstract
The emergence of systems biology enables us to simulate and analyze organism’s microscope features from the level of genome, proteome and interactome. This article utilized data integration method to predict molecular interaction network of Norway rat following the basic principles of systems biology. This research selects microarray related with cardiac hypertrophy, and built the downstream studies on 730 differentially expressed genes.4 heterogeneous kinds of data type including microarray expression, gene sequence, subcellular localization of protein and orthologous data are selected to make the overall model more comprehensive. After processed by specific algorithms, the 4 data types are transformed to 5 types of evidence: Pearson correlation coefficient, SVM model recognition, similarities between gene sequences, distance between proteins and orthologous alignment. A widely used machine learning algorithm, support vector machines (SVM) is introduced here to help deal with single evidence preparation and multiple evidence integration. This article finds that the prediction accuracy of data integration is obviously higher than that of single evidence. Data integration promised that heterogeneous data types could enhance each other’s advantages by weakening each other’s disadvantages so as to deliver more objective and comprehensive understanding of molecular interactions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barrett, T., Troup, D.B., Wilhite, S.E.: NCBI GEO: archive for high-throughput functional genomic data. Nucleic Acids Res. 37(Database issue), 885–890 (2009)
Lu, L.J., Xia, Y., Paccanaro, A., et al.: Accessing the limits of genomic data integration for predicting protein networks. Genome Res. 15, 945–953 (2005)
Hernandez-Toro, J., Prieto, C., Delas, R.J.: APID2NET: unified interactome graphic analyzer. BMC Bioinformatics 23(18), 2495–2497 (2007)
Jiang, C., Xuan, Z., Zhao, F., et al.: TRED: a transcriptional regulatory element database, new entries and other development. Nucleic Acids Res. 35(Database issue), 137–140 (2007)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Jansen, R., Bussemaker, H.J., Gerstein, M.: Revisiting the codon adaptation index from a whole-genome perspective: analyzing the relationship between gene expression and codon occurrence in yeast using a variety of models. Nucleic Acids Res. 31, 2242–2251 (2003)
Daubin, V., Perriere, G.: G+C3 structuring along the genome: a common feature in prokaryotes. Molecular Biology and Evolution 20, 471–483 (2003)
Fraser, H.B., Hirsh, A.E., Wall, D.P., et al.: Coevolution of gene expression among interacting proteins. PNAS 101, 9033–9038 (2004)
Hubbard, T.J., Aken, B.L., Ayling, S., et al.: Ensembl 2009. Nucleic Acids Res. 37(Database issue), 690–697 (2009)
Nakamura, Y., Gojobori, T., Ikemura, T.: Codon usage tabulated from international DNA sequence databases: status for the year 2000. Nucleic Acids Res. 28(1), 292 (2000)
Tatusov, R.L., Galperin, M.Y., Natale, D.A., et al.: The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 28(1), 33–36 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, Q., Rong, Q. (2010). Predict Molecular Interaction Network of Norway Rats Using Data Integration. In: Li, K., Jia, L., Sun, X., Fei, M., Irwin, G.W. (eds) Life System Modeling and Intelligent Computing. ICSEE LSMS 2010 2010. Lecture Notes in Computer Science(), vol 6330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15615-1_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-15615-1_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15614-4
Online ISBN: 978-3-642-15615-1
eBook Packages: Computer ScienceComputer Science (R0)