Abstract
Hepatitis B virus (HBV) infection is a worldwide health problem, with more than 1 million people died from liver cirrhosis and hepatocellular carcinoma (HCC) each year. HBV infection could result in the progression from normal to serious cirrhosis which is insidious and asymptomatic in most of the cases. The recent development of DNA microarray technology provides biomedical researchers with a molecular sight to observe thousands of genes simultaneously. How to efficiently extract useful information from these large-scale gene expression data is an important issue. Although there exist a number of interesting researches on this issue, they used to deploy some complicated statistical hypotheses. In this paper, we propose a multi-information-based methodology to score genes based on the microarray expressions. The concept of multi-information here is to combine different scoring functions in different tiers for analyzing gene expressions. The proposed methods can rank the genes according to the degree of relevance to the targeted diseases so as to form a precise prediction base. The experimental results show that our approach delivers accurate prediction through the assessment of QRT-PRC results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alizadeh, A.A., Eisen, M.B., Davis, R.E., Ma, C., Lossos, I.S., Rosenwald, A., Boldrick, J.C., Sabet, H., Tran, T., Yu, X., Powell, J.I., Yang, L., Marti, G.E., Moore, T., Hudson, J.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403(6769), 503–511 (2000)
Alon, U., et al.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. In: Proceedings of the National Academy of Sciences, vol. 96, pp. 6745–6750 (1999)
Ben-Dor, A., Friedman, N., Yakhini, Z.: Scoring genes for relevance, Technical Report, 2000-38, School of Computer Science & Engineering. Hebrew University, Jerusalem
Ben-Dor, A., Friedman, N., Yakhini, Z.: Overabundance Analysis and Class Discovery in Gene Expression Data, Technical Reports of the Leibniz Center (2002)
Ben-Dor, A., Bruhn, L., Friedman, N., Nachman, I., Schummer, M., Yakhini, Z.: Tissue classification with gene expression profiles. Jour. Of Comp. Bio. 7, 559–584 (2000)
Ben-Dor, A., Shamir, R., Yakhini, Z.: Clustering gene expression patterns. J. Comp. Bio. 6(3-4), 281–297 (1999)
Blum, A., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97, 245–271 (1997)
Chuang, H.Y., Liu, H.F., Brown, S., Cameron, M.C., Kao, C.Y.: Identifying significant genes from microarray data. In: fourth IEEE Symposium on Bioinformatics and Bioengineering (BIBE), pp. 358–366 (2004)
Chuang, H.Y., Tsai, H.K., Tsai, Y.F., Kao, C.Y.: Ranking genes for discriminability on microarray data. Journal of Information Science and Engineering 19, 953–966 (2003)
Cortes, C., Vapnik, V.: Support vector machines. Machine Learning 20, 273–297 (1995)
de Kok, J.B., Roelofs, R.W., Giesendorf, B.A., Pennings, J.L., Waas, E.T., Feuth, T., Swinkels, D.W., Span, P.N.: Normalization of gene expression measurements in tumor tissues: comparison of 13 endogenous control genes. Lab Invest. Jan 85(1), 154–159 (2005)
Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. PNAS 95(25), 14863–14868 (1998)
Gerard, C.J., Andrejka, L.M., Macina, R.A.: Mitochondrial ATP synthase 6 as an endogenous control in the quantitative RT-PCR analysis of clinical cancer samples. Mol Diagn 5, 39–46 (2000)
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531–537 (1999)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, Englewood Cliffs (1988)
Kunth, K., Hofler, H., Atkinson, M.J.: Quantification of messenger RNA expression in tumors: which standard should be used for best RNA normalization? Verh Dtsch Ges Pathol 78, 226–230 (1994)
Marden, J.I.: ‘Analysing and Modeling Rank Data. Chapman and Hall, Boca Raton (1995)
McQueen, J.: Some Methods of Classification and Analysis of Multivariate Observations. In: Proc. of the 5th Berkeley Symp. Mathematical Statistics and Probability, pp. 281–297 (1967)
Park, P.J., Pagano, M., Bonetti, M.: A Nonparametric Scoring Algorithm for Identifying Informative Genes from Microarray Data. Pacific Symposium on Biocomputing 6, 52–63 (2001)
Pavlidis, P., Tang, C.: Classification of genes using probabilistic models of microarray expression profiles. In: Proceedings of BIOKDD 2001 (2001)
Schmittgen, T.D., Zakrajsek, B.A.: Effect of experimental treatment on housekeeping gene expression: validation by real-time, quantitative RT-PCR. J. Biochem. Biophys. Methods 46, 69–81 (2000)
Sharan, R., Shamir, R.: CLICK: A clustering algorithm with applications to gene expression analisys. In: ISMB 2000 (2000)
Slonim, D.K., Tamayo, P., Mesirov, J.P., Golub, T.R., Lander, E.S.: Class prediction and discovery using gene expression data. In: RECOMB 2000 (2000)
Staunton, J.E., Slonim, D.K., Coller, H.A., Tamayo, P., Angelo, M.J., Park, J., Scherf, U., Lee, J.K., Reinhold, W.O., Weinstein, J.N., et al.: Chemosensitivity prediction by transcriptional profiling. In: Proc. Natl. Acad. Sci. USA 2001, vol. 98, pp. 10787–10792 (2000)
Weston, J., Mukherjee, S., Chapelle, O., Pontil, M., Poggio, T., Vapnik, V.: Feature selection for SVMs. In: Advances in Neural Information Processing Systems, vol. 13. MIT Press, Cambridge (2001)
Xu, L., Krzyzak, A., Suen, C.Y.: Method of Combining Multiple Classifiers and their Application to Handwriting Recognition. IEEE Trans SMC 22, 418–435 (1992)
Zuo, F., Kaminski, N., Eugui, E., Allard, J., Yakhini, Z., Ben-Dor, A., Lollini, L., Morris, D., Kim, Y., DeLustro, B., et al.: Gene expression analysis reveals matrilysin as a key regulator of pulmonary fibrosis in mice and humans. In: Proc. Natl. Acad. Sci. USA 2002, vol. 99, pp. 6292–6297 (2000)
Affymetrix. User’s guide to product comparison spreadsheets (2003), http://www.affymetrix.com/support/technical/manual/comparison_spreadsheets_manual.pdf
LocusLink, http://www.ncbi.nlm.nih.gov/LocusLink
UniGene, http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=unigene
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yu, HH., Tseng, V.S., Chuang, JH. (2006). A Multi-information Based Gene Scoring Method for Analysis of Gene Expression Data. In: Priami, C., Hu, X., Pan, Y., Lin, T.Y. (eds) Transactions on Computational Systems Biology V. Lecture Notes in Computer Science(), vol 4070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11790105_8
Download citation
DOI: https://doi.org/10.1007/11790105_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36048-3
Online ISBN: 978-3-540-36049-0
eBook Packages: Computer ScienceComputer Science (R0)