A Multi-information Based Gene Scoring Method for Analysis of Gene Expression Data

Yu, Hsieh-Hui; Tseng, Vincent S.; Chuang, Jiin-Haur

doi:10.1007/11790105_8

Hsieh-Hui Yu²³,
Vincent S. Tseng²³ &
Jiin-Haur Chuang²⁴

Part of the book series: Lecture Notes in Computer Science ((TCSB,volume 4070))

283 Accesses

Abstract

Hepatitis B virus (HBV) infection is a worldwide health problem, with more than 1 million people died from liver cirrhosis and hepatocellular carcinoma (HCC) each year. HBV infection could result in the progression from normal to serious cirrhosis which is insidious and asymptomatic in most of the cases. The recent development of DNA microarray technology provides biomedical researchers with a molecular sight to observe thousands of genes simultaneously. How to efficiently extract useful information from these large-scale gene expression data is an important issue. Although there exist a number of interesting researches on this issue, they used to deploy some complicated statistical hypotheses. In this paper, we propose a multi-information-based methodology to score genes based on the microarray expressions. The concept of multi-information here is to combine different scoring functions in different tiers for analyzing gene expressions. The proposed methods can rank the genes according to the degree of relevance to the targeted diseases so as to form a precise prediction base. The experimental results show that our approach delivers accurate prediction through the assessment of QRT-PRC results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alizadeh, A.A., Eisen, M.B., Davis, R.E., Ma, C., Lossos, I.S., Rosenwald, A., Boldrick, J.C., Sabet, H., Tran, T., Yu, X., Powell, J.I., Yang, L., Marti, G.E., Moore, T., Hudson, J.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403(6769), 503–511 (2000)
Article Google Scholar
Alon, U., et al.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. In: Proceedings of the National Academy of Sciences, vol. 96, pp. 6745–6750 (1999)
Google Scholar
Ben-Dor, A., Friedman, N., Yakhini, Z.: Scoring genes for relevance, Technical Report, 2000-38, School of Computer Science & Engineering. Hebrew University, Jerusalem
Google Scholar
Ben-Dor, A., Friedman, N., Yakhini, Z.: Overabundance Analysis and Class Discovery in Gene Expression Data, Technical Reports of the Leibniz Center (2002)
Google Scholar
Ben-Dor, A., Bruhn, L., Friedman, N., Nachman, I., Schummer, M., Yakhini, Z.: Tissue classification with gene expression profiles. Jour. Of Comp. Bio. 7, 559–584 (2000)
Article Google Scholar
Ben-Dor, A., Shamir, R., Yakhini, Z.: Clustering gene expression patterns. J. Comp. Bio. 6(3-4), 281–297 (1999)
Article Google Scholar
Blum, A., Langley, P.: Selection of relevant features and examples in machine learning. Artificial Intelligence 97, 245–271 (1997)
Article MathSciNet MATH Google Scholar
Chuang, H.Y., Liu, H.F., Brown, S., Cameron, M.C., Kao, C.Y.: Identifying significant genes from microarray data. In: fourth IEEE Symposium on Bioinformatics and Bioengineering (BIBE), pp. 358–366 (2004)
Google Scholar
Chuang, H.Y., Tsai, H.K., Tsai, Y.F., Kao, C.Y.: Ranking genes for discriminability on microarray data. Journal of Information Science and Engineering 19, 953–966 (2003)
Google Scholar
Cortes, C., Vapnik, V.: Support vector machines. Machine Learning 20, 273–297 (1995)
MATH Google Scholar
de Kok, J.B., Roelofs, R.W., Giesendorf, B.A., Pennings, J.L., Waas, E.T., Feuth, T., Swinkels, D.W., Span, P.N.: Normalization of gene expression measurements in tumor tissues: comparison of 13 endogenous control genes. Lab Invest. Jan 85(1), 154–159 (2005)
Google Scholar
Eisen, M.B., Spellman, P.T., Brown, P.O., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. PNAS 95(25), 14863–14868 (1998)
Article Google Scholar
Gerard, C.J., Andrejka, L.M., Macina, R.A.: Mitochondrial ATP synthase 6 as an endogenous control in the quantitative RT-PCR analysis of clinical cancer samples. Mol Diagn 5, 39–46 (2000)
Google Scholar
Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531–537 (1999)
Article Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, Englewood Cliffs (1988)
MATH Google Scholar
Kunth, K., Hofler, H., Atkinson, M.J.: Quantification of messenger RNA expression in tumors: which standard should be used for best RNA normalization? Verh Dtsch Ges Pathol 78, 226–230 (1994)
Google Scholar
Marden, J.I.: ‘Analysing and Modeling Rank Data. Chapman and Hall, Boca Raton (1995)
Google Scholar
McQueen, J.: Some Methods of Classification and Analysis of Multivariate Observations. In: Proc. of the 5th Berkeley Symp. Mathematical Statistics and Probability, pp. 281–297 (1967)
Google Scholar
Park, P.J., Pagano, M., Bonetti, M.: A Nonparametric Scoring Algorithm for Identifying Informative Genes from Microarray Data. Pacific Symposium on Biocomputing 6, 52–63 (2001)
Google Scholar
Pavlidis, P., Tang, C.: Classification of genes using probabilistic models of microarray expression profiles. In: Proceedings of BIOKDD 2001 (2001)
Google Scholar
Schmittgen, T.D., Zakrajsek, B.A.: Effect of experimental treatment on housekeeping gene expression: validation by real-time, quantitative RT-PCR. J. Biochem. Biophys. Methods 46, 69–81 (2000)
Article Google Scholar
Sharan, R., Shamir, R.: CLICK: A clustering algorithm with applications to gene expression analisys. In: ISMB 2000 (2000)
Google Scholar
Slonim, D.K., Tamayo, P., Mesirov, J.P., Golub, T.R., Lander, E.S.: Class prediction and discovery using gene expression data. In: RECOMB 2000 (2000)
Google Scholar
Staunton, J.E., Slonim, D.K., Coller, H.A., Tamayo, P., Angelo, M.J., Park, J., Scherf, U., Lee, J.K., Reinhold, W.O., Weinstein, J.N., et al.: Chemosensitivity prediction by transcriptional profiling. In: Proc. Natl. Acad. Sci. USA 2001, vol. 98, pp. 10787–10792 (2000)
Google Scholar
Weston, J., Mukherjee, S., Chapelle, O., Pontil, M., Poggio, T., Vapnik, V.: Feature selection for SVMs. In: Advances in Neural Information Processing Systems, vol. 13. MIT Press, Cambridge (2001)
Google Scholar
Xu, L., Krzyzak, A., Suen, C.Y.: Method of Combining Multiple Classifiers and their Application to Handwriting Recognition. IEEE Trans SMC 22, 418–435 (1992)
Google Scholar
Zuo, F., Kaminski, N., Eugui, E., Allard, J., Yakhini, Z., Ben-Dor, A., Lollini, L., Morris, D., Kim, Y., DeLustro, B., et al.: Gene expression analysis reveals matrilysin as a key regulator of pulmonary fibrosis in mice and humans. In: Proc. Natl. Acad. Sci. USA 2002, vol. 99, pp. 6292–6297 (2000)
Google Scholar
Affymetrix. User’s guide to product comparison spreadsheets (2003), http://www.affymetrix.com/support/technical/manual/comparison_spreadsheets_manual.pdf
LocusLink, http://www.ncbi.nlm.nih.gov/LocusLink
UniGene, http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=unigene

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan
Hsieh-Hui Yu & Vincent S. Tseng
Department of Surgery and Internal Medicine, Chang Gung Memorial Hospital at Kaohsiung, Kaoshiung, Taiwan
Jiin-Haur Chuang

Authors

Hsieh-Hui Yu
View author publications
You can also search for this author in PubMed Google Scholar
Vincent S. Tseng
View author publications
You can also search for this author in PubMed Google Scholar
Jiin-Haur Chuang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Microsoft Research - Centre for Computational and Systems Biology, University of Trento, Piazza Manci, 17, 38050, Povo (TN), Italy
Corrado Priami
College of Computer and Information Engineering, Hehan University, Henan, China
Xiaohua Hu
Georgia State University, Dept. of CS, 30302, Atlanta, GA, USA
Yi Pan
Department of Computer Science, San Jose State University, CA 95192, San Jose, USA
Tsau Young Lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, HH., Tseng, V.S., Chuang, JH. (2006). A Multi-information Based Gene Scoring Method for Analysis of Gene Expression Data. In: Priami, C., Hu, X., Pan, Y., Lin, T.Y. (eds) Transactions on Computational Systems Biology V. Lecture Notes in Computer Science(), vol 4070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11790105_8

Download citation

DOI: https://doi.org/10.1007/11790105_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36048-3
Online ISBN: 978-3-540-36049-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics