Abstract
In functional genomics, a pathway is defined as a set of genes which exhibit similar biological activities. Given a microarray expression data, the corresponding pathway information can be extracted with the use of some public databases. All member genes of a given pathway may not be equally relevant in estimating the activity of that pathway. Some genes can participate adequately in the given pathway, some may have low-associations. Existing literature has either considered all the genes wholly or discarded some genes completely in estimating the corresponding pathway-activity. Inspired by this, the current work reports about an automated approach to measure the degree of relevance of a given gene in predicting the pathway-activity. As a large search space has to be dragged, the exploration properties of particle swarm optimization are utilized in the current context. Particles of the PSO represent different scores of relevance for the member genes of different pathways. In order to deal with the relevance-score, the popular t-score which is widely used in measuring the pathway-activity is expanded in the name of weighted t-score. The proposed PSO-based weighted framework is then evaluated on three gene expression data sets. In order to show the supremacy of the proposed method, top 50% pathway markers are selected for each data set and the quality of these measures is checked after performing 10-fold cross-validation with respect to different quality measures. The results are further validated using biological significance tests.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alon, U., et al.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci. 96(12), 6745–6750 (1999)
Bandyopadhyay, S., Mallik, S., Mukhopadhyay, A.: A survey and comparative study of statistical tests for identifying differential expression from microarray data. IEEE/ACM Trans. Comput. Biol. Bioinform. 11(1), 95–115 (2014)
Deng, L., Pei, J., Ma, J., Lee, D.L.: A rank sum test method for informative gene discovery. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 410–419. ACM (2004)
Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3(02), 185–205 (2005)
Dutta, P., Saha, S.: Fusion of expression values and protein interaction information using multi-objective optimization for improving gene clustering. Comput. Biol. Med. 89, 31–43 (2017)
Ghosh, A., Dhara, B.C., De, R.K.: Selection of genes mediating certain cancers, using a neuro-fuzzy approach. Neurocomputing 133, 122–140 (2014)
Guo, Z., et al.: Towards precise classification of cancers based on robust gene functional expression profiles. BMC Bioinform. 6(1), 58 (2005)
Huang, D.W., Sherman, B.T., Lempicki, R.A.: Systematic and integrative analysis of large gene lists using david bioinformatics resources. Nat. Protoc. 4(1), 44 (2009)
Huang, T., Chen, L., Cai, Y.D., Chou, K.C.: Classification and analysis of regulatory pathways using graph property, biochemical and physicochemical property, and functional property. PLOS ONE 6(9), 1–11 (2011). https://doi.org/10.1371/journal.pone.0025297
Kennedy, J.: Particle swarm optimization. In: Encyclopedia of Machine Learning, pp. 760–766. Springer, Heidelberg (2011)
Khunlertgit, N., Yoon, B.J.: Identification of robust pathway markers for cancer through rank-based pathway activity inference. Adv. Bioinform. 2013 (2013)
Lee, E., Chuang, H.Y., Kim, J.W., Ideker, T., Lee, D.: Inferring pathway activity toward precise disease classification. PLoS Comput. Biol. 4(11), e1000217 (2008)
Ma, S., Kosorok, M.R.: Identification of differential gene pathways with principal component analysis. Bioinformatics 25(7), 882–889 (2009)
Mandal, M., Mondal, J., Mukhopadhyay, A.: A PSO-based approach for pathway marker identification from gene expression data. IEEE Trans. Nanobiosci. 14(6), 591–597 (2015)
Mandal, M., Mukhopadhyay, A.: A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO. PloS ONE 9(3), e90949 (2014)
Mukhopadhyay, A., Mandal, M.: Identifying non-redundant gene markers from microarray data: a multiobjective variable length PSO-based approach. IEEE/ACM Trans. Comput. Biol. Bioinform. (TCBB) 11(6), 1170–1183 (2014)
Pang, H., Zhao, H.: Building pathway clusters from random forests classification using class votes. BMC Bioinform, 9(1), 87 (2008)
Parsopoulos, K.E.: Particle Swarm Optimization and Intelligence: Advances and Applications: Advances and Applications. IGI Global, Hershey (2010)
Su, J., Yoon, B.J., Dougherty, E.R.: Accurate and reliable cancer classification based on probabilistic inference of pathway activity. PloS ONE 4(12), e8161 (2009)
Wang, K., Li, M., Bucan, M.: Pathway-based approaches for analysis of genomewide association studies. Am. J. Hum. Genet. 81(6), 1278–1283 (2007)
Wang, Y., Makedon, F.S., Ford, J.C., Pearlman, J.: HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data. Bioinformatics 21(8), 1530–1537 (2004)
Yang, K., Cai, Z., Li, J., Lin, G.: A stable gene selection in microarray data analysis. BMC Bioinform. 7, 228 (2006). https://doi.org/10.1186/1471-2105-7-228
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Dutta, P., Saha, S., Chauhan, A.B. (2018). Predicting Degree of Relevance of Pathway Markers from Gene Expression Data: A PSO Based Approach. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11305. Springer, Cham. https://doi.org/10.1007/978-3-030-04221-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-04221-9_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04220-2
Online ISBN: 978-3-030-04221-9
eBook Packages: Computer ScienceComputer Science (R0)