Skip to main content

Predicting Degree of Relevance of Pathway Markers from Gene Expression Data: A PSO Based Approach

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2018)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11305))

Included in the following conference series:

Abstract

In functional genomics, a pathway is defined as a set of genes which exhibit similar biological activities. Given a microarray expression data, the corresponding pathway information can be extracted with the use of some public databases. All member genes of a given pathway may not be equally relevant in estimating the activity of that pathway. Some genes can participate adequately in the given pathway, some may have low-associations. Existing literature has either considered all the genes wholly or discarded some genes completely in estimating the corresponding pathway-activity. Inspired by this, the current work reports about an automated approach to measure the degree of relevance of a given gene in predicting the pathway-activity. As a large search space has to be dragged, the exploration properties of particle swarm optimization are utilized in the current context. Particles of the PSO represent different scores of relevance for the member genes of different pathways. In order to deal with the relevance-score, the popular t-score which is widely used in measuring the pathway-activity is expanded in the name of weighted t-score. The proposed PSO-based weighted framework is then evaluated on three gene expression data sets. In order to show the supremacy of the proposed method, top 50% pathway markers are selected for each data set and the quality of these measures is checked after performing 10-fold cross-validation with respect to different quality measures. The results are further validated using biological significance tests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alon, U., et al.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl. Acad. Sci. 96(12), 6745–6750 (1999)

    Article  Google Scholar 

  2. Bandyopadhyay, S., Mallik, S., Mukhopadhyay, A.: A survey and comparative study of statistical tests for identifying differential expression from microarray data. IEEE/ACM Trans. Comput. Biol. Bioinform. 11(1), 95–115 (2014)

    Article  Google Scholar 

  3. Deng, L., Pei, J., Ma, J., Lee, D.L.: A rank sum test method for informative gene discovery. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 410–419. ACM (2004)

    Google Scholar 

  4. Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3(02), 185–205 (2005)

    Article  Google Scholar 

  5. Dutta, P., Saha, S.: Fusion of expression values and protein interaction information using multi-objective optimization for improving gene clustering. Comput. Biol. Med. 89, 31–43 (2017)

    Article  Google Scholar 

  6. Ghosh, A., Dhara, B.C., De, R.K.: Selection of genes mediating certain cancers, using a neuro-fuzzy approach. Neurocomputing 133, 122–140 (2014)

    Article  Google Scholar 

  7. Guo, Z., et al.: Towards precise classification of cancers based on robust gene functional expression profiles. BMC Bioinform. 6(1), 58 (2005)

    Article  MathSciNet  Google Scholar 

  8. Huang, D.W., Sherman, B.T., Lempicki, R.A.: Systematic and integrative analysis of large gene lists using david bioinformatics resources. Nat. Protoc. 4(1), 44 (2009)

    Article  Google Scholar 

  9. Huang, T., Chen, L., Cai, Y.D., Chou, K.C.: Classification and analysis of regulatory pathways using graph property, biochemical and physicochemical property, and functional property. PLOS ONE 6(9), 1–11 (2011). https://doi.org/10.1371/journal.pone.0025297

    Article  Google Scholar 

  10. Kennedy, J.: Particle swarm optimization. In: Encyclopedia of Machine Learning, pp. 760–766. Springer, Heidelberg (2011)

    Google Scholar 

  11. Khunlertgit, N., Yoon, B.J.: Identification of robust pathway markers for cancer through rank-based pathway activity inference. Adv. Bioinform. 2013 (2013)

    Google Scholar 

  12. Lee, E., Chuang, H.Y., Kim, J.W., Ideker, T., Lee, D.: Inferring pathway activity toward precise disease classification. PLoS Comput. Biol. 4(11), e1000217 (2008)

    Article  Google Scholar 

  13. Ma, S., Kosorok, M.R.: Identification of differential gene pathways with principal component analysis. Bioinformatics 25(7), 882–889 (2009)

    Article  Google Scholar 

  14. Mandal, M., Mondal, J., Mukhopadhyay, A.: A PSO-based approach for pathway marker identification from gene expression data. IEEE Trans. Nanobiosci. 14(6), 591–597 (2015)

    Article  Google Scholar 

  15. Mandal, M., Mukhopadhyay, A.: A graph-theoretic approach for identifying non-redundant and relevant gene markers from microarray data using multiobjective binary PSO. PloS ONE 9(3), e90949 (2014)

    Article  Google Scholar 

  16. Mukhopadhyay, A., Mandal, M.: Identifying non-redundant gene markers from microarray data: a multiobjective variable length PSO-based approach. IEEE/ACM Trans. Comput. Biol. Bioinform. (TCBB) 11(6), 1170–1183 (2014)

    Article  Google Scholar 

  17. Pang, H., Zhao, H.: Building pathway clusters from random forests classification using class votes. BMC Bioinform, 9(1), 87 (2008)

    Article  Google Scholar 

  18. Parsopoulos, K.E.: Particle Swarm Optimization and Intelligence: Advances and Applications: Advances and Applications. IGI Global, Hershey (2010)

    Google Scholar 

  19. Su, J., Yoon, B.J., Dougherty, E.R.: Accurate and reliable cancer classification based on probabilistic inference of pathway activity. PloS ONE 4(12), e8161 (2009)

    Article  Google Scholar 

  20. Wang, K., Li, M., Bucan, M.: Pathway-based approaches for analysis of genomewide association studies. Am. J. Hum. Genet. 81(6), 1278–1283 (2007)

    Article  Google Scholar 

  21. Wang, Y., Makedon, F.S., Ford, J.C., Pearlman, J.: HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data. Bioinformatics 21(8), 1530–1537 (2004)

    Article  Google Scholar 

  22. Yang, K., Cai, Z., Li, J., Lin, G.: A stable gene selection in microarray data analysis. BMC Bioinform. 7, 228 (2006). https://doi.org/10.1186/1471-2105-7-228

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pratik Dutta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dutta, P., Saha, S., Chauhan, A.B. (2018). Predicting Degree of Relevance of Pathway Markers from Gene Expression Data: A PSO Based Approach. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11305. Springer, Cham. https://doi.org/10.1007/978-3-030-04221-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-04221-9_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-04220-2

  • Online ISBN: 978-3-030-04221-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics