Skip to main content

Predicting the Antigenic Variant of Human Influenza A(H3N2) Virus with a Stacked Auto-Encoder Model

  • Conference paper
  • First Online:
Data Science (ICPCSEE 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 728))

  • 1740 Accesses

Abstract

The influenza virus changes its antigenicity frequently due to rapid mutations, leading to immune escape and failure of vaccination. Rapid determination of the influenza antigenicity could help identify the antigenic variants in time. Here, we built a stacked auto-encoder (SAE) model for predicting the antigenic variant of human influenza A(H3N2) viruses based on the hemagglutinin (HA) protein sequences. The model achieved an accuracy of 0.95 in five-fold cross-validations, better than the logistic regression model did. Further analysis of the model shows that most of the active nodes in the hidden layer reflected the combined contribution of multiple residues to antigenic variation. Besides, some features (residues on HA protein) in the input layer were observed to take part in multiple active nodes, such as residue 189, 145 and 156, which were also reported to mostly determine the antigenic variation of influenza A(H3N2) viruses. Overall, this work is not only useful for rapidly identifying antigenic variants in influenza prevention, but also an interesting attempt in inferring the mechanisms of biological process through analysis of SAE model, which may give some insights into interpretation of the deep learning models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Thompson, W.W., Shay, D.K., Weintraub, E., Brammer, L., Cox, N., Anderson, L.J., et al.: Mortality associated with influenza and respiratory syncytial virus in the United States. JAMA 289, 179–186 (2003)

    Article  Google Scholar 

  2. Taubenberger, J.K., Kash, J.C.: Influenza virus evolution, host adaptation, and pandemic formation. Cell Host Microbe 7, 440–451 (2010)

    Article  Google Scholar 

  3. Stephenson, I., Heath, A., Major, D., Newman, R.W., Hoschler, K., Junzi, W., et al.: Reproducibility of serologic assays for influenza virus A(H5N1). Emerg. Infect. Dis. 15, 1252–1259 (2009)

    Article  Google Scholar 

  4. Lee, M.S., Chen, J.S.E.: Predicting antigenic variants of influenza A/H3N2 viruses. Emerg. Infect. Dis. 10, 1385–1390 (2004)

    Article  Google Scholar 

  5. Wu, A., Peng, Y., Du, X., Shu, Y., Jiang, T.: Correlation of influenza virus excess mortality with antigenic variation: application to rapid estimation of influenza mortality burden. PLoS Comput. Biol. 6(8), e1000882 (2010)

    Article  Google Scholar 

  6. Du, X., Dong, L., Lan, Y., Peng, Y., Wu, A., Zhang, Y., et al.: Mapping of H3N2 influenza antigenic evolution in China reveals a strategy for vaccine strain recommendation. Nat. Commun. 3, 709 (2012)

    Article  Google Scholar 

  7. Li, H., Peng, Y., Zou, Y., Huang, Z., Wu, A., Li, K., et al.: PREDAC-H5: a user-friendly tool for the automated surveillance of antigenic variants for the HPAI H5N1 virus. Infect. Genet. Evol. 28, 62–63 (2014)

    Article  Google Scholar 

  8. Bengio, Y.: Learning deep architectures for AI. Found. Trends® Mach. Learn. 2(1), 1–127 (2009)

    Article  MATH  Google Scholar 

  9. Eickholt, J., Cheng, J.: Predicting protein residue-residue contacts using deep networks and boosting. Bioinformatics 28, 3066–3072 (2012)

    Article  Google Scholar 

  10. Chen, Y., Li, Y., Narayan, R., Subramanian, A., Xie, X.: Gene expression inference with deep learning. Bioinformatics 32, 1832–1839 (2016)

    Article  Google Scholar 

  11. Leung, M.K., Xiong, H.Y., Lee, L.J., Frey, B.J.: Deep learning of the tissue-regulated splicing code. Bioinformatics 30, i121–i129 (2014)

    Article  Google Scholar 

  12. Smith, D.J., Lapedes, A.S., de Jong, J.C., Bestebroer, T.M., Rimmelzwaan, G.F., Osterhaus, A.D., et al.: Mapping the antigenic and genetic evolution of influenza virus. Science 305, 371–376 (2004)

    Article  Google Scholar 

  13. Bao, Y.M., Bolotov, P., Dernovoy, D., Kiryutin, B., Zaslavsky, L., Tatusova, T., et al.: The influenza virus resource at the national center for biotechnology information. J. Virol. 82, 596–601 (2008)

    Article  Google Scholar 

  14. Palm, R.B.: Prediction as a candidate for learning deep hierarchical models of data, Master, Informatics and Mathematical Modelling. Technical University of Denmark (2012)

    Google Scholar 

  15. Koel, B.F., Burke, D.F., Bestebroer, T.M., van der Vliet, S., Zondag, G.C., Vervaet, G., et al.: Substitutions near the receptor binding site determine major antigenic change during influenza virus evolution. Science 342, 976–979 (2013)

    Article  Google Scholar 

Download references

Acknowledgement

This study was supported by the National Natural Science Foundation (31500126 and 31371338), the National Key Plan for Scientific Research and Development of China (2016YFD0500300 and 2016YFC1200200), and the Chinese Academy of Medical Sciences (2016-I2 M-1-005).

There are no conflicts of interest.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Taijiao Jiang or Yousong Peng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Tan, Z., Xu, B., Li, K., Jiang, T., Peng, Y. (2017). Predicting the Antigenic Variant of Human Influenza A(H3N2) Virus with a Stacked Auto-Encoder Model. In: Zou, B., Han, Q., Sun, G., Jing, W., Peng, X., Lu, Z. (eds) Data Science. ICPCSEE 2017. Communications in Computer and Information Science, vol 728. Springer, Singapore. https://doi.org/10.1007/978-981-10-6388-6_25

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-6388-6_25

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-6387-9

  • Online ISBN: 978-981-10-6388-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics