Combining Protein-Protein Interaction (PPI) Network and Sequence Attributes for Predicting Hypertension Related Proteins

Dobson, Richard J. B.; Munroe, Patricia B.; Mein, Charles A.; Caulfield, Mark J.; Saqi, Mansoor A. S.

doi:10.1007/978-3-540-70600-7_28

Richard J. B. Dobson¹,
Patricia B. Munroe¹,
Charles A. Mein¹,
Mark J. Caulfield¹ &
…
Mansoor A. S. Saqi¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 13))

Included in the following conference series:

International Conference on Bioinformatics Research and Development

735 Accesses

Abstract

Cardiovascular disease is set to become the number one cause of deaths worldwide. It is therefore important to understand the etiologic mechanisms for hypertension, in order to identify new routes to improved treatment. Human hypertension arises from a combination of genetic factors and lifestyle influences. Here we study hypertension related proteins from the perspective of protein-protein interaction (PPI) networks, pathways, Gene Ontology (GO) categories and sequence properties. We find that hypertension related proteins are not generally associated with network hubs and do not exhibit high clustering coefficients. Despite this, they tend to be closer and better connected to other hypertension proteins on the interaction network than we would expect, with 23% directly interacting. We find that molecular function category ‘oxidoreductase’ and biological process categories ‘response to stimulus’ and ‘electron transport’ are overrepresented. We also find that functional similarity does not correlate strongly with PPI distance separating hypertension related protein pairs and known hypertension related proteins are spread across 36 KEGG pathways. Finally, weighted Bagged PART classifiers were used to build predictive models that combined amino acid sequence with PPI network and GO properties.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447(7145) 661–678 (2007)
Google Scholar
Adie, E.A., Adams, R.R., Evans, K.L., Porteous, D.J., Pickard, B.S.: Speeding disease gene discovery by sequence based candidate prioritization. BMC Bioinformatics 6, 55 (2005)
Article Google Scholar
Adie, E.A., Adams, R.R., Evans, K.L., Porteous, D.J., Pickard, B.S.: SUSPECTS: enabling fast and effective prioritization of positional candidates. Bioinformatics 22(6), 773–774 (2006)
Article Google Scholar
Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J.T., Harris, M.A., Hill, D.P., Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese, J.C., Richardson, J.E., Ringwald, M., Rubin, G.M., Sherlock, G.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25(1), 25–29 (2000)
Article Google Scholar
Boeckmann, B., Bairoch, A., Apweiler, R., Blatter, M.C., Estreicher, A., Gasteiger, E., Martin, M.J., Michoud, K., O’Donovan, C., Phan, I., Pilbout, S., Schneider, M.: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 31(1), 365–370 (2003)
Article Google Scholar
Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
MATH MathSciNet Google Scholar
Brown, K.R., Jurisica, I.: Online predicted human interaction database. Bioinformatics 21(9), 2076–2082 (2005)
Article Google Scholar
Chen, C., Liaw, A., Breiman, L.: Using random forest to learn imbalanced data. Technical Report 666, Department of Statistics, University of California, Berkeley (2004), http://www.stat.berkeley.edu/tech-reports/666.pdf
Chen, J.Y., Shen, C., Sivachenko, A.Y.: Mining Alzheimer disease relevant proteins from integrated protein interactome data. In: Pac. Symp. Biocomput., pp. 367–378 (2006)
Google Scholar
Dijkstra, E.W.: A note on two problems in connexion with graphs. Numerische Mathematik 1, 269–271 (1959)
Article MATH MathSciNet Google Scholar
Dondoshansky, I.: Blastclust (NCBI Software Development Toolkit), 6.1 edn., NCBI, Bethesda, MD (2002)
Google Scholar
Ezzati, M., Vander Hoorn, S., Lawes, C.M., Leach, R., James, W.P., Lopez, A.D., Rodgers, A., Murray, C.J.: Rethinking the ”diseases of affluence” paradigm: global patterns of nutritional risks in relation to economic development. PLoS Med 2(5), e133 (2005)
Article Google Scholar
Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: Proc. 15th International Conf. on Machine Learning, pp. 144–151. Morgan Kaufmann, San Francisco (1998)
Google Scholar
George, R.A., Liu, J.Y., Feng, L.L., Bryson-Richardson, R.J., Fatkin, D., Wouters, M.A.: Analysis of protein sequence and interaction data for candidate disease gene prediction. Nucleic Acids Res. 34(19), e130 (2006)
Article Google Scholar
Goh, K.I., Cusick, M.E., Valle, D., Childs, B., Vidal, M., Barabasi, A.L.: The human disease network. Proc. Natl. Acad. Sci. U S A 104(21), 8685–8690 (2007)
Article Google Scholar
Hamosh, A., Scott, A.F., Amberger, J., Bocchini, C., Valle, D., McKusick, V.A.: Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 30(1), 52–55 (2002)
Article Google Scholar
Jonsson, P.F., Bates, P.A.: Global topological features of cancer proteins in the human interactome. Bioinformatics 22(18), 2291–2297 (2006)
Article Google Scholar
Kanehisa, M., Goto, S., Hattori, M., Aoki-Kinoshita, K.F., Itoh, M., Kawashima, S., Katayama, T., Araki, M., Hirakawa, M.: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 34(Database issue), 354–357 (2006)
Article Google Scholar
Kyte, J., Doolittle, R.F.: A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157(1), 105–132 (1982)
Article Google Scholar
Lifton, R.P., Gharavi, A.G., Geller, D.S.: Molecular mechanisms of human hypertension. Cell 104(4), 545–556 (2001)
Article Google Scholar
Lopez-Bigas, N., Ouzounis, C.A.: Genome-wide identification of genes likely to be involved in human genetic disease. Nucleic Acids Res. 32(10), 3108–3114 (2004)
Article Google Scholar
Perez-Iratxeta, C., Wjst, M., Bork, P., Andrade, M.A.: G2D: a tool for mining genes associated with disease. BMC Genet. 6, 45 (2005)
Article Google Scholar
Rual, J.F., Venkatesan, K., Hao, T., Hirozane-Kishikawa, T., Dricot, A., Li, N., Berriz, G.F., Gibbons, F.D., Dreze, M., Ayivi-Guedehoussou, N., Klitgord, N., Simon, C., Boxem, M., Milstein, S., Rosenberg, J., Goldberg, D.S., Zhang, L.V., Wong, S.L., Franklin, G., Li, S., Albala, J.S., Lim, J., Fraughton, C., Llamosas, E., Cevik, S., Bex, C., Lamesch, P., Sikorski, R.S., Vandenhaute, J., Zoghbi, H.Y., Smolyar, A., Bosak, S., Sequerra, R., Doucette-Stamm, L., Cusick, M.E., Hill, D.E., Roth, F.P., Vidal, M.: Towards a proteome-scale map of the human protein-protein interaction network. Nature 437(7062), 1173–1178 (2005)
Article Google Scholar
Sladek, R., Rocheleau, G., Rung, J., Dina, C., Shen, L., Serre, D., Boutin, P., Vincent, D., Belisle, A., Hadjadj, S., Balkau, B., Heude, B., Charpentier, G., Hudson, T.J., Montpetit, A., Pshezhetsky, A.V., Prentki, M., Posner, B.I., Balding, D.J., Meyre, D., Polychronakos, C., Froguel, P.: A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature 445(7130), 881–885 (2007)
Article Google Scholar
Stelzl, U., Worm, U., Lalowski, M., Haenig, C., Brembeck, F.H., Goehler, H., Stroedicke, M., Zenkner, M., Schoenherr, A., Koeppen, S., Timm, J., Mintzlaff, S., Abraham, C., Bock, N., Kietzmann, S., Goedde, A., Toksoz, E., Droege, A., Krobitsch, S., Korn, B., Birchmeier, W., Lehrach, H., Wanker, E.E.: A human protein-protein interaction network: a resource for annotating the proteome. Cell 122(6), 957–968 (2005)
Article Google Scholar
Tiffin, N., Kelso, J.F., Powell, A.R., Pan, H., Bajic, V.B., Hide, W.A.: Integration of text- and data-mining using ontologies successfully selects disease gene candidates. Nucleic Acids Res 33(5), 1544–1552 (2005)
Article Google Scholar
Wang, J.Z., Du, Z., Payattakool, R., Yu, P.S., Chen, C.F.: A new method to measure the semantic similarity of GO terms. Bioinformatics 23(10), 1274–1281 (2007)
Article Google Scholar
Watts, D.J., Strogatz, S.H.: Collective dynamics of ’small-world’ networks. Nature 393(6684), 440–442 (1998)
Article Google Scholar
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)
Google Scholar
Xu, J., Li, Y.: Discovering disease-genes by topological features in human protein-protein interaction network. Bioinformatics 22(22), 2800–2805 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

The Genome Centre, St Barts and The London School of Medicine and Dentistry, Charterhouse Sq., London, EC1 6BQ
Richard J. B. Dobson, Patricia B. Munroe, Charles A. Mein, Mark J. Caulfield & Mansoor A. S. Saqi

Authors

Richard J. B. Dobson
View author publications
You can also search for this author in PubMed Google Scholar
Patricia B. Munroe
View author publications
You can also search for this author in PubMed Google Scholar
Charles A. Mein
View author publications
You can also search for this author in PubMed Google Scholar
Mark J. Caulfield
View author publications
You can also search for this author in PubMed Google Scholar
Mansoor A. S. Saqi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Mourad Elloumi Josef Küng Michal Linial Robert F. Murphy Kristan Schneider Cristian Toma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dobson, R.J.B., Munroe, P.B., Mein, C.A., Caulfield, M.J., Saqi, M.A.S. (2008). Combining Protein-Protein Interaction (PPI) Network and Sequence Attributes for Predicting Hypertension Related Proteins. In: Elloumi, M., Küng, J., Linial, M., Murphy, R.F., Schneider, K., Toma, C. (eds) Bioinformatics Research and Development. BIRD 2008. Communications in Computer and Information Science, vol 13. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70600-7_28

Download citation

DOI: https://doi.org/10.1007/978-3-540-70600-7_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70598-7
Online ISBN: 978-3-540-70600-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics