Abstract
Signaling and metabolic pathways, which consist of a series of reactions producing target molecules from source compounds, are cornerstones of cellular biology. The cellular reaction networks containing such pathways can be precisely modeled by directed hypergraphs, where each reaction corresponds to a hyperedge, directed from its set of reactants to its set of products. Given such a network represented by a directed hypergraph, inferring the most likely set of reactions that produce a given target from a given set of sources corresponds to finding a shortest hyperpath, which is NP-complete. The best methods currently available for shortest hyperpaths either offer no guarantee of optimality, or exclude hyperpaths containing cycles even though cycles are abundant in real biological pathways.
We derive a novel graph-theoretic characterization of hyperpaths, leveraged in a new formulation of the general shortest hyperpath problem as an integer linear program that for the first time handles hyperpaths containing cycles, and present a novel cutting-plane algorithm that can solve this integer program to optimality in practice. This represents a major advance over the best prior exact algorithm, which was limited to acyclic hyperpaths (and hence fails to find a solution for the many biological instances where all hyperpaths are in fact cyclic). In comprehensive experiments over thousands of instances from the standard NCI-PID and Reactome databases, we demonstrate that our cutting-plane algorithm quickly finds an optimal hyperpath, with a median running-time of under ten seconds and a maximum time of around thirty minutes, even on large instances with many thousands of reactions.
Source code implementing our cutting-plane algorithm for shortest hyperpaths in a new tool called Mmunin is available free for research use at http://mmunin.cs.arizona.edu.
S. Krieger—Research performed at the Department of Computer Science of the University of Arizona.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Acuña, V., Milreu, P.V., Cottret, L., Marchetti-Spaccamela, A., Stougie, L., Sagot, M.F.: Algorithms and complexity of enumerating minimal precursor sets in genome-wide metabolic networks. Bioinformatics 28(19), 2474–2483 (2012)
Alberts, B., Johnson, A., Lewis, J., Raff, M., Roberts, K., Walter, P.: Molecular Biology of the Cell. Garland Science, New York (2007)
Andrade, R., Wannagat, M., Klein, C.C., Acuña, V., Marchetti-Spaccamela, A., Milreu, P.V., Stougie, L., Sagot, M.F.: Enumeration of minimal stoichiometric precursor sets in metabolic networks. Alg. Mol. Bio. 11(1), 25 (2016)
Ausiello, G., Laura, L.: Directed hypergraphs: introduction and fundamental algorithms–a survey. Theoret. Comput. Sci. 658, 293–306 (2017)
Bodor, J., Habener, J.F.: Role of transcriptional repressor ICER in cyclic amp-mediated attenuation of cytokine gene expression in human thymocytes. J. Biol. Chem. 273(16), 9544–9551 (1998)
Carbonell, P., Fichera, D., Pandit, S.B., Faulon, J.L.: Enumerating metabolic pathways for the production of heterologous target chemicals in chassis organisms. BMC Syst. Biol. 6(1), 10 (2012)
Christensen, T.S., Oliveira, A.P., Nielsen, J.: Reconstruction and logical modeling of glucose repression signaling pathways in Saccharomyces cerevisiae. BMC Syst. Biol. 3(1), 7 (2009)
Cottret, L., et al.: Enumerating precursor sets of target metabolites in a metabolic network. In: Proceedings of the 8th Workshop on Algorithms in Bioinformatics, pp. 233–244 (2008)
Gallo, G., Longo, G., Pallottino, S., Nguyen, S.: Directed hypergraphs and applications. Discret. Appl. Math. 42(2–3), 177–201 (1993)
Heath, L.S., Sioson, A.A.: Semantics of multimodal network models. IEEE/ACM Trans. Comput. Biol. Bioinf. 6(2), 271–280 (2009)
Hu, Z., Mellor, J., Wu, J., Kanehisa, M., Stuart, J.M., DeLisi, C.: Towards zoomable multidimensional maps of the cell. Nat. Biotech. 25(5), 547–554 (2007)
Italiano, G.F., Nanni, U.: Online maintenance of minimal directed hypergraphs. Department of Computer Science, Columbia University, Tech. Rep. (1989)
Joshi-Tope, G., et al.: Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 33, D428-432 (2005)
Klamt, S., Haus, U.U., Theis, F.: Hypergraphs and cellular networks. PLoS Comput. Biol. 5(5), e1000385 (2009)
Krieger, S.: Algorithmic Inference of Cellular Reaction Pathways and Protein Secondary Structure. Ph.D. dissertation, Department of Computer Science, The University of Arizona (July 2022)
Krieger, S., Kececioglu, J.: Fast approximate shortest hyperpaths for inferring pathways in cell signaling hypergraphs. In: Proceedings of the 21st ISCB Workshop on Algorithms in Bioinformatics (WABI). Leibniz International Proceedings in Informatics, vol. 201, pp. 1–20 (2021)
Krieger, S., Kececioglu, J.: Computing optimal factories in metabolic networks with negative regulation. Bioinformatics, In: Proceedings of the 30th ISCB Conference on Intelligent Systems for Molecular Biology (ISMB) 38(Suppl_1), i369–i377 (2022)
Krieger, S., Kececioglu, J.: Heuristic shortest hyperpaths in cell signaling hypergraphs. Algorithms Mol. Biol. 17(1), 12 (2022)
Krieger, S., Kececioglu, J.: Hhugin: hypergraph heuristic for general shortest source-sink hyperpaths, version 1.0 (2022). http://hhugin.cs.arizona.edu
Krieger, S., Kececioglu, J.: Mmunin: integer-linear-programming-based cutting-plane algorithm for shortest source-sink hyperpaths, version 1.0 (2022). http://mmunin.cs.arizona.edu
Li, Y., McGrail, D.J., Latysheva, N., Yi, S., Babu, M.M., Sahni, N.: Pathway perturbations in signaling networks: linking genotype to phenotype. Semin. Cell Dev. Biol. 99, 3–11 (2020)
Nielsen, L.R., Pretolani, D.: A remark on the definition of a \(B\)-hyperpath. Department of Operations Research, University of Aarhus, Tech. Rep. (2001)
Ramadan, E., Perincheri, S., Tuck, D.: A hyper-graph approach for analyzing transcriptional networks in breast cancer. In: Proceedings of the 1st ACM Conference on Bioinformatics and Computational Biology, pp. 556–562 (2010)
Ramadan, E., Tarafdar, A., Pothen, A.: A hypergraph model for the yeast protein complex network. In: Proceedings of the 18th Parallel and Distributed Processing Symposium, pp. 189–196 (2004)
Ritz, A., Avent, B., Murali, T.: Pathway analysis with signaling hypergraphs. IEEE/ACM Transactions on Comp. Bio. and Bioinf. 14(5), 1042–1055 (2017)
Ritz, A., Murali, T.: Pathway analysis with signaling hypergraphs. In: Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM-BCB), pp. 249–258 (2014)
Ritz, A., Tegge, A.N., Kim, H., Poirel, C.L., Murali, T.: Signaling hypergraphs. Trends Biotechnol. 32(7), 356–362 (2014)
Schaefer, C.F., et al.: PID: the pathway interaction database. Nucl. Acids Res. 37, 674–679 (2009)
Schwob, M.R., Zhan, J., Dempsey, A.: Modeling cell communication with time-dependent signaling hypergraphs. IEEE/ACM Trans. Comput. Biol. Bioinform. 18, 1151–1163 (2019)
Sharan, R., Ideker, T.: Modeling cellular machinery through biological network comparison. Nat. Biotechnol. 24(4), 427–433 (2006)
Vidal, M., Cusick, M.E., Barabási, A.L.: Interactome networks and human disease. Cell 144(6), 986–998 (2011)
Zarecki, R., Oberhardt, M.A., Reshef, L., Gophna, U., Ruppin, E.: A novel nutritional predictor links microbial fastidiousness with lowered ubiquity, growth rate, and cooperativeness. PLoS Comput. Biol. 10(7), 1–12 (2014)
Zhou, W., Nakhleh, L.: Properties of metabolic graphs: biological organization or representation artifacts? BMC Bioinform. 12(1), 132 (2011)
Acknowledgments
We thank Anna Ritz and T.M. Murali for helpful discussions and for providing the BioPax parser, and the anonymous referees for their useful comments. This research was supported by the US National Science Foundation, through grants CCF-1617192 and IIS-2041613 to JK.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Krieger, S., Kececioglu, J. (2023). Computing Shortest Hyperpaths for Pathway Inference in Cellular Reaction Networks. In: Tang, H. (eds) Research in Computational Molecular Biology. RECOMB 2023. Lecture Notes in Computer Science(), vol 13976. Springer, Cham. https://doi.org/10.1007/978-3-031-29119-7_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-29119-7_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-29118-0
Online ISBN: 978-3-031-29119-7
eBook Packages: Computer ScienceComputer Science (R0)