Abstract
Interaction between species in microbial communities plays an important role in the functioning of all ecosystems, from cropland soils to human gut microbiota. Many statistical approaches have been proposed to infer these interactions from microbial abundance information. However, these statistical approaches have no general mechanisms for incorporating existing ecological knowledge in the inference process. We propose an Abductive/Inductive Logic Programming (A/ILP) framework to infer microbial interactions from microbial abundance data, by including logical descriptions of different types of interaction as background knowledge in the learning. This framework also includes a new mechanism for estimating the probability of each interaction based on the frequency and compression of hypotheses computed during the abduction process. This is then used to identify real interactions using a bootstrapping, re-sampling procedure. We evaluate our proposed framework on simulated data previously used to benchmark statistical interaction inference tools. Our approach has comparable accuracy to SparCC, which is one of the state-of-the-art statistical interaction inference algorithms, but with the the advantage of including ecological background knowledge. Our proposed framework opens up the opportunity of inferring ecological interaction information from diverse ecosystems that currently cannot be studied using other methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Amini, A., Muggleton, S.H., Lodhi, H., Sternberg, M.J.E.: A novel logic-based approach for quantitative toxicology prediction. J. Chem. Inf. Model. 47(3), 998–1006 (2007). https://doi.org/10.1021/ci600223d
Bryant, C.H., Muggleton, S.H., Oliver, S.G., Kell, D.B., Reiser, P., King, R.D.: Combining inductive logic programming, active learning and robotics to discover the function of genes. Electron. Trans. Artif. Intell. 6, 1–36 (2001)
Derocles, S.A., et al.: Chapter one - biomonitoring for the 21st century: integrating next-generation sequencing into ecological network analysis. In: Advances in Ecological Research, vol. 58. Academic Press (2018). https://doi.org/10.1016/bs.aecr.2017.12.001
Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap. No. 57 in Monographs on Statistics and Applied Probability, Chapman & Hall/CRC, Boca Raton, Florida, USA (1993)
Faust, K., Raes, J.: Microbial interactions: from networks to models. Nat. Rev. Microbiol. 10, 538–550 (2012). https://doi.org/10.1038/nrmicro2832
Faust, K., Raes, J.: CoNet app: inference of biological association networks using Cytoscape. F1000Research 5 (2016). https://doi.org/10.12688/f1000research.9050.2
Friedman, J., Alm, E.J.: Inferring correlation networks from genomic survey data. PLoS Comput. Biol. 8(9) (2012). https://doi.org/10.1371/journal.pcbi.1002687
Getoor, L., Taskar, B.: Introduction to Statistical Relational Learning (Adaptive Computation and Machine Learning). The MIT Press (2007)
Gloor, G.B., Macklaim, J.M., Pawlowsky-Glahn, V., Egozcue, J.J.: Microbiome datasets are compositional: and this is not optional. Front. Microbiol. 8, 2224 (2017). https://doi.org/10.3389/fmicb.2017.02224
Golubev, W.: Antagonistic Interactions Among Yeasts. Springer, Berlin Heidelberg, Berlin, Heidelberg (2006). https://doi.org/10.1007/3-540-30985-3_10
Li, J., Tai, B.C., Nott, D.J.: Confidence interval for the bootstrap p-value and sample size calculation of the bootstrap test. J. Nonparametric Stat. 21(5), 649–661 (2009). https://doi.org/10.1080/10485250902770035
Muggleton, S.: Inverse entailment and progol. NGCO 13(3), 245–286 (1995). https://doi.org/10.1007/BF03037227
Muggleton, S.H., Bryant, C.H.: Theory Completion Using Inverse Entailment. Springer, Berlin Heidelberg (2000)
Robin, X., et al.: proc: an open-source package for r and s+ to analyze and compare roc curves. BMC Bioinform. 12, 77 (2011)
Röttjers, L., Faust, K.: From hairballs to hypotheses–biological insights from microbial networks. FEMS Microbiol. Rev. 42(6), 761–780 (2018). https://doi.org/10.1093/femsre/fuy030
Shoemaker, W.R., Locey, K.J., Lennon, J.T.: A macroecological theory of microbial biodiversity. Nat. Ecol. Evol. 1(0107), 1–6 (2017). https://doi.org/10.1038/s41559-017-0107
Tamaddoni-Nezhad, A., Bohan, D., Raybould, A., Muggleton, S.H.: Machine learning a probabilistic network of ecological interactions. In: Muggleton, S.H., Tamaddoni-Nezhad, A., Lisi, F.A. (eds.) ILP 2011. LNCS (LNAI), vol. 7207, pp. 332–346. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-31951-8_28
Tamaddoni-Nezhad, A., Milani, G., Raybould, A., Muggleton, S., Bohan, D.: Construction and validation of food-webs using logic-based machine learning and text-mining. Adv. Ecol. Res. 49, 225–289 (2013)
Vacher, C., et al.: Chapter one - learning ecological networks from next-generation sequencing data. In: Advances in Ecological Research, vol. 54. Academic Press (2016). https://doi.org/10.1016/bs.aecr.2015.10.004
Watts, S.C., Ritchie, S.C., Inouye, M., Holt, K.E.: FastSpar: rapid and scalable correlation estimation for compositional data. Bioinformatics 35(6), 1064–1066 (08 2018). https://doi.org/10.1093/bioinformatics/bty734
Weiss, S., et al.: Correlation detection strategies in microbial data sets vary widely in sensitivity and precision. ISME J. 10, 1669–1681 (2016). https://doi.org/10.1038/ismej.2015.235
Acknowledgements
This work was supported by the Agence Nationale de la Recherche, Grant/ Award Number: ANR-17-CE32-0011, and SYNGENTA CROP PROTECTION AG. Corinne Vacher and David A. Bohan acknowledge the support of the Learn-Biocontrol project, funded by the INRAE MEM metaprogramme, and the BCMicrobiome project funded by the Consortium Biocontrôle. Alireza Tamaddoni-Nezhad and Stephen Muggleton were supported by the EPSRC Network Plus grant on Human-Like Computing (HLC).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Barroso-Bergada, D., Tamaddoni-Nezhad, A., Muggleton, S.H., Vacher, C., Galic, N., Bohan, D.A. (2022). Machine Learning of Microbial Interactions Using Abductive ILP and Hypothesis Frequency/Compression Estimation. In: Katzouris, N., Artikis, A. (eds) Inductive Logic Programming. ILP 2021. Lecture Notes in Computer Science(), vol 13191. Springer, Cham. https://doi.org/10.1007/978-3-030-97454-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-97454-1_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97453-4
Online ISBN: 978-3-030-97454-1
eBook Packages: Computer ScienceComputer Science (R0)