Skip to main content

Comparative Analysis of Machine Learning Models for Time-Series Forecasting of Escherichia Coli Contamination in Portuguese Shellfish Production Areas

  • Conference paper
  • First Online:
Machine Learning, Optimization, and Data Science (LOD 2023)

Abstract

Shellfish farming and harvesting have experienced a surge in popularity in recent years. However, the presence of fecal bacteria can contaminate shellfish, posing a risk to human health. This can result in the reclassification of shellfish production areas or even prohibit harvesting, leading to significant economic losses. Therefore, it is crucial to establish effective strategies for predicting contamination of shellfish by the bacteria Escherichia coli (E. coli). In this study, various univariate and multivariate time series forecasting models were investigated to address this problem. These models include autoregressive integrated moving average (ARIMA), vector autoregressive (VAR), and long short-term memory (LSTM) networks. The data used for this study consisted of measurements of both E. coli concentrations and meteorological variables, which were obtained from the Portuguese Institute of Sea and Atmosphere (IPMA) for four shellfish production areas. Overall, the ARIMA models performed the best with the lowest root mean squared error (RMSE) compared to the other models tested. The ARIMA models were able to accurately predict the concentrations of E. coli one week in advance. Additionally, the models were able to detect the peaks of E. coli for all areas, except for one, with recall values ranging from 0.75 to 1. This work represents the initial steps in the search for candidate forecasting models to help the shellfish production sector in anticipating harvesting prohibitions and hence supporting management and regulation decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mateus, M., et al.: Early warning systems for shellfish safety: the pivotal role of computational science. In: Rodrigues, J.M.F., et al. (eds.) ICCS 2019. LNCS, vol. 11539, pp. 361–375. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-22747-0_28

    Chapter  Google Scholar 

  2. Matarazzo Suplicy, F.: A review of the multiple benefits of mussel farming. Rev. Aquac. 12(1), 204–223 (2020)

    Article  Google Scholar 

  3. Hallegraeff, G., Anderson, D., Cembella, A., Enevoldsen, H.: Manual on Harmful Marine Microalgae, 2nd edn. UNESCO (2004)

    Google Scholar 

  4. Mok, J.S., Shim, K.B., Kwon, J.Y., Kim, P.H.: Bacterial quality evaluation on the shellfish-producing area along the south coast of Korea and suitability for the consumption of shellfish products therein. Fisheries Aquatic Sci. 21(36), (2018)

    Google Scholar 

  5. European Union: Commission Implementing Regulation (EU) 2019/ 627 - of 15 March 2019 - Laying down Uniform Practical Arrangements for the Performance of Official Controls on Products of Animal Origin Intended for Human Consumption in Accordance with Regulation (EU) 2017. Offic. J. Eur. Union, 131, 51–100, (2019)

    Google Scholar 

  6. Schmidt, W., et al.: A generic approach for the development of short-term predictions of Escherichia coli and biotoxins in shellfish. In: Aquaculture Environment Interactions, vol. 10, pp. 173–185 (2018)

    Google Scholar 

  7. Chen, Q., Guan, T., Yun, L., Li, R., Recknagel, F.: Online forecasting chlorophyll a concentrations by an auto-regressive integrated moving average model: feasibilities and potentials. In: Harmful Algae, Elsevier B. V., vol. 43, pp. 58–65 (2015)

    Google Scholar 

  8. Cho, H., Choi, U.-J., Park, H.: Deep learning application to time-series prediction of daily chlorophyll-a concentration. In: WIT Transactions on Ecology and the Environment, vol. 215, pp. 157–163. https://doi.org/10.2495/EID180141

  9. Lee, S., Lee, D.: Improved prediction of harmful algal blooms in four Major South Korea’s rivers using deep learning models. Int. J. Environ. Res. Public Health 15 (2018)

    Google Scholar 

  10. Cruz, R.C., Costa, P.R., Krippahl, L., Lopes, M.B.: Forecasting biotoxin contamination in mussels across production areas of the Portuguese coast with artificial neural networks. Knowl. Based Syst. 257 (2022)

    Google Scholar 

  11. Ciccarelli, C., et al.: Assessment of relationship between rainfall and Escherichia coli in clams (Chamelea gallina) using the Bayes Factor. Italian J. Food Saf. 6(6826) (2017)

    Google Scholar 

  12. Jang, J., Hur, H.G., Sadowsky, M.J., Byappanahalli, M.N., Yan, T., Ishii, S.: Environmental Escherichia coli: ecology and public health implications-a review. J. Appl. Microbiol. 123(3), 570–581 (2017)

    Article  Google Scholar 

  13. Anacleto, P., Pedro, S., Nunes, M.L., Rosa, R., Marques, A.: Microbiological composition of native and exotic clams from Tagus estuary: effect of season and environmental parameters. Mar. Pollut. Bull. 74(1), 116–124 (2013)

    Article  Google Scholar 

  14. Campos, C.J.A., Kershaw, S.R., Lee, R.J.: Environmental influences on faecal indicator organisms in coastal waters and their accumulation in bivalve shellfish. Estuaries Coasts 36, 834–853 (2013)

    Article  Google Scholar 

  15. Brockwell, P.J., Davis, R.A.: Introduction to Time Series and Forecasting, 2nd edn. Springer, Berlin (2002)

    Book  Google Scholar 

  16. Wei, W.W.S.: Multivariate Time Series Analysis and Applications, 1st edn. Wiley, Hoboken (2019)

    Book  Google Scholar 

  17. Chatfield, C.: Time-Series Forecasting. CHAPMAN & HALL/CRC (2001)

    Google Scholar 

  18. Cowpertwait, P.S.P., Metcalfe, A.V.: Introductory Time Series with R. Springer, Berlin (2009)

    Google Scholar 

  19. Tsay, R.S.: Multivariate Time Series Analysis: With R and Financial Applications, 1st edn. Willey, Hopboken (2014)

    Google Scholar 

  20. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–80 (1997)

    Article  Google Scholar 

  21. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016)

    Google Scholar 

  22. Hewamalage, H., Bergmeir, C., Bandara, K.: Recurrent neural networks for time series forecasting: current status and future directions. Int. J. Forecast. 37(1), 388–427 (2021)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by national funds through Fundação para a Ciência e a Tecnologia (FCT) through projects UIDB/00297/2020 and UIDP/00297/2020 (NOVA Math), UIDB/00667/2020 and UIDP/00667/2020 (UNIDEMI), UIDB/50008/2020 (IT), UIDB/50021/2020 (INESC-ID), and also the project MATISSE (DSAIPA/DS/0026/2019), and CEECINST/00042/2021, PTDC/CCI-BIO/4180/2020, and PTDC/CTM-REF/2679/2020. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 951970 (OLISSIPO project).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alexandra M. Carvalho .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ferraz, F., Ribeiro, D., Lopes, M.B., Pedro, S., Vinga, S., Carvalho, A.M. (2024). Comparative Analysis of Machine Learning Models for Time-Series Forecasting of Escherichia Coli Contamination in Portuguese Shellfish Production Areas. In: Nicosia, G., Ojha, V., La Malfa, E., La Malfa, G., Pardalos, P.M., Umeton, R. (eds) Machine Learning, Optimization, and Data Science. LOD 2023. Lecture Notes in Computer Science, vol 14505. Springer, Cham. https://doi.org/10.1007/978-3-031-53969-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-53969-5_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-53968-8

  • Online ISBN: 978-3-031-53969-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics