Abstract
Exploratory data mining, machine learning, and statistical modeling all have a role in discovery science. We describe a paleoecological reconstruction problem where Bayesian methods are useful and allow plausible inferences from the small and vague data sets available. Paleoecological reconstruction aims at estimating temperatures in the past. Knowledge about present day abundances of certain species are combined with data about the same species in fossil assemblages (e.g., lake sediments). Stated formally, the reconstruction task has the form of a typical machine learning problem. However, to obtain useful predictions, a lot of background knowledge about ecological variation is needed. In paleoecological literature the statistical methods are involved variations of regression. We compare these methods with regression trees, nearest neighbor methods, and Bayesian hierarchical models. All the methods achieve about the same prediction accuracy on modern specimens, but the Bayesian methods and the involved regression methods seem to yield the best reconstructions. The advantage of the Bayesian methods is that they also give good estimates on the variability of the reconstructions
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
H. J. B. Birks. Quantitative palaeoenvironmental reconstructions. In D. Maddy and J. Brew, editors, Statistical modelling quaternary science data, pages 161–254. Quaternary Research Association, Cambridge, 1995.
L. Breiman, J. Friedman, R. Olshen, and C. Stone. Classification and Regression Trees. Wadsworth International Group, Belmont,CA, 1984.
A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin. Bayesian Data Analysis. Chapman & Hall, New York, 1995.
W. R. Gilks, S. Richardson, and D. J. Spiegelhalter. Markov Chain Monte Carlo in Practice. Chapman & Hall, London, 1996.
A. Karalic. Employing linear regression in regression tree leaves. In Proceedings of ECAI-92. Wiley & Sons, 1992.
H. Olander, A. Korhola, H. Birks, and T. Blom. An expanded calibration model for inferring lake-water temperatures form fossil chironomid assemblages in northern fennoscandia. Manuscript, 1998.
H. Olander, A. Korhola, and T. Blom. Surface sediment chironomidae (insecta: Diptera) distributions along an ecotonal transect in subarctic fennoscandia: developing a tool for palaeotemperature reconstructions. Journal of Paleolimnology, 1997.
J. R. Quinlan. Combining instance-based and model-based learning. In Proceedings of the Tenth International Conference on Machine Learning, pages 236–243. Morgan Kaufmann Publishers, 1993.
H. Toivonen et al. Bassist. Technical Report C-1998-31, Department of Computer Science, P.O. Box 26, FIN-00014 University of Helsinki, Finland, 1998
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mannila, H., Toivonen, H., Korhola, A., Olander, H. (1998). Learning, Mining, or Modeling? A Case Study from Paleoecology. In: Arikawa, S., Motoda, H. (eds) Discovey Science. DS 1998. Lecture Notes in Computer Science(), vol 1532. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49292-5_2
Download citation
DOI: https://doi.org/10.1007/3-540-49292-5_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65390-5
Online ISBN: 978-3-540-49292-4
eBook Packages: Springer Book Archive