Skip to main content
Log in

Agenames a stratigraphic information harvester and text parser

  • Research Article
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

A common task for earth scientists is the search for stratigraphic background information on a certain rock unit, e.g. its age and properties and its position within the hierarchy of stratigraphic units. Analogously, when geoscientists search for information in databases or within the internet the stratigraphic and geospatial constraints serve as a first orientation within a huge amount of data. However, spatio-temporal information is mostly only implicitly encoded e.g. in the title and abstract given in bibliographic databases and library catalogues. Means to decode spatial information from such texts has become commonly available through gazetteers and geocoding services, but the paleotemporal information remains elusive. Agenames is a stratigraphic information harvester and text parser which offers a web-service to parse geological texts and identify stratigraphic terms. The service has both a web-based GUI and a REST interface. The Agenames ontology records the stratigraphic rank of e.g. a chronostratigraphic or lithostratigraphic unit and the hierarchical relations between terms. Any geologic body has an associated age of origin, assigned by relation to a geochronologic unit from the geologic time scale. In a given text Agenames will identify potential stratigraphic keywords and use these terms to assign a geological age estimate. The web-service can also be used to augment web portals or library catalogues. Agenames can be used to generate indices to infer the relations of library catalogue entries or web pages to the approximate geological epoch that is covered in a text. This allows, for instance, including stratigraphic age into a catalogue search without the need to specify all possibly relevant terms in complex queries.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Ager DV (1993) The nature of the stratigraphical record, 3rd edn. John Wiley & Sons, Ltd., Chichester

    Google Scholar 

  • Carroll JJ, Bizer C, Hayes P, Stickler P (2005) Named Graphs, Provenance and Trust. In: Proceedings of the 14th International Conference on World Wide Web. ACM, New York, pp 613–622. doi:10.1145/1060745.1060835

    Chapter  Google Scholar 

  • Davenport P (2007–2013) Lexicon of Canadian Geological Names [online], http://weblex.nrcan.gc.ca/weblex_e.pl

  • Densham I, und Reid J (2003) A geo-coding service encompassing a geo-parsing tool and integrated digital gazetteer service, in Proceedings of the HLT-NAACL 2003 Workshop on Analysis of Geographic References, Bd. 1, S. 80. doi:10.3115/1119394.1119406

  • Fischer, R., 1971. Fascicule 5. Allemagne. Fascicule 5 f 4, Jurassique Alpine. Lexique Stratigraphique International Vol. 1 Europe, p.43

  • Gibbard PL, Head MJ, Walker MJC, The S. on Q. Stratigraphy (2010) Formal ratification of the Quaternary System/Period and the Pleistocene Series/Epoch with a base at 2.58 Ma. J Quat Sci 25(2):96–102. doi:10.1002/jqs.1338

    Article  Google Scholar 

  • Gradstein FM, Ogg JG, Smith AG (2004) A Geologic Time Scale 2004. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Hearst M (2009) Search user interfaces. Cambridge University Press, New York

    Book  Google Scholar 

  • Kent LE (1980) Stratigraphy of South Africa, Handbook 8, Part 1: Lithostratigraphy of the Republic of South Africa. South West Africa/Namibia and the Republics of Bophuthatswana, Transkei and Venda, Geological Survey Republic of South Africa, 690 p

    Google Scholar 

  • King T, Narock T, Walker R, Merka J, Joy S (2008) A brave new (virtual) world: distributed searches, relevance scoring and facets. Earth Sci Informa 1(1):29–34. doi:10.1007/s12145-008-0002-7

    Article  Google Scholar 

  • Klump J, Huber R (2011) WorldML vs. YaML – On the scope and purpose of mark-up languages. In: Geophysical Research Abstracts, Vol. 13, EGU2011–10505. Copernicus Society, Vienna

    Google Scholar 

  • Lenz (2008–2013), Australian Stratigraphic Units Database [online], http://www.ga.gov.au/products-services/data-applications/reference-databases/stratigraphic-units.html

  • Powers DMW (2007) Evaluation: from precision, recall and f-factor to ROC. Informedness Markedness & Correlation, Technical Report, School of Informatics and Engineering, Flinders University, Adelaide

    Google Scholar 

  • Remane, J. (1998), Appendix B; Explanatory note to the Global Stratigraphic Chart, Circ. - Int. Subcomm. Strat. Classif. ISSC IUGS Comm. Strat

  • Rowe, (2007–2013) British Geological Survey Lexicon of Named Rock Units [online, http://www.bgs.ac.uk/lexicon/]

  • Salvador A (1994) International stratigraphic guide: a guide to stratigraphic classification, terminology, and procedure. Denver, CO., Geological Society of America

    Google Scholar 

  • Sautter G, Böhm K, Agosti D (2006) A combining approach to find all taxon names (FAT) in legacy biosystematics literature. Biodivers Inform 3:41–53

    Article  Google Scholar 

  • Schatz BR, Johnson EH, Cochrane PA, Chen H (1996) Interactive term suggestion for users of digital libraries: using subject thesauri and co-occurrence lists for information retrieval. In: Proceedings of the first ACM international conference on Digital libraries. ACM, New York, pp 126–133. doi:10.1145/226931.226956

    Chapter  Google Scholar 

  • Schindler U, Diepenbroek M (2008) Generic XML-based framework for metadata portals. Comput Geosci 34(12):1947–1955. doi:10.1016/j.cageo.2008.02.023

    Article  Google Scholar 

  • Singer G, Norbisrath U, Lewandowski D (2013) Ordinary search engine users carrying out complex search tasks. J Inf Sci 39(3):346–358. doi:10.1177/0165551512466974

    Article  Google Scholar 

  • Stamm, N.R., Wardlaw, B. & Soller, D.R. (2007–2013), GEOLEX The National Geologic Map Database's Geologic Names Lexicon [online], http://ngmdb.usgs.gov/Geolex/geolex_home.html

  • Van Rijsbergen CJ (1979) Information retrieval, 2nd edn. Butterworth Heinemann, Oxford

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Robert Huber.

Additional information

Communicated by: H. A. Babaie

Published in the Special Issue with Guest Editors Dr. Xiaogang Ma, Dr. Peter Fox, Dr. Thomas Narock and Dr. Brian Wilson.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huber, R., Klump, J. Agenames a stratigraphic information harvester and text parser. Earth Sci Inform 8, 125–134 (2015). https://doi.org/10.1007/s12145-014-0171-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12145-014-0171-5

Keywords

Navigation