skip to main content
10.1145/3465222.3465223acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicgdaConference Proceedingsconference-collections
Article

Multidimensional analysis of geosciences literature for knowledge discovery

Published:28 July 2021Publication History

ABSTRACT

With the increasing volume of online geosciences data, geoscientists are now facing huge challenges in rapidly discovering and extracting valuable information from a large number of documents. Nowadays, it has become crucial to develop flexible and efficient tools that can help geoscientists to quickly navigate through unstructured texts to reveal hidden patterns and trends. This paper presents a workflow for the multidimensional analysis of geosciences literature. NLP techniques and ontologies are used to automatically identify and extract domain-specific concepts and entities buried in unstructured text. Based on these extracted data, we defined a multidimensional representation form of geosciences text documents which facilitates quantitative and exploratory analysis for knowledge discovery. To illustrate the potential of the proposed workflow, we implemented a pilot system that allows the user to perform multidimensional analysis on large collection of documents through interactive and user-friendly visualizations. We have analyzed the rare earth elements and carbonatites research topic as an example. The obtained visualizations show the usefulness and the efficiency of the proposed system for discovering knowledge and identifying potential research gaps.

References

  1. Sobhana, N., Mitra, P., & Ghosh, S. K. (2010). Conditional random field based named entity recognition in geological text. International Journal of Computer Applications, 1(3), 143-147Google ScholarGoogle ScholarCross RefCross Ref
  2. Leveling, J. (2015, November). Tagging of temporal expressions and geological features in scientific articles. In Proceedings of the 9th Workshop on Geographic Information Retrieval (pp. 1-10).Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Wang, C., Ma, X., Chen, J., & Chen, J. (2018). Information extraction and knowledge graph construction from geoscience literature. Computers & Geosciences, 112, 112-120.Google ScholarGoogle ScholarCross RefCross Ref
  4. Fan, R., Wang, L., Yan, J., Song, W., Zhu, Y., & Chen, X. (2020). Deep Learning-Based Named Entity Recognition and Knowledge Graph Construction for Geological Hazards. ISPRS International Journal of Geo-Information, 9(1), 15.Google ScholarGoogle ScholarCross RefCross Ref
  5. Teufel, S., & Moens, M. (2002). Summarizing scientific articles: experiments with relevance and rhetorical status. Computational linguistics, 28(4), 409-445.Google ScholarGoogle Scholar
  6. Ibekwe-Sanjuan, F., Silvia, F., Eric, S., & Eric, C. (2011). Annotation of scientific summaries for information retrieval. arXiv preprint arXiv:1110.5722.Google ScholarGoogle Scholar
  7. Liu, Y., Wu, F., Liu, M., & Liu, B. (2013). Abstract sentence classification for scientific papers based on transductive SVM. Computer and Information Science, 6(4), 125.Google ScholarGoogle ScholarCross RefCross Ref
  8. Huber, R., & Klump, J. (2015). Agenames a stratigraphic information harvester and text parser. Earth Science Informatics, 8(1), 125-134.Google ScholarGoogle ScholarCross RefCross Ref
  9. Leidner, J. L. (2008). Toponym resolution in text: Annotation, evaluation and applications of spatial grounding of place names. Universal-Publishers..Google ScholarGoogle Scholar
  10. B. Technologies. CLAVIN: Cartographic Location And Vicinity INdexer. http://clavin.bericotechnologies.com/, 2012–2013.s.Google ScholarGoogle Scholar
  11. ORRIS, G. J. and R. I. GRAUCH (2002). Rare earth element mines, deposits and occurrences, Open-File Report 2002-189, US Geological Survey, Reston, Va, 167pGoogle ScholarGoogle Scholar
  12. Kynicky, J., Smith, M. P., & Xu, C. (2012). Diversity of rare earth deposits: the key example of China. Elements, 8(5), 361-367.Google ScholarGoogle ScholarCross RefCross Ref
  13. Annad, O., Bendaoud, A., & Goria, S. (2017). Web information monitoring and crowdsourcing for promoting and enhancing the Algerian geoheritage. Arabian Journal of Geosciences, 10(13), 1-15.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ICGDA '21: Proceedings of the 2021 4th International Conference on Geoinformatics and Data Analysis
    April 2021
    78 pages
    ISBN:9781450389341
    DOI:10.1145/3465222

    Copyright © 2021 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 28 July 2021

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • Article
    • Research
    • Refereed limited
  • Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)2

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format