Skip to main content

Extracting Temporal Information from Short Messages

  • Conference paper
Data Management. Data, Data Everywhere (BNCOD 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4587))

Included in the following conference series:

  • 617 Accesses

Abstract

Information Extraction, the process of eliciting data from natural language documents, usually relies on the ability to parse the document and then to detect the meaning of the sentences by exploiting the syntactic structures encountered. In previous papers, we have discussed an application to extract information from short (e-mail and text) messages which takes an alternative approach. The application is lightweight and uses pattern matching rather than parsing, since parsing is not feasible for messages in which both the syntax and the spelling are unreliable. The application works in the context of a high level database schema and identifies sentences which make statements about data describable by this schema. The application matches sentences with templates to identify metadata terms and the data values associated with them. However, the initial prototype could only manage simple, time independent assertions about the data, such as "Jane Austen is the author." This paper describes an extension to the application which can extract temporal data, both time instants and time periods. It also manages time stamps - temporal information which partitions the values of time varying attributes, such as the monarch of a country. In order to achieve this, the original data model has had to be extended with a temporal component and a set of sentence templates has been constructed to recognise statements in this model. The paper describes the temporal model and the extensions to the application, concluding with a worked example.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cardie, C.: Empirical Methods in Information Extraction. AI Magazine 18(4), 1–17 (1997)

    Google Scholar 

  2. Gaizauskas, R., Wilks, Y.: Information Extraction: Beyond Document Retrieval. The Journal of Documentation 54(1), 1–34 (1998)

    Article  Google Scholar 

  3. Cooper, R., Ali, S.: Extracting Data from Short Messages, Natural Language Processing and Information Systems. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 388–391. Springer, Heidelberg (2005)

    Google Scholar 

  4. Cooper, R., Ali, S.: Extracting Data from Personal Text Messages, Technical Report, Computing Science, University of Glasgow (2006)

    Google Scholar 

  5. Kang, I., Na, S., Lee, J., Yang, G.: Lightweight Natural Language Database Interfaces. Meziane & Métais, 76–88 (2004)

    Google Scholar 

  6. Vargas-Vera, M., Dominque, J., Kalfoglou, Y., Motta, E., Buckingham-Schum, S.: Template-driven information extraction for populating ontologies. In: Proceedings of IJCAI’01 Workshop on Ontology Learning, Seattle, WA, USA (2001)

    Google Scholar 

  7. Miller, G.: WordNet: A Lexical Database for English. Communications of the ACM 38(11), 39–4 (1995)

    Article  Google Scholar 

  8. Lorentzos, N.: The Interval-extended Relational Model and Its Application to Valid-time Databases, in [10], pp. 67–91 (1993)

    Google Scholar 

  9. Navathe, S., Ahmed, R.: Temporal Extensions to the Relational Model and SQL, in [10], pp. 92–109 (1993)

    Google Scholar 

  10. Tansel, A., Clifford, J., Gadia, S., Jajodia, S., Segev, A., Snodgrass, R.: Temporal Databases Theory, Design And Implementation, Benjamin Cummings, 1992 (1993)

    Google Scholar 

  11. Cooper, R.: A Strategy for Using More of the Language in Extracting Information from Short Messages (submitted to NLDB) (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Richard Cooper Jessie Kennedy

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cooper, R., Manson, S. (2007). Extracting Temporal Information from Short Messages. In: Cooper, R., Kennedy, J. (eds) Data Management. Data, Data Everywhere. BNCOD 2007. Lecture Notes in Computer Science, vol 4587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73390-4_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73390-4_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73389-8

  • Online ISBN: 978-3-540-73390-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics