Skip to main content

A Term-Based Methodology for Template Creation in Information Extraction

  • Conference paper
  • First Online:
Natural Language Processing — NLP 2000 (NLP 2000)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1835))

Included in the following conference series:

  • 927 Accesses

Abstract

In this paper, we are concerned with the problem of automatic template creation for Information Extraction (IE) and we present a methodology for the creation of IE templates. Our approach proposes the semi-automatic construction of a semantic representation of textual information based on recognition of multi-word and nested terms and Named Entities (NEs) and subsequent exploitation of term and NE context for the induction of Information Extraction template rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bagga, A., J. Y. Chai and A. Biermann: The Role of WordNet in the Creation of a Trainable Message Understanding System. In Proceedings of the Fourteenth Conference on Artificial Intelligence (AAAI/IAAI-97), (1997) 941–948

    Google Scholar 

  2. Boguraev, B. and C. Kennedy:Technical Terminology for Domain Specification and Content Characterisation. In Information Extraction: A multi-disciplinary approach to an emerging information technology. International Summer School, SCIE-97, Frascati, Italy, July 14–18.1997, M.T. Pazienza (ed.) Springer, (1997) 27–96

    Google Scholar 

  3. Boguraev, B. and C. Kennedy: Salience-Based Content Characterisation of Text Documents. In Proceedings of ACL/EACL’7 Workshop on Intelligent Scalable Text Summarisation, Madrid, Spain, (1997) 2–9

    Google Scholar 

  4. Bourigault, D.: LEXTER, a Terminology Extraction Software for Knowledge Acquisition from Texts. In Proceedings of the Ninth Knowledge Acquisition for Knowledge Based System Workshop (KAW’95), Banff, Canada, (1995)

    Google Scholar 

  5. Califf, M. E. and R. J. Mooney: Relational Learning of Pattern-Match Rules for Information Extraction. In Working Papers of ACL-97 Workshop on Natural Language Learning, (1997) 9–15

    Google Scholar 

  6. Chinchor, N. A.: MUC-7 Named Entity Task Definition. Version 3.4, 13 July 1997.

    Google Scholar 

  7. Chinchor, N. A.: Overview of MUC-7/MET-2. In Science Applications International Corporation (SAIC), (1998 )http://www.muc.saic.com/proceedings/muc_7_proceedings/overview.html

  8. Frantzi, K. T. and S. Ananiadou: The C-Value/NC-Value Domain Independent Method for Multi-Word Term Extraction. In Journal of Natural Language Processing, 6(3) (1999) 145–179

    Google Scholar 

  9. Justeson, J. S. and S. M. Katz: Technical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text. In Natural Language Engineering, 1(1) (1995) 9–27

    Article  Google Scholar 

  10. McNaught, J., W. J. Black, F. Rinaldi, E. Bertino, A. Brasher, D. Deavin, B. Catania, D. Silvestri, B. Armani, P. Leo, A. Persidis, G. Semeraro, F. Esposito, G. P. Zarri and L. Gilardoni: Integrated Document and Knowledge Management for the Knowledge-based Enterprise. In Proceedings of Practical Application of Knowledge Management 2000 (PAKeM 2000) (forthcoming), Manchester, (April 2000) 10–14

    Google Scholar 

  11. Mikheev, A., M. Moens and C. Grover: Named Entity Recognition without Gazetteers. In Proceedings of EACL’99, (1999) 1–8

    Google Scholar 

  12. Miller, G.A., R. Beckwith, C. Fellbaum, D. Gross and K. Miller: Introduction to WordNet: An Online Lexical Database. In Five Papers on WordNet, (1993) 1–9 ftp://ftp.cogsci.princeton.edu/pub/wordnet/5papers.ps

  13. Riloff, E.: Automatically Constructing a Dictionary for Information Extraction Tasks. In Proceedings of the Eleventh National Conference on Artificial Intelligence (AAAI-93), (1993) 811–816

    Google Scholar 

  14. Riloff, E.: Automatically Generating Extraction Patterns from Untagged Text. In Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96), (1996) 1044–1049

    Google Scholar 

  15. Riloff, E.: An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains. AI Journal, 85 (August 1996)

    Google Scholar 

  16. Riloff, E.and R. Jones: Learning Dictionaries for Information Extraction by MultiLevel Bootstrapping. In Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-99), (1999)

    Google Scholar 

  17. Sager, J.C., D. Dungworth and P. F. McDonald: English Special Languages: principles and practice in science and technology. Oscar Brandstetter Verlag KG, Wiesbaden, (1980)

    Google Scholar 

  18. Soderland, S.: Learning Information Extraction Rules for Semi-structured and Free Text. In Machine Learning, C. Cardie and R. Mooney (eds.) Kluwer Academic Publishers, Boston (1999) 1–44

    Google Scholar 

  19. Soderland, S., D. Fisher, J. Aseltine and W. Lehnert: CRYSTAL: Inducing a Conceptual Dictionary. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI’ 95), (1995) 1314–1319

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zervanou, K., McNaught, J. (2000). A Term-Based Methodology for Template Creation in Information Extraction. In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_38

Download citation

  • DOI: https://doi.org/10.1007/3-540-45154-4_38

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67605-8

  • Online ISBN: 978-3-540-45154-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics