A Term-Based Methodology for Template Creation in Information Extraction

Zervanou, Kalliopi; McNaught, John

doi:10.1007/3-540-45154-4_38

Kalliopi Zervanou² &
John McNaught²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1835))

Included in the following conference series:

International Conference on Natural Language Processing

927 Accesses

Abstract

In this paper, we are concerned with the problem of automatic template creation for Information Extraction (IE) and we present a methodology for the creation of IE templates. Our approach proposes the semi-automatic construction of a semantic representation of textual information based on recognition of multi-word and nested terms and Named Entities (NEs) and subsequent exploitation of term and NE context for the induction of Information Extraction template rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bagga, A., J. Y. Chai and A. Biermann: The Role of WordNet in the Creation of a Trainable Message Understanding System. In Proceedings of the Fourteenth Conference on Artificial Intelligence (AAAI/IAAI-97), (1997) 941–948
Google Scholar
Boguraev, B. and C. Kennedy:Technical Terminology for Domain Specification and Content Characterisation. In Information Extraction: A multi-disciplinary approach to an emerging information technology. International Summer School, SCIE-97, Frascati, Italy, July 14–18.1997, M.T. Pazienza (ed.) Springer, (1997) 27–96
Google Scholar
Boguraev, B. and C. Kennedy: Salience-Based Content Characterisation of Text Documents. In Proceedings of ACL/EACL’7 Workshop on Intelligent Scalable Text Summarisation, Madrid, Spain, (1997) 2–9
Google Scholar
Bourigault, D.: LEXTER, a Terminology Extraction Software for Knowledge Acquisition from Texts. In Proceedings of the Ninth Knowledge Acquisition for Knowledge Based System Workshop (KAW’95), Banff, Canada, (1995)
Google Scholar
Califf, M. E. and R. J. Mooney: Relational Learning of Pattern-Match Rules for Information Extraction. In Working Papers of ACL-97 Workshop on Natural Language Learning, (1997) 9–15
Google Scholar
Chinchor, N. A.: MUC-7 Named Entity Task Definition. Version 3.4, 13 July 1997.
Google Scholar
Chinchor, N. A.: Overview of MUC-7/MET-2. In Science Applications International Corporation (SAIC), (1998 )http://www.muc.saic.com/proceedings/muc_7_proceedings/overview.html
Frantzi, K. T. and S. Ananiadou: The C-Value/NC-Value Domain Independent Method for Multi-Word Term Extraction. In Journal of Natural Language Processing, 6(3) (1999) 145–179
Google Scholar
Justeson, J. S. and S. M. Katz: Technical Terminology: Some Linguistic Properties and an Algorithm for Identification in Text. In Natural Language Engineering, 1(1) (1995) 9–27
Article Google Scholar
McNaught, J., W. J. Black, F. Rinaldi, E. Bertino, A. Brasher, D. Deavin, B. Catania, D. Silvestri, B. Armani, P. Leo, A. Persidis, G. Semeraro, F. Esposito, G. P. Zarri and L. Gilardoni: Integrated Document and Knowledge Management for the Knowledge-based Enterprise. In Proceedings of Practical Application of Knowledge Management 2000 (PAKeM 2000) (forthcoming), Manchester, (April 2000) 10–14
Google Scholar
Mikheev, A., M. Moens and C. Grover: Named Entity Recognition without Gazetteers. In Proceedings of EACL’99, (1999) 1–8
Google Scholar
Miller, G.A., R. Beckwith, C. Fellbaum, D. Gross and K. Miller: Introduction to WordNet: An Online Lexical Database. In Five Papers on WordNet, (1993) 1–9 ftp://ftp.cogsci.princeton.edu/pub/wordnet/5papers.ps
Riloff, E.: Automatically Constructing a Dictionary for Information Extraction Tasks. In Proceedings of the Eleventh National Conference on Artificial Intelligence (AAAI-93), (1993) 811–816
Google Scholar
Riloff, E.: Automatically Generating Extraction Patterns from Untagged Text. In Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96), (1996) 1044–1049
Google Scholar
Riloff, E.: An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains. AI Journal, 85 (August 1996)
Google Scholar
Riloff, E.and R. Jones: Learning Dictionaries for Information Extraction by MultiLevel Bootstrapping. In Proceedings of the Sixteenth National Conference on Artificial Intelligence (AAAI-99), (1999)
Google Scholar
Sager, J.C., D. Dungworth and P. F. McDonald: English Special Languages: principles and practice in science and technology. Oscar Brandstetter Verlag KG, Wiesbaden, (1980)
Google Scholar
Soderland, S.: Learning Information Extraction Rules for Semi-structured and Free Text. In Machine Learning, C. Cardie and R. Mooney (eds.) Kluwer Academic Publishers, Boston (1999) 1–44
Google Scholar
Soderland, S., D. Fisher, J. Aseltine and W. Lehnert: CRYSTAL: Inducing a Conceptual Dictionary. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (IJCAI’ 95), (1995) 1314–1319
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Language Engineering, University of Manchester Institute of Science and Technology UMIST, PO Box 88, Manchester, M60 1QD, UK
Kalliopi Zervanou & John McNaught

Authors

Kalliopi Zervanou
View author publications
You can also search for this author in PubMed Google Scholar
John McNaught
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Engineering Department and Computer Technology Institute, University of Patras, 26500, Patras, Greece
Dimitris N. Christodoulakis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zervanou, K., McNaught, J. (2000). A Term-Based Methodology for Template Creation in Information Extraction. In: Christodoulakis, D.N. (eds) Natural Language Processing — NLP 2000. NLP 2000. Lecture Notes in Computer Science(), vol 1835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45154-4_38

Download citation

DOI: https://doi.org/10.1007/3-540-45154-4_38
Published: 25 May 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67605-8
Online ISBN: 978-3-540-45154-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics