To read this content please select one of the options below:

(excl. tax) 30 days to view and download

Information extraction: beyond document retrieval

Robert Gaizauskas, Yorick Wilks

Journal of Documentation

ISSN: 0022-0418

Article publication date: 1 March 1998

Downloads

1473

Abstract

In this paper we give a synoptic view of the growth of the text processing technology of information extraction (IE) whose function is to extract information about a pre‐specified set of entities, relations or events from natural language texts and to record this information in structured representations called templates. Here we describe the nature of the IE task, review the history of the area from its origins in AI work in the 1960s and 70s till the present, discuss the techniques being used to carry out the task, describe application areas where IE systems are or are about to be at work, and conclude with a discussion of the challenges facing the area. What emerges is a picture of an exciting new text processing technology with a host of new applications, both on its own and in conjunction with other technologies, such as information retrieval, machine translation and data mining.

Keywords

Citation

Gaizauskas, R. and Wilks, Y. (1998), "Information extraction: beyond document retrieval", Journal of Documentation, Vol. 54 No. 1, pp. 70-105. https://doi.org/10.1108/EUM0000000007162

Publisher

:

MCB UP Ltd

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Information extraction: beyond document retrieval

Abstract

Keywords

Citation

Publisher

Related articles

To read this content please select one of the options below:

Please note you do not have access to teaching notes

Abstract

Keywords

Citation

Publisher

Related articles

All feedback is valuable

Report an issue or find answers to frequently asked questions