ABSTRACT
Two conditions have made it almost inevitable that we have an information-retrieval project at the System Development Corporation (SDC). First, our internal documentation. It has been estimated that we acquire 10,000 documents a year both from internal and external sources, not including books and periodicals. Internal distribution of these documents runs into millions of copies. We have not been able to afford to abstract and subject categorize more than 10 per cent of our 10,000 documents a year. With a good retrieval system we might be able to make many thousands more of our documents accessible by subject without increasing documentation expense.
Recommendations
Approaches to passage retrieval in full text information systems
SIGIR '93: Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrievalLarge collections of full-text documents are now commonly used in automated information retrieval. When the stored document texts are long, the retrieval of complete documents may not be in the users' best interest. In such circumstance, efficient and ...
Passage-Based Document Retrieval as a Tool for Text Mining with User's Information Needs
DS '01: Proceedings of the 4th International Conference on Discovery ScienceDocument retrieval can be considered as a basic but important tool for text mining that is capable of taking a user's information need into account. However, document retrieval is a hard task if multitopic lengthy documents have to be retrieved with a ...
Information Retrieval System for XML Documents
DEXA '02: Proceedings of the 13th International Conference on Database and Expert Systems ApplicationsIn the research field of document information retrieval, the unit of retrieval results returned by IR systems is a whole document or a document fragment, like a paragraph in passage retrieval. IR systems based on the vector space model compute feature ...
Comments