Synonyms
Definition
A document database is a collection of stored texts managed by a system that provides query and update facilities. Usually the database includes many documents related by their subject matter, origin, or applicability to an enterprise. The content of each document may be free text, semi-structured text including a few well-identified fields (e.g., title, author, date), or highly structured tagged text such as might be encoded using XML. Occasionally documents may also contain multimedia components.
In contrast, the term corpus (plural corpora) typically refers to a static collection of texts that have been assembled by experts to study linguistic phenomena (e.g., the Brown Corpus, created in 1964 to study American English, and the Swedish Language Bank) or to provide a rich source of text for lexicographic needs (e.g., the Dictionary of Old English Corpus, including all extant texts written in Old English in the period...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Bertino E., Ooi B., Sacks-Davis R., Tan K.-L., and Zobel J. Text databases. In Indexing Techniques for Advanced Database Systems. Kluwer Academic, Norwell, MA, 1997, pp. 151–184.
A.G. (ed.). Chin Text Databases and Document Management: Theory and Practice. Idea Group, Hershey, PA, 2001.
Christophides V., Abiteboul S., Cluet S., and Scholl M. From structured documents to novel query facilities. In Proc. ACM SIGMOD Int. Conf. on Management of Data, 1994, pp. 313–324.
Kilpeläinen P., Lindén G., Mannila H., and Nikunen E. 1A structured text database system. In Proc. Int. Conf. on Electronic Publishing, Document Manipulation and Typography, 1990, pp. 139–151.
Loeffen A. Text databases: a survey of text models and systems. ACM SIGMOD Rec., 23(1):97–106, 1994.
Lowe B., Zobel J., and Sacks-Davis R. A formal model for databases of structured text, In Proc. 4th Int. Conf. on Database Systems for Advanced Applications, 1995, pp. 449–456.
Macleod A. A data base management system for document retrieval applications. Inf. Syst., 6(2):131–137, 1981.
Sacks-Davis R., Arnold-Moore T., and Zobel J. Database systems for structured documents. In Proc. Int. Symp. on Advanced Database Technologies and Their Integration, 1994, pp. 272–283.
Salminen A. and Tompa F.W. Requirements for XML document database systems. In Proc. ACM Symp. on Document Engineering, 2001, pp. 85–94.
Stonebraker M., Stettner H., Lynn N., Kalash J., and Guttman A. Document processing in a relational database system. ACM Trans. Inf. Syst., 1(2):143–158, 1983.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this entry
Cite this entry
Tompa, F.W. (2009). Document Databases. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_807
Download citation
DOI: https://doi.org/10.1007/978-0-387-39940-9_807
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-35544-3
Online ISBN: 978-0-387-39940-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering