Skip to main content
Log in

Integrating statistical theory with statistical databases

  • Published:
Annals of Mathematics and Artificial Intelligence Aims and scope Submit manuscript

Abstract

Statistical databases have traditionally been stored as flat files approximating relations. We propose that by storing statistical data in an object oriented type database, enhanced with knowledge of statistical theory, a more natural and powerful interface to statistical data can be created.

A formalism is proposed for dealing with and combining data that have random components by makingstatistics first class citizens in the database world. Entities in the databases are classified according to whether they areobservations, orstatistics.Estimates are a special type ofstatistics which aremoored toobservation entities.Statistics entities are classified by their statistical properties. A hierarchical structure of random features is provided, with distributions at its leaves. This structure is a DAG (Directed Acyclic Graph), which may be extended or redefined for different applications and contains information used to compare and manipulatestatistics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. SAS Institute Inc.,SAS Users Guide (Raleigh NC, 1984).

  2. R.A. Becker, J.M. Chambers and A.R. Wilks,The New S Language (Wadsworth and Brooks/Cole, 1988).

  3. H. Boral, D.J. De Witt and D. Bates, A framework for research in database management for statistical analysis, Computer Science Technical Report no. 465, Madison, University of Wisconsin CS Dept. (Feb. 1982).

    Google Scholar 

  4. A. Borgida, Features of languages for the development of information systems at the conceptual level, Rutgers University Technical Report, LCSR-TR-52 (1983).

  5. M.L. Brodie and J. Mylopoulos, AI and databases: semantic vs computational theories of information, in:New Directions for Database Systems (Ablex Publ. Co., 1985).

  6. N.C. Rowe, Rule based statistical calculations on a database abstract, in:Proc. 1st LBL Workshop on Statistical Database Management, LBL, ed. H.T.K. Wong (March 1982).

  7. D. Lubinsky and D. Pregibon, Data analysis as search, J. Econometrics 38 (1988) 247–268.

    Google Scholar 

  8. W.A. Gale, Student Phase 1 — A report on work in progress, in:Artificial Intelligence and Statistics, ed. W.A. Gale (Addison-Wesley, 1987).

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lubinsky, D.J. Integrating statistical theory with statistical databases. Ann Math Artif Intell 2, 245–259 (1990). https://doi.org/10.1007/BF01531010

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01531010

Keywords

Navigation