Articles
Symbolic Data Analysis: A Mathematical Framework and Tool for Data Mining

https://doi.org/10.1016/S1571-0653(04)00047-2Get rights and content

Abstract

Summary

knowledge extraction from large data bases is called “Data Mining”. In Data Bases the descriptions of the units are more complex than the standard ones due to the fact that they can contain internal variation and be structured. Moreover, symbolic data happen from many sources in order to summarise huge sets of data. They need more complex data tables called “symbolic data tables” because a cell of such data table does not necessarily contain as usual, a single quantitative or categorical values. For instance, a cell can contain several values linked by a taxonomy. The need to extend standard data analysis methods (exploratory, clustering, factorial analysis, discrimination,…) to symbolic data table is increasing in order to get more accurate information and summarise extensive data sets contained in Data Bases. We define “Symbolic Data Analysis” (SDA) the extension of standard Data Analysis to such tables. “Symbolic objects” are defined, in order to describe in an explanatory way classes of such units. They constitute an explanatory output of a SDA and they can be used as queries of the Data Base. A symbolic object is “complete” if its “extent” covers exactly the class that it describes. The set of complete symbolic objects constitutes a Galois lattice. The SDA tools developed in the European Community project “SODAS” are finally mentioned.

References (11)

  • P. Brito

    Order structure of symbolic assertion objects

    IEEE TR. on Knowledge and Data Engineering

    (1994)
  • H. Bandemer et al.

    Fuzzy Data Analysis

    (1992)
  • De Carvalho F.A.T. (1998) “New metrics for constrained boolean symbolic objects” Proc. KESDA '98, Eurostat....
  • Ciampi A., Diday E., Lebbe J., Périnel E., Vigne R. (1995) “Recursive partition with probabilistically imprecise data”....
  • Diday E., Emilion R. (1995) “Lattices and Capacities in Analysis of Probabilist Objects”. Proceed. of OSDA '95 (Ordinal...
There are more references available in the full text version of this article.
View full text