Loading [MathJax]/extensions/TeX/color_ieee.js
Visualizing and Querying Large-scale Structured Datasets by Learning Multi-layered 3D Meta-Profiles | IEEE Conference Publication | IEEE Xplore

Visualizing and Querying Large-scale Structured Datasets by Learning Multi-layered 3D Meta-Profiles


Abstract:

Data profiling is a "set of statistical data analysis activities to determine properties of a dataset". Historically, it was aimed at data (not meta-data), but at scale, ...Show More

Abstract:

Data profiling is a "set of statistical data analysis activities to determine properties of a dataset". Historically, it was aimed at data (not meta-data), but at scale, the tables’ meta-data (i.e. title, attribute names, types) becomes abundant, hence its profiling becomes vital, especially in order to understand the contents of large-scale structured datasets.Here we describe and evaluate the algorithms and models behind our scalable Meta-data profiler. It is capable of learning Meta-profiles for a topic of interest in extreme-scale structured datasets, such as WDC [1] or CORD-19 [2] having millions of tables and hundreds of thousands of sources. A 3D Meta-profile visualizes a specific topic (e.g. COVID-19 vaccine side-effects) present in a large-scale structured dataset and simplifies access and comparison for data scientists and end-users.
Date of Conference: 17-20 December 2022
Date Added to IEEE Xplore: 26 January 2023
ISBN Information:
Conference Location: Osaka, Japan

Contact IEEE to Subscribe

References

References is not available for this document.