skip to main content
10.1145/3555041.3589392acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
abstract

Discovering Denial Constraints Using Boolean Patterns

Published: 05 June 2023 Publication History

Abstract

Denial constraints (DCs) are at the heart of maintaining data consistency. Formulating DCs by hand is difficult and susceptible to errors. Automatically discovering DCs from data is an alternative, but this is computationally expensive due to the large search space. We propose a new method for automatically discovering DCs, named Boolean Patterns (BP), that identifies specific patterns in sets of predicates that provide minimal coverage of a set of distinct evidences from which DCs can be extracted. The main appeal of BP is its simplicity, bringing the discovery of DCs from the land of elaborated data structures to the land of boolean signs. We are currently studying two research opportunities. First, BP drastically reduces the memory required to keep intermediates in the evidence set data structures. Second, the execution of boolean signs allows exploring the discovery of DCs in highly parallel emerging hardware, like GPUs/FPGAs and processing-in-memory, offloading the discovery execution and overcoming performance bottlenecks in the CPU. We developed a CPU version of BP and compared it to other algorithms that deal with the problem of discovering minimal coverage sets of an evidence set on real-world datasets used in discovering DCs. In preliminary results, BP demonstrated superior performance, with a fraction of the memory used by its counterparts to hold evidence sets while enabling hardware acceleration.

Supplemental Material

MP4 File
Discovering Denial Constraints Using Boolean Patterns presentation video for ACM SIGMOD 2023 Student Research Competition.

References

[1]
Tobias Bleifuß, Sebastian Kruse, and Felix Naumann. 2017. Efficient denial constraint discovery with hydra. Proceedings of the VLDB Endowment, Vol. 11 (11 2017), 311--323. https://doi.org/10.14778/3157794.3157800
[2]
Xu Chu, Ihab F. Ilyas, and Paolo Papotti. 2013. Discovering Denial Constraints. Proc. VLDB Endow., Vol. 6, 13 (2013), 1498--1509. https://doi.org/10.14778/2536258.2536262
[3]
Andrew Gainer-Dewar and Paola Vera-Licona. 2017. The Minimal Hitting Set Generation Problem: Algorithms and Computation. SIAM Journal on Discrete Mathematics, Vol. 31, 1 (2017), 63--100. https://doi.org/10.1137/15M1055024 https://doi.org/10.1016/j.dam.2014.01.012
[4]
Eduardo H. M. Pena and Eduardo Cunha de Almeida. 2018. BFASTDC: A Bitwise Algorithm for Mining Denial Constraints. In Database and Expert Systems Applications - 29th International Conference, DEXA 2018, Regensburg, Germany, September 3--6, 2018, Proceedings, Part I (Lecture Notes in Computer Science, Vol. 11029), Sven Hartmann, Hui Ma, Abdelkader Hameurlain, Gü nther Pernul, and Roland R. Wagner (Eds.). Springer, Regensburg, Bavaria Land, Germany, 53--68. https://doi.org/10.1007/978--3--319--98809--2_4
[5]
Eduardo H. M. Pena, Eduardo C. de Almeida, and Felix Naumann. 2019. Discovery of Approximate (and Exact) Denial Constraints. Proc. VLDB Endow., Vol. 13, 3 (nov 2019), 266--278. https://doi.org/10.14778/3368289.3368293
[6]
Eduardo H. M. Pena, Eduardo C. de Almeida, and Felix Naumann. 2022. Fast Detection of Denial Constraint Violations. Proc. VLDB Endow., Vol. 15, 4 (apr 2022), 859--871. https://doi.org/10.14778/3503585.3503595
[7]
K. Ramamohanarao, J. Bailey, and T. Manoukian. 2003. A Fast Algorithm for Computing Hypergraph Transversals and its Application in Mining Emerging Patterns. In 2013 IEEE 13th International Conference on Data Mining. IEEE Computer Society, Los Alamitos, CA, USA, 485. https://doi.org/10.1109/ICDM.2003.1250958
[8]
Shaoxu Song, Fei Gao, Ruihong Huang, and Chaokun Wang. 2020. Data Dependencies over Big Data: A Family Tree. IEEE Transactions on Knowledge and Data Engineering, Vol. 34, 10 (2020), 1--1. https://doi.org/10.1109/TKDE.2020.3046443
[9]
Zijing Tan, Ai Ran, Shuai Ma, and Sheng Qin. 2021. Fast Incremental Discovery of Pointwise Order Dependencies. Proc. VLDB Endow., Vol. 13, 10 (mar 2021), 1669--1681. https://doi.org/10.14778/3401960.3401965
[10]
Renjie Xiao, Zijing Tan, Haojin Wang, and Shuai Ma. 2022. Fast Approximate Denial Constraint Discovery. Proc. VLDB Endow., Vol. 16, 2 (nov 2022), 269--281. https://doi.org/10.14778/3565816.3565828

Index Terms

  1. Discovering Denial Constraints Using Boolean Patterns

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '23: Companion of the 2023 International Conference on Management of Data
    June 2023
    330 pages
    ISBN:9781450395076
    DOI:10.1145/3555041
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 05 June 2023

    Check for updates

    Author Tags

    1. denial constraints
    2. embedded devices
    3. fpga accelerators
    4. relational database

    Qualifiers

    • Abstract

    Data Availability

    Discovering Denial Constraints Using Boolean Patterns presentation video for ACM SIGMOD 2023 Student Research Competition. https://dl.acm.org/doi/10.1145/3555041.3589392#SIGMOD23-modug003.mp4

    Conference

    SIGMOD/PODS '23
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 84
      Total Downloads
    • Downloads (Last 12 months)27
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media