skip to main content
10.1145/3459930.3469499acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
poster

FARM: hierarchical association rule mining and visualization method

Published:01 August 2021Publication History

ABSTRACT

Associations search is one of the methods of data analysis. Association Rule Mining (ARM) approach can construct association rules from observational data, but the most widely used algorithm Apriori typically produces large number of unstructured results without any ranking or statistical significance. We propose a novel method for association rules mining FARM (Fishbone Association Rule Mining) to address these challenges.

First of all, it is necessary to solve the problem with huge number of unstructured rules. It is important because large number of rules results in time costs for their investigation and absence of rule structure gives no information which features are more important. FARM uses hierarchical structure for rules producing which is helpful because priority of features became clearly visible. At each step FARM is trying to increase hierarchical rule complexity by adding additional features in such way that optimization metric (e.g., conviction) would grow. During this procedure it's also being checked that information growth is achieving. Further significance filtering is used to focus on statistically significant results. FARM involves check for statistical significance using hold-out approach which begins with splitting dataset into two parts - first for rules construction and second one for validation. Constructed rules are firstly filtered by chi-squared test, then validated and finally checked using statistical testing with multiple comparisons correction. At this point FARM obtains statistically significant hierarchical rules and they need to be shown in human readable way for which Ishikawa diagram is used. This diagram is based on the idea of causal-like hierarchy structure visualization with the fishbone head target and ordered predicates in the ribs, so it perfectly corresponds to our needs. Final rules are included in result diagrams and interactive filters provide FARM users with ability to set filters to show most significant rules, or rules with minimal required characteristics. Analysis can be run using dedicated web service what improves convenience for everyone who wants to try FARM.

We applied FARM to previously published public datasets achieving rules which included original papers results. After that we used FARM in our recent paper [1] where we found associations between changes in methylome and regulatory regions in the genome.

FARM has shown itself convenient in use and promising due to abilities in detecting significant rules and their apparent visualization. We believe that FARM will accelerate discoveries by producing complete solution for analysis and visualization of data patterns.

References

  1. Irina Shchukina, Juhi Bagaitkar and Oleg Shpynov. 2021. Enhanced epigenetic profiling of classical human monocytes reveals a specific signature of healthy aging in the DNA methylome. Nat Aging, 1 (2021), 124--141. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. FARM: hierarchical association rule mining and visualization method

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      BCB '21: Proceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics
      August 2021
      603 pages
      ISBN:9781450384506
      DOI:10.1145/3459930

      Copyright © 2021 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 August 2021

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate254of885submissions,29%
    • Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader