skip to main content
10.1145/2790798.2790819acmotherconferencesArticle/Chapter ViewAbstractPublication PagesuccsConference Proceedingsconference-collections
short-paper

Pragamana: Performance Comparison and Programming Alpha-miner Algorithm in Relational Database Query Language and NoSQL Column-Oriented Using Apache Phoenix

Published: 13 July 2015 Publication History

Abstract

Process-Aware Information Systems (PAIS) is an IT system that support business processes and generate large amounts of event logs from the execution of business processes. An event log is represented as a tuple of CaseID, Timestamp, Activity and Actor. Process Mining is a new and emerging field that aims at analyzing the event logs to discover, enhance and improve business processes and check conformance between run time and design time business processes. The large volume of event logs generated are stored in the databases. Relational databases perform well for a certain class of applications. However, there are a certain class of applications for which relational databases are not able to scale. To handle such class of applications, NoSQL database systems emerged. Discovering a process model (workflow model) from event logs is one of the most challenging and important Process Mining task. The α-miner algorithm is one of the first and most widely used Process Discovery technique. Our objective is to investigate which of the databases (Relational or NoSQL) performs better for a Process Discovery application under Process Mining. We implement the α-miner algorithm on relational (row-oriented) and NoSQL (column-oriented) databases in database query languages so that our algorithm is tightly coupled to the database. We present a performance benchmarking and comparison of the α-miner algorithm on row-oriented database and NoSQL column-oriented database so that we can compare which database can efficiently store massive event logs and analyze it in seconds to discover a process model.

References

[1]
Nick Dimiduk and Amandeep Khurana. HBase In Action.
[2]
D.J.Abadi, S.R.Madden, and N.Hachem. Column-Stores vs. Row-Stores: How Different Are They Really? SIGMOID, 2008.
[3]
Mary A. Finn. Fighting impedance mismatch at the database level.
[4]
K-U.Sattler and O.Dunemann. Sql Database Primitives for Decision Tree Classifiers. Conference on Information and Knowledge Management, pages 379--386, 2001.
[5]
Hasso Plattner. A Common Database Approach for Oltp and Olap using an In-Memory Column Database. ACM SIGMOD International Conference on Management of data, 2009.
[6]
Nicholas Charles Russell. Foundation of Process-Aware Information Systems.
[7]
Vatika Sharma and Meenu Dave. Sql and NoSQL Database. International Journal of Advanced Research in Computer Science and Software Engineering, 2012.
[8]
S.Weerapong, P.Porouhan, and W.Premchaiswadi. Process Mining Using α-Algorithm as a Tool. IEEE, 2012.
[9]
M.N. Vora. Hadoop-Hbase for Large-Scale Data. Computer Science and Network Technology (ICCSNT), 2011 International Conference on, 2011.
[10]
W.V.D.Aalst. Process Mining: Overview and Opportunities. ACM, 2012.

Cited By

View all

Index Terms

  1. Pragamana: Performance Comparison and Programming Alpha-miner Algorithm in Relational Database Query Language and NoSQL Column-Oriented Using Apache Phoenix

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    C3S2E '15: Proceedings of the Eighth International C* Conference on Computer Science & Software Engineering
    July 2015
    166 pages
    ISBN:9781450334198
    DOI:10.1145/2790798
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • Keio University: Keio University
    • BytePress

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 July 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Apache HBase
    2. Apache Hadoop
    3. Apache Phoenix
    4. Column-Oriented Database
    5. Hadoop Distributed File System (HDFS)
    6. MySQL
    7. Process Mining
    8. Row-Oriented Database

    Qualifiers

    • Short-paper
    • Research
    • Refereed limited

    Conference

    C3S2E 2015

    Acceptance Rates

    Overall Acceptance Rate 12 of 42 submissions, 29%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 20 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media