Skip to main content

Mining Frequent Instances on Workflows

  • Conference paper
  • First Online:
Advances in Knowledge Discovery and Data Mining (PAKDD 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2637))

Included in the following conference series:

Abstract

A workflow is a partial or total automation of a business process, in which a collection of activities must be executed by humans or machines, according to certain procedural rules. This paper deals with an aspect of workflows which has not so far received much attention: providing facilities for the human system administrator to monitor the actual behavior of the workflow system in order to predict the “most probable” workflow executions. In this context, we develop a data mining algorithm for identifying frequent patterns, i.e., the workflow substructures that have been scheduled more frequently by the system. Several experiments show that our algorithm outperforms the standard approaches adapted to mining frequent instances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal, D. Gunopulos, and F. Leymann. Mining Process Models from Workflow Logs. In Proc. 6th Int. Conf. on Extending Database Technology (EDBT), pages 469–483, 1998.

    Google Scholar 

  2. R. Agrawal, T. Imielinski, and A. Swami. Mining Association Rules between Sets of Items in Large Databases. In Proc. ACM Conf. on Management of Data (SIGMOD93), pages 207–216, 1993.

    Google Scholar 

  3. R. Agrawal, and R. Srikant. Fast Algorithms for Mining Association Rules in Large Databases In Proc. 20th Int. Conf. on Very Large Data Bases(VLDB94), pages 487–499, 1994.

    Google Scholar 

  4. R. Agrawal, and R. Srikant. Mining Sequential Patterns. In Proc. 11th Int. Conf. on Data Engineering (ICDE95), pages 3–14, 1995.

    Google Scholar 

  5. E. Cohen, M. Datar, S. Fujiwara, A. Gionis, P. Indyk, R. Motwani, J. D. Ullman, and C. Yang. Finding Interesting Associations without Support Pruning. IEEE Transactions on Knowledge and Data Engineering, 13(1), pages 64–78, 2001.

    Article  Google Scholar 

  6. J. E. Cook, and A. L. Wolf. Automating Process Discovery Through Event-Data Analysis. In Proc. 17th Int. Conf. on Software Engineering (ICSE95), pages 73–82, 1995.

    Google Scholar 

  7. H. Davulcu, M. Kifer, C. R. Ramakrishnan, and I. V. Ramakrishnan. Logic Based Modeling and Analysis of Workflows. In Proc. 17th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pages 25–33, 1998.

    Google Scholar 

  8. U. Dayal, M. Hsu, and R. Ladin. Business Process Coordination: State of the Art, Trends, and Open Issues. In Proc. 27th Int. Conf. on Very Large Data Bases (VLBDB01), pages 3–13, 2001.

    Google Scholar 

  9. L. Dehaspe, and H. Toivonen. Discovery of Frequent DATALOG Patterns. Data Mining and Knowledge Discovery, 3(1), pages 7–36, 1999.

    Article  Google Scholar 

  10. T. Feder, P. Hell, S. Klein, and R. Motwani. Complexity of Graph Partition Problems. STOC, pages 464–472, 1999

    Google Scholar 

  11. M. R. Garey, and D. S. Johnson. Computers and Intractability. A Guide to the Theory of NP-completeness, Freeman and Comp., NY, USA, 1979.

    Google Scholar 

  12. D. Georgakopoulos, M. Hornick, and A. Sheth. An overview of Workflow Management: From Process Modeling to Workflow Automation Infrastructure. Distributed and Parallel Databases, 3(2), pages 119–153, 1995.

    Article  Google Scholar 

  13. M. Gillmann, W. Wonner, and G. Weikum. Workflow Management with Service Quality Guarantees. In Proc. ACM Conf. on Management of Data (SIGMOD02), 2002.

    Google Scholar 

  14. P. Grefen, J. Vonk, and P. M. G. Apers. Global transaction support for workflow management systems: from formal specification to practical implementation. VLDB Journal 10(4), pages 316–333, 2001.

    Article  MATH  Google Scholar 

  15. G. Lee, K. L. Lee, and A. L. P. Chen. Efficient Graph-Based Algorithms for Discovering and Maintaining Association Rules in Large Databases. Knowledge and Information Systems 3(3), pages 338–355, 2001.

    Article  MATH  MathSciNet  Google Scholar 

  16. M. Kamath, and K. Ramamritham. Failure handling and coordinated execution of concurrent workflows. In Proc. 14th Int. Conf. on Data Engineering (ICDE98), pages 334–341, 1998.

    Google Scholar 

  17. P. Koksal, S. N. Arpinar, and A. Dogac. Workflow History Management. SIGMOD Record, 27(1), pages 67–75, 1998.

    Article  Google Scholar 

  18. A. Inokuchi, T. Washio, and H. Motoda. An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data. In Proc. 4th European Conf. on Principles of Data Mining and Knowledge Discovery, pages 13–23, 2000.

    Google Scholar 

  19. T. Miyahara and others. Discovery of Frequent Tag Tree Patterns in Semistructured Web Documents. In Proc 6th Pacific-Asia Conf. on Advances in Knowledge Discovery and Data Mining, pages 356–367, 2002.

    Google Scholar 

  20. W. M. P. van der Aalst, and K. M. van Hee. Workflow Management: Models, Methods, and Systems. MIT Press, 2002.

    Google Scholar 

  21. D. Wodtke, and G. Weikum. A Formal Foundation for Distributed Workflow Execution Based on State Charts. In Proc. 6th Int. Conf. on Database Theory (ICDT97), pages 230–246, 1997.

    Google Scholar 

  22. The Workflow Management Coalition, http://www.wfmc.org/.

  23. M. Zaki. Efficiently Mining Frequent Trees in a Forest. In Proc. 8th Int. Conf. On Knowledge Discovery and Data Mining (SIGKDD02), 2002. to appear.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Greco, G., Guzzo, A., Manco, G., Saccà, D. (2003). Mining Frequent Instances on Workflows. In: Whang, KY., Jeon, J., Shim, K., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2003. Lecture Notes in Computer Science(), vol 2637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36175-8_21

Download citation

  • DOI: https://doi.org/10.1007/3-540-36175-8_21

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-04760-5

  • Online ISBN: 978-3-540-36175-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics