skip to main content
10.1145/1385989.1386022acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdebsConference Proceedingsconference-collections
research-article

Complex event processing over uncertain data

Published:01 July 2008Publication History

ABSTRACT

In recent years, there has been a growing need for active systems that can react automatically to events. Some events are generated externally and deliver data across distributed systems, while others are materialized by the active system itself. Event materialization is hampered by uncertainty that may be attributed to unreliable data sources and networks, or the inability to determine with certainty whether an event has actually occurred. Two main obstacles exist when designing a solution to the problem of event materialization with uncertainty. First, event materialization should be performed efficiently, at times under a heavy load of incoming events from various sources. The second challenge involves the generation of a correct probability space, given uncertain events. We present a solution to both problems by introducing an efficient mechanism for event materialization under uncertainty. A model for representing materialized events is presented and two algorithms for correctly specifying the probability space of an event history are given. The first provides an accurate, albeit expensive method based on the construction of a Bayesian network. The second is a Monte Carlo sampling algorithm that heuristically assesses materialized event probabilities. We experimented with both the Bayesian network and the sampling algorithms, showing the latter to be scalable under an increasing rate of explicit event delivery and an increasing number of uncertain rules (while the former is not). Finally, our sampling algorithm accurately and efficiently estimates the probability space.

References

  1. Adi, A. A Language and an Execution Model for the Detection of Reactive Situations. Ph.D. Thesis, Technion -- Israel Institute of Technology, 2003.Google ScholarGoogle Scholar
  2. Adi, A. and Etzion, O. Amit - the situation manager. The VLDB journal 13, 5 (May 2004), 177--203. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Balazinska, M., Khoussainova, N., and Suciu, D. PEEX: Extracting probabilistic events from rd data. Proceedings of ICDE, (2008).Google ScholarGoogle Scholar
  4. Beeri, C., Eyal, A., Milo, T., and Pilberg A. Query-Based Monitoring of BPEL Business Processes. In Proceedings of the ACM-SIGMOD, (2007), 1122--1124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Breese, J. S., Goldman, R. P., and Wellman, M. P. Introduction to the Special Section on Knowledge-Based Construction of Probabilistic and Decision Models. IEEE Transactions on Systems, Man and Cybernatics, 24, 11, (1994), 1577.Google ScholarGoogle Scholar
  6. Campbell, M., Li, C.-S., Aggarwal, C., Naphade, M., Wu, K.-L., and Zhang, T. An evaluation of over-the-counter medication sales for syndromic surveillance. IEEE International Conference on Data Mining - Life Sciences Data Mining Workshop, (2004).Google ScholarGoogle Scholar
  7. Chu, D., Deshpande, A., Hellerstein, J., and Hong, W. Approximate data collection in sensor networks using probabilistic models. In Proceedings ICDE, (2007), page 48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Cowie, J., Ogielski, A. T., Premore, B., and Yuanb, Y. Internet worms and global routing instabilities. SPIE, 4868, (2002).Google ScholarGoogle Scholar
  9. Dayal U. et al. The HiPAC project: Combining active databases and timing constraints. ACM SIGMOD Record, 17, 1, (March 1988), 51--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Demers, A., Gehrke, J., Hong, M., Riedewald, M., and White, W. Towards expressive publish/subscribe systems. Advances in Database Technology - EDBT 2006, (2006), 627--644. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Demers, A., Gehrke, J., Panda, B., Riedewald, M., Sharma, V., and White, W. Cayuga: A General Purpose Event Monitoring System. In IDR 2007, Third Biennial Conference on Innovative Data Systems Research, (2007) 412--422.Google ScholarGoogle Scholar
  12. Diaz, O., Jaime, A., and Paton, N. Dear: A Debugger for Active Rules in an Object-Oriented Context. In Proceedings of the 1st International Workshop on Rules in Database Systems, (1993) 180--193.Google ScholarGoogle Scholar
  13. Forgy, C. L. Rete: A fast algorithm for the many pattern/many object pattern match problem. Artificial Intelligence, 19, (1982), 17--37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Gehani, N. H., Jagadish, N. H., and Shmueli, O. Composite event specification in active databases: Model and implementation, Proceedings VLDB, (1992), 23--27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Girod, L., Mei, Y., Newton, R., Rost, S., Thiagarajan, A., Balakrishnan, H., and Madden, S. The Case for a Signal-Oriented Data Stream Management System. In CIDR 2007, (2007) 397--406.Google ScholarGoogle Scholar
  16. Green, T. and Tannen, V. Models for incomplete and probabilistic information. IEEE Data Eng. Bull 29, 1 (2006) 17--24.Google ScholarGoogle Scholar
  17. Halpern, J. Y. Reasoning about Uncertainty. MIT Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Li, C.-S., Aggarwal, C., Campbell, M., Chang, Y.-C., Glass, G., Iyengar, V., Joshi, M., Lin, C.-Y., Naphade, M., and Smith, J. R. Epi-spire: A system for environmental and public health activity monitoring. IEEE International Conference on Multimedia and Expo, (2003). Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Paton, N. W. Active Rules in Database Systems. Springer, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Re, C., Dalvi, N., and Suciu, D. Query evaluation on probabilistic databases. IEEE Data Eng. Bull. 29, 1, (2006) 25--31.Google ScholarGoogle Scholar
  22. Shmueli, G. and Fienberg. S. Current and potential statistical methods for monitoring multiple data streams for biosurveillance. In Statistical Methods in Counterterrorism, pages 109--140. Springer Verlag, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  23. Wasserkrug, S., Gal, A., and Etzion, O. A model for reasoning with uncertain rules in event composition systems. Proceedings of the 21st Annual Conference on Uncertainty in Artificial Intelligence (UAI-05), (2005), 599--606.Google ScholarGoogle Scholar
  24. Widom, J. Trio: A system for integrated management of data, accuracy, and lineage. CIDR, (2005), 262--276.Google ScholarGoogle Scholar
  25. Widom, J. and Ceri, S., editors. Triggers and Rules for Advanced Database Processing. Morgan-Kaufmann, San Francisco, CA, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Zimmer, D. and Unland, R. On the Semantics of Complex Events in Active Database Management Systems. Proceedings ICDE (1999), 392--399 Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Complex event processing over uncertain data

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            DEBS '08: Proceedings of the second international conference on Distributed event-based systems
            July 2008
            377 pages
            ISBN:9781605580906
            DOI:10.1145/1385989

            Copyright © 2008 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 1 July 2008

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate130of553submissions,24%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader