Abstract
Real-world applications generate uncertain streams due to unreliable equipments and/or data processing such as object identification. However, application context implies specific rules, which are critical in cleaning data and make them closer to the reality. In this paper, we propose a framework for cleaning uncertain streams by Parallelized Probabilistic Graphical Models (P2GM). Making full use of multi-core processing architecture, the system processes parallelized high-volume streams efficiently. With P2GM, users can define their own cleaning algorithms and generate specific parallelized systems. We implement a prototype of video surveillance based on P2GM, and demonstrate the quality and performance of our approaches experimentally.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kanagal, B., Deshpande, A.: Online Filtering, Smoothing and Probabilistic Modeling of Streaming data. In: ICDE 2008, Cancún, Mexico (2008)
Benjelloun, O., Sarma, A.D., Halevy, A., Widom, J.: ULDBs: databases with uncertainty and lineage. In: VLDB 2006, Seoul, Korea (2006)
Sen, P., Deshpande, A.: Representing and Querying Correlated Tuples in Probabilistic Databases. In: ICDE 2007, Istanbul, Turkey (2007)
PostgreSQL, http://www.postgresql.org/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Q., Wang, S., Qin, B. (2010). Cleaning Uncertain Streams by Parallelized Probabilistic Graphical Models. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds) Web-Age Information Management. WAIM 2010. Lecture Notes in Computer Science, vol 6184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14246-8_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-14246-8_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14245-1
Online ISBN: 978-3-642-14246-8
eBook Packages: Computer ScienceComputer Science (R0)