skip to main content
10.1145/2666310.2666393acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article

Online event clustering in temporal dimension

Published: 04 November 2014 Publication History

Abstract

This work is motivated by a real-life application that exploits sensor data available from traffic light control systems currently deployed in many cities around the world. Each sensor consists of an induction loop that generates a stream of events triggered whenever a metallic object e.g. car, bus, or a bicycle, is detected above the sensor. Because of the red phase of traffic lights objects are usually divided into groups that move together. Detecting these groups of objects as long as they pass through the sensor is useful for estimating the status of the toad networks such as car queue length or detecting traffic anomalies. In this work, given a data stream that contains observations of an event, e.g. detection of a moving object, together with the timestamps indicating when the events happen, we study the problem that clusters the events together in real-time based on the proximity of the event's occurrence time. We propose an efficient real-time algorithm that scales up to the large data streams extracted from thousands of sensors in the city of London. Moreover, our algorithm is better than the baseline algorithms in terms of clustering accuracy. We demonstrate motivations of the work by showing a real-life use-case in which clustering results are used for estimating the car queue lengths on the road and detecting traffic anomalies.

References

[1]
C. C. Aggarwal, J. Han, J. Wang, and P. S. Yu. A framework for clustering evolving data streams. In Proceedings of the 29th International Conference on Very Large Data Bases - Volume 29, VLDB '03, pages 81--92. VLDB Endowment, 2003.
[2]
J. Bacon, A. I. Bejan, A. R. Beresford, D. Evans, R. J. Gibbens, and K. Moody. Using real-time road traffic data to evaluate congestion. In C. B. Jones and J. L. Lloyd, editors, Dependable and Historic Computing, volume 6875 of Lecture Notes in Computer Science, pages 93--117. Springer, 2011.
[3]
M. Basseville and I. V. Nikiforov. Detection of Abrupt Changes: Theory and Application. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, 1993.
[4]
J. Beringer and E. Hüllermeier. Online clustering of parallel data streams. Data Knowl. Eng., 58(2):180--204, 2006.
[5]
A. Bifet, G. Holmes, B. Pfahringer, J. Read, P. Kranen, H. Kremer, T. Jansen, and T. Seidl. Moa: A real-time analytics open source framework. In D. Gunopulos, T. Hofmann, D. Malerba, and M. Vazirgiannis, editors, ECML/PKDD (3), volume 6913 of Lecture Notes in Computer Science, pages 617--620. Springer, 2011.
[6]
J. de Andrade Silva, E. R. Faria, R. C. Barros, E. R. Hruschka, A. C. P. L. F. de Carvalho, and J. Gama. Data stream clustering: A survey. ACM Comput. Surv., 46(1):13, 2013.
[7]
S. Guha, A. Meyerson, N. Mishra, R. Motwani, and L. O'Callaghan. Clustering data streams: Theory and practice. IEEE Trans. on Knowl. and Data Eng., 15(3):515--528, Mar. 2003.
[8]
R. Herring, P. Abbeel, A. Hofleitner, and A. Bayen. Estimating arterial traffic conditions using sparse probe data, 2010.
[9]
R. Herring, A. Hofleitner, P. Abbeel, and A. Bayen. Estimating arterial traffic conditions using sparse probe data. In Intelligent Transportation Systems (ITSC), 2010 13th International IEEE Conference on, pages 929--936, Sept 2010.
[10]
X. Jeff Ban, P. Hao, and Z. Sun. Real time queue length estimation for signalized intersections using travel times from mobile sensors. Transportation Research Part C: Emerging Technologies, Feb. 2011.
[11]
P. Kranen, I. Assent, C. Baldauf, and T. Seidl. The clustree: indexing micro-clusters for anytime stream mining. Knowl. Inf. Syst., 29(2):249--272, 2011.
[12]
H. X. Liu, X. Wu, W. Ma, and H. Hu. Real-time queue length estimation for congested signalized intersections. Transportation Research Part C: Emerging Technologies, 17(4):412--427, Aug. 2009.
[13]
L. O'Callaghan, N. Mishra, A. Meyerson, S. Guha, and R. Motwani. Streaming-data algorithms for high-quality clustering. In Proceedings of IEEE International Conference on Data Engineering, page 685, 2001.
[14]
T. Zhang, R. Ramakrishnan, and M. Livny. Birch: An efficient data clustering method for very large databases. In Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, SIGMOD '96, pages 103--114, New York, NY, USA, 1996. ACM.
[15]
A. Zhou, F. Cao, W. Qian, and C. Jin. Tracking clusters in evolving data streams over sliding windows. Knowl. Inf. Syst., 15(2):181--214, May 2008.

Index Terms

  1. Online event clustering in temporal dimension

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGSPATIAL '14: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
    November 2014
    651 pages
    ISBN:9781450331319
    DOI:10.1145/2666310
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 November 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. SCOOT data
    2. clustering algorithms
    3. data stream
    4. real-time monitoring
    5. sensor network
    6. social good
    7. transportation

    Qualifiers

    • Research-article

    Conference

    SIGSPATIAL '14
    Sponsor:
    • University of North Texas
    • Microsoft
    • ORACLE
    • Facebook
    • SIGSPATIAL

    Acceptance Rates

    SIGSPATIAL '14 Paper Acceptance Rate 39 of 184 submissions, 21%;
    Overall Acceptance Rate 257 of 1,238 submissions, 21%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 190
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 28 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media