skip to main content
10.1145/2926693.2929903acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

TrailMarker: Automatic Mining of Geographical Complex Sequences

Published:14 June 2016Publication History

ABSTRACT

Given a huge collection of vehicle sensor data consisting of d sensors for w trajectories of duration n, which are accompanied by geographical information, how can we find patterns, rules and outliers? How can we efficiently and effectively find typical patterns and points of variation? In this paper we present TRAILMARKER, a fully automatic mining algorithm for geographical complex sequences. Our method has the following properties: (a) effective: it finds important patterns and outliers in real datasets; (b) scalable: it is linear with respect to the data size; (c) parameter-free: it is fully automatic, and requires no prior training, and no parameter tuning. Extensive experiments on real data demonstrate that TRAILMARKER finds interesting and unexpected patterns and groups accurately. In fact, TRAILMARKER consistently outperforms the best state-of-the-art methods in terms of both accuracy and execution speed.

References

  1. Y. Cai and R. T. Ng. Indexing spatio-temporal trajectories with chebyshev polynomials. In Proceedings of ACM SIGMOD, pages 599--610, Paris, France, June 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. E. J. Keogh, S. Chu, D. Hart, and M. J. Pazzani. An online algorithm for segmenting time series. In ICDM, pages 289--296, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J.-G. Lee, J. Han, and K.-Y. Whang. Trajectory clustering: a partition-and-group framework. In SIGMOD, pages 593--604, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Y. Matsubara, Y. Sakurai, and C. Faloutsos. Autoplait: Automatic mining of co-evolving time sequences. In SIGMOD, pages 193--204, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Matsubara, Y. Sakurai, and C. Faloutsos. The web as a jungle: Non-linear dynamical systems for co-evolving online activities. In WWW, pages 721--731, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Matsubara, Y. Sakurai, and C. Faloutsos. Non-linear mining of competing local activities. In WWW, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Matsubara, Y. Sakurai, C. Faloutsos, T. Iwata, and M. Yoshikawa. Fast mining and forecasting of complex time-stamped events. In KDD, pages 271--279, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Y. Matsubara, Y. Sakurai, W. G. van Panhuis, and C. Faloutsos. FUNNEL: automatic mining of spatially coevolving epidemics. In KDD, pages 105--114, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Vlachos, D. Gunopulos, and G. Kollios. Discovering similar multidimensional trajectories. In ICDE, pages 673--684, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Wang, H. Wang, and W. Wang. Finding semantics in time series. In SIGMOD Conference, pages 385--396, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. G. Wilpon, L. R. Rabiner, C. H. Lee, and E. R. Goldman. Automatic recognition of keywords in unconstrained speech using hidden Markov models. IEEE Transactions on Acoustics, Speech, and Signal Processing, 38(11):1870--1878, 1990.Google ScholarGoogle Scholar

Index Terms

  1. TrailMarker: Automatic Mining of Geographical Complex Sequences

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD'16 PhD: Proceedings of the 2016 on SIGMOD'16 PhD Symposium
      June 2016
      54 pages
      ISBN:9781450341929
      DOI:10.1145/2926693

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 14 June 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SIGMOD'16 PhD Paper Acceptance Rate9of10submissions,90%Overall Acceptance Rate40of60submissions,67%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader