skip to main content
10.1145/3397056.3397068acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicgdaConference Proceedingsconference-collections
research-article

Clustering Algorithms for Spatial Data Mining

Published:01 July 2020Publication History

ABSTRACT

With the advances in mobile and wireless technologies, there has been a rise in applications that track and share the users' geospatial data. People use several social networking sites such as Twitter, Facebook and Flickr, where they share their status updates. With the integration of Global Positioning System (GPS) with mobile phones, it is now possible to share one's locations on these social networks. GPS allows us to record and track a person's movement along with the timestamp. The data set obtained from these GPS logs is vast and is widely used to analyze the users' movement patterns. Specifically, we can find out significant locations based on the number of users present at that location and the time spent by them at such places. Once significant places have been identified, it is also possible to identify the semantic importance of these locations. This paper presents an overview of the clustering techniques used to find important places of interest using large GPS based mobility datasets. Four clustering algorithms, K-Means, DBSCAN, OPTICS and Hierarchical, are implemented, and performance is tested using real-time data of 50 users collected over 2--5 years. Performance summary depicts that K-Means and DBSCAN perform well for spatial data.

References

  1. Mingqi Lv, Ling Chen, Zhenxing Xu, Yinglong Li, Gencai Chen, "The discovery of personally semantic places based on trajectory data mining", Neurocomputing, 2015 https://doi.org/10.1016/j.neucom.2015.08.071Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Ester, H.-P. Kriegel, J. Sander and X. Xu, "A Density Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise", Proc. Second International Conference on Knowledge Discover and Data Mining (KDD), pp. 226--231, 1996.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Ankerst, M. Breunig, H-P. Kreigel and J. Sander, "OPTICS: Ordering Poits to Identify the Clustering Structure", Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 49--60, 1999.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Zhou, C., Frankowski, D., Ludford, P., Shekhar, S. and Terveen, L., "Discovering personal gazetteers: an interactive clustering approach", Proceedings of GIS, pp. 266--273. ACM, New York, 2004Google ScholarGoogle Scholar
  5. A.T. Palma, V. Bogorny, B. Kuijpers and L.O. Alvares, "A clustering-based approach for discovering interesting places in trajectories", Proceedings of SAC, 2008Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Steven Van Canneyt, Steven Schockaert, Olivier Van Laere and Bart Dhoedt "Detecting Places Of Interest using Social Media", 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2012Google ScholarGoogle Scholar
  7. Enrico Steiger, René Westerholt, Bernd Resch and Alexander Zipf, "Twitter as an indicator for whereabouts of people? Correlating Twitter with UK census data", Computers, Environment and Urban Systems, Volume 54, November 2015Google ScholarGoogle Scholar
  8. Gennady Andrienko, Natalia Andrienko, Christophe Hurter, Salvatore Rinzivillo and Stefan Wrobel, "Scalable Analysis of Movement Data for Extracting and Exploring Significant Places", IEEE Transactions on Visualization and Computer Graphics, Vol. 19, No. 7, July 2013Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Md Reaz Uddin, Chinya Ravishankar and Vassilis J. Tsotras, "Finding Regions of Interest from Trajectory Data", 12th International Conference on Mobile Data Management, 2011Google ScholarGoogle Scholar
  10. Tanusri Bhattacharya, Lars Kulik, James Bailey, "Automatically recognizing places of interest from unreliable GPS data using spatio-temporal density estimation and line intersections", Pervasive and Mobile Computing 19, 2015Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Miao Lin and Wen-Jing Hsu, "Mining GPS data for mobility patterns: A survey", Pervasive and Mobile Computing 12, 2014Google ScholarGoogle ScholarCross RefCross Ref
  12. Xin Cao, Gao Cong, Christian and S. Jensen, "Mining Significant Semantic Locations from GPS Data", Proceedings of the VLDB Endowment, Vol. 3, No. 1, 2010Google ScholarGoogle Scholar
  13. Peter J. Rousseeuw, "Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis", Computational and Applied Mathematics. 20: 53--65, 1987 https://doi.org/10.1016/0377-0427(87)90125-7Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Chen Guang-xue, Li Xiao-zhou, Chen Qi-feng and Li Xiaozhou, "Clustering Algorithms for Area Geographical Entities in Spatial Data Mining", Seventh International Conference on Fuzzy Systems and Knowledge Discovery, pp. 1630--1633, 2010Google ScholarGoogle ScholarCross RefCross Ref
  15. B. Rama, Jayashree P., Salim Jiwani, "A Survey on Clustering Current Status and Challenging Issues", (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 9, pp. 2976--2980, 2010Google ScholarGoogle Scholar
  16. Amit Saxena, Mukesh Prasad, Akshansh Gupta, Neha Bharill, Om Prakash Patel, Aruna Tiwari, Mengjoo Er, Weiping Ding, Chin-Teng Lin, "A Review of Clustering Techniques and Developments", Neurocomputing, 2017, https://doi.org/10.1016/j.neucom.2017.06.053.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Clustering Algorithms for Spatial Data Mining

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICGDA '20: Proceedings of the 2020 3rd International Conference on Geoinformatics and Data Analysis
        April 2020
        176 pages
        ISBN:9781450377416
        DOI:10.1145/3397056

        Copyright © 2020 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 July 2020

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader