Skip to main content

Automated Discovery of Mobile Users Locations with Improved K-means Clustering

  • Conference paper
Book cover Artificial Intelligence and Soft Computing (ICAISC 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9120))

Included in the following conference series:

Abstract

Location is one of the most commonly used contextual information in mobile context-aware systems. It can be considered on many different levels of granularity, varying from geolocation that is based on GPS systems, up to microlocation that uses Bluetooth Low Energy devices and WiFi access points for locating users inside buildings. Most common use of location is navigation, however recently it is more often considered also as an important component of the user profile. One of the biggest challenges in location-based context-aware systems is the discovery of patterns in user transportation traces and extraction of the most often visited places. In this paper we presented and evaluated a method that allows for automatic extraction of clusters from user location traces. These clusters represents user points of interest like home, work, favourite restaurants, but also transportation routines. The original contribution of this work is a proposal of an approach based on the K-means clustering algorithm equipped with a module for automatic discovery of number of clusters and density-based cluster merging. This method allows for online, adaptable discovery of user points of interests, and transportation routines in mobile systems.

This work was funded by the National Science Centre, Poland as a part of the KnowMe project (reference number 2014/13/N/ST6/01786).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alvares, L.O., Bogorny, V., Kuijpers, B., de Macedo, J.A.F., Moelans, B., Vaisman, A.: A model for enriching trajectories with semantic geographical information. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, GIS 2007, pp. 22:1–22:8. ACM, New York (2007), http://doi.acm.org/10.1145/1341012.1341041

    Chapter  Google Scholar 

  2. Ashbrook, D., Starner, T.: Using gps to learn significant locations and predict movement across multiple users. Personal Ubiquitous Comput. 7(5), 275–286 (2003), http://dx.doi.org/10.1007/s00779-003-0240-0

    Article  Google Scholar 

  3. Bobek, S., Porzycki, K., Nalepa, G.J.: Learning sensors usage patterns in mobile context-aware systems. In: Proceedings of the FedCSIS 2013 Conference, Krakow, pp. 993–998. IEEE (September 2013)

    Google Scholar 

  4. Debatty, T., Michiardi, P., Thonnard, O., Mees, W.: Determining the k in k-means with MapReduce. In: ICDT 2014, 17th International Conference on Database Theory, in conjunction with EDBT/ICDT 2014, Athens, Greece, March 24-28 (2014), http://www.eurecom.fr/publication/4366

  5. Dey, A.K., Mankoff, J.: Designing mediation for context-aware applications. ACM Trans. Comput.-Hum. Interact. 12(1), 53–80 (2005), http://doi.acm.org/10.1145/1057237.1057241

    Article  Google Scholar 

  6. Ester, M., Peter Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise, pp. 226–231. AAAI Press (1996)

    Google Scholar 

  7. Ferreira, D.: AWARE: A mobile context instrumentation middleware to collaboratively understand human behavior. Ph.D. thesis (2013)

    Google Scholar 

  8. Flach, P.: Machine Learning: The art and science of algorithms that make sense of data. Cambridge University Press (September 2012)

    Google Scholar 

  9. Foundation, A.S.: Apache Mahout, https://mahout.apache.org/

  10. Hamerly, G., Elkan, C.: Learning the k in k-means. In: Neural Information Processing Systems, p. 2003. MIT Press (2003)

    Google Scholar 

  11. Hartigan, J.A., Wong, M.A.: Algorithm AS 136: A K-Means Clustering Algorithm. Applied Statistics 28(1), 100–108 (1979), http://dx.doi.org/10.2307/2346830

    Article  Google Scholar 

  12. Kang, J.H., Welbourne, W., Stewart, B., Borriello, G.: Extracting places from traces of locations. In: Proceedings of the 2nd ACM International Workshop on Wireless Mobile Applications and Services on WLAN Hotspots, WMASH 2004, pp. 110–118. ACM, New York (2004), http://doi.acm.org/10.1145/1024733.1024748

    Google Scholar 

  13. Leung, K.W.T., Lee, D.L., Lee, W.C.: Clr: A collaborative location recommendation framework based on co-clustering. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2011, pp. 305–314. ACM, New York (2011), http://doi.acm.org/10.1145/2009916.2009960

    Google Scholar 

  14. Lloyd, S.: Least squares quantization in pcm. IEEE Transactions on Information Theory 28(2), 129–137 (1982)

    Article  MathSciNet  Google Scholar 

  15. MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. Statistics, vol. 1, pp. 281–297. University of California Press, Berkeley (1967), http://projecteuclid.org/euclid.bsmsp/1200512992

    Google Scholar 

  16. Mahalanobis, P.C.: On the generalised distance in statistics. Proceedings of the National Institute of Science 2, 49–55 (1936), http://ir.isical.ac.in/dspace/handle/1/1268

    Google Scholar 

  17. Montoliu, R., Gatica-Perez, D.: Discovering human places of interest from multimodal mobile phone data. In: Proceedings of the 9th International Conference on Mobile and Ubiquitous Multimedia, MUM 2010, pp. 12:1–12:10. ACM, New York (2010), http://doi.acm.org/10.1145/1899475.1899487

    Google Scholar 

  18. Nalepa, G.J., Bobek, S., Ligęza, A., Kaczor, K.: Algorithms for rule inference in modularized rule bases. In: Bassiliades, N., Governatori, G., Paschke, A. (eds.) RuleML 2011 - Europe. LNCS, vol. 6826, pp. 305–312. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  19. Nalepa, G.J., Bobek, S.: Rule-based solution for context-aware reasoning on mobile devices. Computer Science and Information Systems 11(1), 171–193 (2014)

    Article  Google Scholar 

  20. Palma, A.T., Bogorny, V., Kuijpers, B., Alvares, L.O.: A clustering-based approach for discovering interesting places in trajectories. In: Proceedings of the 2008 ACM Symposium on Applied Computing, SAC 2008, pp. 863–868. ACM, New York (2008), http://doi.acm.org/10.1145/1363686.1363886

  21. Pelleg, D., Moore, A.: X-means: Extending k-means with efficient estimation of the number of clusters. In: Proceedings of the 17th International Conf. on Machine Learning, pp. 727–734. Morgan Kaufmann (2000)

    Google Scholar 

  22. Shindler, M., Wong, A., Meyerson, A.: Fast and accurate k-means for large datasets. In: Shawe-Taylor, J., Zemel, R.S., Bartlett, P.L., Pereira, F.C.N., Weinberger, K.Q. (eds.) NIPS, pp. 2375–2383 (2011), http://dblp.uni-trier.de/db/conf/nips/nips2011.html#ShindlerWM11

  23. Sugar, C.A., James, G.M.: Finding the number of clusters in a data set: An information theoretic approach. Journal of the American Statistical Association 98, 750–763 (2003)

    Article  MathSciNet  Google Scholar 

  24. Tibshirani, R., Walther, G., Hastie, T.: Estimating the number of clusters in a dataset via the gap statistic 63, 411–423 (2000)

    Google Scholar 

  25. Wang, J., Ghosh, R., Das, S.: A survey on sensor localization. Journal of Control Theory and Applications 8(1), 2–11 (2010), http://dx.doi.org/10.1007/s11768-010-9187-7

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Szymon Bobek .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Bobek, S., Nalepa, G.J., Grodzki, O. (2015). Automated Discovery of Mobile Users Locations with Improved K-means Clustering. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L., Zurada, J. (eds) Artificial Intelligence and Soft Computing. ICAISC 2015. Lecture Notes in Computer Science(), vol 9120. Springer, Cham. https://doi.org/10.1007/978-3-319-19369-4_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19369-4_50

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19368-7

  • Online ISBN: 978-3-319-19369-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics