skip to main content
research-article

Discovering Mobile Application Usage Patterns from a Large-Scale Dataset

Published:27 June 2018Publication History
Skip Abstract Section

Abstract

The discovering of patterns regarding how, when, and where users interact with mobile applications reveals important insights for mobile service providers. In this work, we exploit for the first time a real and large-scale dataset representing the records of mobile application usage of 5,342 users during 2014. The data was collected by a software agent, installed at the users’ smartphones, which monitors detailed usage of applications. First, we look for general patterns of how users access some of the most popular mobile applications in terms of frequency, duration, diversity, and data traffic. Next, we mine the dataset looking for temporal patterns in terms of when and how often accesses occur. Finally, we exploit the location of each access to detect users’ points of interest and location-based communities. Based on the results, we derive a model to generate synthetic datasets of mobile application usage and evaluate solutions to predict the next application to be launched. We also discuss a series of implications of the findings regarding telecommunication services, mobile advertisements, and smart cities. This is the first time this dataset is used, and we also make it publicly available for other researchers.

References

  1. Moustafa Alzantot, Supriyo Chakraborty, and Mani Srivastava. 2017. SenseGen: A deep learning architecture for synthetic sensor data generation. In Proceedings of IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops). IEEE, 188--193.Google ScholarGoogle ScholarCross RefCross Ref
  2. Jason W. Anderson, K. E. Kennedy, Linh B. Ngo, Andre Luckow, and Amy W. Apon. 2014. Synthetic data generation for the internet of things. In Proceedings of IEEE International Conference on Big Data. IEEE, 171--176.Google ScholarGoogle Scholar
  3. Alex Aravind, Hassan Tahir, and Baldeep. 2008. RealMobiSim: Realistic mobility simulator and analyzer. In Proceedings of the 3rd ACM Workshop on Performance Monitoring and Measurement of Heterogeneous Wireless and Wired Networks. ACM, 186--189. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Yakov Bart, Andrew T. Stephen, and Miklos Sarvary. 2014. Which products are best suited to mobile advertising? A field study of mobile display advertising effects on consumer attitudes and intentions. Journal of Marketing Research 51, 3 (2014), 270--285.Google ScholarGoogle ScholarCross RefCross Ref
  5. M. Batty, K. W. Axhausen, F. Giannotti, A. Pozdnoukhov, A. Bazzani, M. Wachowicz, G. Ouzounis, and Y. Portugali. 2012. Smart cities of the future. The European Physical Journal Special Topics 214, 1 (2012), 481--518.Google ScholarGoogle ScholarCross RefCross Ref
  6. Rainer Baumann, Franck Legendre, and Philipp Sommer. 2008. Generic mobility simulation framework (GMSF). In Proceedings of the 1st ACM SIGMOBILE Workshop on Mobility Models. ACM, 49--56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Fabrício Benevenuto, Tiago Rodrigues, Meeyoung Cha, and Virgílio Almeida. 2012. Characterizing user navigation and interactions in online social networks. Information Sciences 195 (2012), 1--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Mate Boban and Tiago T. V. Vinhoza. 2011. Modeling and simulation of vehicular networks: Towards realistic and efficient models. In Proceedings of Mobile Ad-Hoc Networks: Applications. In Tech.Google ScholarGoogle Scholar
  9. Francesco Calabrese, Francisco C. Pereira, Giusy Di Lorenzo, Liang Liu, and Carlo Ratti. 2010. The geography of taste: Analyzing cell-phone mobility and social events. In Proceedings of International Conference on Pervasive Computing. Springer, 22--37. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Juan Pablo Carrascal and Karen Church. 2015. An in-situ study of mobile app 8 mobile search interactions. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, 2739--2748. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Gokul Chittaranjan, Jan Blom, and Daniel Gatica-Perez. 2011. Who’s who with big-five: Analyzing and classifying personality traits with smartphones. In Proceedings of the 15th Annual International Symposium on Wearable Computers. IEEE, 29--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Gokul Chittaranjan, Jan Blom, and Daniel Gatica-Perez. 2013. Mining large-scale smartphone data for personality studies. Personal and Ubiquitous Computing 17, 3 (2013), 433--450. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Karen Church, Denzil Ferreira, Nikola Banovic, and Kent Lyons. 2015. Understanding the challenges of mobile phone usage data. In Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services. ACM, 504--514. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Rafael de A. Oliveira, Wladmir Cardoso Brandão, and Humberto Torres Marques-Neto. 2015. Characterizing user behavior on a mobile SMS-based chat service. In Proceedings of the 33rd Brazilian Symposium on Computer Networks and Distributed Systems (SBRC’15). IEEE, 130--139. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Trinh Minh Tri Do, Jan Blom, and Daniel Gatica-Perez. 2011. Smartphone usage in the wild: A large-scale analysis of applications and context. In Proceedings of the 13th International Conference on Multimodal Interfaces. ACM, 353--360. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Martin Ester, Hans-Peter Kriegel, Jörg Sander, Xiaowei Xu, et al. 1996. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of KDD, Vol. 96. ACM, 226--231. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Aleksandr Farseev, Liqiang Nie, Mohammad Akbari, and Tat-Seng Chua. 2015. Harvesting multiple sources for user profile learning: A big data study. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval. ACM, 235--242. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Raihana Ferdous, Venet Osmani, and Oscar Mayora. 2015. Smartphone app usage as a predictor of perceived stress levels at workplace. In Proceedings of the 9th International Conference on Pervasive Computing Technologies for Healthcare. ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering), 225--228. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Miriam Fernandez, Arno Scharl, Kalina Bontcheva, and Harith Alani. 2014. User profile modelling in online communities. In Proceedings of the 3rd International Conference on Semantic Web Collaborative Spaces-Volume 1275. CEUR-WS, 1--15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Pierdomenico Fiadino, Pedro Casas, Mirko Schiavone, and Alessandro D’Alconzo. 2015. Online social networks anatomy: On the analysis of Facebook and Whatsapp in cellular networks. In Proceedings of IFIP Networking Conference (IFIP Networking’15). IEEE, 1--9.Google ScholarGoogle ScholarCross RefCross Ref
  21. André Luiz Firmino Alves, Cláudio de Souza Baptista, Anderson Almeida Firmino, Maxwell Guimarães de Oliveira, and Anselmo Cardoso de Paiva. 2014. A comparison of SVM versus naive-bayes techniques for sentiment analysis in tweets: A case study with the 2013 FIFA confederations cup. In Proceedings of the 20th Brazilian Symposium on Multimedia and the Web. ACM, 123--130. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Eugenio Giordano, Enzo De Sena, Giovanni Pau, and Mario Gerla. 2010. Vergilius: A scenario generator for vanet. In Proceedings of 71st Vehicular Technology Conference (VTC’10-Spring). IEEE, 1--5.Google ScholarGoogle ScholarCross RefCross Ref
  23. Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Proceedings of Advances in Neural Information Processing Systems. MIT Press, 2672--2680. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Liang Hong, Yu Zheng, Duncan Yung, Jingbo Shang, and Lei Zou. 2015. Detecting urban black holes based on human mobility data. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 35. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Ke Huang, Chunhui Zhang, Xiaoxiao Ma, and Guanling Chen. 2012. Predicting mobile application usage using contextual information. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing. ACM, 1059--1065. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Md Shahadat Iqbal, Charisma F. Choudhury, Pu Wang, and Marta C. González. 2014. Development of origin--Destination matrices using mobile phone call data. Transportation Research Part C: Emerging Technologies 40 (2014), 63--74.Google ScholarGoogle ScholarCross RefCross Ref
  27. Sibren Isaacman, Richard Becker, Ramón Cáceres, Margaret Martonosi, James Rowland, Alexander Varshavsky, and Walter Willinger. 2012. Human mobility modeling at metropolitan scales. In Proceedings of the 10th International Conference on Mobile Systems, Applications, and Services. ACM, 239--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Mayu Iwata, Hiroki Miyamoto, Takahiro Hara, Daijiro Komaki, Kentaro Shimatani, Tomohiro Mashita, Kiyoshi Kiyokawa, Toshiaki Uemukai, Gen Hattori, Shojiro Nishio, et al. 2013. A content search system considering the activity and context of a mobile user. Personal and Ubiquitous Computing 17, 5 (2013), 1035--1050. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Raj Jain. 1991. The Art of Computer Systems Performance Analysis - Techniques for Experimental Design, Measurement, Simulation, and Modeling. Wiley. I--XXVII, pp. 1--685.Google ScholarGoogle Scholar
  30. Long Jin, Yang Chen, Tianyi Wang, Pan Hui, and Athanasios V. Vasilakos. 2013. Understanding user behavior in online social networks: A survey. IEEE Communications Magazine 51, 9 (2013), 144--150.Google ScholarGoogle ScholarCross RefCross Ref
  31. Sokol Kosta, Alessandro Mei, and Julinda Stefa. 2014. Large-scale synthetic social mobile networks with SWIM. IEEE Transactions on Mobile Computing 13, 1 (2014), 116--129. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Juha K. Laurila, Daniel Gatica-Perez, Imad Aad, Olivier Bornet, Trinh-Minh-Tri Do, Olivier Dousse, Julien Eberle, Markus Miettinen, et al. 2012. The mobile data challenge: Big data for mobile computing research. In Proceedings of Pervasive Computing. EPFL.Google ScholarGoogle Scholar
  33. Yannick Leo, Anthony Busson, Carlos Sarraute, and Eric Fleury. 2016. Call detail records to characterize usages and mobility events of phone users. Computer Communications 95 (2016), 43--53.Google ScholarGoogle ScholarCross RefCross Ref
  34. Cynthia LeRouge and Jiao Ma. 2010. User profiles and personas in consumer health technologies. In Proceedings of the 43rd Hawaii International Conference on System Sciences (HICSS’10). IEEE, 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Huoran Li, Xuan Lu, Xuanzhe Liu, Tao Xie, Kaigui Bian, Felix Xiaozhu Lin, Qiaozhu Mei, and Feng Feng. 2015. Characterizing smartphone usage patterns from millions of android users. In Proceedings of the 2015 ACM Conference on Internet Measurement Conference. ACM, 459--472. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Quannan Li, Yu Zheng, Xing Xie, Yukun Chen, Wenyu Liu, and Wei-Ying Ma. 2008. Mining user similarity based on location history. In Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. ACM, 34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Keun-Woo Lim, Stefano Secci, Lionel Tabourier, and Badis Tebbani. 2016. Characterizing and predicting mobile application usage. Computer Communications 95 (2016), 82--94.Google ScholarGoogle ScholarCross RefCross Ref
  38. Bojin Liu, Behrooz Khorashadi, Haining Du, Dipak Ghosal, Chen-Nee Chuah, and Michael Zhang. 2009. VGSim: An integrated networking and microscopic vehicular mobility simulation platform. IEEE Communications Magazine 47, 5 (2009), 134--141. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Eric Malmi and Ingmar Weber. 2016. You are what apps you use: Demographic prediction based on user’s apps. In Proceedings of the International AAAI Conference on Web and Social Media (ICWSM’16). 635--638.Google ScholarGoogle Scholar
  40. Zijian Ming, Chunjie Luo, Wanling Gao, Rui Han, Qiang Yang, Lei Wang, and Jianfeng Zhan. 2013. BDGS: A scalable big data generator suite in big data benchmarking. In Proceedings of Workshop on Big Data Benchmarks. Springer, 138--154.Google ScholarGoogle Scholar
  41. Darakhshan J. Mir, Sibren Isaacman, Ramón Cáceres, Margaret Martonosi, and Rebecca N. Wright. 2013. Dp-where: Differentially private modeling of human mobility. In Proceedings of IEEE International Conference on Big Data. IEEE, 580--588.Google ScholarGoogle Scholar
  42. Raul Montoliu, Jan Blom, and Daniel Gatica-Perez. 2013. Discovering places of interest in everyday life from smartphone data. Multimedia Tools and Applications 62, 1 (2013), 179--207.Google ScholarGoogle ScholarCross RefCross Ref
  43. Diala Naboulsi, Marco Fiore, Stephane Ribot, and Razvan Stanica. 2015. Large-scale mobile traffic analysis: A survey. IEEE Communications Surveys 8 Tutorials 18, 1 (2015), 124--161.Google ScholarGoogle Scholar
  44. Abhinav Parate, Matthias Böhmer, David Chu, Deepak Ganesan, and Benjamin M. Marlin. 2013. Practical prediction and prefetch for faster access to applications on mobile phones. In Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 275--284. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Marco Pavan, Stefano Mizzaro, and Ivan Scagnetto. 2017. Mining movement data to extract personal points of interest: A feature based approach. In Information Filtering and Retrieval. Springer, 35--61.Google ScholarGoogle Scholar
  46. Dan Pelleg, Andrew W. Moore, et al. 2000. X-means: Extending K-means with efficient estimation of the number of clusters. In Proceedings of International Conference on Machine Learning (ICML’2000), Vol. 1. ICML, 727--734. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Yoann Pigné, Grégoire Danoy, and Pascal Bouvry. 2011. A vehicular mobility model based on real traffic counting data. Nets4Cars/Nets4Trains 6596 (2011), 131--142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. C. Carl Robusto. 1957. The cosine-haversine formula. The American Mathematical Monthly 64, 1 (1957), 38--40.Google ScholarGoogle ScholarCross RefCross Ref
  49. Christelle Scharff, Khadidiatou Ndiaye, Meghan Jordan, Aminata Niang Diene, and Fatou Maria Drame. 2015. Human mobility during religious festivals and its implications on public health in Senegal: A mobile dataset analysis. In Proceedings of Global Humanitarian Technology Conference (GHTC’15). IEEE, 108--113.Google ScholarGoogle ScholarCross RefCross Ref
  50. Suranga Seneviratne, Aruna Seneviratne, Prasant Mohapatra, and Anirban Mahanti. 2014. Predicting user traits from a snapshot of apps installed on a smartphone. ACM SIGMOBILE Mobile Computing and Communications Review 18, 2 (2014), 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Claude Elwood Shannon. 2001. A mathematical theory of communication. ACM SIGMOBILE Mobile Computing and Communications Review 5, 1 (2001), 3--55. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Choonsung Shin, Jin-Hyuk Hong, and Anind K. Dey. 2012. Understanding and prediction of mobile application usage for smart phones. In Proceedings of the 2012 ACM Conference on Ubiquitous Computing. ACM, 173--182. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Vijay Srinivasan, Saeed Moghaddam, Abhishek Mukherji, Kiran K. Rachuri, Chenren Xu, and Emmanuel Munguia Tapia. 2014. Mobileminer: Mining your frequent patterns on your phone. In Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing. ACM, 389--400. Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Teleco. 2017. Intelligence in Telecommunications. Retrieved June 2, 2017 from http://www.teleco.com.br/en/.Google ScholarGoogle Scholar
  55. Khoi-Nguyen Tran, Dinusha Vatsalan, and Peter Christen. 2013. GeCo: An online personal data generator and corruptor. In Proceedings of the 22nd ACM International Conference on Information 8 Knowledge Management. ACM, 2473--2476. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Sandesh Uppoor, Oscar Trullols-Cruces, Marco Fiore, and Jose M. Barcelo-Ordinas. 2014. Generation and analysis of a large-scale urban vehicular mobility dataset. IEEE Transactions on Mobile Computing 13, 5 (2014), 1061--1075. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. Huandong Wang, Fengli Xu, Yong Li, Pengyu Zhang, and Depeng Jin. 2015. Understanding mobile traffic patterns of large scale cellular towers in urban environment. In Proceedings of the 2015 ACM Conference on Internet Measurement Conference. ACM, 225--238. Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Zhen Jia, Yingjie Shi, Shujie Zhang, et al. 2014. Bigdatabench: A big data benchmark suite from internet services. In Proceedings of the 20th International Symposium on High Performance Computer Architecture (HPCA’14). IEEE, 488--499.Google ScholarGoogle ScholarCross RefCross Ref
  59. Xinwei Wang, Dezhi Wu, and Hock Hai Teo. 2014. Right message at right place: Investigating the effectiveness of location-based mobile advertising on consumers’ redemption responses. In Proceedings of the 35th International Conference on Information Systems. AIS.Google ScholarGoogle Scholar
  60. Faber Henrique Zacarias Xavier, Lucas Maia Silveira, Jussara Marques de Almeida, Artur Ziviani, Carlos Henrique Silva Malab, and Humberto Torres Marques-Neto. 2012. Analyzing the workload dynamics of a mobile phone network in large scale events. In Proceedings of the 1st Workshop on Urban Networking. ACM, 37--42. Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Runhua Xu, Remo Manuel Frey, Denis Vuckovac, and Alexander Ilic. 2015. Towards understanding the impact of personality traits on mobile app adoption-A scalable approach. In Proceedings of the 23rd European Conference on Information Systems. AIS.Google ScholarGoogle Scholar
  62. Tingxin Yan, David Chu, Deepak Ganesan, Aman Kansal, and Jie Liu. 2012. Fast app launching for mobile devices using predictive user context. In Proceedings of the 10th International Conference on Mobile Systems, Applications, and Services. ACM, 113--126. Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Jie Yang, Yuanyuan Qiao, Xinyu Zhang, Haiyang He, Fang Liu, and Gang Cheng. 2015. Characterizing user behavior in mobile internet. IEEE Transactions on Emerging Topics in Computing 3, 1 (2015), 95--106.Google ScholarGoogle ScholarCross RefCross Ref
  64. Lantao Yu, Weinan Zhang, Jun Wang, and Yong Yu. 2017. SeqGAN: Sequence generative adversarial nets with policy gradient. In Proceedings of AAAI. 2852--2858.Google ScholarGoogle ScholarCross RefCross Ref
  65. Xun Zou, Wangsheng Zhang, Shijian Li, and Gang Pan. 2013. Prophet: What app you wish to use next. In Proceedings of the 2013 ACM Conference on Pervasive and Ubiquitous Computing Adjunct Publication. ACM, 167--170. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Discovering Mobile Application Usage Patterns from a Large-Scale Dataset

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Knowledge Discovery from Data
          ACM Transactions on Knowledge Discovery from Data  Volume 12, Issue 5
          October 2018
          354 pages
          ISSN:1556-4681
          EISSN:1556-472X
          DOI:10.1145/3234931
          Issue’s Table of Contents

          Copyright © 2018 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 27 June 2018
          • Revised: 1 March 2018
          • Accepted: 1 March 2018
          • Received: 1 June 2017
          Published in tkdd Volume 12, Issue 5

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader