Abstract
Social media has drastically entered into a new concept by empowering people to publish their data along with their locations in order to provide benefits to the community and the country overall. There is a significant increase in the use of geosocial networks, such as Twitter, Facebook, Foursquare, and Flickr. Therefore, people worldwide can now voice their opinion, report an event instantly, and connect with others while sharing their views. Thus, geosocial network data provides full information on human current trends in terms of behavior, lifestyle, incidents and events, disasters, current medical infections, and much more with respect to location. Hence, current geosocial media can serve as data assets for countries and their government by analyzing geosocial data in a real time. However, there are millions of geosocial network users who generate terabytes of heterogeneous data with a variety of information every day and at high speed; such information is called “Big Data.” Analyzing such a significant amount of data and making real-time decisions regarding event detection is a challenging task. Therefore, in this paper, we propose an efficient system for exploring geosocial networks while harvesting data in order to make real-time decisions while detecting various events. A novel system architecture is proposed and implemented in a real environment in order to process an abundant amount of various social network data to monitor Earth events, incidents, medical diseases, user trends, and views to make future real-time decisions and facilitate future planning. The proposed system consists of five layers, i.e., data collection, data processing, application, communication, and data storage. The system deploys Spark at the top of the Hadoop ecosystem to run a real-time analysis. Twitter and Flickr data are analyzed using the proposed architecture in order to identify current events or disasters, such as earthquakes, fires, Ebola virus contagion, and snow. The system is evaluated on the Tweeter’s data by considering the recent earthquake detection occurred in New Zealand. The system is also evaluated with respect to efficiency while considering system throughput on large datasets. We prove that the system has higher throughput and is capable of analyzing a huge amount of geosocial network data at a real time while detecting any event.
















Similar content being viewed by others
References
140kit (2016) https://github.com/WebEcologyProject/140kit, Accessed on 19 Feb 2016
Anand Paul, Awais Ahmad, M Mazhar Rathore, Sohail Jabbar (2016) Smartbuddy: defining human behaviors using big data analytics in social internet of things. IEEE Wirel Commun 23(5):68–74
Archive Team (2016) The Twitter Stream Grab. Retrieved from: https://archive.org/details/twitterstream. Accessed on 19 Feb 2016
Becker RA, Caceres R, Hanson K, Loh JM, Urbanek S, Varshavsky A, Volinsky C (2011) A tale of one city: using cellular network data for urban planning. IEEE Pervasive Computing 10(4):18–26
Bobadilla J, Ortega F, Hernando A, Guti_errez A (2013) Recommender systems survey. Knowl-Based Syst 46:109–132
Bodnar T, Dering ML, Tucker C, Hopkinson KM (2016) Using large-scale social media networks as a scalable sensing system for modeling real-time energy utilization patterns. IEEE Trans Syst Man, Cybern Syst PP(99):1–14
Cheng Z, Caverlee J, Lee K, Sui DZ (2011) Exploring millions of footprints in location sharing services. ICWSM, vol 2011:81–88
Chow J Bao, Mokbel M (2010) Towards location-based social networking services. In proceedings of the 2nd ACM SIGSPATIAL International Workshop on location based social networks, pp. 31-38, ACM
Crooks A Croitoru A Stefanidis, Radzikowski J (2012) #Earthquake: Twitter as a distributed sensor system, Transactions in GIS
Eriksson B, Barford P, Sommers J, and Nowak R (2010) A learning-based approach for IP geolocation. In Krishnamurthy A and Plattner B (eds) Passive and active measurement. Berlin, Springer Lecture Notes in Computer Science 6032: 171–80
Ferrari L, Rosi A, Mamei M and Zambonelli F (2011) Extracting urban patterns from location-based social networks. In Proc. of the 3rd ACM LBSN
Fielding RT (2000) “Architectural styles and the design of network-based software architectures,” Thesis (Ph.D.). University of California, Irvine
Haklay M (2010) How good is volunteered geographical information? A comparative study of OpenStreetMap and ORDNANCE Survey datasets. Environ Plann B, Plann Des 37(4):682
Hefez, Kanza Y, Levin R (2011) Tarsius: A system for traffic-aware route search under conditions of uncertainty. In SIGSPATIAL'11, pages 517–520, Chicago
Hern (2013) Online volunteers map Philippines after typhoon Haiyan, The Guardian, [online] 15 Nov Retrived from: http://www.theguardian.com/technology/2013/nov/15/online-volunteers-mapphilippines-after-typhoon-haiyan Accessed:15th November 2013,
Hootsuite (2016) hootsuite.com, Accessed on 19 Feb 2016
Kanza Y, Kravi E and Motchan U (2014) City nexus: Discovering pairs of jointly-visited locations based on geo-tagged posts in social networks. In SIGSPATIAL'14, pages 597–600, Dallas
Lan R, Lieberman MD and Samet H (2012) The picture of health: map-based, collaborative spatio-temporal disease tracking. In HealthGIS'12, pages 27–35, Redondo Beach
Levin R, Kanza Y (2014) Tars: traffic-aware route search. GeoInformatica 18(3):461–500
MAPD Twitter (2016) “http://tweetmap.mapd.com/”, Accessed on 19 Feb 2016
Marco A et al (2012) Landmark-assisted location and tracking in outdoor mobile network. Multimedia Tools and Applications 59(1):89–111
Middleton SE, Middleton L, Modafferi S (2014) Real-time crisis mapping of natural disasters using social media. IEEE Intelligent Systems 29(2):9–17
O'Connor, M, Krieger, Ahn D (2010) Tweetmotif: Exploratory search and topic summarization for twitter. In ICWSM
Papadimitriou P, Symeonidis, Manolopoulos Y (2011) Geo-social recommendations. In ACM Recommender Systems 2011 (RecSys) Workshop on Personalization in Mobile Applications
Poese I, Uhlig S, Kaafar MA, Donnet B, Gueye B (2011) IP Geolocation databases: Unreliable? Computer Communication Review 4(2):53–56
Rathore MM, Ahmad A, Paul A, Wan J, Zhang D (2016a) Real-time medical emergency response system: exploiting IoT and big data for public health. J Med Syst 40(12):283
Rathore MM, Ahmad A, Paul A (2016b) Real time intrusion detection system for ultra-high-speed big data environments. J Supercomput 72(9):3489–3510
Ratti S, Williams D, Frenchman, Pulselli R (2006) Mobile landscapes: using location data from cell phones for urban analysis. Environment and Planning B Planning and Design 33(5):727
Lan Rongjian, Adelfio MD, Samet H (2014) Spatio-temporal disease tracking using news articles. In HealthGIS'14, pages 31–38, Dallas
Shafiq MZ, Ji L, Liu AX, Pang J, Wang J (2015) Geospatial and temporal dynamics of application usage in cellular data networks. IEEE Trans Mob Comput 14(7):1369–1381
Stefanidis A, Crooks A, Radzikowski J (2012) Harvesting ambient geospatial information from social media feeds. GeoJournal:1–20
Waller LA, Gotway CA (2004) Applied spatial statistics for public health data, volume 368. John Wiley & Sons, Hoboken
Xia C, Schwartz, R, Xie Ke, Krebs A, Langdon A, Ting J, Naaman M (2014) CityBeat: real-time social media visualization of hyper-local city data. In Proc. of International World Wide Web Conference Committee ((IW3C2), WWW’14, April 7–11, 2014, Seoul, Korea
Zheng Y, Chen Y, Xie X and Ma W (2009) GeoLife 2.0: a location-based social networking service. In mobile data management: systems, services and middleware, 2009. MDM'09. Tenth international Conference on, pp. 357-358, IEEE
Zhou X, Chen L (2014) Event detection over twitter social media streams. VLDB J 23(3):381–400
Zook M, Graham M, Shelton T, Gorman S (2010) Volunteered geographic information and crowdsourcing disaster relief: a case study of the Haitian earthquake. World Medical & Health Policy 2(2):7–33
Acknowledgements
This work is also supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (NRF- 2016R1A2A1A05005459).
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Mazhar Rathore, M., Ahmad, A., Paul, A. et al. Advanced computing model for geosocial media using big data analytics. Multimed Tools Appl 76, 24767–24787 (2017). https://doi.org/10.1007/s11042-017-4644-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-017-4644-7