SocioScope: A framework for understanding Internet of Social Knowledge

https://doi.org/10.1016/j.future.2018.01.064Get rights and content

Abstract

Social data is one of important material for representing social activities. However, efficiently collecting social data from multiple sources is extremely hard due its big-data attributes. Moreover, discovering hidden patterns between social data to understand our society is actually a big challenge. Especially, knowledge around us is not stable but dynamically change over time, therefore, we need to refresh it promptly. SocioScope is created as an automatically tool to overcome aforementioned problems. In addition, other researchers can utilize SocioScope as a framework for reducing their effort time with essential tasks (i.e., collecting data, pre-processing data, and analyzing data).

Introduction

Due to the growth of Internet of Things (IoT), every devices (e.g., wireless sensor networks, GPS, control systems) are connected. Data is generated through every activities and it brings human into era of big data. From a statistics [1] in 2013, social data increased quickly in every sides (i.e., volume, velocity, variety) and reached 4.4 zettabytes. By mining hidden knowledge from this recourse, we can achieve huge benefits for our society. However, social data is still raw and exists without any context and analysis. It cannot be useful itself unless we process to obtain knowledge.

In addition, knowledge is dynamics due to the huge social data generating by time. People even do not know that their activities on the internet fortuitously create knowledge. As a statistics from [2] in 2016, around 7000 tweets and 2,500,000 mails is sent; about 700 photos is uploads to Instagram in every second. Although one second is only a very short time in the real world, it means quite a lot on the internet. In [3], authors shows that sentiment of social media users towards blackberry, iphone, and android devices consecutive changes in a short period of time. Besides, authors mention that there are total 41 local events in Occupy Wall Street (OWS) at New York City during only few hours [4]. Due to aforementioned reason, we need an automated system to obtain our social knowledge rather than conduct manually.

Therefore, we build SocioScope as a framework with three core roles: (i) collecting social data from different sources (e.g., social media, mass media, and sensor), (ii) analyzing social data for understanding its relationships and connections to obtain social information, and (iii) extracting hidden patterns to discover social knowledge [5]. From social knowledge, we can understand about societies around us. The scope of society is very dynamics and depends on the scope of social data that we get. Society can be a university, a company, or even a community.

In this section, we already gave the overview of our motivation. Section 2 provides the definition of Internet of Social Knowledge. Next, we will describe our system in Section 3. Further, the performance of SocioScope will be presented in Section 4. Finally, we conclude and discuss future work in Section 5.

Section snippets

Internet of social knowledge

In this section, we focus on providing detail definitions for clearly comprehending about Internet of Social Knowledge. In another study, authors point out that data is the beginning point of knowledge [6]. Further, Cooley [7] gives explanation that “information consists of organized data and when it is applied by people it may become knowledge”. Fig. 1 shows relationship of data, information, and knowledge under pyramid principle. Based on this idea, we propose definitions of social data,

Overview

We aim to give detail explanation about overview of our system in this section. SocioScope framework contains two services with are web service and background service (i.e., windows service or linux daemon) as shown in Fig. 4. Both of them are implemented by using Java programming language. We choose Model View Controller (MVC) as our programming pattern with Apache Tomcat Servlet Container and Java Server Pages technology for implementing web service side. The advantage of MVC is well

Performance

We use the computer with specifications as follows: Intel(R) Core(TM) i5-4590 CPU 3.30 GHz 12 GB RAM for the experiments. The performance of SocioScope is demonstrate in this section. There are three issues that we want to prove which are: (i) performance of crawling process, (ii) ability of MongoDB when dealing with big data, and (iii) time consuming for conducting data analysis tasks. Twitter is selected as the social data source. The limitation of Twitter API (180 calls every 15 min) is also

Conclusion

We propose SocioScope framework for collecting and analyzing social data to create social knowledge in this paper by providing various essential features. Output of SocialScope could be use as inputs for creating applications to understand our society (e.g., event detection [18], trust-based recommendation system [19]).

For enhancing the utility of our system, other data sources will be investigated for integrating into our system to enrich social data. Besides, we consider applying Hadoop,

Acknowledgment

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (NRF-2017R1A2B4010774).

Hoang Long Nguyen is a Ph.D. Student in Chung-Ang University, Korea since March 2016. He received the B.S. in Department of Computer Engineering from Ho Chi Minh City University of Technology, Vietnam in April 2013. And he received M.S. degrees in Department of Computer Engineering from Yeungnam University in Korea in February 2016. His research topics include knowledge engineering on social networks by using machine learning, semantic Web mining, and ambient intelligence.

References (19)

  • ErevellesS. et al.

    Big data consumer analytics and the transformation of marketing

    J. Bus. Res.

    (2016)
  • What happens in an internet second, http://www.dailymail.co.uk/sciencetech/article-3662925 (accessed:...
  • NguyenL.T. et al.

    Predicting collective sentiment dynamics from time-series social media

  • DongX. et al.

    Multiscale event detection in social media

    Data Min. Knowl. Discov.

    (2015)
  • AckoffR.L.

    From data to wisdom

    J. Appl. Syst. Anal.

    (1989)
  • ZinsC.

    Conceptual approaches for defining data, information, and knowledge

    J. Assoc. Inf. Sci. Technol.

    (2007)
  • M. Cooley, Architect or bee? Langley Technical Services Slough,...
  • SmithE.A.

    The role of tacit and explicit knowledge in the workplace

    J. Knowl. Manag.

    (2001)
  • De RougemontD. et al.

    Information is not knowledge

    Diogenes

    (1981)
There are more references available in the full text version of this article.

Cited by (0)

Hoang Long Nguyen is a Ph.D. Student in Chung-Ang University, Korea since March 2016. He received the B.S. in Department of Computer Engineering from Ho Chi Minh City University of Technology, Vietnam in April 2013. And he received M.S. degrees in Department of Computer Engineering from Yeungnam University in Korea in February 2016. His research topics include knowledge engineering on social networks by using machine learning, semantic Web mining, and ambient intelligence.

Jai E. Jung is an Associate Professor in Chung-Ang University, Korea, since September 2014. Before joining CAU, he was an Assistant Professor in Yeungnam University, Korea since 2007. Also, He was a postdoctoral researcher in INRIA Rhone-Alpes, France in 2006, and a visiting scientist in Fraunhofer Institute (FIRST) in Berlin, Germany in 2004. He received the B.Eng. in Computer Science and Mechanical Engineering from Inha University in 1999. He received M.S. and Ph.D. degrees in Computer and Information Engineering from Inha University in 2002 and 2005, respectively. His research topics are knowledge engineering on social networks by using many types of AI methodologies, e.g., data mining, machine learning, and logical reasoning. Recently, he has been working on intelligent schemes to understand various social dynamics in large scale social media (e.g., Twitter and Flickr).

View full text