Skip to main content
Log in

Data warehouse design approaches from social media: review and comparison

  • Review Article
  • Published:
Social Network Analysis and Mining Aims and scope Submit manuscript

Abstract

With the rise of social media in our life, several decision makers have worked on these networks to make better decisions. In order to benefit from the data issued from these media, many researchers focused on helping companies understand how to perform a social media competitive analysis and transform these data into knowledge for decision makers. A high number of users interact at any time on different ways in social media such as by expressing their opinions about products, services or transaction related to the organization which can prove very helpful for making better projections. In this paper, we provide a literature review on data warehouse design approaches from social media. More precisely, we start by introducing the main concepts of data warehouse and social media. We also propose two classes of data warehouse design approaches from social media (behavior analysis and integration of sentiment analysis in data warehouse schema) and expose for each one the most representative existing works. Afterward, we propose a comparative study of the existing works.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. http://www.statista.com/statistics/272014/global-social-networks-ranked-by-number-of-users.

  2. http://blog.kinoa.com/2013/08/05/reseaux-sociaux-et-medias-sociaux-quelle-difference/#comments.

    http://www.reseaux-professionnels.fr/comprendre-ce-sont-les-medias-sociaux/.

  3. http://www.alchemyapi.com.

  4. http://www.opencalais.com.

References

  • Abello A, Samios J, Saltor F (2001) Understandding analysis dimensions in a multidimensional object-oriented model. In: Proceedings of the international workshop on design and management of data warehouses (DMDW), Interlaken, Switzerland, pp 4-1–4-9

  • Aha D, Kibler D, Albert M (1991) Instance-based learning algorithms. J. Mach Learn 6(1):37–66

    Google Scholar 

  • Akar E, Topçu B (2011) An examination of the factors influencing consumers’ attitudes toward social media marketing. J Internet Commer 10(1):35–67

    Article  Google Scholar 

  • Bringay S, Béchet N, Bouillot F, Poncelet P, Roche M, Teisseire M (2011) Towards an on-line analysis of tweets processing. Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6861 LNCS, pp 154–161

  • Costa PR, Souza FF, Times VC, Benevenuto F (2012) Towards integrating online social networks and business intelligence. In: IADIS international conference on web based communities and social media

  • Cuzzocrea A, De Maio C, Fenza G (2015) Towards OLAP analysis of multidimensional tweet streams, 69–73. OLAP analysis of multidimensional tweet streams for supporting advanced analytics. Sac 2016, pp 992–999. doi:10.1145/2811222.2811233

  • Cuzzocrea A, De Maio C, Fenza G, Loia V, Parente M (2016). OLAP analysis of multidimensional tweet streams for supporting advanced analytics. In: Proceedings of the 31st annual ACM symposium on applied computing. ACM, pp. 992–999

  • Eguchi K, Lavrenko V (2006) Sentiment retrieval using generative models. In: Proceedings of the 2006 conference on empirical methods in natural language processing, pp 345–354

  • Fan R-E, Chen P-H, Lin CJ (2005) Working set selection using the second order information for training SVM. J Mach Learn Res 6:1889–1918

    MathSciNet  MATH  Google Scholar 

  • Gallinucci E, Golfarelli M, Rizzi S (2013) Meta-stars: multidimensional modeling for social business intelligence. In: Proceedings of the sixteenth international workshop on Data warehousing and OLAP. ACM, pp 11–18

  • Gallinucci E, Golfarelli M, Rizzi S (2015) Advanced topic modeling for social business intelligence. Inf Syst 53:87–106

    Article  Google Scholar 

  • Ganter B, Wille R (1997) Formal concept analysis: mathematical foundations, vol 1. Springer, New York

    MATH  Google Scholar 

  • Go A, Bhayani R, Huang L (2009) Twitter Sentiment classification using distant supervision. Processing, CS224N Project Report, Stanford, vol 1, p 12

  • Golfarelli M, Maio D, Rizzi S (1998) The dimensional fact model: a conceptual model for data warehouses. Int J Coop Inf Syst 7:215–247

    Article  Google Scholar 

  • Grabs T, Schek H.-J. (2002) ETH Zürich at INEX: Flexible Information Retrieval from XML with PowerDB-XML. Xml With Powerdb-Xml. Inex Workshop, 2002, pp 141–148

  • Inmon WH (1996) Building the data warehouse. Wiley, New York

    Google Scholar 

  • Kaplan AM, Haenlein M (2010) Users of the world, unite! The challenges and opportunities of social media. Bus Horiz 53(1):61

    Article  Google Scholar 

  • Kimball R (1996) The data warehouse toolkit practical techniques for building dimensional data warehouses. Wiley, New York

    Google Scholar 

  • Kraiem MB, Feki J, Khrouf K, Ravat F, Teste O (2015) Modeling and OLAPing social media: the case of Twitter. J Social Netw Anal Inf Syst 5(1):47

    Article  Google Scholar 

  • Liu X, Tang K, Hancock J, Han J, Song M, Xu R, Pokorny B (2012). SocialCube: A text cube framework for analyzing social media data. In: International conference on IEEE social informatics (SocialInformatics), pp 252–259

  • Liu X, Tang K, Hancock J, Han J, Song M, Xu R, Pokorny B (2013) A text cube approach to human, social and cultural behavior in the Twitter stream. Lecture Notes in Computer Science (including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7812 LNCS, pp 321–330

  • Lujan-Mora S, Trujillo J, Song I (2002) Multidimensional modeling with UML Package diagrams, In: 21st international conference on conceptual modeling (ER2002), 2002

  • Malinowski E, Zimányi E (2005) Spatial hierarchies and topological relationships in the spatial MultiDimER model. In: British national conference on databases. Springer, Berlin, pp. 17–28

  • Mansmann S (2008) Extending the OLAP technology to handle non-conventional and complex data. Ph.D. dissertation, University of Konstanz, Department of Computer and Information Science Germany

  • Moalla I, Nabli A (2014) Towards data mart building from social network for opinion analysis. In: International conference on intelligent data engineering and automated learning IDEAL14, pp 295–302

  • Moya LG, Kudama S, Cabo MJA, Llavori RB (2011) Integrating web feed opinions into a corporate data warehouse. In: Proceedings of the 2nd international workshop on business intelligence and the WEB-BEWEB’11, p 20

  • Nabli, A, Jamel F, Faiez G (2005) Automatic construction of multidimensional schema from olap requirements. In: 3rd ACS/IEEE international conference on computer systems and applications, 2005 2005 (Figure 1), pp 97–103. doi:10.1109/AICCSA.2005.1387025

  • Phipps C, Davis K (2002). Automating data warehouse conceptual schema design and evaluation, DMDW'02

  • Ravat F, Teste O, Zurfluh G (2001) Modélisation multidimensionnelle des systèmes décisionnels. In EGC, pp 201–212

  • Rehman NU (2015) Extending the OLAP technology for social media analysis (Doctoral dissertation)

  • Rehman NU, Mansmann S, Weiler A, Scholl MH (2012) Building a data warehouse for twitter stream exploration. In: Proceedings of the 2012 IEEE/ACM international conference on advances in social networks analysis and mining, ASONAM, pp 1341–1348

  • Rehman NU, Weiler A, Scholl M (2013) OLAPing social media: the case of Twitter. In: International conference on advances in social networks analysis and mining, ASONAM2013, pp 1139–1146

  • Rizzi S (2007) Conceptual modeling solutions for the data warehouse. Data Warehouses and OLAP: Concepts, Architectures and Solutions, pp 1–26

  • Shankar V, Hollinger M (2007) Online and mobile advertising: current scenario, emerging trends, and future directions. Market Sci Inst 31(3):206–207

    Google Scholar 

  • Stone PJ, Dunphy DC, Smith MS, Ogilvie DM (1966) The general inquirer: a computer approach to content analysis. MIT Press, Cambridge

    Google Scholar 

  • Trujillo J, Palomar M (1998) An object oriented approach to multidimensional database conceptual modeling (OOMD). In: Proceeding 1st international workshop on data warehousing and OLAP (DOLAP98)

  • Tryfona N, Busborg F, Borch Christiansen JG (1999) starER: a conceptual model for data warehouse design. In: Proceedings of the 2nd ACM international workshop on Data warehousing and OLAP. ACM. pp 3–8

  • Walha A, Ghozzi F, Gargouri F (2015) ETL design toward social network opinion analysis. In: Computer and information science 2015. Springer International Publishing, p 235–249

  • Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques, The Morgan Kaufmann Series in Data Management Systems

  • Zhang L, Bing L, Lim SH, O’Brien-Strain E (2010) Extracting and ranking product features in opinion documents. In: Proceedings of the 23rd international conference on computational linguistics: association for computational linguistics. Posters, pp 1462–1470

  • Zhao P, Li X, Xin D, Han J (2011) Graph cube: on warehousing and OLAP multidimensional networks. In: Proceedings of the 2011 ACM SIGMOD international conference on management of data. ACM, pp 853–864

Download references

Acknowledgements

This work is dedicated to the soul of the author of this paper Dr. Lotfi Bouzguenda who passed away a few months ago. We are very grateful for his help, his advice and his prestigious remarks. May God have his soul in blessed mercy.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Imen Moalla.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Moalla, I., Nabli, A., Bouzguenda, L. et al. Data warehouse design approaches from social media: review and comparison. Soc. Netw. Anal. Min. 7, 5 (2017). https://doi.org/10.1007/s13278-017-0423-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13278-017-0423-8

Keywords

Navigation