Skip to main content
Log in

Multimodal social intelligence in a real-time dashboard system

  • Special Issue Paper
  • Published:
The VLDB Journal Aims and scope Submit manuscript

Abstract

Social Networks provide one of the most rapidly evolving data sets in existence today. Traditional Business Intelligence applications struggle to take advantage of such data sets in a timely manner. The BBC SoundIndex, developed by the authors and others, enabled real-time analytics of music popularity using data from a variety of Social Networks. We present this system as a grounding example of how to overcome the challenges of working with this data from social networks. We discuss a variety of technologies to implement near real-time data analytics to transform Social Intelligence into Business Intelligence and evaluate their effectiveness in the music domain. The SoundIndex project helped to highlight a number of key research areas, including named entity recognition and sentiment analysis in Informal English. It also drew attention to the importance of metadata aggregation in multimodal environments. We explored challenges such as drawing data from a wide set of sources spanning a myriad of modalities, developing adjudication techniques to harmonize inputs, and performing deep analytics on extremely challenging Informal English snippets. Ultimately, we seek to provide guidance on developing applications in a variety of domains that allow an analyst to rapidly grasp the evolution in the social landscape, and show how to validate such a system for a real-world application.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Adali, S., Hill, B., Magdon-Ismail, M.: The impact of ranker quality on rank aggregation algorithms: information robustness. In: ICDEW ’06, p. 37 (2006)

  2. Adorno, T.W.: A social critique of radio music. Kenyon Review, pp 208–217 (1945)

  3. Alba, A., Bhagwan, V., Grace, J., Gruhl, D., Haas, K., Nagarajan, M., Pieper, J., Robson, C., Sahoo, N.: Applications of voting theory to information mashups. In: ICSC, pp 10–17 (2008)

  4. Alba, A., Bhagwan, V., Grandison, T.: Accessing the deep web: when good ideas go bad. In: Companion to the 23rd ACM SIGPLAN conference on Object-oriented programming systems languages and applications, OOPSLA Companion ’08, Nashville, TN, USA. ACM, New York, pp 815–818 (2008). http://doi.acm.org/10.1145/1449814.1449871

  5. Arrow, K.J.: Social Choice and Individual Values. Yale University Press (1951, 2nd ed., 1970)

  6. Baeza-Yates R.A., Ribeiro-Neto B.: Modern Information Retrieval. Addison-Wesley Longman Publishing Co. Inc., Boston (1999)

    Google Scholar 

  7. Balinski M., Laraki R.: A theory of measuring, electing, and ranking. Proc. Natl. Acad. Sci. 104(21), 8720 (2007)

    Article  MATH  MathSciNet  Google Scholar 

  8. Balinski M., Laraki R.: A theory of measuring, electing, and ranking. PNAS 104(21), 8720–8725 (2007). doi:10.1073/pnas.0702634104

    Article  MATH  MathSciNet  Google Scholar 

  9. Bergson A.: A reformulation of certain aspects of welfare economics. Quart. J. Econ. 52(2), 310–334 (1938)

    Article  Google Scholar 

  10. Bhagwan, V., Grandison, T., Alba, A., Gruhl, D., Pieper, J.: Mongoose: Monitoring global online opinions via semantic extraction. In: SQAM workshop at IEEE 2009 International Conferenc on Cloud Computing (2009)

  11. Blosser, J., Josephsen, D.: Scalable centralized bayesian spam mitigation with bogofilter. In: USENIX conference on System Administration (2004)

  12. Bunescu, R.C., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: EACL, The Association for Computer Linguistics (2006)

  13. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm (2001)

  14. Codd E., Codd S., Salley C.: Providing OLAP (on-line Analytical Processing) to User-analysts: An IT Mandate. Codd & Date Inc. (1993)

  15. Cody W.F., Kreulen J.T., Krishna V., Spangler W.S.: The integration of business intelligence and knowledge management. IBM Syst. J. 41(4), 697–713 (2002)

    Article  Google Scholar 

  16. de Borda, J.C.: Memoire sur les elections au Scrutin. Histoire de l’Acad. R. des Sci (1981)

  17. Diaconis P., Graham R.: Spearman’s footrule as a measure of disarray. J. Royal Stat. Soc. Ser. B (Methodological) 39(2), 262–268 (1977)

    MATH  MathSciNet  Google Scholar 

  18. Dwork, C., Kumar, R., Naor, M., Sivakumar, D.: Rank aggregation methods for the Web. In: Proceedings of the 10th International Conference on World Wide Web, pp 613–622 (2001)

  19. Esuli, A.: Survey of Techniques for Opinion Mining. Language and Intelligence Reading Group (2006)

  20. Esuli, A., Sebastiani, F.: Determining the semantic orientation of terms through gloss classification. In: CIKM ’05, ACM Press, pp. 617–624 (2005)

  21. Fagin, R., Kumar, R., Sivakumar, D.: Efficient similarity search and classification via rank aggregation. Proceedings of ACM SIGMOD (2003)

  22. Ferrucci, D., Lally, A.: Uima: an architectural approach to unstructured information processing in the corporate research environment. Nat Lang Eng (2004)

  23. Freitag, D.: Information extraction from html: application of a general machine learning approach. In: Conference on AI/Innovative Applications of AI (1998)

  24. Freitag, D., Kushmerick, N.: Boosted wrapper induction. In: Proceedings of the 17th National Conference on AI/Innovative Applications of AI (2000)

  25. Grace, J., Gruhl, D., Haas, K., Nagarajan, M., Robson, C., Sahoo, N.: Artist ranking through analysis of online community comments. IBM Tech Report (2008)

  26. Gruhl, D., Nagarajan, M., Pieper, J., Robson, C., Sheth, A.: Context and domain knowledge enhanced entity spotting in informal text. In: ISWC (2009)

  27. Hatzivassiloglou, V., McKeown, K.R.: Predicting the semantic orientation of adjectives. In: Association for Computational Linguistics (1997)

  28. Hassell, J, Aleman-Meza, B., Arpinar, I.: Ontology-driven automatic entity disambiguation in unstructured text. In: ISWC ’06 (2006)

  29. Joachims, T.: Text categorization with support vector machines. In: Lecture Notes in Computer Science: Machine Learning. Springer, Berlin (1998)

  30. Kamps, J., Marx, M., Mokken, R., de Rijke, M.: Using wordnet to measure semantic orientation of adjectives. http://citeseer.ist.psu.edu/kamps04using.html (2004)

  31. Koutsoukis N.S., Mitra G., Lucas C.: Adapting on-line analytical processing for decision modelling: the interaction of information and decision technologies. Decis. Support Syst. 26(1), 1–30 (1999)

    Article  Google Scholar 

  32. Kushmerick, N.: Wrapper Induction for Information Extraction. PhD thesis, U. Washington (1997)

  33. Lasswell, H.D.: Listening to popular music. The Communication of Ideas (1948)

  34. Locke L.A.: Super searches. Time Magazine, New York (2004)

    Google Scholar 

  35. Makhoul, J., Kubala, F., Schwartz, R., Weischedel, R.: Performance measures for information extraction. Proceedings of DARPA Broadcast News Workshop (1999)

  36. Mason, J.: Filtering spam with spamassassin. In: Proceedings of HEANet Annual Conference (2002)

  37. Mayzlin, D., Chevalier, J.A.: The effect of word of mouth on sales: Online book reviews. Yale School of Management Working Papers (2003)

  38. McIntyre, M.: Hubbard hot-author status called illusion. http://www.scientology-lies.com/press/san-diego-union/1990-04-15/hubbard-hot-author-status-illusion.html (1990)

  39. Mediamark: Teen market profile. http://www.magazine.org/content/files/teenprofile04.pdf (2004)

  40. Muller, C., Gurevych, I.: Using wikipedia and wiktionary in domain-specific information retrieval. In: Working Notes for the CLEF 2008 Workshop (2008)

  41. Nadeau D., Sekine S.: A survey of named entity recognition and classification. Linguisticae Investigationes, (2007)

  42. Riesman D.: Listening to popular music. American Quarterly 2(4), 359–371 (1950)

    Article  Google Scholar 

  43. Riker W.H.: Liberalism Against Populism. Waveland Press Inc, Prospect Heights, (1982)

  44. Saari D.G.: Geometry of Voting, vol 3 of Studies in Economic Theory. Springer-Verlag, (1994)

  45. Sheth, A.P.: Changing focus on interoperability in information systems: From system, syntax, structure to semantics. Interoperating Geographic Info Sys (1999)

  46. Soderland, S.: Learning to extract text-based information from the world wide web. KDD ’97 (1997)

  47. Surowiecki J.: The wisdom of crowds. Doubleday, (2004)

  48. Thomason, A.: Blog spam: A review. In: Fourth Conference on Email and Anti-Spam CEAS 2007 (2007)

  49. Tufte E.: Beautiful Evidence. Graphics Press, (2006)

  50. Turney P.D., Littman M.L.: Measuring praise and criticism: Inference of semantic orientation from association. ACM Trans Inf Syst 21(4), 315–346 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christine Robson.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gruhl, D., Nagarajan, M., Pieper, J. et al. Multimodal social intelligence in a real-time dashboard system. The VLDB Journal 19, 825–848 (2010). https://doi.org/10.1007/s00778-010-0207-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00778-010-0207-5

Keywords

Navigation