Abstract
Online social networks (OSNs) have become popular platforms for people to interact with each other in the cyber space. Users use OSNs to talk about their daily activities, mood, health status, sports events, travel experiences, political campaigns, entertainment events, and commercial products, among other things. Conversations between users on an OSN site could reflect the current social trends that are of great interest and importance for individuals, businesses, and government agencies alike. In this paper we design and develop a comprehensive system to collect, store, query, and analyze OSN data for effective discovery of online social trends. Our system consists of three parts: (1) an OSN data collection engine; (2) a spatio-temporal database for storing, indexing, and querying data; and (3) a set of analytical tools for online social trend discovery. We demonstrate the effectiveness of our system using a recent result of predicting seasonal flu trends using Twitter data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Achrekar, H., Gandhe, A., Lazarus, R., Yu, S.H., Liu, B.: Predicting flu trends using twitter data. In: International Workshop on Cyber-Physical Networking Systems (CPNS) in conjunction with IEEE Infocom (2011)
Bayer, R., McCreight, E.M.: Organization and maintenance of large ordered indices. Acta Inf. 1, 173–189 (1972)
Beckmann, N., Kriegel, H.P., Schneider, R., Seeger, B.: The R*-tree: An efficient and robust access method for points and rectangles. In: Garcia-Molina, H., Jagadish, H.V. (eds.) Proceedings of the 1990 ACM SIGMOD International Conference on Management of Data, Atlantic City, NJ, May 23-25, pp. 322–331. ACM Press, New York (1990)
Berchtold, S., Böhm, C., Kriegel, H.P.: The pyramid-technique: Towards breaking the curse of dimensionality. In: SIGMOD Conference. pp. 142–153 (1998)
Berchtold, S., Keim, D.A., Kriegel, H.P.: The x-tree: An index structure for high-dimensional data. In: VLDB. pp. 28–39 (1996)
Chai, J., Pan, S., Zho, M.: MIND: A Context-based Multimodal Interpretation Framework, Conversational Systems, Natural, Intelligent and Effective Interaction in Multimodal Dialogue Systems. Kluwer Academic Publishers, Dordrecht (2005)
Chen, C.X., Wang, H., Zaniolo, C.: Toward extensible spatio-temporal databases: an approach based on user-defined aggregates. In: de Caluwe, R., de Tré, G., Bordogna, G. (eds.) Spatio-Temporal Databases, Flexible Querying and Reasoning, pp. 55–74. Springer, Heidelberg (2004)
Chen, C.X., Zaniolo, C.: SQL T: A spatio-temporal data model and query language. In: Laender, A.H.F., Liddle, S.W., Storey, V.C. (eds.) ER 2000. LNCS, vol. 1920, pp. 96–111. Springer, Heidelberg (2000)
Eisenberg, A., Melton, J., Kulkarni, K.G., Michels, J.E., Zemke, F.: SQL:2003 has been published. SIGMOD Record 33(1), 119–126 (2004)
Espino, J., Hogan, W., Wagner, M.: Telephone triage: A timely data source for surveillance of influenza-like diseases. In: AMIA: Annual Symposium Proceedings (2003)
Ferguson, N.M., Cummings, D.A., Cauchemez, S., Fraser, C., Riley, S., Meeyai, A., Iamsirithaworn, S., Burke, D.S.: Strategies for containing an emerging influenza pandemic in southeast asia. Nature 437 (2005)
Gauvin, W., Ribeiro, B., Towsley, D., Liu, B., Wang, J.: Measurement and gender-specific analysis of user publishing characteristics on myspace. IEEE Networks (September 2010)
Ginsberg, J., Mohebbi, M.H., Patel, R.S., Brammer, L., Smolinski, M.S., Brilliant, L.: Detecting influenza epidemics using search engine query data. Nature 457 (2009)
Goodwin, G.C., Sin, K.S.: Adaptive Filtering Prediction and Control. Prentice-Hall, Inc., Englewood Cliffs (1984)
Guttman, A.: R-trees: A dynamic index structure for spatial searching. In: SIGMOD Conference, pp. 47–57 (1984)
Jordans, F.: WHO working on formulas to model swine flu spread (2009)
Koudas, N.: Stream data management: Research directions and opportunities. In: IDEAS (2002)
Lazarus, R., Kleinman, K., Dashevsky, I., Adams, C., Kludt, P., DeMaria, Jr., A. R.: Platt: Use of automated ambulatory-care encounter records for detection of acute illness clusters, including potential bioterrorism events (2002)
Ljung, L.: System Identification: Theory for the User. Prentice-Hall, Inc., Upper Saddle River (1999)
Longini, I., Nizam, A., Xu, S., Ungchusak, K., Hanshaoworakul, W., Cummings, D., Halloran, M.: Containing pandemic influenza at the source. Science 309(5737) (2005)
Qiao, L., Agrawal, D., Abbadi, A.E.: Supporting sliding window queries for continuous data streams. In: SSDBM (2003)
Magruder. S.: Evaluation of over-the-counter pharmaceutical sales as a possible early warning indicator of human disease. Johns Hopkins University APL Technical Digest (2003)
Sellis, T.K., Roussopoulos, N., Faloutsos, C.: The R+-tree: A dynamic index for multi-dimensional objects. In: Stocker, P.M., Kent, W., Hammersley, P. (eds.) Proceedings of 13th International Conference on Very Large Data Bases, VLDB 1987, Brighton, England, September 1-4, pp. 507–518. Morgan Kaufmann, San Francisco (1987)
Wang, J., Fang, Z., Chen, C.X.: The pl-tree: A fast high-dimensional access method for range queries. Technical Report, Department of Computer Science, University of Massachusetts Lowell (2009)
Yu, C., Ooi, B.C., Tan, K.L., Jagadish, H.V.: Indexing the distance: An efficient method to knn processing. In: VLDB. pp. 421–430 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Achrekar, H., Fang, Z., Li, Y., Chen, C., Liu, B., Wang, J. (2011). A Spatio-Temporal Approach to the Discovery of Online Social Trends. In: Wang, W., Zhu, X., Du, DZ. (eds) Combinatorial Optimization and Applications. COCOA 2011. Lecture Notes in Computer Science, vol 6831. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22616-8_40
Download citation
DOI: https://doi.org/10.1007/978-3-642-22616-8_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22615-1
Online ISBN: 978-3-642-22616-8
eBook Packages: Computer ScienceComputer Science (R0)