Skip to main content

Detect and Analyze Flu Outlier Events via Social Network

  • Conference paper
Web Technologies and Applications (APWeb 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8710))

Included in the following conference series:

Abstract

The popularity of social networks provides a new way for constant surveillance of unusual events related to a certain disease. Some researchers have begun to use twitter to estimate the situation of public health, as well as predict disease trends. However, previous studies usually focused on the infection data but not the data judged as non-infection, which was usually filtered directly in their studies. We believe that the non-infection data is also essential for monitoring disease activity, because of their inherently subtle connections. Firstly, we construct a time series outlier model that can detect flu outlier events of different region in China with high precision and good recall by mining all the flu related data. Secondly, those outlier events are used to find out hot topics by SN-TDT and use the twice iteration classification method which is designed to analyze users’ status who published a flu-related weibo. These results could provide science reference for deploying sickness prevention resources, and make recommendation about which place pose a high risk of getting infected.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Signorini, A., Segre, A.M., Polgreen, P.M.: The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic. PLoS One 6(5), e19467 (1946), doi:10.1371/journal.pone.0019467

    Google Scholar 

  2. Achrekar, H., Gandhe, A., et al.: Predicting Flu Trends using Twitter Data. In: The First International Workshop on Cyber-Physical Networking Systems, pp. 702–707 (2011)

    Google Scholar 

  3. Sadilek, A., Kautz, H.: Vincent Silenzio: Predicting Disease Transmission from Geo-Tagged Micro-Blog Data. In: Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, pp. 136–142 (2012)

    Google Scholar 

  4. Prieto, V.M., Matos, S., Alvarez, M., et al.: Twitter: A Good Place to Detect Health Conditions. PLoS One 9(1), e86191 (2014), doi:10.1371/journal.pone.0086191

    Google Scholar 

  5. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. In: 19th International Conference on World Wide Web, pp. 851–860 (2010)

    Google Scholar 

  6. Breunig, M.M., Kriegel, H.-P., et al.: LOF: Identifying Density-Based Local Outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (2000)

    Google Scholar 

  7. Ferdousi, Z., Maeda, A.: Unsupervised Outlier Detection in Time Series Data. In: The 22nd International Conference on Data Engineering Workshops, 0-7695-2571-7 (2006)

    Google Scholar 

  8. Lee, C., Lee, G.G.: Information gain and divergence-based feature selection for machine learning-based text categorization. Information Processing & Management 42(1), 155–165 (2006)

    Article  Google Scholar 

  9. Chen, Y.T., Chen, M.C.: Using chi-square statistics to measure similarities for text categorization. Expert Systems with Applications 38(4), 3085–3090 (2010)

    Article  Google Scholar 

  10. Azam, N., Yao, J.: Comparison of term frequency and document frequency based feature selection metrics in text categorization. Expert Systems with Applications 39(5), 4760–4768 (2012)

    Article  Google Scholar 

  11. Joachims, T.: A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization, Carnegie-Mellon Univ. Pittsburgh Pa Dept of Computer Science. No. CMU-CS 96-118 (1996)

    Google Scholar 

  12. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explorations Newsletter 11(1), 10–18 (2009)

    Article  Google Scholar 

  13. Joachims, T.: Learning to classify text using support vector machines: Methods, theory and algorithms, p. 205. Kluwer Academic Publishers (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Fu, Q., Hu, C., Xu, W., He, X., Zhang, T. (2014). Detect and Analyze Flu Outlier Events via Social Network. In: Han, W., Huang, Z., Hu, C., Zhang, H., Guo, L. (eds) Web Technologies and Applications. APWeb 2014. Lecture Notes in Computer Science, vol 8710. Springer, Cham. https://doi.org/10.1007/978-3-319-11119-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11119-3_13

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11118-6

  • Online ISBN: 978-3-319-11119-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics