Skip to main content

Improving Personal Health Mention Detection on Twitter Using Permutation Based Word Representation Learning

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12532))

Abstract

Social media has become a substitute for social interaction, thus the amount of medical and clinical-related information on the web is increasing. Monitoring of Personal Health Mentioning (PHM) on social media is an active area of research that predicts whether a given piece of text contains a health condition or not. To this end, the main idea is to consider the usage of disease or symptom words in the text. However, due to their usage in a figurative sense, disease or symptom words may not always indicate the presence of the health condition. Prior work attempts to address this by considering contextual word representations along with the utilization of the sentiment information. However, these methods are unable to capture the complete context in which symptom word is used. In this work, we incorporate permutation-based contextual word representation for the task of health mention detection which captures the context of disease words efficiently, in the given piece of text, and hence improves the performance of the classifier. To evaluate the integrity of the proposed method, we perform experimentation on the public benchmark dataset that shows an improvement of 5.5% in F-score in comparison to the state of the art health mention detection classifier. (Code is available at https://github.com/pervaizniazi/Figurative-Mention).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    This is an original tweet taken from Twitter.

References

  1. WHO. Epidemic intelligence - systematic event detection (2017)

    Google Scholar 

  2. Biddle, R., Joshi, A., Liu, S., Paris, C., Guandong, X.: Leveraging sentiment distributions to distinguish figurative from literal health reports on Twitter. In: Proceedings of The Web Conference 2020, pp. 1217–1227 (2020)

    Google Scholar 

  3. Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R.R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding. In: Advances in Neural Information Processing Systems, pp. 5754–5764 (2019)

    Google Scholar 

  4. Dai, A.M., Le, Q.V.: Semi-supervised sequence learning. In: Advances in Neural Information Processing Systems, pp. 3079–3087 (2015)

    Google Scholar 

  5. Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  6. Saeed, Z., Ayaz Abbasi, R., Razzak, I.: EveSense: what can you sense from Twitter? In: Jose, J.M., et al. (eds.) ECIR 2020. LNCS, vol. 12036, pp. 491–495. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45442-5_64

    Chapter  Google Scholar 

  7. McCann, B., Bradbury, J., Xiong, C., Socher, R.: Learned in translation: contextualized word vectors. In: Advances in Neural Information Processing Systems, pp. 6294–6305 (2017)

    Google Scholar 

  8. Peters, M.E., et al.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)

  9. Saeed, Z., et al.: What’s happening around the world? A survey and framework on event detection techniques on twitter. J. Grid Comput. 17(2), 279–312 (2019)

    Article  MathSciNet  Google Scholar 

  10. Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018)

    Google Scholar 

  11. Saeed, Z., Abbasi, R.A., Razzak, I., Maqbool, O., Sadaf, A., Xu, G.: Enhanced heartbeat graph for emerging event detection on twitter using time series networks. Expert Syst. Appl. 136, 115–132 (2019)

    Article  Google Scholar 

  12. Jiang, K., Feng, S., Song, Q., Calix, R.A., Gupta, M., Bernard, G.R.: Identifying tweets of personal health experience through word embedding and LSTM neural network. BMC Bioinf. 19(8), 210 (2018)

    Article  Google Scholar 

  13. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  14. Karisani, P., Agichtein, E.: Did you really just have a heart attack? Towards robust detection of personal health mentions in social media. In: Proceedings of the 2018 World Wide Web Conference, pp. 137–146 (2018)

    Google Scholar 

  15. Iyer, A., Joshi, A., Karimi, S., Sparks, R., Paris, C.: Figurative usage detection of symptom words to improve personal health mention detection. arXiv preprint arXiv:1906.05466 (2019)

  16. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)

    Google Scholar 

  17. Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol. 10, pp. 2200–2204 (2010)

    Google Scholar 

  18. Mohammad, S.: Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 English words. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 174–184 (2018)

    Google Scholar 

  19. Howard, J., Ruder, S.: Universal language model fine-tuning for text classification. arXiv preprint arXiv:1801.06146 (2018)

  20. Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645–6649. IEEE (2013)

    Google Scholar 

  21. Zhu, Y., et al.: Aligning books and movies: towards story-like visual explanations by watching movies and reading books. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 19–27 (2015)

    Google Scholar 

  22. Parker, R., Graff, D., Kong, J., Chen, K., Maeda, K.: English gigaword fifth edition LDC2011T07 (technical report). Technical report. Linguistic Data Consortium, Philadelphia (2011)

    Google Scholar 

  23. Callan, J.: The lemur project and its ClueWeb12 dataset. In: Invited Talk at the SIGIR 2012 Workshop on Open-Source Information Retrieval (2012)

    Google Scholar 

  24. Common Crawl. Common crawl corpus (2019). http://commoncrawl.org

Download references

Acknowledgement

The authors would like to thank Shoaib Ahmed Siddiqui and Muhammad Nabeel Asim for providing useful feedback during this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pervaiz Iqbal Khan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Khan, P.I., Razzak, I., Dengel, A., Ahmed, S. (2020). Improving Personal Health Mention Detection on Twitter Using Permutation Based Word Representation Learning. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12532. Springer, Cham. https://doi.org/10.1007/978-3-030-63830-6_65

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-63830-6_65

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-63829-0

  • Online ISBN: 978-3-030-63830-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics