Improving Personal Health Mention Detection on Twitter Using Permutation Based Word Representation Learning

Khan, Pervaiz Iqbal; Razzak, Imran; Dengel, Andreas; Ahmed, Sheraz

doi:10.1007/978-3-030-63830-6_65

Improving Personal Health Mention Detection on Twitter Using Permutation Based Word Representation Learning

Conference paper
First Online: 19 November 2020

2252 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12532))

Abstract

Social media has become a substitute for social interaction, thus the amount of medical and clinical-related information on the web is increasing. Monitoring of Personal Health Mentioning (PHM) on social media is an active area of research that predicts whether a given piece of text contains a health condition or not. To this end, the main idea is to consider the usage of disease or symptom words in the text. However, due to their usage in a figurative sense, disease or symptom words may not always indicate the presence of the health condition. Prior work attempts to address this by considering contextual word representations along with the utilization of the sentiment information. However, these methods are unable to capture the complete context in which symptom word is used. In this work, we incorporate permutation-based contextual word representation for the task of health mention detection which captures the context of disease words efficiently, in the given piece of text, and hence improves the performance of the classifier. To evaluate the integrity of the proposed method, we perform experimentation on the public benchmark dataset that shows an improvement of 5.5% in F-score in comparison to the state of the art health mention detection classifier. (Code is available at https://github.com/pervaizniazi/Figurative-Mention).

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
This is an original tweet taken from Twitter.

References

WHO. Epidemic intelligence - systematic event detection (2017)
Google Scholar
Biddle, R., Joshi, A., Liu, S., Paris, C., Guandong, X.: Leveraging sentiment distributions to distinguish figurative from literal health reports on Twitter. In: Proceedings of The Web Conference 2020, pp. 1217–1227 (2020)
Google Scholar
Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R.R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding. In: Advances in Neural Information Processing Systems, pp. 5754–5764 (2019)
Google Scholar
Dai, A.M., Le, Q.V.: Semi-supervised sequence learning. In: Advances in Neural Information Processing Systems, pp. 3079–3087 (2015)
Google Scholar
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Saeed, Z., Ayaz Abbasi, R., Razzak, I.: EveSense: what can you sense from Twitter? In: Jose, J.M., et al. (eds.) ECIR 2020. LNCS, vol. 12036, pp. 491–495. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45442-5_64
Chapter Google Scholar
McCann, B., Bradbury, J., Xiong, C., Socher, R.: Learned in translation: contextualized word vectors. In: Advances in Neural Information Processing Systems, pp. 6294–6305 (2017)
Google Scholar
Peters, M.E., et al.: Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018)
Saeed, Z., et al.: What’s happening around the world? A survey and framework on event detection techniques on twitter. J. Grid Comput. 17(2), 279–312 (2019)
Article MathSciNet Google Scholar
Radford, A., Narasimhan, K., Salimans, T., Sutskever, I.: Improving language understanding by generative pre-training (2018)
Google Scholar
Saeed, Z., Abbasi, R.A., Razzak, I., Maqbool, O., Sadaf, A., Xu, G.: Enhanced heartbeat graph for emerging event detection on twitter using time series networks. Expert Syst. Appl. 136, 115–132 (2019)
Article Google Scholar
Jiang, K., Feng, S., Song, Q., Calix, R.A., Gupta, M., Bernard, G.R.: Identifying tweets of personal health experience through word embedding and LSTM neural network. BMC Bioinf. 19(8), 210 (2018)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Karisani, P., Agichtein, E.: Did you really just have a heart attack? Towards robust detection of personal health mentions in social media. In: Proceedings of the 2018 World Wide Web Conference, pp. 137–146 (2018)
Google Scholar
Iyer, A., Joshi, A., Karimi, S., Sparks, R., Paris, C.: Figurative usage detection of symptom words to improve personal health mention detection. arXiv preprint arXiv:1906.05466 (2019)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: LREC, vol. 10, pp. 2200–2204 (2010)
Google Scholar
Mohammad, S.: Obtaining reliable human ratings of valence, arousal, and dominance for 20,000 English words. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 174–184 (2018)
Google Scholar
Howard, J., Ruder, S.: Universal language model fine-tuning for text classification. arXiv preprint arXiv:1801.06146 (2018)
Graves, A., Mohamed, A., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645–6649. IEEE (2013)
Google Scholar
Zhu, Y., et al.: Aligning books and movies: towards story-like visual explanations by watching movies and reading books. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 19–27 (2015)
Google Scholar
Parker, R., Graff, D., Kong, J., Chen, K., Maeda, K.: English gigaword fifth edition LDC2011T07 (technical report). Technical report. Linguistic Data Consortium, Philadelphia (2011)
Google Scholar
Callan, J.: The lemur project and its ClueWeb12 dataset. In: Invited Talk at the SIGIR 2012 Workshop on Open-Source Information Retrieval (2012)
Google Scholar
Common Crawl. Common crawl corpus (2019). http://commoncrawl.org

Download references

Acknowledgement

The authors would like to thank Shoaib Ahmed Siddiqui and Muhammad Nabeel Asim for providing useful feedback during this work.

Author information

Authors and Affiliations

German Research Center for Artificial Intelligence (DFKI), Kaiserslautern, Germany
Pervaiz Iqbal Khan, Andreas Dengel & Sheraz Ahmed
TU Kaiserslautern, Kaiserslautern, Germany
Pervaiz Iqbal Khan & Andreas Dengel
Deakin University, Geelong, Australia
Imran Razzak

Authors

Pervaiz Iqbal Khan
View author publications
You can also search for this author in PubMed Google Scholar
Imran Razzak
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Dengel
View author publications
You can also search for this author in PubMed Google Scholar
Sheraz Ahmed
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pervaiz Iqbal Khan .

Editor information

Editors and Affiliations

Department of AI, Ping An Life, Shenzhen, China
Haiqin Yang
Faculty of Information Technology, King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand
Kitsuchart Pasupa
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi-Sing Leung
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong
James T. Kwok
School of Information Technology, King Mongkut’s University of Technology Thonburi, Bangkok, Thailand
Jonathan H. Chan
The Chinese University of Hong Kong, New Territories, Hong Kong
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khan, P.I., Razzak, I., Dengel, A., Ahmed, S. (2020). Improving Personal Health Mention Detection on Twitter Using Permutation Based Word Representation Learning. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12532. Springer, Cham. https://doi.org/10.1007/978-3-030-63830-6_65

Download citation

DOI: https://doi.org/10.1007/978-3-030-63830-6_65
Published: 19 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63829-0
Online ISBN: 978-3-030-63830-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics