Opinion Mining on Small and Noisy Samples of Health-Related Texts

Akhtyamova, Liliya; Alexandrov, Mikhail; Cardiff, John; Koshulko, Oleksiy

doi:10.1007/978-3-030-01069-0_27

Liliya Akhtyamova¹⁶,
Mikhail Alexandrov^17,18,
John Cardiff¹⁶ &
…
Oleksiy Koshulko¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 871))

Included in the following conference series:

Conference on Computer Science and Information Technologies

689 Accesses

Abstract

The topic of people’s health has always attracted the attention of public and private structures, the patients themselves and, therefore, researchers. Social networks provide an immense amount of data for analysis of health-related issues; however it is not always the case that researchers have enough data to build sophisticated models. In the paper, we artificially create this limitation to test performance and stability of different popular algorithms on small samples of texts. There are two specificities in this research apart from the size of a sample: (a) here, instead of usual 5-star classification, we use combined classes reflecting a more practical view on medicines and treatments; (b) we consider both original and noisy data. The experiments were carried out using data extracted from the popular forum AskaPatient. For tuning parameters, GridSearchCV technique was used. The results show that in dealing with small and noisy data samples, GMDH Shell is superior to other methods. The work has a practical orientation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Identifying Service Gaps from Public Patient Opinions Through Text Mining

Multi-label classification and knowledge extraction from oncology-related content on online social networks

Article 17 April 2020

Mining Health Social Media with Sentiment Analysis

Article 23 September 2016

References

Kaplan, A.M., Haenlein, M.: Users of the world, unite! The challenges and opportunities of Social Media (2007). https://doi.org/10.1016/j.bushor.2009.09.003
Article Google Scholar
Ventola, C.L.: Social media and health care professionals: benefits, risks, and best practices. P T 39, 491–520 (2014)
Google Scholar
Lehne, R.A., Rosenthal, L.D.: Pharmacology for Nursing Care. Elsevier Health Sciences (2013)
Google Scholar
Struik, L.L., Baskerville, N.B.: The role of Facebook in crush the crave, a mobile- and social media-based smoking cessation intervention: qualitative framework analysis of posts. J. Med Int. Res. 16(7), e170 (2014). https://doi.org/10.2196/jmir.3189
Article Google Scholar
Sarker, A., O’Connor, K., Ginn, R., Scotch, M., Smith, K., Malone, D., Gonzalez, G.: Social media mining for toxicovigilance: automatic monitoring of prescription medication abuse from Twitter. Drug Saf. 39, 231–240 (2016)
Article Google Scholar
Nakhasi, A., Passarella, R.J., Bell, S.J., Paul, M.J., Dredze, M., Pronovost P.J.: Malpractice and Malcontent: analyzing medical complaints in Twitter. In: AAAI Technical Report FS-12-05, Information Retrieval and Knowledge Discovery in Biomedical Text, pp. 84–85 (2012)
Google Scholar
Alexandrov, M., Skitalinskaya, G., Cardiff, J., Koshulko, O., Shushkevich, E.: Classifiers for Yelp-reviews based on GMDH-algorithms. In: Proceedings of the Conference in Intelligent Text Processing and Comput. Linguistics (CICLing-2018). LNCS, pp. 1–18. Springer (2018)
Google Scholar
Stepashko, V.S.: Method of critical variances as analytical tool of theory of inductive modeling. J. Autom. Inf. Sci. 40, 4–22 (2008). https://doi.org/10.1615/J.AutomatInfScien.v40.i3.20
Article MathSciNet Google Scholar
Huynh, T., He, Y., Willis, A., Uger, S.: Adverse drug reaction classification with deep neural networks. In: Proceedings of 26-th International Conference on Computational Linguistics (COLING-2016), pp. 877–887 (2016)
Google Scholar
Akhtyamova, L., Ignatov, A., Cardiff, J.: A Large-scale CNN ensemble for medication safety analysis. In: Proceedings of 22th International Conference on Applications of Natural Language to Information Systems (NLDB 2017). LNCS, pp. 1–6. Springer (2017)
Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Schakel, A.M.J., Wilson, B.J.: Measuring word significance using distributed representations of words, CoRR, abs/1508.02297 (2015)
Google Scholar
Madala, H.R., Ivakhnenko, A.G.: Inductive Learning Algorithms for Complex Systems Modelling. CRC Press, New York (1994)
MATH Google Scholar
Farlow, S.J.: Self-Organizing methods in modeling: GMDH type algorithms. In: Statistics: A Series of Textbooks and Monographs, Book 54, 1-st edn. Marcel Decker Inc., New York, Basel (1984)
Google Scholar
Stepashko, V.: Developments and prospects of GMDH-based inductive modeling. In: Shakhovska, N., Stepashko, V. (eds.) Advances in Intelligent Systems and Computing II / AISC book series, vol. 689, pp. 346–360. Springer, Cham (2017)
Google Scholar
Platform GMDH Shell. www.gmdhshell.com
Resource GMDH in IRTC ITS NAS of Ukraine. mgua.irtc.org.ua/
Alexandrov, M., Blanco, X., Catena, A., Ponomareva, N.: Inductive modeling in subjectivity/sentiment analysis (case study: dialog processing). In: Proceedings of 3-rd International Workshop on Inductive Modeling (IWIM-2009), pp. 40–43 (2009)
Google Scholar
Kaurova, O., Alexandrov, M., Koshulko, O.: Classifiers of medical records presented in free text form (GMDH shell application). In: Proceedings of 4-th International Conference on Inductive Modeling (ICIM-2013), pp. 273–278 (2013)
Google Scholar
Alexandrov, M., Danilova, V., Koshulko, A., Tejada, J.: Models for opinion classification of blogs taken from Peruvian Facebook. In: Proceedings of 4-th International Conference on Inductive Modeling, pp. 241–246 (2013)
Google Scholar
Tax, D.M.J., Duin, R.P.W.: Using two-class classifiers for multiclass classification. In: Proceedings of 16-th International Conference on Pattern Recognition, pp. 1051–1054. IEEE (2002)
Google Scholar
Akhtyamova, L., Alexandrov, M., Cardiff, J., Koshulko, O.: Building classifiers with GMDH for health social networks (DB AskaPatient). In: Proceedings of the International Workshop on Inductive Modelling (IWIM-2018). IEEE (2018). [to be published]
Google Scholar
Sarker, A., Gonzalez, G.: Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J. Biomed. Inform. 53, 196–207 (2015). https://doi.org/10.1016/j.jbi.2014.11.002
Article Google Scholar
Lai, S., Xu, L., Liu, K., Zhao, J.: Recurrent convolutional neural networks for text classification. In: Proceedings of 16th International Conference on Artificial Intelligence, pp. 2266–2273 (2015)
Google Scholar
Stojanovski, D., Strezoski, G., Madjarov, G., Dimitrovski, I.: Finki at SemEval-2016 Task 4: deep learning architecture for Twitter sentiment analysis. In: Proceedings of SemEval-2016, pp. 149–154 (2016)
Google Scholar
Miftahutdinov, Z., Tutubalina, E., Tropsha, A.: Identifying disease-related expressions in reviews using conditional random fields. In: Proceedings of International Conference on Computational Linguistics and Intellectual Technologies (Dialog-2017), pp. 155–166 (2017)
Google Scholar
Draper, N., Smith, H.: Applied Regression Analysis. Wiley, New York (1981)
MATH Google Scholar
Gelbukh, A., Sidorov, G., Lavin-Villa E., Chanova-Hernandez, L.: Automatic term extraction using Log-likelihood based comparison with General Reference Corpus. In: Proceedings of 15-th International Conference on Applications of Natural Language to Information Systems (NLDB-2010). LNCS, vol. 6177, pp. 248–255. Springer (2010)
Google Scholar
Lopez, R., Alexandrov, M., Barreda, D., Tejada, J.: LexisTerm – the program for term selection by the criterion of specificity. In: Artificial Intelligence Application to Business and Engineering Domain, vol. 24, pp. 8–15. ITHEA Publ., Rzeszov-Sofia (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Technology Tallaght, Dublin, Ireland
Liliya Akhtyamova & John Cardiff
Autonomous University of Barcelona, Barcelona, Spain
Mikhail Alexandrov
Russian Presidential Academy of National Economy and Public Administration, Moscow, Russia
Mikhail Alexandrov
Glushkov Institute of Cybernetics, Kyiv, Ukraine
Oleksiy Koshulko

Authors

Liliya Akhtyamova
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Alexandrov
View author publications
You can also search for this author in PubMed Google Scholar
John Cardiff
View author publications
You can also search for this author in PubMed Google Scholar
Oleksiy Koshulko
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liliya Akhtyamova .

Editor information

Editors and Affiliations

Lviv Polytechnic National University, Lviv, Ukraine
Natalia Shakhovska
Institute of Computer Science and Information Technologies, Lviv Polytechnic National University, Lviv, Ukraine
Mykola O. Medykovskyy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Akhtyamova, L., Alexandrov, M., Cardiff, J., Koshulko, O. (2019). Opinion Mining on Small and Noisy Samples of Health-Related Texts. In: Shakhovska, N., Medykovskyy, M. (eds) Advances in Intelligent Systems and Computing III. CSIT 2018. Advances in Intelligent Systems and Computing, vol 871. Springer, Cham. https://doi.org/10.1007/978-3-030-01069-0_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-01069-0_27
Published: 20 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01068-3
Online ISBN: 978-3-030-01069-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics