Skip to main content

Advertisement

Log in

A Robust User Sentiment Biterm Topic Mixture Model Based on User Aggregation Strategy to Avoid Data Sparsity for Short Text

  • Mobile & Wireless Health
  • Published:
Journal of Medical Systems Aims and scope Submit manuscript

Abstract

Sentiment analysis is a process of computationally finding the opinions that are expressed in a short text or a feedback by a writer towards a particular topic, product, service. The short piece of review from the user can help a business determine or understand the attitude of the user thereby predict the customer’s behaviour and itsubstantiallyimproves the quality of service parameters. The proposed Robust User Sentiment Biterm Topic Mixture (RUSBTM)model discovers the user preference and their sentiment orientation views for effective Topic Modelling using Biterms or word-pair from the short text of a particular venue. Since short review or text suffers from data sparse, the user aggregation strategy is adapted to form a pseudo document and the word pairset is created for the whole corpus. The RUSBTM learns topics by generating the word co-occurrence patterns thereby inferring topics with rich corpus-level information. By analysing the sentiments of the paired words and their corresponding topics in the review corpus of the particular venue, prediction can be done that exactly portrays the user interest, preference and expectation from a particular venue. The RUSBTM model proved to be more robust and also, the extracted topics are more coherent and informative. Also the method uses accurate sentiment polarity techniques to exactly capture the sentiment orientation and the model proves to be outperforming better when compared to other state of art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Blei, D. M., Andrew, Y. N. G., and Jordan, M. I., Latent dirichlet allocation. J. Mach. Learn. Res. 3:993–1022, 2003.

    Google Scholar 

  2. Zuo, Y., Wu, J., Zhang, H., Lin, H., Wang, F., Xu, K., and Xiong, H., Topic modeling of short texts: a pseudo- document view. KDD ‘16, 2016.

  3. Hong, L., and Davison, B. D., Emprical study of topic modeling Twitter. In: Proceedings of SOMA, pages 80–88. ACM, 2010.

  4. Devi, G. U., Priyan, M. K., & Gokulnath, C. (2018). Wireless camera network with enhanced SIFT algorithm for human tracking mechanism. International Journal of Internet Technology and Secured Transactions, 8(2), 185–194.

  5. Cheng, X., Yan, X., Lan, Y., and Guo, J., BTM: topic modelling over short text. IEEE Trans. Knowl. Data Eng. 26(12), 2014.

  6. Xia, Y., Tang, N., Hussian, A., and Cambria, E., Discriminative bi-term topic model for headline-based social news clustering. Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research society conference.

  7. Vijayakumar, V., Priyan, M. K., Ushadevi, G., Varatharajan, R., Manogaran, G., & Tarare, P. V. (2018). E-health cloud security using timing enabled proxy re-encryption. Mobile Networks and Applications, 1–12.

  8. Li, W., Feng, Y., Li, D., and Yu, Z., Micro-blog topic detection method based on BTM topic model and K-means clustering algorithm. Autom. Control. Comput. Sci. 50(4):271–277, 2016.

    Article  Google Scholar 

  9. Pan, Y., Yin, J., Liu, S., and Li, J., A biterm-based Dirichlet process topic model for short texts. International Conference on Computer Science and Service system, CSSS, 2014.

  10. Li, C., Zhang, J., Sun, J. T., and Chen, Z., Sentiment topic model with decomposed prior. In: Proceedings of SDM, pages 767–775. SAIM, 2013.

  11. Mehrotra, R., Sanner, S., Buntine, W., and Xie, L., Improving lda topic modelsfor microblogs via tweet pooling and automatic labeling. In: Proceedings of SIGIR, pages 889–892. ACM, 2013.

  12. Mei, Q., Ling, X., Wondra, M., Su, H., and Zhai, C., Topic sentiment mixture:modeling facets and opinions in weblogs. In: Proceedings of WWW, pages 171–180. ACM, 2007.

  13. Nguyen, T.-S., Lauw, H. W., and Tsaparas, P., Review synthesis for micro-review summarization. In: Proceedings ofthe Eighth ACM International Conference on Web Search and Data Mining, WSDM ‘15, 2015.

  14. Lu, Z., Mamoulis, N., Pituoria, E., and Tsaparas, P., Sentiment-based topic suggestion for micro reviews. Proceedings of the Tenth International AAAI conference on Web and social media, 2016.

  15. Kumar, P. M., Devi, U., Manogaran, G., Sundarasekar, R., Chilamkurti, N., & Varatharajan, R. (2018). Ant colony optimization algorithm with Internet of Vehicles for intelligent traffic control system. Computer Networks, 144, 154-162.

  16. Manogaran, G., Shakeel, P. M., Hassanein, A. S., Priyan, M. K., & Gokulnath, C. (2018). Machine-learning approach based gamma distribution for Brian abnormalities detection and data sample imbalance analysis. IEEE Access.

  17. Mukherjee, S., and Bhat-tacharyya, P., Wikisent: weakly supervised sentiment analysis through extractive summarization withwikipedia, ECML PKDD’12, pp. 774–793, 2012.

  18. Bicalho, P., Pita, M., Pedrosa, G., Lacerda, A., and Pappa, G. L., A general framework to expand short text for topic modeling. Inf. Sci. 393:66–81, 2017.

    Article  Google Scholar 

  19. Rosen-Zvi, M., Griffiths, T., Steyvers, M., and Smyth, P., The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in ArtificialIntelligence, UAI ‘04, 487–494, 2004.

  20. Mukherjee, S., Basu, G., and Joshi, S., Joint author sentiment topic model. In: Proceedings of the 2014 SIAM International Conferenceon Data Mining, pages 370–378. SIAM, 2014.

  21. Balan, E. V., Priyan, M. K., Nath, C. G., & Devi, G. U. (2014, December). Efficient energy scheme for wireless sensor network application. In 2014 IEEE International Conference on Computational Intelligence and Computing Research (pp. 1–5). IEEE.

  22. Lin, C., He, Y.. Joint sentiment/topic model for sentiment analysis. In: Proceedings of CIKM, pages 375–384, Hong Kong, China. ACM, 2009.

  23. Jo, Y., and Oh, A. H., Aspect and sentiment unification model for online reviewanalysis. In: Proceedings of WSDM, pages 815–824. ACM, 2011.

  24. Sundarasekar, R., Thanjaivadivel, M., Manogaran, G., Kumar, P. M., Varatharajan, R., Chilamkurti, N., & Hsu, C. H. (2018). Internet of things with maximal overlap discrete wavelet transform for remote health monitoring of abnormal ECG signals. Journal of medical systems, 42(11), 228.

  25. Priyan, M. K., & Devi, G. U. (2019) A survey on internet of vehicles: applications, technologies, challenges and opportunities. International Journal of Advanced Intelligence Paradigms 12 (1–2):98

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nimala K.

Ethics declarations

Conflict of interest

The authors declare that this article content has no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Topical Collection on Mobile & Wireless Health

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

K, N., R, J. A Robust User Sentiment Biterm Topic Mixture Model Based on User Aggregation Strategy to Avoid Data Sparsity for Short Text. J Med Syst 43, 93 (2019). https://doi.org/10.1007/s10916-019-1225-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10916-019-1225-5

Keywords

Navigation