Abstract
In order to enhance the real-time performance of Internet public opinion recognizing and early warning, and improve the accuracy of the analysis of Internet public opinion for hot spots, similarity analysis methods of Internet public opinion are put forward. Firstly, web crawler technology is introduced for obtaining accurate and comprehensive public opinion. Secondly, propose similarity algorithms from the aspects of known and unknown of the subject. At the same time, considering the uncertainty and fuzzy of Internet public opinion, the concept of information entropy is introduced, and present a similarity analysis approach of Internet public opinion based on information entropy, and can cluster and identify hot spots and crisis events of Internet public opinion. Experimental results show that the proposed methods can quickly obtain the Internet public opinion, and has high accuracy rate of clustering, which provide an important technical support for Internet public opinion monitoring and recognizing.
Similar content being viewed by others
References
Zheng, Y.: On the target positioning and realization guarantee of network public. Inf. Sci. 33(6), 81–85 (2015)
Wang, G.: Research on Hotspot Discovery in Internet Public Opinions Based on Improved K-Means. Computational Intelligence and Neuroscience. ID 230946 (2013)
Xue, Y., Xu, L., Qiu, B.: Relationship discovery in public opinion and actual behavior for social media stock data space. EURASIP J. Wirel. Commun. Netw. 2016, 216 (2016)
Wang, S., Peng, Y., Wang, J.: Research of the text clustering based on LDA using in network public opinion analysis. J. Shandong Univ. 49(9), 129–133 (2014)
Yang, P., Gui, X., Tian, F.: Efficient keywords clustering method for topic detection. J. Xi’an Jiaotong Univ. 46(10), 873–876 (2012)
Tang, H., Wang, H.: Application of improved K-means algorithm to analysis of online public opinions. Syst. Appl. Comput. 20(3), 165–168 (2011)
Zhang, Y., Wang, Z.: Forum user social network mining based on content similarity. J. Intell. 29(8), 165–168 (2010)
Yang, Z., Duan, L., Lai, Y.: Online public opinion hot spot detection and analysis based on short text clustering using string distance. J. Beijing Univ. Technol. 36(5), 669–673 (2010)
Wang, H., Cao, C., Gao, S.: Research on text clustering of micro-blog public opinion: word sense cluster and collocation-based method. J. Nanjing Normal Univ. 38(1), 57–64 (2015)
Liu, Y., Lv, K., Liu, J.: Design of public sentiment monitoring system based on co-ICIB co-clustering. J. Henan Polytech. Univ. 32(5), 592–595 (2013)
Tong, L.: Research and Implementation of Network Public Opinion Analysis System Based on Hadoop Platform. Jilin University, Changchun (2015)
Wang, Z., He, M., Du, Y.: Text similarity computing based on topic model LDA. Comput. Sci. 40(12), 229–232 (2013)
Gu, C., Xu, H., Zhou, H., Zhang, J.: Text similarity computing based on lexical semantic information. Appl. Res. Comput. 35(2) (2018)
Li, L., Zhu, A., Su, T.: Research and implementation of an improved vsm-based text similarity algorithm. Comput. Appl. Softw. 29(2), 282–284 (2012)
Li, S., Ling, W., Gong, J., Zhou, C.: Text-similarity method based on entropy. Appl. Res. Comput. 33(3), 665–668 (2016)
Huang, C., Yin, J., Hou, F.: A text similarity measurement combining word semantic information with TF-IDF method. Chin. J. Comput. 34(5), 856–864 (2011)
Hua, X., Zhu, Q., Li, P.: Chinese text similarity method research by combining semantic analysis with statistics. Appl. Res. Comput. 29(3), 833–836 (2012)
Ju, X., Chen, J., Shao, H.: Hierarchical web page classification method based on vector space model. J. Nantong Univ. 9(1), 24–29 (2010)
Li, D., Liao, X., Fan, F.: A focused network crawler with topic knowledge automatically growing. Comput. Appl. Softw. 31(5), 29–33 (2014)
Jin, J.: Review of clustering method. Comput. Sci. 41(11A), 288–293 (2014)
Tang, L.: Text feature selection method based on information entropy and dynamic clustering. Comput. Eng. Appl. 51(19), 152–157 (2015)
Niewiarowski, A., Stanuszek, M.: Mechanism of analysis of similarity short texts based on the Levenshtein distance. Stud. Inform. 34(1), 107–114 (2013)
Yang, K., Zhang, Y., Li, Y.: Feature selection method based on document frequency. Comput. Eng. 36(17), 33–35 (2010)
Chen, X.: Research on fast autonomous clustering method of microblog public opinion based on big data technology. J. Intell. 36(5), 113117 (2017)
Li, H., Shi, Z., Yi, J.: Secondary clustering recommendation algorithm based on information entropy. Comput. Eng. 42(5), 213–217 (2016)
Chen, X., Zhang, J., Cheng, J.: Analysis of similarity of DNA sequences based on information quantity. Appl. Res. Comput. 30(5), 1381–1384 (2013)
Tao, Z., Liu, X., Chen, H.: Entropy measures for linguistic information and its application to decision making. J. Intell. Fuzzy Syst. 29(2), 747–759 (2015)
Park, I.-K., Choi, G.-S.: A variable-precision information-entropy rough set approach for job searching. Inf. Syst. 48, 279–288 (2015)
Li, X., Li, G., Xiao, M.: Novel classification method for remote sensing images based on information entropy discretization algorithm and vector space model. Comput. Geosci. 89, 252–259 (2016)
Zhou, R., Yang, Z., Yu, M., Dan, A.R.: A portfolio optimization model based on information entropy and fuzzy time series. Fuzzy Optim. Decis. Mak. 14, 381–397 (2015)
Zhang, X., Mei, C., Chen, D., Li, J.: Feature selection in mixed data: A method using a novel fuzzy rough set-based information entropy. Pattern Recognit. 56, 1–15 (2016)
Wang, H., Yao, X.: Objective reduction based on nonlinear correlation information entropy. Soft Comput. 20, 2393–2407 (2016)
Navarrete, J., Viejo, D., Cazorla, M.: Color smoothing for RGB-D data using entropy information. Appl. Soft Comput. 46, 361–380 (2016)
Chen, S., Chen, Q., Wu, Z.: A hierarchical clustering algorithm based on kernel function. J. Jinan Univ. 32(1), 31–34 (2011)
Acknowledgements
The authors would like to thank for financial support by youth fund project of the humanities and social sciences of Education Ministry (No. 15YJC870004), science and technology innovation team of XiangNan University (Recognition and analysis based on big data public opinion), Social Sciences fund project of Hunan Province (No. 13YBA302) and education Science in Hunan province in 12th Five-Year planning project (XJK014CGD081, XJK011BXJ004).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, X., Duan, S. & Wang, Ld. Research on clustering analysis of Internet public opinion. Cluster Comput 22 (Suppl 3), 5997–6007 (2019). https://doi.org/10.1007/s10586-018-1781-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-018-1781-3