Abstract
Similar measures play an important role in information processing and have been widely investigated in computer science. With the exploration of social media such as Youtube, Wikipedia, Facebook etc., a huge number of entries have been posted on these portals. They are often described by means of short text or sets of words. Discovering similar entries based on such texts has become challenges in constructing information searching or filtering engines and attracted several research interests. In this paper, we firstly introduce a model of entries posted on media or entertainment portals, which is based on their features composed of title, category, tags, and content. Then, we present a novel similar measure among entries that incorporates their features. The experimental results show the superiority of our incorporation similarity measure compared with the other ones.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Lin, D.: An information-theoretic definition of similarity. In: Proceedings of the 15th International Conference on Machine Learning, pp. 296–304. Morgan Kaufmann, San Francisco (1998)
Sayal, R., Kumar, V.V.: A novel similarity measure for clustering categorical data sets. Int. J. Comput. Appl. 17(1), 25–30 (2011). Published by Foundation of Computer Science
Reddy, G.S., Krishnaiah, R.V.: A novel similarity measure for clustering categorical data sets. IOSR J. Comput. Eng. (IOSRJCE) 4(6), 37–42 (2012)
Nguyen, M.H., Nguyen, T.H.: A general model for similarity measurement between objects. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 6(2), 235–239 (2015)
Buscaldi, D., Roux, J.L., Flores, J.J.G., Popescu, A.: Lipn-core: Semantic text similarity using n-grams, wordnet, syntactic analysis, esa and information retrieval based features (2013)
Han, L., Kashyap, A.L., Finin, T., Mayfield, J., Weese, J.: Semantic textual similarity systems. In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, pp. 44–52. Association for Computational Linguistics, Atlanta, June 2013
Lee, M.C., Chang, J.W., Hsieh, T.C.: A grammar-based semantic similarity algorithm for natural language sentences. Sci. World J. 2014, 17 (2014)
Marsi, E., Moen, H., Bungum, L., Sizov, G., Gambäck, B., Lynum, A.: Combining strong features for semantic similarity. In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, pp. 66–73. Association for Computational Linguistics, Atlanta, June 2013
Oliva, J., Serrano, J.I., del Castillo, M.D., Iglesias, Á.: Symss: a syntax-based measure for short-text semantic similarity. Data Knowl. Eng. 70(4), 390–405 (2011)
Agirre, E., Cer, D., Diab, M., Gonzalez-Agirre, A., Guo, W.: Semantic textual similarity. In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, pp. 32–43. Association for Computational Linguistics, Atlanta, June 2013
Nguyen, M.H., Tran, D.Q.: A semantic similarity measure between sentences. South-East Asian J. Sci. 3(1), 63–75 (2014)
Tran, D.Q., Nguyen, M.H.: A mathematical model for semantic similarity measures. South-East Asian J. Sci. 1(1), 32–45 (2012)
Novelli, A.D.P., Oliveira, J.M.P.D.: Article: a method for measuring semantic similarity of documents. Int. J. Comput. Appl. 60(7), 17–22 (2012)
Bollegala, D., Matsuo, Y., Ishizuka, M.: A web search engine-based approach to measure semantic similarity between words. IEEE Trans. Knowl. Data Eng. 23(7), 977–990 (2011)
Buscaldi, D., Rosso, P., Gomez-Soriano, J.M., Sanchis, E.: Answering questions with an n-gram based passage retrieval engine. J. Intell. Inf. Syst. 34(2), 113–134 (2010)
Croce, D., Storch, V., Basili, R.: Combining text similarity and semantic filters through sv regression. In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, pp. 59–65. Association for Computational Linguistics, Atlanta, June 2013
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by gibbs sampling. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, ACL 2005, pp. 363–370. Association for Computational Linguistics, Stroudsburg (2005)
Lintean, M.C., Rus, V.: Measuring semantic similarity in short texts through greedy pairing and word semantics. In: Youngblood, G.M., McCarthy, P.M. (eds.) Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference, Marco Island, Florida, 23–25 May 2012. AAAI Press (2012)
Proisl, T., Evert, T., Greiner, P., Kabashi, B.: Robust semantic similarity at multiple levels using maximum weight matching. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pp. 532–540. Association for Computational Linguistics and Dublin City University, Dublin, August 2014
Šarić, F., Glavaš, G., Karan, M., Šnajder, J., Bašić, B.D.: Takelab: systems for measuring semantic text similarity. In: Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the Main Conference and the Shared Task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation, SemEval 2012, pp. 441–448. Association for Computational Linguistics, Stroudsburg (2012)
Severyn, A., Nicosia, M., Moschitti, A.: Tree kernel learning for textual similarity. In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, pp. 53–58. Association for Computational Linguistics, Atlanta, June 2013
Sultan, M.A., Bethard, S., Sumner, T.: Sentence similarity from word alignment. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pp. 241–246. Association for Computational Linguistics and Dublin City University, Dublin, August 2014
Xu, J., Lu, Q.: Computing semantic textual similarity using overlapped senses. In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity, pp. 90–95. Association for Computational Linguistics, Atlanta, June 2013
Nguyen, T.H., Tran, D.Q., Dam, G.M., Nguyen, M.H.: Multi-feature based similarity among entries on media portals. In: Akagi, M., Nguyen, T.-T., Vu, D.-T., Phung, T.-N., Huynh, V.-N. (eds.) ICTA 2016. AISC, vol. 538, pp. 373–382. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-49073-1_41
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill Inc., New York (1986)
Ortony, A., Clore, G.L., Collins, A.: The Congnitive Structure of Emotions. The Cambridge University Press, Cambridge (1988)
Frijda, N.H.: The Emotions: Studies in Emotion & Social Interaction. Edition de la Maison des Sciences de l’Homme, ser edn. Cambridge University Press, Paris (1986)
Reisenzein, R.: Emotions as metarepresentational states of mind: Naturalizing the belief-desire theory of emotion. Cognit. Syst. Res. 10(1), 6–20 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Nguyen, T.H., Tran, D.Q., Dam, G.M., Nguyen, M.H. (2018). Integrated Sentiment and Emotion into Estimating the Similarity Among Entries on Social Network. In: Chen, Y., Duong, T. (eds) Industrial Networks and Intelligent Systems. INISCOM 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 221. Springer, Cham. https://doi.org/10.1007/978-3-319-74176-5_21
Download citation
DOI: https://doi.org/10.1007/978-3-319-74176-5_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-74175-8
Online ISBN: 978-3-319-74176-5
eBook Packages: Computer ScienceComputer Science (R0)