Abstract
Latent entity associations (EA) represent that two entities associate with each other indirectly through multiple intermediate entities in different textual Web contents (TWCs) including e-mails, Web news, social network pages, etc. In this paper, by adopting Bayesian Network as the framework to represent and infer latent EAs as well as the probabilities of associations, we propose the concept of entity association Bayesian Network (EABN). To construct EABN efficiently, we employ self-organizing map for TWC dataset division to make the co-occurrence-based dependence of each pair of entities concern just a small set of documents. Using probabilistic inferences of EABN, we evaluate and rank EAs in all possible entity pairs, by which novel latent EAs could be found. Experimental results show the effectiveness and efficiency of our approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Yin, Z., Yue, K., Wu, H., Su, Y.: Adaptive and parallel data acquisition from online big graphs. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds.) DASFAA 2018, Part I. LNCS, vol. 10827, pp. 323–331. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91452-7_21
Liao, C., Xiong, Y., Kong, X., Zhu, Y., Zhao, S., Li, S.: Functional-oriented relationship strength estimation: from online events to offline interactions. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds.) DASFAA 2018, Part I. LNCS, vol. 10827, pp. 442–459. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91452-7_29
Zhang, J., Tan, L., Tao, X., Zheng, X., Luo, Y., Lin, J.C.-W.: SLIND: identifying stable links in online social networks. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds.) DASFAA 2018, Part II. LNCS, vol. 10828, pp. 813–816. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91458-9_54
Liu, W., Yue, K., Yue, M., et al.: A Bayesian network-based approach for incremental learning of uncertain knowledge. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 26(1), 87–108 (2018)
Teye, M., Azizpour, H., Smith, K.: Bayesian uncertainty estimation for batch normalized deep networks. In: Proceedings of the 35th International Conference on Machine Learning, pp. 4914–4923. ACM, New York (2018)
Ishak, R., Messaouda, F., Hafida, B.: Toward a general formalism of fuzzy multi-entity Bayesian networks for representing and reasoning with uncertain knowledge. In: Proceedings of the 19th International Conference on Enterprise Information Systems, ICEIS 2017, vol. 1, pp. 520–528. SciTePress, Setúbal (2017)
Kuo, R.J., Rizki, M., Zulvia, F.E., et al.: Integration of growing self-organizing map and bee colony optimization algorithm for part clustering. Comput. Ind. Eng. 120, 251–265 (2018)
Li, Z., Fang, H., Huang, M., et al.: Data-driven bearing fault identification using improved hidden markov model and self-organizing map. Comput. Ind. Eng. 116, 37–46 (2018)
Saraswati, A., Nguyen, V.T., Hagenbuchner, M., et al.: High-resolution self-organizing maps for advanced visualization and dimension reduction. Neural Netw. 105, 166–184 (2018)
Daphne, K., Nir, F.: Probabilistic Graphical Models: Principles and Techniques, 1st edn. The MIT Press, Cambridge (2009)
Zhang, W., Pan, T., Wang, Y., et al.: UT-LDA based similarity computing in microblog. In: 2015 IEEE International Conference on Software Quality, Reliability and Security, pp. 197–201. IEEE, Piscataway (2015)
Wood, J., Tan, P., Wang, W., et al.: Source-LDA: enhancing probabilistic topic models using prior knowledge sources. In: 33rd IEEE International Conference on Data Engineering, pp. 411–422. IEEE, Piscataway (2017)
Poria, S., Chaturvedi, I., Bisio, F., et al.: Sentic LDA: improving on LDA with semantic similarity for aspect-based sentiment analysis. In: 2016 International Joint Conference on Neural Networks, pp. 4465–4473. IEEE, Piscataway (2016)
Mintz, M., Bills, S., Snow, R., et al.: Distant supervision for relation extraction without labeled data. In: ACL 2009, Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 1003–1011. ACL, Pennsylvania (2009)
Surdeanu, M., Tibshirani, J., Nallapati, R., et al.: Multi-instance multi-label learning for relation extraction. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 455–465. ACL, Pennsylvania (2012)
Ren, X., Wu, Z., He, W., et al.: CoType: joint extraction of typed entities and relations with knowledge bases. In: Proceedings of the 26th International Conference on World Wide Web, pp. 1015–1024. ACM, New York (2017)
Idoudi, R., Ettabaâ, K.S., Solaiman, B.: Association rules-based ontology enrichment. Int. J. Web Appl. 8(1), 16–25 (2016)
Ahmed, E.B., Gargouri, F.: Enhanced association rules over ontology resources. Int. J. Web Appl. 7(1), 10–22 (2015)
Erlandsson, F., Bródka, P., Borg, A., et al.: Enhanced association rules over ontology resources. Entropy 18(5), 164–178 (2016)
Wikidata. https://www.wikidata.org/wiki/Wikidata:Main_Page. Accessed 01 Oct 2018
Färber, M., Bartscherer, F., Menne, C., et al.: Linked data quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO. Seman. Web 9(1), 77–129 (2018)
Ismayilov, A., Kontokostas, D., Auer, S., et al.: Wikidata through the Eyes of DBpedia. Seman. Web 9(4), 493–503 (2018)
Cormen, T.H., Leiserson, C.E., Rivest, R., et al.: Introduction to Algorithms, 2nd edn. The MIT Press, Cambridge (2001)
Machine-Learning-with-R-datasets/groceries.csv. https://github.com/stedy/MachineLearning-with-R-datasets/blob/master/groceries.csv. Accessed 15 Nov 2018
News Popularity in Multiple Social Media Platforms Data Set. http://archive.ics.uci.edu/ml/datasets/News+Popularity+in+Multiple+Social+Media+Platforms. Accessed 01 Nov 2018
Acknowledgement
This paper was supported by the National Natural Science Foundation of China (U1802271), Program for the second Batch of Yunling Scholar of Yunnan Province (C6153001), Donglu Scholar Cultivation Project of Yunnan University, and Research Foundation of Educational Department of Yunnan Province (2016ZZX006).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, L., Yue, K., Zhang, B., Sun, Z. (2019). A Probabilistic Approach for Inferring Latent Entity Associations in Textual Web Contents. In: Li, G., Yang, J., Gama, J., Natwichai, J., Tong, Y. (eds) Database Systems for Advanced Applications. DASFAA 2019. Lecture Notes in Computer Science(), vol 11448. Springer, Cham. https://doi.org/10.1007/978-3-030-18590-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-18590-9_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-18589-3
Online ISBN: 978-3-030-18590-9
eBook Packages: Computer ScienceComputer Science (R0)