Skip to main content

A Probabilistic Approach for Inferring Latent Entity Associations in Textual Web Contents

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11448))

Abstract

Latent entity associations (EA) represent that two entities associate with each other indirectly through multiple intermediate entities in different textual Web contents (TWCs) including e-mails, Web news, social network pages, etc. In this paper, by adopting Bayesian Network as the framework to represent and infer latent EAs as well as the probabilities of associations, we propose the concept of entity association Bayesian Network (EABN). To construct EABN efficiently, we employ self-organizing map for TWC dataset division to make the co-occurrence-based dependence of each pair of entities concern just a small set of documents. Using probabilistic inferences of EABN, we evaluate and rank EAs in all possible entity pairs, by which novel latent EAs could be found. Experimental results show the effectiveness and efficiency of our approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Yin, Z., Yue, K., Wu, H., Su, Y.: Adaptive and parallel data acquisition from online big graphs. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds.) DASFAA 2018, Part I. LNCS, vol. 10827, pp. 323–331. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91452-7_21

    Chapter  Google Scholar 

  2. Liao, C., Xiong, Y., Kong, X., Zhu, Y., Zhao, S., Li, S.: Functional-oriented relationship strength estimation: from online events to offline interactions. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds.) DASFAA 2018, Part I. LNCS, vol. 10827, pp. 442–459. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91452-7_29

    Chapter  Google Scholar 

  3. Zhang, J., Tan, L., Tao, X., Zheng, X., Luo, Y., Lin, J.C.-W.: SLIND: identifying stable links in online social networks. In: Pei, J., Manolopoulos, Y., Sadiq, S., Li, J. (eds.) DASFAA 2018, Part II. LNCS, vol. 10828, pp. 813–816. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91458-9_54

    Chapter  Google Scholar 

  4. Liu, W., Yue, K., Yue, M., et al.: A Bayesian network-based approach for incremental learning of uncertain knowledge. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 26(1), 87–108 (2018)

    Article  MathSciNet  Google Scholar 

  5. Teye, M., Azizpour, H., Smith, K.: Bayesian uncertainty estimation for batch normalized deep networks. In: Proceedings of the 35th International Conference on Machine Learning, pp. 4914–4923. ACM, New York (2018)

    Google Scholar 

  6. Ishak, R., Messaouda, F., Hafida, B.: Toward a general formalism of fuzzy multi-entity Bayesian networks for representing and reasoning with uncertain knowledge. In: Proceedings of the 19th International Conference on Enterprise Information Systems, ICEIS 2017, vol. 1, pp. 520–528. SciTePress, Setúbal (2017)

    Google Scholar 

  7. Kuo, R.J., Rizki, M., Zulvia, F.E., et al.: Integration of growing self-organizing map and bee colony optimization algorithm for part clustering. Comput. Ind. Eng. 120, 251–265 (2018)

    Article  Google Scholar 

  8. Li, Z., Fang, H., Huang, M., et al.: Data-driven bearing fault identification using improved hidden markov model and self-organizing map. Comput. Ind. Eng. 116, 37–46 (2018)

    Article  Google Scholar 

  9. Saraswati, A., Nguyen, V.T., Hagenbuchner, M., et al.: High-resolution self-organizing maps for advanced visualization and dimension reduction. Neural Netw. 105, 166–184 (2018)

    Article  Google Scholar 

  10. Daphne, K., Nir, F.: Probabilistic Graphical Models: Principles and Techniques, 1st edn. The MIT Press, Cambridge (2009)

    MATH  Google Scholar 

  11. Zhang, W., Pan, T., Wang, Y., et al.: UT-LDA based similarity computing in microblog. In: 2015 IEEE International Conference on Software Quality, Reliability and Security, pp. 197–201. IEEE, Piscataway (2015)

    Google Scholar 

  12. Wood, J., Tan, P., Wang, W., et al.: Source-LDA: enhancing probabilistic topic models using prior knowledge sources. In: 33rd IEEE International Conference on Data Engineering, pp. 411–422. IEEE, Piscataway (2017)

    Google Scholar 

  13. Poria, S., Chaturvedi, I., Bisio, F., et al.: Sentic LDA: improving on LDA with semantic similarity for aspect-based sentiment analysis. In: 2016 International Joint Conference on Neural Networks, pp. 4465–4473. IEEE, Piscataway (2016)

    Google Scholar 

  14. Mintz, M., Bills, S., Snow, R., et al.: Distant supervision for relation extraction without labeled data. In: ACL 2009, Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 1003–1011. ACL, Pennsylvania (2009)

    Google Scholar 

  15. Surdeanu, M., Tibshirani, J., Nallapati, R., et al.: Multi-instance multi-label learning for relation extraction. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 455–465. ACL, Pennsylvania (2012)

    Google Scholar 

  16. Ren, X., Wu, Z., He, W., et al.: CoType: joint extraction of typed entities and relations with knowledge bases. In: Proceedings of the 26th International Conference on World Wide Web, pp. 1015–1024. ACM, New York (2017)

    Google Scholar 

  17. Idoudi, R., Ettabaâ, K.S., Solaiman, B.: Association rules-based ontology enrichment. Int. J. Web Appl. 8(1), 16–25 (2016)

    Google Scholar 

  18. Ahmed, E.B., Gargouri, F.: Enhanced association rules over ontology resources. Int. J. Web Appl. 7(1), 10–22 (2015)

    Google Scholar 

  19. Erlandsson, F., Bródka, P., Borg, A., et al.: Enhanced association rules over ontology resources. Entropy 18(5), 164–178 (2016)

    Article  Google Scholar 

  20. Wikidata. https://www.wikidata.org/wiki/Wikidata:Main_Page. Accessed 01 Oct 2018

  21. Färber, M., Bartscherer, F., Menne, C., et al.: Linked data quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO. Seman. Web 9(1), 77–129 (2018)

    Article  Google Scholar 

  22. Ismayilov, A., Kontokostas, D., Auer, S., et al.: Wikidata through the Eyes of DBpedia. Seman. Web 9(4), 493–503 (2018)

    Article  Google Scholar 

  23. Cormen, T.H., Leiserson, C.E., Rivest, R., et al.: Introduction to Algorithms, 2nd edn. The MIT Press, Cambridge (2001)

    MATH  Google Scholar 

  24. Machine-Learning-with-R-datasets/groceries.csv. https://github.com/stedy/MachineLearning-with-R-datasets/blob/master/groceries.csv. Accessed 15 Nov 2018

  25. News Popularity in Multiple Social Media Platforms Data Set. http://archive.ics.uci.edu/ml/datasets/News+Popularity+in+Multiple+Social+Media+Platforms. Accessed 01 Nov 2018

Download references

Acknowledgement

This paper was supported by the National Natural Science Foundation of China (U1802271), Program for the second Batch of Yunling Scholar of Yunnan Province (C6153001), Donglu Scholar Cultivation Project of Yunnan University, and Research Foundation of Educational Department of Yunnan Province (2016ZZX006).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kun Yue .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Li, L., Yue, K., Zhang, B., Sun, Z. (2019). A Probabilistic Approach for Inferring Latent Entity Associations in Textual Web Contents. In: Li, G., Yang, J., Gama, J., Natwichai, J., Tong, Y. (eds) Database Systems for Advanced Applications. DASFAA 2019. Lecture Notes in Computer Science(), vol 11448. Springer, Cham. https://doi.org/10.1007/978-3-030-18590-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-18590-9_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-18589-3

  • Online ISBN: 978-3-030-18590-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics