Skip to main content

Knowledge Fragment Enrichment Using Domain Knowledge Base

  • Conference paper
  • First Online:
  • 1213 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 669))

Abstract

Knowledge fragment enrichment aims to complete user input concept fragment by augmenting each concept with rich domain information. This is a widely studied problem in cognitive science, but has not been intensively investigated in computer science. In this paper, we formally define the problem of knowledge fragment enrichment in domain knowledge base and develop a probabilistic graphical model to tackle the problem. The proposed model is able to model the dependencies among concepts in the input knowledge fragment and also capture the probabilistic relationship between concepts and domain entities. We empirically evaluate the proposed model on two different genres of datasets: PubMed and NSFC. On both datasets, the proposed model significantly improves the accuracy of label prediction task by up to 3–9 % (in terms of MAP) compared with several alternative enrichment methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://en.wikipedia.org/wiki/Jeopardy!.

  2. 2.

    http://www.nlm.nih.gov/bsd/disted/meshtutorial/.

  3. 3.

    https://www.acm.org/about/class.

  4. 4.

    http://www.nsfc.gov.cn/nsfc/cen/daima/.

  5. 5.

    http://www.nsfc.gov.cn/nsfc/cen/daima/.

  6. 6.

    http://www.ncbi.nlm.nih.gov/pubmed.

  7. 7.

    http://www.nlm.nih.gov/mesh/trees.html.

References

  1. Asuncion, A., Welling, M., Smyth, P., Teh, Y.W.: On smoothing and inference for topic models. In: UAI 2009, pp. 27–34. AUAI Press (2009)

    Google Scholar 

  2. Bakalov, A., McCallum, A., Wallach, H., Mimno, D.: Topic models for taxonomies. In: JCDL 2012, pp. 237–240 (2012)

    Google Scholar 

  3. Blei, D.M., Griffiths, T.L., Jordan, M.I., Tenenbaum, J.B.: Hierarchical topic models and the nested chinese restaurant process. In: NIPS 2003 (2003)

    Google Scholar 

  4. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR 3, 993–1022 (2003)

    MATH  Google Scholar 

  5. Chemudugunta, C., Smyth, P., Steyvers, M.: Text modeling using unsupervised topic models and concept hierarchies. arXiv preprint arXiv:0808.0973 (2008)

  6. Chen, X., Zhou, M., Carin, L.: The contextual focused topic model. In: KDD 2012, pp. 96–104 (2012)

    Google Scholar 

  7. Collins, A.M., Loffus, E.F.: A spreading activation theory of semnatic processing. Psychol. Rev. 82, 407–428 (1975)

    Article  Google Scholar 

  8. Collins, A.M., Quiliam, M.K.: Retrieval time from semantic memory. J. Verbal Learn. Verbal Behav. 8, 240–247 (1969)

    Article  Google Scholar 

  9. Kang, D., Park, Y., Chari, S.N.: Hetero-labeled LDA: a partially supervised topic model with heterogeneous labels. In: Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.) ECML PKDD 2014, Part I. LNCS, vol. 8724, pp. 640–655. Springer, Heidelberg (2014)

    Google Scholar 

  10. Kim, D.k., Voelker, G., Saul, L.K.: A variational approximation for topic modeling of hierarchical corpora. In: ICML 2013, pp. 55–63 (2013)

    Google Scholar 

  11. Mimno, D., Li, W., McCallum, A.: Mixtures of hierarchical topics with pachinko allocation. In: ICML 2007, pp. 633–640 (2007)

    Google Scholar 

  12. Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: EMNLP 2011, pp. 262–272 (2011)

    Google Scholar 

  13. Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: KDD 2009, pp. 797–806 (2009)

    Google Scholar 

  14. Teh, Y.W., Newman, D., Welling, M.: A collapsed variational bayesian inference algorithm for latent dirichlet allocation. In: NIPS, vol. 6, pp. 1378–1385 (2006)

    Google Scholar 

  15. Tolman, E.C.: Cognitive maps in rats and men. Psychol. Rev. 55(4), 189–208 (1984)

    Article  Google Scholar 

  16. Wang, C., Danilevsky, M., Liu, J., Desai, N., Ji, H., Han, J.: Constructing topical hierarchies in heterogeneous information networks. In: ICDM 2013, pp. 767–776 (2013)

    Google Scholar 

  17. Zhang, H.P., Yu, H.K., Xiong, D.Y., Liu, Q.: HHMM-based Chinese lexical analyzer ICTCLAS. In: SIGHAN Workshop on Chinese Language Processing, pp. 184–187 (2003)

    Google Scholar 

  18. Zhang, K., Xu, H., Tang, J., Li, J.: Keyword extraction using support vector machine. In: Advances in Web-Age Information Management, pp. 85–96 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jing Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Zhang, J. et al. (2016). Knowledge Fragment Enrichment Using Domain Knowledge Base. In: Li, Y., Xiang, G., Lin, H., Wang, M. (eds) Social Media Processing. SMP 2016. Communications in Computer and Information Science, vol 669. Springer, Singapore. https://doi.org/10.1007/978-981-10-2993-6_24

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-2993-6_24

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-2992-9

  • Online ISBN: 978-981-10-2993-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics