Knowledge Fragment Enrichment Using Domain Knowledge Base

Zhang, Jing; Zhuang, Honglei; Song, Yanglei; Han, Jiawei; Zhang, Yutao; Tang, Jie; Li, Juanzi

doi:10.1007/978-981-10-2993-6_24

Knowledge Fragment Enrichment Using Domain Knowledge Base

Jing Zhang^14,16,
Honglei Zhuang¹⁵,
Yanglei Song¹⁵,
Jiawei Han¹⁵,
Yutao Zhang¹⁶,
Jie Tang¹⁶ &
…
Juanzi Li¹⁶

Conference paper
First Online: 19 October 2016

1213 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 669))

Abstract

Knowledge fragment enrichment aims to complete user input concept fragment by augmenting each concept with rich domain information. This is a widely studied problem in cognitive science, but has not been intensively investigated in computer science. In this paper, we formally define the problem of knowledge fragment enrichment in domain knowledge base and develop a probabilistic graphical model to tackle the problem. The proposed model is able to model the dependencies among concepts in the input knowledge fragment and also capture the probabilistic relationship between concepts and domain entities. We empirically evaluate the proposed model on two different genres of datasets: PubMed and NSFC. On both datasets, the proposed model significantly improves the accuracy of label prediction task by up to 3–9 % (in terms of MAP) compared with several alternative enrichment methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

Asuncion, A., Welling, M., Smyth, P., Teh, Y.W.: On smoothing and inference for topic models. In: UAI 2009, pp. 27–34. AUAI Press (2009)
Google Scholar
Bakalov, A., McCallum, A., Wallach, H., Mimno, D.: Topic models for taxonomies. In: JCDL 2012, pp. 237–240 (2012)
Google Scholar
Blei, D.M., Griffiths, T.L., Jordan, M.I., Tenenbaum, J.B.: Hierarchical topic models and the nested chinese restaurant process. In: NIPS 2003 (2003)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. JMLR 3, 993–1022 (2003)
MATH Google Scholar
Chemudugunta, C., Smyth, P., Steyvers, M.: Text modeling using unsupervised topic models and concept hierarchies. arXiv preprint arXiv:0808.0973 (2008)
Chen, X., Zhou, M., Carin, L.: The contextual focused topic model. In: KDD 2012, pp. 96–104 (2012)
Google Scholar
Collins, A.M., Loffus, E.F.: A spreading activation theory of semnatic processing. Psychol. Rev. 82, 407–428 (1975)
Article Google Scholar
Collins, A.M., Quiliam, M.K.: Retrieval time from semantic memory. J. Verbal Learn. Verbal Behav. 8, 240–247 (1969)
Article Google Scholar
Kang, D., Park, Y., Chari, S.N.: Hetero-labeled LDA: a partially supervised topic model with heterogeneous labels. In: Calders, T., Esposito, F., Hüllermeier, E., Meo, R. (eds.) ECML PKDD 2014, Part I. LNCS, vol. 8724, pp. 640–655. Springer, Heidelberg (2014)
Google Scholar
Kim, D.k., Voelker, G., Saul, L.K.: A variational approximation for topic modeling of hierarchical corpora. In: ICML 2013, pp. 55–63 (2013)
Google Scholar
Mimno, D., Li, W., McCallum, A.: Mixtures of hierarchical topics with pachinko allocation. In: ICML 2007, pp. 633–640 (2007)
Google Scholar
Mimno, D., Wallach, H.M., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: EMNLP 2011, pp. 262–272 (2011)
Google Scholar
Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: KDD 2009, pp. 797–806 (2009)
Google Scholar
Teh, Y.W., Newman, D., Welling, M.: A collapsed variational bayesian inference algorithm for latent dirichlet allocation. In: NIPS, vol. 6, pp. 1378–1385 (2006)
Google Scholar
Tolman, E.C.: Cognitive maps in rats and men. Psychol. Rev. 55(4), 189–208 (1984)
Article Google Scholar
Wang, C., Danilevsky, M., Liu, J., Desai, N., Ji, H., Han, J.: Constructing topical hierarchies in heterogeneous information networks. In: ICDM 2013, pp. 767–776 (2013)
Google Scholar
Zhang, H.P., Yu, H.K., Xiong, D.Y., Liu, Q.: HHMM-based Chinese lexical analyzer ICTCLAS. In: SIGHAN Workshop on Chinese Language Processing, pp. 184–187 (2003)
Google Scholar
Zhang, K., Xu, H., Tang, J., Li, J.: Keyword extraction using support vector machine. In: Advances in Web-Age Information Management, pp. 85–96 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Renmin University of China, Beijing, China
Jing Zhang
University of Illinois at Urbana-Champaign, Champaign, USA
Honglei Zhuang, Yanglei Song & Jiawei Han
Tsinghua University, Beijing, China
Jing Zhang, Yutao Zhang, Jie Tang & Juanzi Li

Authors

Jing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Honglei Zhuang
View author publications
You can also search for this author in PubMed Google Scholar
Yanglei Song
View author publications
You can also search for this author in PubMed Google Scholar
Jiawei Han
View author publications
You can also search for this author in PubMed Google Scholar
Yutao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Tang
View author publications
You can also search for this author in PubMed Google Scholar
Juanzi Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jing Zhang .

Editor information

Editors and Affiliations

Beijing Language and Culture University, Beijing, China
Yuming Li
Jiangxi Normal University, Nanchang, China
Guoxiong Xiang
Dalian University of Technology, Dalian, China
Hongfei Lin
Jiangxi Normal University, Nanchang, China
Mingwen Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J. et al. (2016). Knowledge Fragment Enrichment Using Domain Knowledge Base. In: Li, Y., Xiang, G., Lin, H., Wang, M. (eds) Social Media Processing. SMP 2016. Communications in Computer and Information Science, vol 669. Springer, Singapore. https://doi.org/10.1007/978-981-10-2993-6_24

Download citation

DOI: https://doi.org/10.1007/978-981-10-2993-6_24
Published: 19 October 2016
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2992-9
Online ISBN: 978-981-10-2993-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics