Abstract
As the Web is becoming the largest knowledge repository which contains various entities and their relations, the task of related entity retrieval excites interest in the field of information retrieval. This challenging task is introduced in TREC 2009 Entity Track. In this task, given an entity and the type of the target entity, as well as the nature of their relation described in free text, a retrieval system is required to return a ranked list of related entities that are of the target type. It means that entity ranking goes beyond entity relevance and integrates the judgment of relation into the evaluation of the retrieved entities. In this paper, we propose a probability model using relation pattern to address the task of related entity retrieval. This model takes into account both relevance and relation between entities. We focus on using relation patterns to measure the level of relation matching between entities, and then to estimate the probability of occurrence of relation between two entities. In addition, we represent entity by its context language model and measure the relevance between two entities by a language model approach. Experimental results on TREC Entity Track dataset show that our proposed model significantly improves retrieval performances over baseline. The comparison with other approaches also reveals the effectiveness of our model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Balog, K., de Vries, A.P.: Overview of the TREC 2009 Entity Track. In: Proceedings of TREC 2009, Gaithersburg, USA (2009)
Fang, Y., et al.: Entity Retrieval by Hierarchical Relevance Model, Exploiting the Structure of Tables and Learning Homepage Classifiers. In: Proceedings of TREC 2009 (2009)
McCreadie, R., et al.: University of Glasgow at TREC 2009: Experiments with Terrier. In: Proceedings of TREC 2009 (2009)
Wu, Y., Kashioka, H.: NiCT at TREC 2009: Employing Three Models for Entity Ranking Track. In: Proceedings of TREC 2009 (2009)
Hu, G., et al.: A Supervised Learning Approach to Entity Search. In: Proceedings of Asian Information Retrieval Symposium 2006, pp. 54–66 (2006)
Mikhail, B., Steven, S.: Concordance-Based Entity-Oriented Search. In: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence. IEEE Computer Society (2007)
Hugo, Z., et al.: Ranking very many typed entities on wikipedia. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management. ACM, Lisbon (2007)
Henning, R., Pavel, S., Djoerd, H.: Combining document- and paragraph-based entity ranking. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, Singapore (2008)
Tao, C., Xifeng, Y., Kevin Chen-Chuan, C.: EntityRank: searching entities directly and holistically. In: Proceedings of the 33rd International Conference on Very Large Data Bases, pp. 387–398. VLDB Endowment, Vienna (2007)
Balog, K., Bron, M., de Rijke, M.: Category-based Query Modeling for Entity Search. In: 32nd European Conference on Information Retrieval (2010)
Krisztian, B., Leif, A., de Maarten, R.: Formal models for expert finding in enterprise corpora. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 43–50. ACM, Seattle (2006)
Yupeng, F., et al.: A CDD-based formal model for expert finding. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, pp. 881–884. ACM, Lisbon (2007)
Bron, M., Balog, K., de Rijke, M.: Related Entity Finding Based on Co-Occurrence. In: Proceedings of TREC 2009 (2009)
Schlaefer, N., Gieselmann, P., Schaaf, T., Waibel, A.: A Pattern Learning Approach to Question Answering within the Ephyra Framework. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS (LNAI), vol. 4188, pp. 687–694. Springer, Heidelberg (2006)
Zhang, D., Lee, W.: Web based pattern mining and matching approach to question answering. In: Proceedings of the 11th Text Retrieval Conference (2002)
Ogilvie, P., Callan, J.: Experiments using the Lemur toolkit. In: Proceedings of the 2001 TREC Conference (2002)
Chengxiang, Z., John, L.: A study of smoothing methods for language models applied to information retrieval. ACM Trans. Inf. Syst. 22(2), 179–214 (2004)
Daniel, S.W., Raphael, H., Fei, W.: Using Wikipedia to bootstrap open information extraction. SIGMOD Rec. 37(4), 62–68 (2008)
Chia-Hui, C., Shao-Chen, L.: IEPAD: information extraction based on pattern discovery. In: Proceedings of the 10th International Conference on World Wide Web, pp. 681–688. ACM, Hong Kong (2001)
Rajasekar, K., et al.: SystemT: a system for declarative information extraction. SIGMOD Rec. 37(4), 7–13 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jiang, P., Yang, Q., Zhang, C., Niu, Z., Fu, H. (2011). A Probability Model for Related Entity Retrieval Using Relation Pattern. In: Xiong, H., Lee, W.B. (eds) Knowledge Science, Engineering and Management. KSEM 2011. Lecture Notes in Computer Science(), vol 7091. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25975-3_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-25975-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25974-6
Online ISBN: 978-3-642-25975-3
eBook Packages: Computer ScienceComputer Science (R0)