ABSTRACT
With the creation and rapid development of knowledge bases, it has become easier to understand the underlying semantics of unstructured text (short or long) on the web. In this work we especially look at the impact of entity linking on search logs. Search queries follow a Zipfian distribution wherein other than few popular queries (head queries), a significant percentage of queries (tail queries) occur rarely. Given a search log, there is sufficient data to analyze head queries but insufficient data (low frequency, limited clicks) to draw any conclusions about tail queries. In this work we focus on quantifying the extent of overlap between long tail and head queries by means of entity linking. We specifically analyze the frequency distribution of entities in head and tail queries. Our analysis shows that by means of entity linking, we can indeed bridge the gap between the head and tail.
- R. Baeza-Yates, A. Gionis, F. Junqueira, V. Murdock, V. Plachouras, and F. Silvestri. The impact of caching on search engines. In SIGIR. ACM, 2007. Google ScholarDigital Library
- D. Ceccarelli, C. Lucchese, S. Orlando, R. Perego, and S. Trani. Dexter: an open source framework for entity linking. In ESAIR. ACM, 2013. Google ScholarDigital Library
- D. Downey, S. Dumais, and E. Horvitz. Heads and tails: Studies of web search with common and rare queries. In SIGIR. ACM, 2007. Google ScholarDigital Library
- S. Goel, A. Broder, E. Gabrilovich, and B. Pang. Anatomy of the long tail: Ordinary people with extraordinary tastes. In WSDM. ACM, 2010. Google ScholarDigital Library
- L. Hollink, P. Mika, and R. Blanco. Web usage mining with semantic analysis. In WWW, 2013. Google ScholarDigital Library
- D. Milne and I. H. Witten. Learning to link with wikipedia. In CIKM. ACM, 2008. Google ScholarDigital Library
- F. Silvestri. Mining query logs: Turning search usage data into knowledge. Foundations and Trends in Information Retrieval, 4, 2010. Google ScholarDigital Library
Index Terms
- Bringing Head Closer to the Tail with Entity Linking
Recommendations
Entity Difference Modeling Based Entity Linking for Question Answering over Knowledge Graphs
Natural Language Processing and Chinese ComputingAbstractEntity linking plays a vital role in Question Answering over Knowledge Graphs (KGQA), and the representation of entities is a fundamental component of entity linking for user questions. In order to alleviate the problem of entity descriptions that ...
Improving Entity Linking by Encoding Type Information into Entity Embeddings
Chinese Computational LinguisticsAbstractEntity Linking (EL) refers to the task of linking entity mentions in the text to the correct entities in the Knowledge Base (KB) in which entity embeddings play a vital and challenging role because of the subtle differences between entities. ...
Short Text Entity Linking with Fine-grained Topics
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge ManagementA wide range of web corpora are in the form of short text, such as QA queries, search queries and news titles. Entity linking for these short texts is quite important. Most of supervised approaches are not effective for short text entity linking. The ...
Comments