Abstract
The automatic annotation of textual content with entities from a knowledge base is a well established field. Applications, such as DBpedia Spotlight and GATE enable to identify and disambiguate entities of text at high levels of accuracy. The output of such systems can be used in many different ways. One way is to show knowledge panels which provide a fact-based summary of an entity and provides further information as well as browsing options. Such fact-based summaries are produced by entity summarization systems.
This paper presents ELES, a lightweight combination of DBpedia Spotlight and the SUMMA entity summarization interface. DBpedia Spotlight analyzes text and links fragments to entities of the DBpedia knowledge base. The LinkSUM summarizer (interfaced via the SUMMA API definition) produces fact-based summaries of DBpedia entities. The two applications are combined on the client side through the “Internationalization Tag Set 2.0” W3C recommendation and lightweight jQuery-based interfaces.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
1 Introduction
The field of linking fragments from text to entities of a knowledge base is currently at an advanced stage: words and compounds can be identified and disambiguated at high levels of accuracy [1, 2, 8]. Entity linking usually enables to provide further information on the entities, browsing, or recommendation. In this work, we consider the use case of providing a knowledge panel that pops up on mouse over entities that were identified through annotation. The knowledge panel is used to explain the entities identified by the annotator with concise fact-based information units and also enables further browsing (i.e. navigating through related entities). Entity summarization enables to filter all information that is available about an entity (often more than 1000 facts) and to select a small fraction that will be presented to the user.
With this demo, we propose loose coupling between automatic entity linking and entity summarization systems with the “Internationalization Tag Set 2.0” (ITS 2.0) W3C recommendation [3]. We exemplify the feasibility of the lightweight integration with the applications DBpedia Spotlight [1] (as a entity linking tool) and LinkSUM [4] (as an entity summarization tool interfaced via the SUMMA API [5]). Both applications use DBpediaFootnote 1 as a knowledge base.
2 Implementation
We make use of ITS 2.0 by using the its-ta-ident-ref attribute as the combining element between entity linking and entity summaries. This attribute can be used with HTML elements such as <span> tags (for an example see Listing 1). The system is implemented in accordance to the following workflow:
-
1.
Automatically identify and annotate entities of a text.
-
2.
Register a knowledge panel for each identified entity (on mouseover).
We had to extend the DBpedia Spotlight jQuery plugin in order to enable ITS 2.0 output.Footnote 2 The system uses a Web service in order to anntoate one or more text paragraphs with entities from the DBpedia knowledge base. The entity summaries are produced by LinkSUM [4]. The produced summaries are displayed as knowledge panels via the summaClientFootnote 3 implementation [5]. The original implementation has been extended by the qSUM method. It registers mouseover events for all elements with an its-ta-ident-ref attribute. The respective knowledge panel is then shown at the position of the respective annotation. The LinkSUM entity summarization system can be easily exchanged by another entity summarization system that implements the SUMMA API.
A screenshot of an automatically annotated text is provided in Fig. 1 and a do-it-yourself example is provided in Listing 2.
3 Related Work
Our work on ELES was inspired by Denny Vrandečić’s qLabelFootnote 4 project. qLabel is a jQuery-based application that uses text fragments, annotated with its-ta-ident-ref references to Wikidata [7], for translation between different languages. For this, qLabel leverages the wealth of multi-lingual labels available in Wikidata. A Wikidata-based summarization system that uses the SUMMA API can be easily combined with qLabel via qSUM.
There are also a number of proprietary solutions that combine annotation and knowledge panels. The referFootnote 5 application, partly described in [6], supports automatic annotation in combination with knowledge panels. The application is well integrated with additional browsing features and a complete graph panel that can be enabled at the top of each page. Other proprietary solutions include the Bing Knowledge WidgetFootnote 6 and Ontotext’s NowFootnote 7. Most of the proprietary solutions are highly customized and the annotation and knowledge panel parts are often strongly connected.
4 Summary
With ELES, we propose loose coupling between automatic entity linking and entity summarization systems via ITS 2.0. We exemplify the lightweight integration approach with the applications DBpedia Spotlight and the qSUM method of the SUMMA entity summarization interface.
Notes
- 1.
DBpedia – http://dbpedia.org.
- 2.
ITS 2.0 for DBpedia Spotlight – https://github.com/dbpedia-spotlight/demo/pull/5.
- 3.
summaClient – http://athalhammer.github.io/summaClient/.
- 4.
qLabel – http://googleknowledge.github.io/qlabel/.
- 5.
refer – http://refer.cx/.
- 6.
Bing Knowledge Widget – https://www.bing.com/widget/knowledge.
- 7.
Ontotext Now – http://now.ontotext.com/.
References
Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: Proceedings of the 9th International Conference on Semantic Systems (I-Semantics) (2013)
Damljanovic, D., Bontcheva, K.: Named entity disambiguation using linked data. In: Proceedings of the 9th Extended Semantic Web Conference (2012)
Filip, D., McCance, S., Lewis, D., Lieske, C., Lommel, A., Kosek, J., Sasaki, F., Savourel, Y.: Internationalization Tag Set (ITS) Version 2.0. W3C recommendation. W3C, October 2013. http://www.w3.org/TR/2013/REC-its20-20131029/
Thalhammer, A., Lasierra, N., Rettinger, A.: LinkSUM: using link analysis to summarize entity data. In: Proceedings of the 16th International Conference on Web Engineering (ICWE 2016) (2016, to appear)
Thalhammer, A., Stadtmüller, S.: SUMMA: a common API for linked data entity summaries. In: Cimiano, P., Frasincar, F., Houben, G.-J., Schwabe, D. (eds.) ICWE 2015. LNCS, vol. 9114, pp. 430–446. Springer, Heidelberg (2015)
Tietz, T., Waitelonis, J., Jäger, J., Sack, H., Navigator, S.M.: Visualizing recommendations based on linked data. In: Industry Track at the International Semantic Web Conference (ISWC 2014), vol. 1383. CEUR-WS (2015)
Vrandečić, D., Krötzsch, M.: Wikidata: a free collaborative knowledgebase. Commun. ACM 57, 78–85 (2014)
Zhang, L., Rettinger, A.: X-LiSA: cross-lingual semantic annotation. In: Proceedings of the VLDB Endowment (PVLDB), the 40th International Conference on Very Large Data Bases (VLDB), vol. 7, no. 13, pp. 1693–1696, September 2014
Acknowledgements
The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007–2013) under grant agreement no. 611346 and by the German Federal Ministry of Education and Research (BMBF) within the Software Campus project “SumOn” (grant no. 01IS12051).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Thalhammer, A., Rettinger, A. (2016). ELES: Combining Entity Linking and Entity Summarization. In: Bozzon, A., Cudre-Maroux, P., Pautasso, C. (eds) Web Engineering. ICWE 2016. Lecture Notes in Computer Science(), vol 9671. Springer, Cham. https://doi.org/10.1007/978-3-319-38791-8_45
Download citation
DOI: https://doi.org/10.1007/978-3-319-38791-8_45
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-38790-1
Online ISBN: 978-3-319-38791-8
eBook Packages: Computer ScienceComputer Science (R0)