Abstract
WC3 (Wikipedia Category Consistency Checker) is a system that supports the analysis of the metadata-annotation style in Wikipedia articles belonging to a particular Wikipedia category (the subcategory of “Categories by parameter”) by using the DBpedia metadata database. This system aims to construct an appropriate SPARQL query to represent the category and compares the retrieved results and articles that belong to the category. In this paper, we introduce WC3 and extend the algorithm to analyze efficiently additional varieties of Wikipedia category. We also discuss the metadata-annotation quality of the Wikipedia by using WC3. URL of WC3 is http://wnews.ist.hokudai.ac.jp/wc3/ and related files are available at http://wnews.ist.hokudai.ac.jp/wc3/KEKI.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
The link for WC3 has moved from http://wnews.ist.hokudai.ac.jp/wc3/ to http://wnews.ist.hokudai.ac.jp/wc3/old.
- 3.
- 4.
- 5.
In this paper, we use the abbreviations “dbo”, “dbp”, “dbr” and “rdf”, for “http://dbpedia.org/ontology”, “http://dbpedia.org/property”, “http://dbpedia.org/resource” and “http://www.w3.org/1999/02/22-rdf-syntax-ns#type”.
- 6.
Metadata related to YAGO [4] are excluded, because it uses Wikipedia category information as a resource to extract the data.
- 7.
- 8.
Files related to the experiments are available at http://wnews.ist.hokudai.ac.jp/wc3/KEKI.
- 9.
- 10.
All English Wikipedia articles referred to in this section were accessed on March 7, 2017.
- 11.
Because the rendering time for displaying the results was less than a second, the time for SPARQL query generation is almost equivalent to the total response time.
- 12.
- 13.
- 14.
References
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - a crystallization point for the web of data. Web Seman. Sci. Serv. Agents World Wide Web 7, 154–165 (2009)
Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Seman. Web Inf. Syst. 5, 1–22 (2009)
Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems. I-Semantics 2011, pp. 1–8. ACM, New York (2011)
Hoffart, J., Suchanek, F.M., Berberich, K., Weikum, G.: YAGO2: a spatially and temporally enhanced knowledge base from Wikipedia. Artif. Intell. 194, 28–61 (2013)
Yoshioka, M., Loban, R.: WC3: Wikipedia category consistency checker based on DBPedia. In: Proceedings of 11th International Conference on Signal-Image Technology & Internet-Based Systems, pp. 712–718 (2015)
Giles, J.: Internet encyclopaedias go head to head. Nature 438, 900–901 (2005)
Stvilia, B., Gasser, L., Twidale, M.B., Smith, L.C.: A framework for information quality assessment. J. Am. Soc. Inform. Sci. Technol. 58, 1720–1733 (2007)
Kittur, A., Kraut, R.E.: Harnessing the wisdom of crowds in Wikipedia: quality through coordination. In: Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work, CSCW 2008, pp. 37–46. ACM, New York (2008)
Hu, M., Lim, E.P., Sun, A., Lauw, H.W., Vuong, B.Q.: Measuring article quality in Wikipedia: models and evaluation. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, CIKM 2007, pp. 243–252. ACM, New York (2007)
Mendes, P.N., Mühleisen, H., Bizer, C.: Sieve: linked data quality assessment and fusion. In: Proceedings of the 2012 Joint EDBT/ICDT Workshops, EDBT-ICDT 2012, pp. 116–123. ACM, New York (2012)
Yoshioka, M., Kando, N.: Issues for linking geographical open data of GeoNames and Wikipedia. In: Takeda, H., Qu, Y., Mizoguchi, R., Kitamura, Y. (eds.) JIST 2012. LNCS, vol. 7774, pp. 375–381. Springer, Heidelberg (2013). doi:10.1007/978-3-642-37996-3_32
Orlandi, F., Passant, A.: Modelling provenance of DBpedia resources using Wikipedia contributions. Web Seman. Sci. Serv. Agents World Wide Web 9, 149–164 (2011). Provenance in the Semantic Web
Mihalcea, R., Csomai, A.: Wikify!: linking documents to encyclopedic knowledge. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, CIKM 2007, pp. 233–242. ACM, New York (2007)
Xu, M., Wang, Z., Bie, R., Li, J., Zheng, C., Ke, W., Zhou, M.: Discovering missing semantic relations between entities in Wikipedia. In: Alani, H., et al. (eds.) ISWC 2013. LNCS, vol. 8218, pp. 673–686. Springer, Heidelberg (2013). doi:10.1007/978-3-642-41335-3_42
Tang, L.X., Kang, I.S., Kimura, F., Lee, Y.H., Trotman, A., Geva, S., Xu, Y.: Overview of the NTCIR-10 cross-lingual link discovery task. In: Proceedings of the 10th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Quesiton Answering, and Cross-Lingual Information Access, pp. 8–38 (2013)
Torres, D., Molli, P., Skaf-Molli, H., Diaz, A.: Improving Wikipedia with DBpedia. In: Proceedings of the 21st International Conference on World Wide Web. WWW 2012 Companion, pp. 1107–1112. ACM, New York (2012)
Acknowledgment
This work was partially supported by JSPS KAKENHI Grant Number 25280035 and 16H01756.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Yoshioka, M. (2017). WC3: Analyzing the Style of Metadata Annotation Among Wikipedia Articles by Using Wikipedia Category and the DBpedia Metadata Database. In: van Erp, M., et al. Knowledge Graphs and Language Technology. ISWC 2016. Lecture Notes in Computer Science(), vol 10579. Springer, Cham. https://doi.org/10.1007/978-3-319-68723-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-68723-0_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-68722-3
Online ISBN: 978-3-319-68723-0
eBook Packages: Computer ScienceComputer Science (R0)