Good Quality Complementary Information for Multilingual Wikipedia

Suzuki, Yu; Fujiwara, Yuya; Konishi, Yukio; Nadamoto, Akiyo

doi:10.1007/978-3-642-35063-4_14

Yu Suzuki²¹,
Yuya Fujiwara²⁰,
Yukio Konishi²⁰ &
…
Akiyo Nadamoto²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7651))

Included in the following conference series:

International Conference on Web Information Systems Engineering

2534 Accesses
1 Citations

Abstract

Many Wikipedia articles lack information, because not all users submit truly complete information to Wikipedia. However, Wikipedia has many language versions that have been developed independently. Therefore, if we supply these complementary information from many language versions, the users must satisfy the amount of information of Wikipedia articles with the complementary information, instead of only one language version of Wikipedia articles. In this study, we specifically examine multilingual Wikipedia and propose a method of extracting good quality complementary information from Wikipedia of other languages. Specifically, we compare Wikipedia articles with less information to those with more information. From Wikipedia articles, which can have the same theme and different languages, we extract different information as complementary information. As described herein, we extract comparison target articles of Wikipedia based on a link graph, because cases exist in which information included in an articles is written in multiple pages of different languages. Furthermore, some low-quality information is extracted as complementary information because Wikipedia articles are written by not only good editors but also bad editors such as vandals. We propose a method to calculate the quality of information based on the editors, and we extract good quality complementary information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Enrichment of Information in Multilingual Wikipedia Based on Quality Analysis

Measures for Quality Assessment of Articles and Infoboxes in Multilingual Wikipedia

Analysis of References Across Wikipedia Languages

References

Adar, E., Skinner, M., Weld, D.S.: Information arbitrage across multi-lingual wikipedia. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, WSDM 2009, pp. 94–103. ACM Press, New York (2009)
Google Scholar
Adler, B., de Alfaro, L.: A content-driven reputation system for the Wikipedia. In: Proceedings of the 16th International Conference on World Wide Web (WWW 2007), pp. 261–270 (2007)
Google Scholar
Chen, Z., Liu, S., Wenyin, L., Pu, G., Ma, W.Y.: Building a web thesaurus from web link structure. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 48–55 (2003)
Google Scholar
Eklou, D., Asano, Y., Yoshikawa, M.: How the web can help wikipedia: a study on information complementation of wikipedia by the web. In: Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2012, pp. 9:1–9:10. ACM, New York (2012), http://doi.acm.org/10.1145/2184751.2184763
Fujiwara, Y., Suzuki, Y., Konishi, Y., Nadamoto, A.: Extracting Difference Information from Multilingual Wikipedia. In: Sheng, Q.Z., Wang, G., Jensen, C.S., Xu, G. (eds.) APWeb 2012. LNCS, vol. 7235, pp. 496–503. Springer, Heidelberg (2012)
Chapter Google Scholar
Kamps, J., Koolen, M.: Is wikipedia link structure different? In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 232–241 (2009)
Google Scholar
Ma, Q., Nadamoto, A., Tanaka, K.: Complementary information retrieval for cross-media news content. Inf. Syst. 31(7), 659–678 (2006), http://dx.doi.org/10.1016/j.is.2005.12.004
Article Google Scholar
Milne, D.: Computing semantic relatedness using wikipedia link structure. In: Proc. of New Zealand Computer Science Research Student Conference, NZCSRSC 2007, CDROM (2007)
Google Scholar
Milne, D., Medelyan, O., Witten, I.H.: Mining Domain-Specific thesauri from wikipedia: A case study. In: WI 2006: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 442–448 (2006)
Google Scholar
Nakatani, M., Jatowt, A., Tanaka, K.: Adaptive ranking of search results by considering user’s comprehension. In: Proceedings of the 4th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2010, CDROM (2010)
Google Scholar
Nakayama, K., Hara, T., Nishio, S.: Wikipedia Mining for an Association Web Thesaurus Construction. In: Benatallah, B., Casati, F., Georgakopoulos, D., Bartolini, C., Sadiq, W., Godart, C. (eds.) WISE 2007. LNCS, vol. 4831, pp. 322–334. Springer, Heidelberg (2007)
Chapter Google Scholar
Strube, M., Ponzetto, S.P.: WikiRelate! computing semantic relatedness using wikipedia. In: Proceedings of the 21st International conference on Artificial intelligence (AAAI 2006), pp. 1419–1424 (2006)
Google Scholar
Stvilia, B., Gasser, L., Twidale, M.B., Smith, L.C.: A framework for information quality assessment. Journal of the American Society for Information Science and Technology 58(12), 1720–1733 (2007)
Article Google Scholar
Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Information quality work organization in wikipedia. Journal of the American Society for Information Science and Technology 59(6), 983–1001 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Konan University, 8-9-1 Okamoto, Higashi-Nada, Kobe, Hyogo, 6588501, Japan
Yuya Fujiwara, Yukio Konishi & Akiyo Nadamoto
Nagoya University, Furo, Chikusa, Nagoya, Aichi, 4648601, Japan
Yu Suzuki

Authors

Yu Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Yuya Fujiwara
View author publications
You can also search for this author in PubMed Google Scholar
Yukio Konishi
View author publications
You can also search for this author in PubMed Google Scholar
Akiyo Nadamoto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Fudan University, 825 Zhangheng Rd., Shanghai, 201203, China
X. Sean Wang
Department of Computer Science, College of Engineering, Science and Engineering Offices, The University of Illinois at Chicago, 851 South Morgan Street (M/C 152), 60607-7053, Chicago, Illinois, USA
Isabel Cruz
Department of Informatics and Telecommunications, University of Athens, GR15784, Ilisia, Athens, Greece
Alex Delis
Centre for Applied Informatics, Victoria University, PO Box 14428, 8001, Melbourne, VIC, Australia
Guangyan Huang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Suzuki, Y., Fujiwara, Y., Konishi, Y., Nadamoto, A. (2012). Good Quality Complementary Information for Multilingual Wikipedia. In: Wang, X.S., Cruz, I., Delis, A., Huang, G. (eds) Web Information Systems Engineering - WISE 2012. WISE 2012. Lecture Notes in Computer Science, vol 7651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35063-4_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-35063-4_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35062-7
Online ISBN: 978-3-642-35063-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Good Quality Complementary Information for Multilingual Wikipedia

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enrichment of Information in Multilingual Wikipedia Based on Quality Analysis

Measures for Quality Assessment of Articles and Infoboxes in Multilingual Wikipedia

Analysis of References Across Wikipedia Languages

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Good Quality Complementary Information for Multilingual Wikipedia

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enrichment of Information in Multilingual Wikipedia Based on Quality Analysis

Measures for Quality Assessment of Articles and Infoboxes in Multilingual Wikipedia

Analysis of References Across Wikipedia Languages

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation