Skip to main content

Quality and Importance of Wikipedia Articles in Different Languages

  • Conference paper
  • First Online:
Information and Software Technologies (ICIST 2016)

Abstract

This article aims to analyse the importance of the Wikipedia articles in different languages (English, French, Russian, Polish) and the impact of the importance on the quality of articles. Based on the analysis of literature and our own experience we collected measures related to articles, specifying various aspects of quality that will be used to build the models of articles’ importance. For each language version, the influential parameters are selected that may allow automatic assessment of the validity of the article. Links between articles in different languages offer opportunities in terms of comparison and verification of the quality of information provided by various Wikipedia communities. Therefore, the model can be used not only for a relative assessment of the content of the whole article, but also for a relative assessment of the quality of data contained in their structural parts, the so-called infoboxes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://en.wikipedia.org/wiki/List_of_Wikipedias.

  2. 2.

    https://stats.wikimedia.org/EN/TablesPageViewsMonthly.htm

  3. 3.

    http://www.alexa.com/topsites

  4. 4.

    https://en.wikipedia.org/wiki/Criticism_of_Wikipedia.

  5. 5.

    For English Wikipedia there is a list of articles that have lost their award - https://en.wikipedia.org/wiki/Wikipedia:Former_featured_articles.

  6. 6.

    https://pl.wikipedia.org/wiki/Szablon:Stopnie_oceny_jako%C5%9Bci.

  7. 7.

    https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Wikipedia/Assessment.

  8. 8.

    https://en.wikipedia.org/wiki/Wikipedia:Version_1.0_Editorial_Team.

  9. 9.

    All possible settings in API service can be found on a special page: https://en.wikipedia.org/wiki/Special:ApiSandbox.

  10. 10.

    http://dbpedia.org.

  11. 11.

    http://wikirank.net.

References

  1. Węcel, K., Lewoniewski, W.: Modelling the quality of attributes in Wikipedia infoboxes. In: Abramowicz, W. (ed.) BIS 2015. LNBIP, vol. 228, pp. 308–320. Springer, Heidelberg (2015)

    Chapter  Google Scholar 

  2. Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Assessing information quality of a community-based encyclopedia. In: Proceedings of ICIQ, pp. 442–454 (2005)

    Google Scholar 

  3. Blumenstock, J.E.: Size matters: word count as a measure of quality on Wikipedia. In: WWW, pp. 1095–1096 (2008)

    Google Scholar 

  4. Hu, M., Lim, E.P., Sun, A., Lauw, H.W., Vuong, B.Q.: Measuring article quality in Wikipedia. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management (CIKM 2007), pp. 243–252 (2007)

    Google Scholar 

  5. Wöhner, T., Peters, R.: Assessing the quality of Wikipedia articles with lifecycle based metrics. In: Proceedings of the 5th International Symposium on Wikis and Open Collaboration (WikiSym 2009), p. 16 (2009)

    Google Scholar 

  6. Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic quality assessment of content created collaboratively by web communities: a case study of Wikipedia. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 295–304 (2009)

    Google Scholar 

  7. Lex, E., Voelske, M., Errecalde, M., Ferretti, E., Cagnina, L., Horn, C., Stein, B., Granitzer, M.: Measuring the quality of web content using factual information. In: Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality (WebQuality 2012), p. 7 (2012)

    Google Scholar 

  8. Lipka, N., Stein, B.: Identifying featured articles in Wikipedia: writing style matters. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1147–1148 (2010)

    Google Scholar 

  9. Xu, Y., Luo, T.: Measuring article quality in Wikipedia: lexical clue model. In: IEEE Symposium on Web Society, vol. 19, pp. 141–146 (2011)

    Google Scholar 

  10. Anderka, M.: Analyzing and predicting quality flaws in user-generated content: the case of Wikipedia. Bauhaus-Universitaet Weimar Germany, Ph.d. (2013)

    Google Scholar 

  11. Lewoniewski, W., Węcel, K., Abramowicz, W.: Analiza porównawcza modeli jakości informacji w narodowych wersjach Wikipedii. In: Porębska-Miąc, T., (ed.) Systemy Wspomagania Organizacji (SWO 2015). Wydawnictwo Uniwersytetu Ekonomicznego w Katowicach, pp. 133–154 (2015)

    Google Scholar 

  12. Wilkinson, D.M., Huberman, B.A.: Cooperation and quality in Wikipedia. In: Proceedings of the 2007 International Symposium on Wikis (WikiSym 2007), pp. 157–164 (2007)

    Google Scholar 

  13. Kittur, A., Kraut, R.E.: Harnessing the wisdom of crowds in Wikipedia. In: Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (CSCW 2008), P. 37 (2008)

    Google Scholar 

  14. Arazy, O.: Determinants of Wikipedia quality: the roles of global and local contribution inequality. In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, CSCW 2010. ACM, New York, pp. 233–236 (2010). http://dx.doi.org/10.1145/1718918.1718963

  15. Stein, K., Hess, C.: Does it matter who contributes: a study on featured articles in the German Wikipedia. In: Proceedings of the Eighteenth Conference on Hypertext and Hypermedia (HT 2007), pp. 171–174 (2007)

    Google Scholar 

  16. Suzuki, Y., Yoshikawa, M.: Mutual evaluation of editors and texts for assessing quality of Wikipedia articles. In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration (WikiSym 2012), vol. 18: 1–18: 10. ACM, New York (2012)

    Google Scholar 

  17. Halfaker, A., Kraut, R., Riedl, J.: A jury of your peers: quality, experience and ownership in Wikipedia. In: WikiSym 2009, pp. 1–10 (2009)

    Google Scholar 

  18. Adler, B.T., De Alfaro, L.: A content-driven reputation system for the Wikipedia. In: Proceedings of the 16th International Conference on World Wide Web (WWW 2007), 7(Generic), p. 261 (2007)

    Google Scholar 

  19. Lih, A.: Wikipedia as participatory journalism: reliable sources? Metrics for evaluating collaborative media as a news resource. In: 5th International Symposium on Online Journalism, p. 31 (2004)

    Google Scholar 

  20. Blumenstock, J.E.: Automatically assessing the quality of Wikipedia articles. Technical report (2008)

    Google Scholar 

  21. Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic assessment of document quality in web collaborative digital libraries. J. Data Inf. Qual. 2(3), 1–30 (2011)

    Article  Google Scholar 

  22. Warncke-wang, M., Cosley, D., Riedl, J.: Tell me more : an actionable quality model for Wikipedia. In: WikiSym 2013, pp. 1–10 (2013)

    Google Scholar 

  23. Lewoniewski, W., Węcel, K., Abramowicz, W.: Analiza porównawcza modeli klasyfikacyjnych w kontekście oceny jakości artykułów wikipedii. In: VI Ogólnopolska Konferencja Naukowa. Matematyka i informatyka na usługach ekonomii im. Profesora Zbigniewa Czerwińskiego (2016, in press)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Włodzimierz Lewoniewski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Lewoniewski, W., Węcel, K., Abramowicz, W. (2016). Quality and Importance of Wikipedia Articles in Different Languages. In: Dregvaite, G., Damasevicius, R. (eds) Information and Software Technologies. ICIST 2016. Communications in Computer and Information Science, vol 639. Springer, Cham. https://doi.org/10.1007/978-3-319-46254-7_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46254-7_50

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46253-0

  • Online ISBN: 978-3-319-46254-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics