Abstract
In this paper, we propose a method to identify high-quality Wikipedia articles by mutually evaluating editors and text using implicit positive and negative ratings. One of major approaches for assessing Wikipedia articles is a text survival ratio based approach. However, the problem of this approach is that many low quality articles are misjudged as high quality, because of two issues. This is because, every editor does not always read the whole articles. Therefore, if there is a low quality text at the bottom of a long article, and the text have not seen by the other editors, then the text survives beyond many edits, and the survival ratio of the text is high. To solve this problem, we use a section or a paragraph as a unit of remaining instead of a whole page. This means that if an editor edits an article, the system treats that the editor gives positive ratings to the section or the paragraph that the editor edits. This is because, we believe that if editors edit articles, the editors may not read the whole page, but the editors should read the whole sections or paragraphs, and delete low-quality texts. From experimental evaluation, we confirmed that the proposed method could improve the accuracy of quality values for articles.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adler, B.T., Chatterjee, K., de Alfaro, L., Faella, M., Pye, I., Raman, V.: Assigning Trust to Wikipedia Content. In: Proceedings of the International Symposium on Wikis (WikiSym 2008). ACM (2008)
Adler, B., de Alfaro, L.: A content-driven reputation system for the Wikipedia. In: Proceedings of the 16th International Conference on World Wide Web (WWW 2007), pp. 261–270 (2007)
Adler, B., Chatterjee, K., de Alfaro, L., Faella, M., Pye, I., Raman, V.: Measuting Author Contributions to the Wikipedia. In: Proceedings of the 2008 International Symposium on Wikis (WikiSym 2008) (2008)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval: the concepts and technology behind search. Addison-Wesley (2011)
Hu, M., Lim, E., Sun, A., Lauw, H.W., Vuong, B.: Measuring Article Quality in Wikipedia: Models and Evaluation. In: Proceedings of ACM International Conference on Information and Knowledge Management (CIKM 2007), pp. 243–252 (2007)
Kittur, A., Kraut, R.E.: Harnessing the wisdom of crowds in wikipedia: quality through coordination. In: Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work, CSCW 2008, pp. 37–46. ACM, New York (2008)
Sabel, M.: Structuring Wiki revision history. In: Proceedings of the 2007 International Symposium on Wikis (WikiSym 2007), pp. 125–130. ACM (2007)
Suzuki, Y., Yoshikawa, M.: Qualityrank: Assessing quality of wikipedia articles by mutually evaluating editors and text. In: Proceedings of the 23rd ACM Conference on Hypertext and Social Media (HT 2012). ACM Press (to appear, 2012)
Toms, E.G., Mackenzie, T., Jordan, C., Hall, S.: wikiSearch: enabling interactivity in search. In: Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2009), p. 843 (2009)
Wilkinson, D.M., Huberman, B.A.: Cooperation and quality in wikipedia. In: Proceedings of the 2007 International Symposium on Wikis (WikiSym 2007), pp. 157–164. ACM (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Suzuki, Y. (2012). Assessing Quality Values of Wikipedia Articles Using Implicit Positive and Negative Ratings. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-32281-5_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32280-8
Online ISBN: 978-3-642-32281-5
eBook Packages: Computer ScienceComputer Science (R0)