Skip to main content

Evaluating Article Quality and Editor Reputation in Wikipedia

  • Conference paper
Linked Data and Knowledge Graph (CSWS 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 406))

Included in the following conference series:

  • 1305 Accesses

Abstract

We study a novel problem of quality and reputation evaluation for Wikipedia articles. We propose a difficult and interesting question: How to generate reasonable article quality score and editor reputation in a framework at the same time? In this paper, We propose a dual wing factor graph(DWFG) model, which utilizes the mutual reinforcement between articles and editors to generate article quality and editor reputation. To learn the proposed factor graph model, we further design an efficient algorithm. We conduct experiments to validate the effectiveness of the proposed model. By leveraging the belief propagation between articles and editors, our approach obtains significant improvement over several alternative methods(SVM, LR, PR, CRF).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Smith, T.F., Waterman, M.S.: Identification of Common Molecular Subsequences. J. Mol. Biol. 147, 195–197 (1981)

    Article  Google Scholar 

  2. May, P., Ehrlich, H.-C., Steinke, T.: ZIB Structure Prediction Pipeline: Composing a Complex Biological Workflow Through Web Services. In: Nagel, W.E., Walter, W.V., Lehner, W. (eds.) Euro-Par 2006. LNCS, vol. 4128, pp. 1148–1158. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  3. Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, San Francisco (1999)

    Google Scholar 

  4. Adler, B.T., Chatterjee, K., De Alfaro, L., Faella, M., Pye, I., Raman, V.: Assigning trust to Wikipedia content. In: Proceedings of the 4th International Symposium on Wikis. ACM Press (2008)

    Google Scholar 

  5. Adler, B.T., de Alfaro, L.: A Content-Driven Reputation System for the Wikipedia. ACM Press (2007)

    Google Scholar 

  6. Wu, Q., Irani, D., Pu, C., Ramaswamy, L.: Elusive vandalism detection in wikipedia: a text stability-based approach. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 1797–1800. ACM Press (2010)

    Google Scholar 

  7. Zeng, H., Alhossaini, M.A., Ding, L., Fikes, R., McGuinness, D.L.: Computing trust from revision history. In: Proceedings of the 2006 International Conference on Privacy, Security and Trust: Bridge the Gap Between PST Technologies and Business Services, vol. 8, ACM Press (2006)

    Google Scholar 

  8. West, A.G., Kannan, S., Lee, I.: Detecting Wikipedia vandalism via spatio-temporal analysis of revision metadata. In: Proceedings of the Third European Workshop on System Security, pp. 22–28. ACM Press (2010)

    Google Scholar 

  9. Wang, W.Y., McKeown, K.R.: Got you!: automatic vandalism detection in Wikipedia with web-based shallow syntactic-semantic modeling. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 1146–1154. Association for Computational Linguistic (2010)

    Google Scholar 

  10. Smets, K., Goethals, B., Verdonk, B.: Automatic vandalism detection in Wikipedia: Towards a machine learning approach. In: AAAI Workshop on Wikipedia and Artificial Intelligence: An Evolving Synergy, pp. 43–48. ACM Press (2008)

    Google Scholar 

  11. Itakura, K.Y., Clarke, C.L.A.: Using dynamic markov compression to detect vandalism in the wikipedia. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 822–823. ACM Press (2009)

    Google Scholar 

  12. Rassbach, L., Pincock, T., Mingus, B.: Exploring the Feasibility of Automatically Rating Online Article Quality (2008)

    Google Scholar 

  13. Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Assessing information quality of a community-based encyclopedia. In: Proceedings of the International Conference on Information Quality, vol. 11. Citeseer (2005)

    Google Scholar 

  14. McGuinness, D.L., Zeng, H., Da Silva, P.P., Ding, L., Narayanan, D., Bhaowal, M.: Investigations into trust for collaborative information repositories: A wikipedia case study. In: Proceedings of the Workshop on Models of Trust for the Web. Citeseer (2006)

    Google Scholar 

  15. Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the web. Stanford InfoLab (1999)

    Google Scholar 

  16. West, A.G., Chang, J., Venkatasubramanian, K.K., Lee, I.: Trust in collaborative web applications. In: Future Generation Computer Systems. Elsevier (2011)

    Google Scholar 

  17. Kschischang, F.R., Frey, B.J., Loeliger, H.A.: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 47, 498–519 (2001)

    Article  MATH  MathSciNet  Google Scholar 

  18. Loeliger, H.A.: An introduction to factor graphs. IEEE Signal Processing Magazine 21, 28–41 (2004)

    Article  Google Scholar 

  19. Murphy, K.P., Weiss, Y., Jordan, M.I.: Loopy belief propagation for approximate inference: An empirical study. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp. 467–475. Morgan Kaufmann Publishers Inc. (1999)

    Google Scholar 

  20. Yang, Z., Cai, K., Tang, J., Zhang, L., Su, Z., Li, J.: Social context summarization. In: Proceedings of the 34th ACM SIGIR Conference (2011)

    Google Scholar 

  21. West, A.G.: Calculating and Presenting Trust in Collaborative Content. University of Pennsylvania (2010)

    Google Scholar 

  22. Blumenstock, J.E.: Size matters: word count as a measure of quality on wikipedia. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1095–1096. ACM (2008)

    Google Scholar 

  23. Chin, S.C., Street, W.N., Srinivasan, P., Eichmann, D.: Detecting Wikipedia vandalism with active learning and statistical language models. In: Proceedings of the 4th Workshop on Information Credibility, pp. 3–10. ACM (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lu, Y., Zhang, L., Li, J. (2013). Evaluating Article Quality and Editor Reputation in Wikipedia. In: Qi, G., Tang, J., Du, J., Pan, J.Z., Yu, Y. (eds) Linked Data and Knowledge Graph. CSWS 2013. Communications in Computer and Information Science, vol 406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54025-7_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-54025-7_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-54024-0

  • Online ISBN: 978-3-642-54025-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics