skip to main content
10.1145/2160881.2160890acmconferencesArticle/Chapter ViewAbstractPublication PagesicicConference Proceedingsconference-collections
research-article

Supporting collaboration in Wikipedia between language communities

Authors Info & Claims
Published:21 March 2012Publication History

ABSTRACT

This paper describes an application of machine translation technology for supporting collaboration in Wikipedia. Wikipedia hosts separate language Wikipedias for hundreds of different languages. While some content is specific to these different versions of Wikipedia, some topics have pages within multiple different Wikipedias. Similarly, while some users participate only in one Wikipedia, we find users who play a bridging role between these sub-communities and participate in the process of maintaining similar pages in different Wikipedias. Since these are not the majority of users, a support tool that allows stretching the effort of these specialized users further by indicating where their effort is needed could be a tremendous benefit to the community. An evaluation of the proposed approach demonstrates promise that such a tool could substantially reduce the effort involved in playing this bridging role on Wikipedia.

References

  1. Christof Müller and Iryna Gurevych. 2009 Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval Evaluating Systems for Multilingual and Multimodal Information Access, Springer Berlin /Heidelberg, pp. 219--226. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Steinberger, Ralf and Pouliquen, Bruno and Hagman, Johan 2002. Cross-Lingual Document Similarity Calculation Using the Multilingual Thesaurus EUROVOC EProceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing, pp. 415--424. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Aminul Islam and Diana Inkpen. 2008, Jul. Semantic Text Similarity Using Corpus-Based Word Similarity and String Similarity ACM Transaction on Knowledge Discovery from Data, Vol. 2, No. 2, Article 10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Wikipedia Infoboxes Help. (2010, Dec.) {Online}. Available: http://en.wikipedia.org/wiki/Help:InfoboxGoogle ScholarGoogle Scholar
  5. Wikipedia Infoboxes Categories. (2010, Dec.){Online}. Available http://en.wikipedia.org/wiki/Category:InfoboxtemplatesGoogle ScholarGoogle Scholar
  6. MediaWiki API Documentation. (2010, Dec.) {Online}. Available: http://www.mediawiki.org/wiki/APIoxGoogle ScholarGoogle Scholar
  7. GoogleTranslate API, developer's guide (v2): Using REST. (2010, Dec.) {Online}. Available: http://code.google.com/apis/language/translate/v2/usingrest.htmlGoogle ScholarGoogle Scholar
  8. Libcurl - C API documentation. (2010, Dec.) {Online}. Available: http://curl.haxx.se/libcurl/c/Google ScholarGoogle Scholar
  9. PHP similar text function documentation (2010, Dec.) {Online}. Available: http://php.net/manual/en/function.similar-text.phpGoogle ScholarGoogle Scholar
  10. Jonathan J. Oliver. 2008, Jul. Decision Graphs - An Extension of Decision Trees. Available: http://www.cs.monash.edu.au/jono/TechReports/TR173.dgraph.psGoogle ScholarGoogle Scholar
  11. Metzler, Donald and Dumais, Susan and Meek, Christopher 2007. Similarity Measures for Short Segments of Text Advances in Information Retrieval Vol. 4425, Springer Berlin / Heidelberg, pp. 16--27. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Fellbaum. 1998. WordNet: An Electronical Lexical Database. The MIT Press, Cambridge, MA.Google ScholarGoogle Scholar
  13. PHP metaphone code generation function by Lawrence Philips. (2010, Dec.) {Online}. Available: http://php.net/manual/en/function.metaphone.phpGoogle ScholarGoogle Scholar
  14. Binstock & Rex. 1995. Practical Algorithms for Programmers Addison Wesley. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Parts Of Speech Tagging, PHP/ir, Information Retrieval and other interesting topics. (2010, Dec.) {Online}. Available: http://phpir.com/part-of-speechtaggingGoogle ScholarGoogle Scholar
  16. Adar, Skinner and Weld 2009, Information Arbitrage Across Multi-lingual Wikipedia WSDM'09, Barcelona, Spain. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Ulrike Pfeil, Panayiotis Zaphiris, Chee Siang Ang 2006, Cultural Differences in Collaborative Authoring of Wikipedia.Google ScholarGoogle Scholar
  18. B. Latane, K. Williams, and S. Harkins. Many hands make light the work: The causes and consequences of social loafing. J. Pers. Soc. Psych., 37:822--832, 1979.Google ScholarGoogle Scholar
  19. D. Cosley, D. Frankowski, L. Terveen... - 2007, SuggestBot: Using Intelligent Task Routing to Help People Find Work in Wikipedia.Google ScholarGoogle Scholar
  20. S. L. Bryant, A Forte... - 2005, Becoming Wikipedian: Transformation of Participation in a Collaborative Online Encyclopedia.Google ScholarGoogle Scholar
  21. Slattery, S. P. (2009). "Edit this page": the socio-technological infrastructure of a Wikipedia article. In Proc. of the 27th ACM international conference on Design of communication (pp. 289--296). Bloomington, Indiana, USA: ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Liu, Y., Liu, Q., & Lin, S. (2006). Tree-to-string alignment template for statistical machine translation, Proceedings of the 44th Annual Meeting of the Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Gildea, D. (2003). Loosely tree-based alignment for machine translation, Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Och, F. & Ney, H. (2000). Improved statistical alignment models, Proceedings of the 28th Annual Meeting of the Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Mohler, M. & and Mihalcea, R. (2009). Text-to-text Semantic Similarity for Automatic Short Answer Grading, in Proceedings of the European Chapter of the Association for Computational Linguistics (EACL 2009), Athens, Greece. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Gbrilovich, E. & Markovitch, S. (2009). Wikipedia-based semantic interpretation for natural language processing, Journal of Artificial Intelligence Research 34(1). Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Metzler, D., Dumais, S., & Meek, C. (2007). Similarity Measures for Short Segments of Text, Advances in Information Retrieval, Volume 4425, pp 16--27. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Supporting collaboration in Wikipedia between language communities

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        ICIC '12: Proceedings of the 4th international conference on Intercultural Collaboration
        March 2012
        170 pages
        ISBN:9781450308182
        DOI:10.1145/2160881

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 21 March 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate47of77submissions,61%
      • Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader