skip to main content
10.1145/1458527.1458532acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Detecting reviewer bias through web-based association mining

Published:30 October 2008Publication History

ABSTRACT

Online retailers and content distributors benefit from an active community that shares credible reviews and recommendations. Today, the most popular approach to encouraging credibility in these communities is self-regulation; community members rate reviews according to their accuracy and usefulness, thus helping to weed out reviews that are inaccurate. This self-regulation, while powerful, is limited by its insularity. Community members generally base their assessments on a reviewer's comments and actions only within the community. This ignores relationships the reviewer has outside the community that may be quite relevant to evaluating the reviewer's comments; for example, a relationship between an author and reviewer. We present a simple method for mining the Web to detect many such associations. Our method, together with self-regulation, provides for more comprehensive detection of bias in reviews by alerting the user to the potential for an undisclosed relationship between a reviewer and author. We provide preliminary results using book reviews in Amazon.com demonstrating that our approach is a high-precision method for detecting strong relationships between reviewers and authors that may contribute to reviewer bias.

References

  1. B. Adler and L. de Alfaro. A content-driven reputation system for the wikipedia. WWW 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. R. Agrawal and R. Srikant. Fast algorithms for mining association rules. Proceedings of the 20th VLDB Conference, Santiago, Chile, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Agrawal, H. Mannila, R. Srikant, H. Toivonen and A. Verkamo. Fast discovery of association rules, Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press, Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Bollegala, Y. Matsuo and M. Ishizuka. Measuring semanticsimilarity between words using Web search engines. WWW 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Chevalier and D. Mayzlin. The Effect of Word of Mouth on Sales: Online Book Reviews. August 6, 2003. Yale SOM Working Paper No's. ES-28 & MK-15.Google ScholarGoogle Scholar
  6. R. Chow, P. Golle and J. Staddon. Detecting privacy leaks using corpus-based association rules. In ACM SIGKDD 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Cimiano and S. Staab. Learning by Googling, ACM SIGKDD Explorations Newsletter, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. comScore and The Kelsey Group. November 29,2007.Google ScholarGoogle Scholar
  9. M. Dowman, V. Tablan, H. Cunningham and B. Popov. Web-Assisted Annotation, Semantic Indexing and Search of Television and Radio News. WWW 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. eModeration. http://www.emoderation.comGoogle ScholarGoogle Scholar
  11. N. Glance. Community search assistant. The 6th International Conference on Intelligent User Interfaces, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Guha, R. Kumar, P. Raghavan and A. Tomkins. Propagation of trust and distrust. WWW 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Lauw, E-P.Lim and K. Wang. Bias and controversy:beyond statistical deviation. KDD 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. V. Griffith. WikiScanner. http://virgil.grGoogle ScholarGoogle Scholar
  15. A. Gome. Web two point uh-oh! "It's my shout" Blog on Smart Company. March, 2007.Google ScholarGoogle Scholar
  16. Hakia. http://www.hakia.com/Google ScholarGoogle Scholar
  17. V. Mickunas. Amazon.com book reviewer shakeout. Dayton Daily News. April 14, 2007.Google ScholarGoogle Scholar
  18. . Mowshowitz and A. Kawaguchi. Bias on the Web.Communications of the ACM (CACM), September 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Nakov and M. Hearst. Solving relational similarityproblems using the Web as a corpus. Proceedings of ACL/HLT2008.Google ScholarGoogle Scholar
  20. PowerReviews. Social Shopping Study 2007. http://www.powerreviews.comGoogle ScholarGoogle Scholar
  21. D. Quercia, S. Hailes and L. Capra. Lightweight distributed trust propagation. ICDM2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. P. Resnick, R. Zeckhauser, E. Friedman, K. Kuwabara. Reputation Systems. Communications of the ACM, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. T. Riggs and R. Wilensky. An algorithm for automatedrating of reviewers. JCDL 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. G. Sandoval. Digg continues to battle phony stories. CNET.com, December 18, 2006.Google ScholarGoogle Scholar
  25. P. Turney. Expressing implicit semantic relationswithout supervision. Proceedings of ACL 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. R. Verkaik. Google sued over defamatory posts found on Web search. The Independent. June, 2007.Google ScholarGoogle Scholar
  27. Yahoo! Search BOSS. http://developer.yahoo.com/search/boss/Google ScholarGoogle Scholar

Index Terms

  1. Detecting reviewer bias through web-based association mining

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        WICOW '08: Proceedings of the 2nd ACM workshop on Information credibility on the web
        October 2008
        100 pages
        ISBN:9781605582597
        DOI:10.1145/1458527

        Copyright © 2008 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 30 October 2008

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate9of19submissions,47%

        Upcoming Conference

        WWW '24
        The ACM Web Conference 2024
        May 13 - 17, 2024
        Singapore , Singapore

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader