research-article

Public Access

Bias Mitigation for Toxicity Detection via Sequential Decisions

Authors:

Lu Cheng,

Ahmadreza Mosallanezhad,

Yasin N. Silva,

Deborah L. Hall,

Huan LiuAuthors Info & Claims

SIGIR '22: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 1750 - 1760

https://doi.org/10.1145/3477495.3531945

Published: 07 July 2022 Publication History

PDF eReader

Abstract

Increased social media use has contributed to the greater prevalence of abusive, rude, and offensive textual comments. Machine learning models have been developed to detect toxic comments online, yet these models tend to show biases against users with marginalized or minority identities (e.g., females and African Americans). Established research in debiasing toxicity classifiers often (1) takes a static or batch approach, assuming that all information is available and then making a one-time decision; and (2) uses a generic strategy to mitigate different biases (e.g., gender and racial biases) that assumes the biases are independent of one another. However, in real scenarios, the input typically arrives as a sequence of comments/words over time instead of all at once. Thus, decisions based on partial information must be made while additional input is arriving. Moreover, social bias is complex by nature. Each type of bias is defined within its unique context, which, consistent with intersectionality theory within the social sciences, might be correlated with the contexts of other forms of bias. In this work, we consider debiasing toxicity detection as a sequential decision-making process where different biases can be interdependent. In particular, we study debiasing toxicity detection with two aims: (1) to examine whether different biases tend to correlate with each other; and (2) to investigate how to jointly mitigate these correlated biases in an interactive manner to minimize the total amount of bias. At the core of our approach is a framework built upon theories of sequential Markov Decision Processes that seeks to maximize the prediction accuracy and minimize the bias measures tailored to individual biases. Evaluations on two benchmark datasets empirically validate the hypothesis that biases tend to be correlated and corroborate the effectiveness of the proposed sequential debiasing strategy.

Supplementary Material

MP4 File (SIGIR22-fp0979.mp4)

This video is a brief presentation of the paper. Please refer to the paper for more details.

Download
21.21 MB

References

[1]

Google Perspective API. 2022. https://www.perspectiveapi.com/.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Handling Bias in Toxic Speech Detection: A Survey

A systematic review on media bias detection: What is media bias, how it is expressed, and how to detect it

A Computational Framework for Media Bias Mitigation

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations