research-article

Fair pattern discovery

Authors:

Dino Pedreschi,

Josep Domingo-Ferrer,

Fosca GiannottiAuthors Info & Claims

SAC '14: Proceedings of the 29th Annual ACM Symposium on Applied Computing

Pages 113 - 120

https://doi.org/10.1145/2554850.2555043

Published: 24 March 2014 Publication History

Abstract

Data mining is gaining societal momentum due to the ever increasing availability of large amounts of human data, easily collected by a variety of sensing technologies. We are assisting to unprecedented opportunities of understanding human and society behavior that unfortunately is darkened by several risks for human rights: one of this is the unfair discrimination based on the extracted patterns and profiles. Consider the case when a set of patterns extracted from the personal data of a population of individual persons is released for subsequent use in a decision making process, such as, e.g., granting or denying credit. Decision rules based on such patterns may lead to unfair discrimination, depending on what is represented in the training cases. In this context, we address the discrimination risks resulting from publishing frequent patterns. We present a set of pattern sanitization methods, one for each discrimination measure used in the legal literature, for fair (discrimination-protected) publishing of frequent pattern mining results. Our proposed pattern sanitization methods yield discrimination-protected patterns, while introducing reasonable (controlled) pattern distortion. Finally, the effectiveness of our proposals is assessed by extensive experiments.

References

[1]

R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In VLDB, pp. 487--499, 1994.

Digital Library

[2]

Australian Legislation. (a) Equal Opportunity Act -- Victoria State, (b) Anti-Discrimination Act -- Queensland State, 2008. http://www.austlii.edu.au

[3]

T. Calders and S. Verwer. Three naive Bayes approaches for discrimination-free classification. Data Mining and Knowledge Discovery, 21(2): 277--292, 2010.

Digital Library

[4]

B. Custers, T. Calders, B. Schermer and T. Z. Zarsky (eds.). Discrimination and Privacy in the Information Society - Data Mining and Profiling in Large Databases. Studies in Applied Philosophy, Epistemology and Rational Ethics 3. Springer, 2013.

Digital Library

[5]

C. Dwork, M. Hardt, T. Pitassi, O. Reingold and R. S. Zemel. Fairness through awareness. In ITCS 2012, pp. 214--226. ACM, 2012.

Digital Library

[6]

European Union Legislation. Directive 95/46/EC, 1995.

[7]

European Union Legislation, (a) Race Equality Directive, 2000/43/EC, 2000; (b) Employment Equality Directive, 2000/78/EC, 2000; (c) Equal Treatment of Persons, European Parliament legislative resolution, P6_TA(2009)0211, 2009.

[8]

A. Frank and A. Asuncion. UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science, 2010. http://archive.ics.uci.edu/ml/datasets

[9]

S. Hajian and J. Domingo-Ferrer. A methodology for direct and indirect discrimination prevention in data mining. IEEE TKDE, 25(7): 1445--1459, 2013.

Digital Library

[10]

S. Hajian, A. Monreale, D. Pedreschi, J. Domingo-Ferrer and F. Giannotti. Injecting discrimination and privacy awareness into pattern discovery. In IEEE ICDM Workshops, pp. 360--369, 2012.

Digital Library

[11]

F. Kamiran and T. Calders. Data preprocessing techniques for classification without discrimination. KAIS, 33(1): 1--33, 2011.

Digital Library

[12]

F. Kamiran, T. Calders and M. Pechenizkiy. Discrimination aware decision tree learning. In ICDM, pp. 869--874. IEEE, 2010.

Digital Library

[13]

T. Kamishima, S. Akaho, H. Asoh, J. Sakuma. Fairness-aware classifier with prejudice remover regularizer. In ECML/PKDD, LNCS 7524, pp. 35--50, 2012.

Digital Library

[14]

W. Li, J. Han and J. Pei. CMAR: accurate and efficient classification based on multiple class-association rules. In ICDM, pp. 369--376, 2001.

Digital Library

[15]

B. L. Loung, S. Ruggieri and F. Turini. k-NN as an implementation of situation testing for discrimination discovery and prevention. In KDD, pp. 502--510, 2011.

Digital Library

[16]

D. Pedreschi, S. Ruggieri and F. Turini. Discrimination-aware data mining. In KDD, pp. 560--568, 2008.

Digital Library

[17]

D. Pedreschi, S. Ruggieri and F. Turini. Measuring discrimination in socially-sensitive decision records. In SDM 2009, pp. 581--592. SIAM, 2009.

[18]

S. Ruggieri, D. Pedreschi and F. Turini. Data mining for discrimination discovery. ACM TKDD, 4(2), Article 9, 2010.

Digital Library

[19]

United States Congress, US Equal Pay Act, 1963.

[20]

I. Zliobaite, F. Kamiran and T. Calders. Handling conditional discrimination. In ICDM, pp. 992--1001, 2011.

Digital Library

Cited By

Corrales-Barquero RMarin-Raventos GBarrantes E(2021)A Review of Gender Bias Mitigation in Credit Scoring Models2021 Ethics and Explainability for Responsible Data Science (EE-RDS)10.1109/EE-RDS53766.2021.9708589(1-10)Online publication date: 27-Oct-2021
https://doi.org/10.1109/EE-RDS53766.2021.9708589
L. Cardoso RMeira Jr. WAlmeida VJ. Zaki MConitzer VHadfield GVallor S(2019)A Framework for Benchmarking Discrimination-Aware Models in Machine LearningProceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society10.1145/3306618.3314262(437-444)Online publication date: 27-Jan-2019
https://dl.acm.org/doi/10.1145/3306618.3314262

Index Terms

Fair pattern discovery
1. Information systems
  1. Information systems applications

Recommendations

Injecting Discrimination and Privacy Awareness Into Pattern Discovery
ICDMW '12: Proceedings of the 2012 IEEE 12th International Conference on Data Mining Workshops

Data mining is gaining societal momentum due to the ever increasing availability of large amounts of human data, easily collected by a variety of sensing technologies. Data mining comes with unprecedented opportunities and risks: a deeper understanding ...
Scalable APRIORI-Based Frequent Pattern Discovery
CSE '09: Proceedings of the 2009 International Conference on Computational Science and Engineering - Volume 01

Frequent pattern discovery, the task of finding sets of items that frequently occur together in a dataset, has beenat the core of the field of data mining for the past sixteen years.In that time, the size of datasets has grown much faster than has the ...
The discovery of frequent patterns with logic and constraint programming
MAMECTIS/NOLASC/CONTROL/WAMUS'11: Proceedings of the 13th WSEAS international conference on mathematical methods, computational techniques and intelligent systems, and 10th WSEAS international conference on non-linear analysis, non-linear systems and chaos, and 7th WSEAS international conference on dynamical systems and control, and 11th WSEAS international conference on Wavelet analysis and multirate systems: recent researches in computational techniques, non-linear systems and control

The basic goal of data mining is to discover patterns occurring in the databases, such as associations, classification models, sequential patterns, and so on. In this paper we focus on the problem of frequent pattern discovery, which is the process of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SAC '14: Proceedings of the 29th Annual ACM Symposium on Applied Computing

March 2014

1890 pages

ISBN:9781450324694

DOI:10.1145/2554850

Conference Chairs:
Yookun Cho
Seoul National University, Korea
,
Sung Y. Shin
South Dakota State University
,
Program Chairs:
Sangwook Kim
Kyungpook National University, Korea
,
Chih-Cheng Hung
Southern Polytechnic State University
,
Jiman Hong
Soongsil University, South Korea

Copyright © 2014 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGAPP: ACM Special Interest Group on Applied Computing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 March 2014

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Institució Catalana de Recerca i Estudis Avançats

Conference

SAC 2014

Sponsor:

SIGAPP

SAC 2014: Symposium on Applied Computing

March 24 - 28, 2014

Gyeongju, Republic of Korea

Acceptance Rates

SAC '14 Paper Acceptance Rate 218 of 939 submissions, 23%;

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Upcoming Conference

SAC '25

Sponsor:
sigapp

The 40th ACM/SIGAPP Symposium on Applied Computing

March 31 - April 4, 2025

Catania , Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
175
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 30 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Corrales-Barquero RMarin-Raventos GBarrantes E(2021)A Review of Gender Bias Mitigation in Credit Scoring Models2021 Ethics and Explainability for Responsible Data Science (EE-RDS)10.1109/EE-RDS53766.2021.9708589(1-10)Online publication date: 27-Oct-2021
https://doi.org/10.1109/EE-RDS53766.2021.9708589
L. Cardoso RMeira Jr. WAlmeida VJ. Zaki MConitzer VHadfield GVallor S(2019)A Framework for Benchmarking Discrimination-Aware Models in Machine LearningProceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society10.1145/3306618.3314262(437-444)Online publication date: 27-Jan-2019
https://dl.acm.org/doi/10.1145/3306618.3314262

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten