Abstract
Discrimination in decision making is prohibited on many attributes (religion, gender, etc…), but often present in historical decisions. Use of such discriminatory historical decision making as training data can perpetuate discrimination, even if the protected attributes are not directly present in the data. This work focuses on discovering discrimination in instances and preventing discrimination in classification. First, we propose a discrimination discovery method based on modeling the probability distribution of a class using Bayesian networks. This measures the effect of a protected attribute (e.g., gender) in a subset of the dataset using the estimated probability distribution (via a Bayesian network). Second, we propose a classification method that corrects for the discovered discrimination without using protected attributes in the decision process. We evaluate the discrimination discovery and discrimination prevention approaches on two different datasets. The empirical results show that a substantial amount of discrimination identified in instances is prevented in future decisions.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Calders T, Verwer S (2010) Three naive Bayes approaches for discrimination-free classification. Data Min J (special issue with selected papers from ECML/PKDD)
Cooper GF, Herskovits E (1991) A Bayesian method for the induction of probabilistic networks from data. Mach Learn BMIR-1991-0293
Dwork C, Hardt M, Pitassi T, Reingold O, Zemel R (2012) Fairness through awareness. In: ITCS, pp 214–226
Hajian S, Domingo-Ferrer J (2012) A study on the impact of data anonymization on anti-discrimination. In: IEEE ICDM international workshop on discrimination and privacy-aware data mining
Hajian S, Monreale A, Pedreschi D, Domingo-Ferrer J, Giannotti F (2012) Injecting discrimination and privacy awareness into pattern discovery. In: IEEE ICDM international workshop on discrimination and privacy-aware data mining
Kamiran F, Calders T (2009) Classifying without discriminating. IEEE Press, New York
Kamiran F, Calders T (2011) Data preprocessing techniques for classification without discrimination. Knowl Inf Syst
Kamiran F, Calders T, Pechenizkiy M (2010) Discrimination aware decision tree learning. In: Proceedings IEEE ICDM international conference on data mining
Kamiran F, Karim A, Zhang X (2012) Decision theory for discrimination-aware classification. In: IEEE international conference on data mining
Luong BT, Ruggieri S, Turini F (2011) k-NN as an implementation of situation testing for discrimination discovery and prevention. In: 17th ACM international conference on knowledge discovery and data mining (KDD 2011). ACM, pp 502–510
Mancuhan K, Clifton C (2012) Discriminatory decision policy aware classification. In: IEEE ICDM international workshop on discrimination and privacy-aware data mining
Newman DJ, Hettich S, Blake CL, Merz CJ UCI repository of machine learning databases, http://archive.ics.uci.edu/ml/
Pedreschi D, Ruggieri S, Turini F (2008) Discrimination-aware data mining. In: KDD conference
Pedreschi D, Ruggieri S, Turini F (2009) Measuring discrimination in socially-sensitive decision records. In: 9th SIAM conference on data mining (SDM 2009). SIAM, pp 581–592
Romei A, Ruggieri S (2013) A multidisciplinary survey on discrimination analysis. Knowl Eng Rev 1–57
Ruggieri S, Pedreschi D, Turini F (2011) DCUBE: discrimination discovery in databases. In: ACM international conference on knowledge discovery and data mining (KDD 2011). ACM, pp 502–510
Tan P.-N, Steinbach M, Kumar V (2006) Introduction to data mining. Addison-Wesley, Reading, MA, pp 227-246
Witten IH, Frank E (2011) Data mining: practical machine learning tools and techniques. 3rd edn. Morgan Kaufmann, Los Altos, CA
Zemel R, Wu Y, Swersky K, Pitassi T, Dwork C (2013) Learning fair representations. In: ICML
Zliobaite I, Kamiran F, Calders T (2011) Handling conditional discrimination. In: Proceedings IEEE ICDM international conference on data mining
Acknowledgments
We wish to thank Alysa C. Rollock, J.D., Vice President for Ethics and Compliance at Purdue University, for discussion and pointers to relevant U.S. law. We also wish to thank journal reviewers for their helpful comments.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mancuhan, K., Clifton, C. Combating discrimination using Bayesian networks. Artif Intell Law 22, 211–238 (2014). https://doi.org/10.1007/s10506-014-9156-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10506-014-9156-4