tutorial

Public Access

Dealing with Bias and Fairness in Data Science Systems: A Practical Hands-on Tutorial

Authors:

Kit T. Rodolfa,

Rayid GhaniAuthors Info & Claims

KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 3513 - 3514

https://doi.org/10.1145/3394486.3406708

Published: 20 August 2020 Publication History

Abstract

Tackling issues of bias and fairness when building and deploying data science systems has received increased attention from the research community in recent years, yet a lot of the research has focused on theoretical aspects and very limited set of application areas and data sets. There is a lack of 1) practical training materials, 2) methodologies, and 3) tools for researchers and developers working on real-world algorithmic decision making system to deal with issues of bias and fairness. Today, treating bias and fairness as primary metrics of interest, and building, selecting, and validating models using those metrics is not standard practice for data scientists. In this hands-on tutorial we will try to bridge the gap between research and practice, by deep diving into algorithmic fairness, from metrics and definitions to practical case studies, including bias audits using the Aequitas toolkit (http://github.com/dssg/aequitas). By the end of this hands-on tutorial, the audience will be familiar with bias mitigation frameworks and tools to help them making decisions during a project based on intervention and deployment contexts in which their system will be used.

References

[1]

Kit T. Rodolfa, Pedro Saleiro, and Rayid Ghani. Chapter 11: Bias and fairness. In Ian Foster, Rayid Ghani, Ron S Jarmin, Frauke Kreuter, and Julia Lane, editors, Big data and social science: A practical guide to methods and tools. crc Press, 2020.

[2]

Kit T Rodolfa, Erika Salomon, Lauren Haynes, Iván Higuera Mendieta, Jamie Larson, and Rayid Ghani. Case study: predictive fairness to reduce misdemeanor recidivism through social service interventions. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pages 142--153, 2020.

Digital Library

[3]

Pedro Saleiro, Benedict Kuester, Loren Hinkson, Jesse London, Abby Stevens, Ari Anisfeld, Kit T. Rodolfa, and Rayid Ghani. Aequitas: A Bias and Fairness Audit Toolkit. (2018), nov 2018.

[4]

Moritz Hardt, Eric Price, and Nathan Srebro. Equality of Opportunity in Supervised Learning. Advances in Neural Information Processing Systems, (Nips):1--22, 2016.

[5]

Solon Barocas and Andrew D. Selbst. Big Data's Disparate Impact. California Law Review, 104(3):671--732, 2016.

[6]

Shira Mitchell, Eric Potash, Solon Barocas, Alexander D'Amour, and Kristian Lum. Prediction-Based Decisions and Fairness: A Catalogue of Choices, Assumptions, and Definitions. nov 2018.

[7]

Alexandra Chouldechova, Diana Benavides-Prado, Oleksandr Fialko, and Rhema Vaithianathan. A case study of algorithm-assisted decision making in child maltreatment hotline screening decisions. In Conference on Fairness, Accountability and Transparency, pages 134--148, 2018.

[8]

Alexandra Chouldechova and Aaron Roth. The frontiers of fairness in machine learning. arXiv preprint arXiv:1810.08810, 2018.

[9]

Geoff Pleiss, Manish Raghavan, Felix Wu, Jon Kleinberg, and Kilian Q Weinberger. On Fairness and Calibration. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 5680--5689. Curran Associates, Inc., 2017.

[10]

Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P. Gummadi. Fairness constraints: Mechanisms for fair classification. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, AISTATS 2017, 54, 2017.

[11]

Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P. Gummadi. Fairness beyond disparate treatment & disparate impact: Learning classification without disparate mistreatment. 26th International World Wide Web Conference, WWW 2017, pages 1171--1180, 2017.

Digital Library

[12]

Andrew Cotter, Maya Gupta, Heinrich Jiang, Nathan Srebro, Karthik Sridharan, Serena Wang, Blake Woodworth, and Seungil You. Training Well-Generalizing Classifiers for Fairness Metrics and Other Data-Dependent Constraints. In Proceedings of the 36th International Conference on Machine Learning, volume 97, pages 1397--1405, Long Beach, California, USA, jun 2019. PMLR.

[13]

Indre liobait ? e. Measuring discrimination in algorithmic decision making. Data Mining and Knowledge Discovery, 31(4):1060--1089, jul 2017.

Digital Library

[14]

Sahil Verma and Julia Rubin. Fairness definitions explained. In Proceedings of the International Workshop on Software Fairness - FairWare '18, pages 1--7, New York, New York, USA, 2018. ACM Press.

Digital Library

[15]

Alexandra Chouldechova. Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments. Big Data, 5(2):153--163, jun 2017.

[16]

Jon Kleinberg, Himabindu Lakkaraju, Jure Leskovec, Jens Ludwig, and Sendhill Mullainathan. Human Decisions and Machine Predictions*. The Quarterly Journal of Economics, 133(January):237--293, aug 2017.

[17]

Chris Russell, Matt J Kusner, Joshua Loftus, and Ricardo Silva. When Worlds Collide: Integrating Different Counterfactual Assumptions in Fairness. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 6414--6423. Curran Associates, Inc., 2017.

[18]

Matt J Kusner, Joshua Loftus, Chris Russell, and Ricardo Silva. Counterfactual Fairness. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 4066--4076. Curran Associates, Inc., 2017.

[19]

Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. Machine bias. ProPublica, May, 23:2016, 2016.

[20]

Michael Feldman, Sorelle A. Friedler, John Moeller, Carlos Scheidegger, and Suresh Venkatasubramanian. Certifying and Removing Disparate Impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15, pages 259--268, New York, New York, USA, 2015. ACM Press.

Digital Library

[21]

Alekh Agarwal, Aliiia Beygelzimer, Miroslav Dudfk, John Langford, and Wallach Hanna. A reductions approach to fair classification. 35th International Conference on Machine Learning, ICML 2018, 1:102--119, 2018.

[22]

Yahav Bechavod and Katrina Ligett. Penalizing Unfairness in Binary Classification. jun 2017.

[23]

Naman Goel, Mohammad Yaghini, and Boi Faltings. Non-Discriminatory Machine Learning through Convex Fairness Criteria. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society - AIES '18, pages 116--116, New York, New York, USA, 2018. ACM Press.

Digital Library

[24]

Blake Woodworth, Suriya Gunasekar, Mesrob I. Ohannessian, and Nathan Srebro. Learning Non-Discriminatory Predictors. In Satyen Kale and Ohad Shamir, editors, Proceedings of the 2017 Conference on Learning Theory, volume 65, pages 1920--1953, Amsterdam, Netherlands, jul 2017. PMLR.

Cited By

Criscuolo CDolci T(2024)Exploring Fairness Interpretability with FairnessFriend: A Chatbot Solution2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW61823.2024.00037(246-253)Online publication date: 13-May-2024
https://doi.org/10.1109/ICDEW61823.2024.00037
Asif RHassan SParr G(2023)Integrating a Blockchain-Based Governance Framework for Responsible AIFuture Internet10.3390/fi1503009715:3(97)Online publication date: 28-Feb-2023
https://doi.org/10.3390/fi15030097
Bell ABynum LDrushchak NZakharchenko TRosenblatt LStoyanovich J(2023)The Possibility of Fairness: Revisiting the Impossibility Theorem in PracticeProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594007(400-422)Online publication date: 12-Jun-2023
https://dl.acm.org/doi/10.1145/3593013.3594007
Show More Cited By

Index Terms

Dealing with Bias and Fairness in Data Science Systems: A Practical Hands-on Tutorial
1. Computing methodologies
  1. Machine learning
2. General and reference
  1. Cross-computing tools and techniques
    1. Evaluation

Recommendations

Introduction to AI Fairness
CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems

Today, AI is used in many high-stakes decision-making applications in which fairness is an important concern. Already, there are many examples of AI being biased and making questionable and unfair decisions. Recently, the AI research community has ...
Introduction to AI Fairness
CHI EA '21: Extended Abstracts of the 2021 CHI Conference on Human Factors in Computing Systems

Today, AI is used in many high-stakes decision-making applications in which fairness is an important concern. Already, there are many examples of AI being biased and making questionable and unfair decisions. Recently, the AI research community has ...
Fairness metrics and bias mitigation strategies for rating predictions
Abstract
Algorithm fairness is an established line of research in the machine learning domain with substantial work while the equivalent in the recommender system domain is relatively new. In this article, we consider rating-based recommender ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

August 2020

3664 pages

ISBN:9781450379984

DOI:10.1145/3394486

General Chairs:
Rajesh Gupta
UC San Diego, USA
,
Yan Liu
USC, USA
,
Program Chairs:
Mohak Shah
LG Electronics, USA
,
Suju Rajan
Linkedin, USA
,
Publications Chairs:
Jiliang Tang
Michigan State, USA
,
B. Aditya Prakash
Georgia Tech, USA

Copyright © 2020 Owner/Author.

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 August 2020

Check for updates

Author Tags

Qualifiers

Tutorial

Funding Sources

National Science Foundation

Conference

KDD '20

Sponsor:

KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

July 6 - 10, 2020

CA, Virtual Event, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '24

Sponsor:
sigkdd
sigkdd

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 25 - 29, 2024

Barcelona , Spain

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
1,260
Total Downloads

Downloads (Last 12 months)344
Downloads (Last 6 weeks)30

Reflects downloads up to 10 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

Criscuolo CDolci T(2024)Exploring Fairness Interpretability with FairnessFriend: A Chatbot Solution2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW61823.2024.00037(246-253)Online publication date: 13-May-2024
https://doi.org/10.1109/ICDEW61823.2024.00037
Asif RHassan SParr G(2023)Integrating a Blockchain-Based Governance Framework for Responsible AIFuture Internet10.3390/fi1503009715:3(97)Online publication date: 28-Feb-2023
https://doi.org/10.3390/fi15030097
Bell ABynum LDrushchak NZakharchenko TRosenblatt LStoyanovich J(2023)The Possibility of Fairness: Revisiting the Impossibility Theorem in PracticeProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594007(400-422)Online publication date: 12-Jun-2023
https://dl.acm.org/doi/10.1145/3593013.3594007
Baresi LCriscuolo CGhezzi C(2023)Understanding Fairness Requirements for ML-based Software2023 IEEE 31st International Requirements Engineering Conference (RE)10.1109/RE57278.2023.00046(341-346)Online publication date: Sep-2023
https://doi.org/10.1109/RE57278.2023.00046
Zawad SAnwar AZhou YBaracaldo NYan F(2023)HDFL: A Heterogeneity and Client Dropout-Aware Federated Learning Framework2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid57682.2023.00037(311-321)Online publication date: May-2023
https://doi.org/10.1109/CCGrid57682.2023.00037
Rawal YSoni HDani RBagchi P(2022)A Review on Service Delivery in Tourism and Hospitality Industry Through Artificial IntelligenceProceedings of Third International Conference on Computing, Communications, and Cyber-Security10.1007/978-981-19-1142-2_34(427-436)Online publication date: 3-Jul-2022
https://doi.org/10.1007/978-981-19-1142-2_34
Sharma SRawal YPal SDani R(2022)Fairness, Accountability, Sustainability, Transparency (FAST) of Artificial Intelligence in Terms of Hospitality IndustryICT Analysis and Applications10.1007/978-981-16-5655-2_48(495-504)Online publication date: 7-Jan-2022
https://doi.org/10.1007/978-981-16-5655-2_48
Flannagan C(2022)Big Data in Road Transport and Mobility ResearchAI-enabled Technologies for Autonomous and Connected Vehicles10.1007/978-3-031-06780-8_19(523-546)Online publication date: 8-Sep-2022
https://doi.org/10.1007/978-3-031-06780-8_19
Zawad SYan FAnwar A(2022)Systems Bias in Federated LearningFederated Learning10.1007/978-3-030-96896-0_12(259-278)Online publication date: 8-Feb-2022
https://doi.org/10.1007/978-3-030-96896-0_12
Lamba HRodolfa KGhani R(2021)An Empirical Comparison of Bias Reduction Methods on Real-World Problems in High-Stakes Policy SettingsACM SIGKDD Explorations Newsletter10.1145/3468507.346851823:1(69-85)Online publication date: 29-May-2021
https://dl.acm.org/doi/10.1145/3468507.3468518
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents