Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3394486.3406708acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
tutorial
Public Access

Dealing with Bias and Fairness in Data Science Systems: A Practical Hands-on Tutorial

Published: 20 August 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Tackling issues of bias and fairness when building and deploying data science systems has received increased attention from the research community in recent years, yet a lot of the research has focused on theoretical aspects and very limited set of application areas and data sets. There is a lack of 1) practical training materials, 2) methodologies, and 3) tools for researchers and developers working on real-world algorithmic decision making system to deal with issues of bias and fairness. Today, treating bias and fairness as primary metrics of interest, and building, selecting, and validating models using those metrics is not standard practice for data scientists. In this hands-on tutorial we will try to bridge the gap between research and practice, by deep diving into algorithmic fairness, from metrics and definitions to practical case studies, including bias audits using the Aequitas toolkit (http://github.com/dssg/aequitas). By the end of this hands-on tutorial, the audience will be familiar with bias mitigation frameworks and tools to help them making decisions during a project based on intervention and deployment contexts in which their system will be used.

    References

    [1]
    Kit T. Rodolfa, Pedro Saleiro, and Rayid Ghani. Chapter 11: Bias and fairness. In Ian Foster, Rayid Ghani, Ron S Jarmin, Frauke Kreuter, and Julia Lane, editors, Big data and social science: A practical guide to methods and tools. crc Press, 2020.
    [2]
    Kit T Rodolfa, Erika Salomon, Lauren Haynes, Iván Higuera Mendieta, Jamie Larson, and Rayid Ghani. Case study: predictive fairness to reduce misdemeanor recidivism through social service interventions. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pages 142--153, 2020.
    [3]
    Pedro Saleiro, Benedict Kuester, Loren Hinkson, Jesse London, Abby Stevens, Ari Anisfeld, Kit T. Rodolfa, and Rayid Ghani. Aequitas: A Bias and Fairness Audit Toolkit. (2018), nov 2018.
    [4]
    Moritz Hardt, Eric Price, and Nathan Srebro. Equality of Opportunity in Supervised Learning. Advances in Neural Information Processing Systems, (Nips):1--22, 2016.
    [5]
    Solon Barocas and Andrew D. Selbst. Big Data's Disparate Impact. California Law Review, 104(3):671--732, 2016.
    [6]
    Shira Mitchell, Eric Potash, Solon Barocas, Alexander D'Amour, and Kristian Lum. Prediction-Based Decisions and Fairness: A Catalogue of Choices, Assumptions, and Definitions. nov 2018.
    [7]
    Alexandra Chouldechova, Diana Benavides-Prado, Oleksandr Fialko, and Rhema Vaithianathan. A case study of algorithm-assisted decision making in child maltreatment hotline screening decisions. In Conference on Fairness, Accountability and Transparency, pages 134--148, 2018.
    [8]
    Alexandra Chouldechova and Aaron Roth. The frontiers of fairness in machine learning. arXiv preprint arXiv:1810.08810, 2018.
    [9]
    Geoff Pleiss, Manish Raghavan, Felix Wu, Jon Kleinberg, and Kilian Q Weinberger. On Fairness and Calibration. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 5680--5689. Curran Associates, Inc., 2017.
    [10]
    Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P. Gummadi. Fairness constraints: Mechanisms for fair classification. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, AISTATS 2017, 54, 2017.
    [11]
    Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P. Gummadi. Fairness beyond disparate treatment & disparate impact: Learning classification without disparate mistreatment. 26th International World Wide Web Conference, WWW 2017, pages 1171--1180, 2017.
    [12]
    Andrew Cotter, Maya Gupta, Heinrich Jiang, Nathan Srebro, Karthik Sridharan, Serena Wang, Blake Woodworth, and Seungil You. Training Well-Generalizing Classifiers for Fairness Metrics and Other Data-Dependent Constraints. In Proceedings of the 36th International Conference on Machine Learning, volume 97, pages 1397--1405, Long Beach, California, USA, jun 2019. PMLR.
    [13]
    Indre liobait ? e. Measuring discrimination in algorithmic decision making. Data Mining and Knowledge Discovery, 31(4):1060--1089, jul 2017.
    [14]
    Sahil Verma and Julia Rubin. Fairness definitions explained. In Proceedings of the International Workshop on Software Fairness - FairWare '18, pages 1--7, New York, New York, USA, 2018. ACM Press.
    [15]
    Alexandra Chouldechova. Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments. Big Data, 5(2):153--163, jun 2017.
    [16]
    Jon Kleinberg, Himabindu Lakkaraju, Jure Leskovec, Jens Ludwig, and Sendhill Mullainathan. Human Decisions and Machine Predictions*. The Quarterly Journal of Economics, 133(January):237--293, aug 2017.
    [17]
    Chris Russell, Matt J Kusner, Joshua Loftus, and Ricardo Silva. When Worlds Collide: Integrating Different Counterfactual Assumptions in Fairness. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 6414--6423. Curran Associates, Inc., 2017.
    [18]
    Matt J Kusner, Joshua Loftus, Chris Russell, and Ricardo Silva. Counterfactual Fairness. In I Guyon, U V Luxburg, S Bengio, H Wallach, R Fergus, S Vishwanathan, and R Garnett, editors, Advances in Neural Information Processing Systems 30, pages 4066--4076. Curran Associates, Inc., 2017.
    [19]
    Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. Machine bias. ProPublica, May, 23:2016, 2016.
    [20]
    Michael Feldman, Sorelle A. Friedler, John Moeller, Carlos Scheidegger, and Suresh Venkatasubramanian. Certifying and Removing Disparate Impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15, pages 259--268, New York, New York, USA, 2015. ACM Press.
    [21]
    Alekh Agarwal, Aliiia Beygelzimer, Miroslav Dudfk, John Langford, and Wallach Hanna. A reductions approach to fair classification. 35th International Conference on Machine Learning, ICML 2018, 1:102--119, 2018.
    [22]
    Yahav Bechavod and Katrina Ligett. Penalizing Unfairness in Binary Classification. jun 2017.
    [23]
    Naman Goel, Mohammad Yaghini, and Boi Faltings. Non-Discriminatory Machine Learning through Convex Fairness Criteria. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society - AIES '18, pages 116--116, New York, New York, USA, 2018. ACM Press.
    [24]
    Blake Woodworth, Suriya Gunasekar, Mesrob I. Ohannessian, and Nathan Srebro. Learning Non-Discriminatory Predictors. In Satyen Kale and Ohad Shamir, editors, Proceedings of the 2017 Conference on Learning Theory, volume 65, pages 1920--1953, Amsterdam, Netherlands, jul 2017. PMLR.

    Cited By

    View all
    • (2024)Exploring Fairness Interpretability with FairnessFriend: A Chatbot Solution2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW61823.2024.00037(246-253)Online publication date: 13-May-2024
    • (2023)Integrating a Blockchain-Based Governance Framework for Responsible AIFuture Internet10.3390/fi1503009715:3(97)Online publication date: 28-Feb-2023
    • (2023)The Possibility of Fairness: Revisiting the Impossibility Theorem in PracticeProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594007(400-422)Online publication date: 12-Jun-2023
    • Show More Cited By

    Index Terms

    1. Dealing with Bias and Fairness in Data Science Systems: A Practical Hands-on Tutorial

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
        August 2020
        3664 pages
        ISBN:9781450379984
        DOI:10.1145/3394486
        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 20 August 2020

        Check for updates

        Author Tags

        1. ai ethics
        2. algorithmic fairness
        3. bias

        Qualifiers

        • Tutorial

        Funding Sources

        Conference

        KDD '20
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

        Upcoming Conference

        KDD '24

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)344
        • Downloads (Last 6 weeks)30
        Reflects downloads up to 10 Aug 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Exploring Fairness Interpretability with FairnessFriend: A Chatbot Solution2024 IEEE 40th International Conference on Data Engineering Workshops (ICDEW)10.1109/ICDEW61823.2024.00037(246-253)Online publication date: 13-May-2024
        • (2023)Integrating a Blockchain-Based Governance Framework for Responsible AIFuture Internet10.3390/fi1503009715:3(97)Online publication date: 28-Feb-2023
        • (2023)The Possibility of Fairness: Revisiting the Impossibility Theorem in PracticeProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594007(400-422)Online publication date: 12-Jun-2023
        • (2023)Understanding Fairness Requirements for ML-based Software2023 IEEE 31st International Requirements Engineering Conference (RE)10.1109/RE57278.2023.00046(341-346)Online publication date: Sep-2023
        • (2023)HDFL: A Heterogeneity and Client Dropout-Aware Federated Learning Framework2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid57682.2023.00037(311-321)Online publication date: May-2023
        • (2022)A Review on Service Delivery in Tourism and Hospitality Industry Through Artificial IntelligenceProceedings of Third International Conference on Computing, Communications, and Cyber-Security10.1007/978-981-19-1142-2_34(427-436)Online publication date: 3-Jul-2022
        • (2022)Fairness, Accountability, Sustainability, Transparency (FAST) of Artificial Intelligence in Terms of Hospitality IndustryICT Analysis and Applications10.1007/978-981-16-5655-2_48(495-504)Online publication date: 7-Jan-2022
        • (2022)Big Data in Road Transport and Mobility ResearchAI-enabled Technologies for Autonomous and Connected Vehicles10.1007/978-3-031-06780-8_19(523-546)Online publication date: 8-Sep-2022
        • (2022)Systems Bias in Federated LearningFederated Learning10.1007/978-3-030-96896-0_12(259-278)Online publication date: 8-Feb-2022
        • (2021)An Empirical Comparison of Bias Reduction Methods on Real-World Problems in High-Stakes Policy SettingsACM SIGKDD Explorations Newsletter10.1145/3468507.346851823:1(69-85)Online publication date: 29-May-2021
        • Show More Cited By

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Get Access

        Login options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media