Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–23 of 23 results for author: Zafar, M B

.
  1. arXiv:2407.12872  [pdf, other

    cs.CL cs.LG

    Evaluating Large Language Models with fmeval

    Authors: Pola Schwöbel, Luca Franceschi, Muhammad Bilal Zafar, Keerthan Vasist, Aman Malhotra, Tomer Shenhar, Pinal Tailor, Pinar Yilmaz, Michael Diamond, Michele Donini

    Abstract: fmeval is an open source library to evaluate large language models (LLMs) in a range of tasks. It helps practitioners evaluate their model for task performance and along multiple responsible AI dimensions. This paper presents the library and exposes its underlying design principles: simplicity, coverage, extensibility and performance. We then present how these were implemented in the scientific an… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2312.14183  [pdf, other

    cs.CL cs.AI

    On Early Detection of Hallucinations in Factual Question Answering

    Authors: Ben Snyder, Marius Moisescu, Muhammad Bilal Zafar

    Abstract: While large language models (LLMs) have taken great strides towards helping humans with a plethora of tasks like search and summarization, hallucinations remain a major impediment towards gaining user trust. The fluency and coherence of model generations even when hallucinating makes it difficult to detect whether or not a model is hallucinating. In this work, we explore if the artifacts associate… ▽ More

    Submitted 27 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

  3. arXiv:2302.13319  [pdf, other

    stat.ML cs.CY cs.LG

    Efficient fair PCA for fair representation learning

    Authors: Matthäus Kleindessner, Michele Donini, Chris Russell, Muhammad Bilal Zafar

    Abstract: We revisit the problem of fair principal component analysis (PCA), where the goal is to learn the best low-rank linear approximation of the data that obfuscates demographic information. We propose a conceptually simple approach that allows for an analytic solution similar to standard PCA and can be kernelized. Our methods have the same complexity as standard PCA, or kernel PCA, and run much faster… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

  4. arXiv:2212.13897  [pdf, other

    cs.IR

    What You Like: Generating Explainable Topical Recommendations for Twitter Using Social Annotations

    Authors: Parantapa Bhattacharya, Saptarshi Ghosh, Muhammad Bilal Zafar, Soumya K. Ghosh, Niloy Ganguly

    Abstract: With over 500 million tweets posted per day, in Twitter, it is difficult for Twitter users to discover interesting content from the deluge of uninteresting posts. In this work, we present a novel, explainable, topical recommendation system, that utilizes social annotations, to help Twitter users discover tweets, on topics of their interest. A major challenge in using traditional rating dependent r… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  5. arXiv:2203.11103  [pdf, other

    cs.LG stat.ML

    Diverse Counterfactual Explanations for Anomaly Detection in Time Series

    Authors: Deborah Sulem, Michele Donini, Muhammad Bilal Zafar, Francois-Xavier Aubet, Jan Gasthaus, Tim Januschowski, Sanjiv Das, Krishnaram Kenthapadi, Cedric Archambeau

    Abstract: Data-driven methods that detect anomalies in times series data are ubiquitous in practice, but they are in general unable to provide helpful explanations for the predictions they make. In this work we propose a model-agnostic algorithm that generates counterfactual ensemble explanations for time series anomaly detection models. Our method generates a set of diverse counterfactual examples, i.e, mu… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 24 pages, 11 figures

  6. arXiv:2112.12444  [pdf, other

    cs.CL

    More Than Words: Towards Better Quality Interpretations of Text Classifiers

    Authors: Muhammad Bilal Zafar, Philipp Schmidt, Michele Donini, Cédric Archambeau, Felix Biessmann, Sanjiv Ranjan Das, Krishnaram Kenthapadi

    Abstract: The large size and complex decision mechanisms of state-of-the-art text classifiers make it difficult for humans to understand their predictions, leading to a potential lack of trust by the users. These issues have led to the adoption of methods like SHAP and Integrated Gradients to explain classification decisions by assigning importance scores to input tokens. However, prior work, using differen… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

  7. arXiv:2111.13657  [pdf, other

    cs.LG cs.AI stat.ML

    Amazon SageMaker Model Monitor: A System for Real-Time Insights into Deployed Machine Learning Models

    Authors: David Nigenda, Zohar Karnin, Muhammad Bilal Zafar, Raghu Ramesha, Alan Tan, Michele Donini, Krishnaram Kenthapadi

    Abstract: With the increasing adoption of machine learning (ML) models and systems in high-stakes settings across different industries, guaranteeing a model's performance after deployment has become crucial. Monitoring models in production is a critical aspect of ensuring their continued performance and reliability. We present Amazon SageMaker Model Monitor, a fully managed service that continuously monitor… ▽ More

    Submitted 5 August, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

  8. Amazon SageMaker Clarify: Machine Learning Bias Detection and Explainability in the Cloud

    Authors: Michaela Hardt, Xiaoguang Chen, Xiaoyi Cheng, Michele Donini, Jason Gelman, Satish Gollaprolu, John He, Pedro Larroy, Xinyu Liu, Nick McCarthy, Ashish Rathi, Scott Rees, Ankit Siva, ErhYuan Tsai, Keerthan Vasist, Pinar Yilmaz, Muhammad Bilal Zafar, Sanjiv Das, Kevin Haas, Tyler Hill, Krishnaram Kenthapadi

    Abstract: Understanding the predictions made by machine learning (ML) models and their potential biases remains a challenging and labor-intensive task that depends on the application, the dataset, and the specific model. We present Amazon SageMaker Clarify, an explainability feature for Amazon SageMaker that launched in December 2020, providing insights into data and ML models by identifying biases and expl… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Journal ref: In Proc. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2974-2983 (2021)

  9. arXiv:2107.05978  [pdf, other

    cs.LG cs.AI cs.CY

    DIVINE: Diverse Influential Training Points for Data Visualization and Model Refinement

    Authors: Umang Bhatt, Isabel Chien, Muhammad Bilal Zafar, Adrian Weller

    Abstract: As the complexity of machine learning (ML) models increases, resulting in a lack of prediction explainability, several methods have been developed to explain a model's behavior in terms of the training data points that most influence the model. However, these methods tend to mark outliers as highly influential points, limiting the insights that practitioners can draw from points that are not repre… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: 30 pages, 32 figures

  10. arXiv:2106.12639  [pdf, other

    stat.ML cs.LG

    Multi-objective Asynchronous Successive Halving

    Authors: Robin Schmucker, Michele Donini, Muhammad Bilal Zafar, David Salinas, Cédric Archambeau

    Abstract: Hyperparameter optimization (HPO) is increasingly used to automatically tune the predictive performance (e.g., accuracy) of machine learning models. However, in a plethora of real-world applications, accuracy is only one of the multiple -- often conflicting -- performance criteria, necessitating the adoption of a multi-objective (MO) perspective. While the literature on MO optimization is rich, fe… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  11. arXiv:2106.04631  [pdf, other

    cs.CL cs.LG

    On the Lack of Robust Interpretability of Neural Text Classifiers

    Authors: Muhammad Bilal Zafar, Michele Donini, Dylan Slack, Cédric Archambeau, Sanjiv Das, Krishnaram Kenthapadi

    Abstract: With the ever-increasing complexity of neural language models, practitioners have turned to methods for understanding the predictions of these models. One of the most well-adopted approaches for model interpretability is feature-based interpretability, i.e., ranking the features in terms of their impact on model predictions. Several prior studies have focused on assessing the fidelity of feature-b… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: Appearing at ACL Findings 2021

  12. Loss-Aversively Fair Classification

    Authors: Junaid Ali, Muhammad Bilal Zafar, Adish Singla, Krishna P. Gummadi

    Abstract: The use of algorithmic (learning-based) decision making in scenarios that affect human lives has motivated a number of recent studies to investigate such decision making systems for potential unfairness, such as discrimination against subjects based on their sensitive features like gender or race. However, when judging the fairness of a newly designed decision making system, these studies have ove… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: 8 pages, Accepted at AIES 2019

    Journal ref: In AAAI/ACM Conference on AI, Ethics, and Society (AIES 2019), January 27-28 2019 Honolulu, HI, USA

  13. arXiv:2105.03153  [pdf, other

    stat.ML cs.LG

    Pairwise Fairness for Ordinal Regression

    Authors: Matthäus Kleindessner, Samira Samadi, Muhammad Bilal Zafar, Krishnaram Kenthapadi, Chris Russell

    Abstract: We initiate the study of fairness for ordinal regression. We adapt two fairness notions previously considered in fair ranking and propose a strategy for training a predictor that is approximately fair according to either notion. Our predictor has the form of a threshold model, composed of a scoring function and a set of thresholds, and our strategy is based on a reduction to fair binary classifica… ▽ More

    Submitted 11 February, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

  14. arXiv:2007.00251  [pdf, other

    cs.AI cs.CY cs.LG

    Unifying Model Explainability and Robustness via Machine-Checkable Concepts

    Authors: Vedant Nanda, Till Speicher, John P. Dickerson, Krishna P. Gummadi, Muhammad Bilal Zafar

    Abstract: As deep neural networks (DNNs) get adopted in an ever-increasing number of applications, explainability has emerged as a crucial desideratum for these models. In many real-world tasks, one of the principal reasons for requiring explainability is to in turn assess prediction robustness, where predictions (i.e., class labels) that do not conform to their respective explanations (e.g., presence or ab… ▽ More

    Submitted 2 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: 22 pages, 12 figures, 11 tables

  15. arXiv:2006.05109  [pdf, other

    stat.ML cs.LG

    Fair Bayesian Optimization

    Authors: Valerio Perrone, Michele Donini, Muhammad Bilal Zafar, Robin Schmucker, Krishnaram Kenthapadi, Cédric Archambeau

    Abstract: Given the increasing importance of machine learning (ML) in our lives, several algorithmic fairness techniques have been proposed to mitigate biases in the outcomes of the ML models. However, most of these techniques are specialized to cater to a single family of ML models and a specific definition of fairness, limiting their adaptibility in practice. We introduce a general constrained Bayesian op… ▽ More

    Submitted 18 June, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  16. arXiv:1807.00787  [pdf, other

    cs.LG cs.CY stat.ML

    A Unified Approach to Quantifying Algorithmic Unfairness: Measuring Individual & Group Unfairness via Inequality Indices

    Authors: Till Speicher, Hoda Heidari, Nina Grgic-Hlaca, Krishna P. Gummadi, Adish Singla, Adrian Weller, Muhammad Bilal Zafar

    Abstract: Discrimination via algorithmic decision making has received considerable attention. Prior work largely focuses on defining conditions for fairness, but does not define satisfactory measures of algorithmic unfairness. In this paper, we focus on the following question: Given two unfair algorithms, how should we determine which of the two is more unfair? Our core idea is to use existing inequality in… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: 12 pages 7 figures To be published in: KDD '18: The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Proceedings

  17. arXiv:1707.00010  [pdf, other

    stat.ML cs.LG

    From Parity to Preference-based Notions of Fairness in Classification

    Authors: Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, Krishna P. Gummadi, Adrian Weller

    Abstract: The adoption of automated, data-driven decision making in an ever expanding range of applications has raised concerns about its potential unfairness towards certain social groups. In this context, a number of recent studies have focused on defining, detecting, and removing unfairness from data-driven decision systems. However, the existing notions of fairness, based on parity (equality) in treatme… ▽ More

    Submitted 28 November, 2017; v1 submitted 30 June, 2017; originally announced July 2017.

    Comments: To appear in Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017). Code available at: https://github.com/mbilalzafar/fair-classification

  18. arXiv:1706.10208  [pdf, other

    stat.ML cs.LG

    On Fairness, Diversity and Randomness in Algorithmic Decision Making

    Authors: Nina Grgić-Hlača, Muhammad Bilal Zafar, Krishna P. Gummadi, Adrian Weller

    Abstract: Consider a binary decision making process where a single machine learning classifier replaces a multitude of humans. We raise questions about the resulting loss of diversity in the decision making process. We study the potential benefits of using random classifier ensembles instead of a single classifier in the context of fairness-aware learning and demonstrate various attractive properties: (i) a… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

    Comments: Presented as a poster at the 2017 Workshop on Fairness, Accountability, and Transparency in Machine Learning (FAT/ML 2017)

  19. arXiv:1704.01442  [pdf, other

    cs.SI cs.CY cs.HC

    Characterizing Information Diets of Social Media Users

    Authors: Juhi Kulshrestha, Muhammad Bilal Zafar, Lisette Espin-Noboa, Krishna P. Gummadi, Saptarshi Ghosh

    Abstract: With the widespread adoption of social media sites like Twitter and Facebook, there has been a shift in the way information is produced and consumed. Earlier, the only producers of information were traditional news organizations, which broadcast the same carefully-edited information to all consumers over mass media channels. Whereas, now, in online social media, any user can be a producer of infor… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.

    Comments: In Proceeding of International AAAI Conference on Web and Social Media (ICWSM), Oxford, UK, May 2015

  20. arXiv:1704.01347  [pdf, ps, other

    cs.SI cs.CY cs.HC

    Quantifying Search Bias: Investigating Sources of Bias for Political Searches in Social Media

    Authors: Juhi Kulshrestha, Motahhare Eslami, Johnnatan Messias, Muhammad Bilal Zafar, Saptarshi Ghosh, Krishna P. Gummadi, Karrie Karahalios

    Abstract: Search systems in online social media sites are frequently used to find information about ongoing events and people. For topics with multiple competing perspectives, such as political events or political candidates, bias in the top ranked results significantly shapes public opinion. However, bias does not emerge from an algorithm alone. It is important to distinguish between the bias that arises f… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.

    Comments: In Proceedings of ACM Conference on Computer Supported Cooperative Work & Social Computing (CSCW), Portland, USA, February 2017

  21. arXiv:1610.10064  [pdf, other

    stat.ML cs.CY

    The Case for Temporal Transparency: Detecting Policy Change Events in Black-Box Decision Making Systems

    Authors: Miguel Ferreira, Muhammad Bilal Zafar, Krishna P. Gummadi

    Abstract: Bringing transparency to black-box decision making systems (DMS) has been a topic of increasing research interest in recent years. Traditional active and passive approaches to make these systems transparent are often limited by scalability and/or feasibility issues. In this paper, we propose a new notion of black-box DMS transparency, named, temporal transparency, whose goal is to detect if/when t… ▽ More

    Submitted 31 October, 2016; originally announced October 2016.

  22. Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate Mistreatment

    Authors: Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, Krishna P. Gummadi

    Abstract: Automated data-driven decision making systems are increasingly being used to assist, or even replace humans in many settings. These systems function by learning from historical decisions, often taken by humans. In order to maximize the utility of these systems (or, classifiers), their training involves minimizing the errors (or, misclassifications) over the given historical data. However, it is qu… ▽ More

    Submitted 8 March, 2017; v1 submitted 26 October, 2016; originally announced October 2016.

    Comments: To appear in Proceedings of the 26th International World Wide Web Conference (WWW), 2017. Code available at: https://github.com/mbilalzafar/fair-classification

  23. arXiv:1507.05259  [pdf, other

    stat.ML cs.LG

    Fairness Constraints: Mechanisms for Fair Classification

    Authors: Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, Krishna P. Gummadi

    Abstract: Algorithmic decision making systems are ubiquitous across a wide variety of online as well as offline services. These systems rely on complex learning methods and vast amounts of data to optimize the service functionality, satisfaction of the end user and profitability. However, there is a growing concern that these automated decisions can lead, even in the absence of intent, to a lack of fairness… ▽ More

    Submitted 23 March, 2017; v1 submitted 19 July, 2015; originally announced July 2015.

    Comments: To appear in Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS). Open-source code implementation of our scheme is available at: https://github.com/mbilalzafar/fair-classification