Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 105 results for author: Rudin, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04846  [pdf, other

    cs.LG cs.AI

    Amazing Things Come From Having Many Good Models

    Authors: Cynthia Rudin, Chudi Zhong, Lesia Semenova, Margo Seltzer, Ronald Parr, Jiachang Liu, Srikar Katta, Jon Donnelly, Harry Chen, Zachery Boner

    Abstract: The Rashomon Effect, coined by Leo Breiman, describes the phenomenon that there exist many equally good predictive models for the same dataset. This phenomenon happens for many real datasets and when it does, it sparks both magic and consternation, but mostly magic. In light of the Rashomon Effect, this perspective piece proposes reshaping the way we think about machine learning, particularly for… ▽ More

    Submitted 9 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Journal ref: ICML (spotlight), 2024

  2. arXiv:2406.14675  [pdf, other

    cs.CV cs.AI cs.LG

    This Looks Better than That: Better Interpretable Models with ProtoPNeXt

    Authors: Frank Willard, Luke Moffett, Emmanuel Mokel, Jon Donnelly, Stark Guo, Julia Yang, Giyoung Kim, Alina Jade Barnett, Cynthia Rudin

    Abstract: Prototypical-part models are a popular interpretable alternative to black-box deep learning models for computer vision. However, they are difficult to train, with high sensitivity to hyperparameter tuning, inhibiting their application to new datasets and our understanding of which methods truly improve their performance. To facilitate the careful study of prototypical-part networks (ProtoPNets), w… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.06386  [pdf, other

    cs.CV

    FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography

    Authors: Julia Yang, Alina Jade Barnett, Jon Donnelly, Satvik Kishore, Jerry Fang, Fides Regina Schwartz, Chaofan Chen, Joseph Y. Lo, Cynthia Rudin

    Abstract: Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable ("black box") deep learning models are unsuitable and there is a call in these fields to make interpretable models. Recent work in interpretable computer vision provides transparency t… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures, Accepted for oral presentation at the 2024 CVPR Workshop on Domain adaptation, Explainability, Fairness in AI for Medical Image Analysis (DEF-AI-MIA)

  4. arXiv:2404.17667  [pdf, other

    eess.SP cs.LG

    SiamQuality: A ConvNet-Based Foundation Model for Imperfect Physiological Signals

    Authors: Cheng Ding, Zhicheng Guo, Zhaoliang Chen, Randall J Lee, Cynthia Rudin, Xiao Hu

    Abstract: Foundation models, especially those using transformers as backbones, have gained significant popularity, particularly in language and language-vision tasks. However, large foundation models are typically trained on high-quality data, which poses a significant challenge, given the prevalence of poor-quality real-world data. This challenge is more pronounced for developing foundation models for phys… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  5. arXiv:2404.04714  [pdf, other

    cs.LG cs.AI cs.CR

    Data Poisoning Attacks on Off-Policy Policy Evaluation Methods

    Authors: Elita Lobo, Harvineet Singh, Marek Petrik, Cynthia Rudin, Himabindu Lakkaraju

    Abstract: Off-policy Evaluation (OPE) methods are a crucial tool for evaluating policies in high-stakes domains such as healthcare, where exploration is often infeasible, unethical, or expensive. However, the extent to which such methods can be trusted under adversarial threats to data quality is largely unexplored. In this work, we make the first attempt at investigating the sensitivity of OPE methods to m… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: Accepted at UAI 2022

  6. arXiv:2403.05652  [pdf, other

    cs.LG cs.AI

    What is different between these datasets?

    Authors: Varun Babbar, Zhicheng Guo, Cynthia Rudin

    Abstract: The performance of machine learning models heavily depends on the quality of input data, yet real-world applications often encounter various data-related challenges. One such challenge could arise when curating training data or deploying the model in the real world - two comparable datasets in the same domain may have different distributions. While numerous techniques exist for detecting distribut… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  7. arXiv:2402.09702  [pdf, other

    cs.LG stat.ML

    Sparse and Faithful Explanations Without Sparse Models

    Authors: Yiyang Sun, Zhi Chen, Vittorio Orlandi, Tong Wang, Cynthia Rudin

    Abstract: Even if a model is not globally sparse, it is possible for decisions made from that model to be accurately and faithfully described by a small number of features. For instance, an application for a large loan might be denied to someone because they have no credit history, which overwhelms any evidence towards their creditworthiness. In this work, we introduce the Sparse Explanation Value (SEV), a… ▽ More

    Submitted 8 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted in AISTATS 2024

  8. arXiv:2401.15330  [pdf, other

    cs.LG

    Optimal Sparse Survival Trees

    Authors: Rui Zhang, Rui Xin, Margo Seltzer, Cynthia Rudin

    Abstract: Interpretability is crucial for doctors, hospitals, pharmaceutical companies and biotechnology corporations to analyze and make decisions for high stakes problems that involve human health. Tree-based methods have been widely adopted for survival analysis due to their appealing interpretablility and their ability to capture complex relationships. However, most existing methods to produce survival… ▽ More

    Submitted 22 May, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: AISTATS2024 camera ready version. arXiv admin note: text overlap with arXiv:2211.14980

  9. arXiv:2312.10569  [pdf, other

    cs.LG eess.SP stat.ME

    Interpretable Causal Inference for Analyzing Wearable, Sensor, and Distributional Data

    Authors: Srikar Katta, Harsh Parikh, Cynthia Rudin, Alexander Volfovsky

    Abstract: Many modern causal questions ask how treatments affect complex outcomes that are measured using wearable devices and sensors. Current analysis approaches require summarizing these data into scalar statistics (e.g., the mean), but these summaries can be misleading. For example, disparate distributions can have the same means, variances, and other statistics. Researchers can overcome the loss of inf… ▽ More

    Submitted 20 March, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

  10. arXiv:2312.10056  [pdf, other

    eess.SP cs.LG

    ProtoEEGNet: An Interpretable Approach for Detecting Interictal Epileptiform Discharges

    Authors: Dennis Tang, Frank Willard, Ronan Tegerdine, Luke Triplett, Jon Donnelly, Luke Moffett, Lesia Semenova, Alina Jade Barnett, Jin Jing, Cynthia Rudin, Brandon Westover

    Abstract: In electroencephalogram (EEG) recordings, the presence of interictal epileptiform discharges (IEDs) serves as a critical biomarker for seizures or seizure-like events.Detecting IEDs can be difficult; even highly trained experts disagree on the same sample. As a result, specialists have turned to machine-learning models for assistance. However, many existing models are black boxes and do not provid… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 11 pages, 4 figures

  11. arXiv:2312.02300  [pdf

    cs.LG eess.SP

    Reconsideration on evaluation of machine learning models in continuous monitoring using wearables

    Authors: Cheng Ding, Zhicheng Guo, Cynthia Rudin, Ran Xiao, Fadi B Nahab, Xiao Hu

    Abstract: This paper explores the challenges in evaluating machine learning (ML) models for continuous health monitoring using wearable devices beyond conventional metrics. We state the complexities posed by real-world variability, disease dynamics, user-specific characteristics, and the prevalence of false notifications, necessitating novel evaluation strategies. Drawing insights from large-scale heart stu… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  12. arXiv:2311.13015  [pdf, other

    cs.LG cs.CY

    Fast and Interpretable Mortality Risk Scores for Critical Care Patients

    Authors: Chloe Qinyu Zhu, Muhang Tian, Lesia Semenova, Jiachang Liu, Jack Xu, Joseph Scarpa, Cynthia Rudin

    Abstract: Prediction of mortality in intensive care unit (ICU) patients is an important task in critical care medicine. Prior work in creating mortality risk models falls into two major categories: domain-expert-created scoring systems, and black box machine learning (ML) models. Both of these have disadvantages: black box models are unacceptable for use in hospitals, whereas manual creation of models (incl… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  13. arXiv:2310.19726  [pdf, other

    cs.LG cs.AI stat.ML

    A Path to Simpler Models Starts With Noise

    Authors: Lesia Semenova, Harry Chen, Ronald Parr, Cynthia Rudin

    Abstract: The Rashomon set is the set of models that perform approximately equally well on a given dataset, and the Rashomon ratio is the fraction of all models in a given hypothesis space that are in the Rashomon set. Rashomon ratios are often large for tabular datasets in criminal justice, healthcare, lending, education, and in other areas, which has practical implications about whether simpler models can… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  14. arXiv:2310.18589  [pdf, other

    cs.CV

    This Looks Like Those: Illuminating Prototypical Concepts Using Multiple Visualizations

    Authors: Chiyu Ma, Brandon Zhao, Chaofan Chen, Cynthia Rudin

    Abstract: We present ProtoConcepts, a method for interpretable image classification combining deep learning and case-based reasoning using prototypical parts. Existing work in prototype-based image classification uses a ``this looks like that'' reasoning process, which dissects a test image by finding prototypical parts and combining evidence from these prototypes to make a final classification. However, al… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  15. arXiv:2310.15333  [pdf, other

    cs.LG stat.AP stat.ME

    Safe and Interpretable Estimation of Optimal Treatment Regimes

    Authors: Harsh Parikh, Quinn Lanners, Zade Akras, Sahar F. Zafar, M. Brandon Westover, Cynthia Rudin, Alexander Volfovsky

    Abstract: Recent statistical and reinforcement learning methods have significantly advanced patient care strategies. However, these approaches face substantial challenges in high-stakes contexts, including missing data, inherent stochasticity, and the critical requirements for interpretability and patient safety. Our work operationalizes a safe and interpretable framework to identify optimal treatment regim… ▽ More

    Submitted 1 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted for publication in the proceedings of AISTATS 2025

  16. arXiv:2310.12869  [pdf, other

    cs.SD eess.AS physics.app-ph physics.data-an

    Uncertainty Quantification of Bandgaps in Acoustic Metamaterials with Stochastic Geometric Defects and Material Properties

    Authors: Han Zhang, Rayehe Karimi Mahabadi, Cynthia Rudin, Johann Guilleminot, L. Catherine Brinson

    Abstract: This paper studies the utility of techniques within uncertainty quantification, namely spectral projection and polynomial chaos expansion, in reducing sampling needs for characterizing acoustic metamaterial dispersion band responses given stochastic material properties and geometric defects. A novel method of encoding geometric defects in an interpretable, resolution independent is showcased in th… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  17. arXiv:2310.09203  [pdf, other

    cs.LG cs.AI

    SiamAF: Learning Shared Information from ECG and PPG Signals for Robust Atrial Fibrillation Detection

    Authors: Zhicheng Guo, Cheng Ding, Duc H. Do, Amit Shah, Randall J. Lee, Xiao Hu, Cynthia Rudin

    Abstract: Atrial fibrillation (AF) is the most common type of cardiac arrhythmia. It is associated with an increased risk of stroke, heart failure, and other cardiovascular complications, but can be clinically silent. Passive AF monitoring with wearables may help reduce adverse clinical outcomes related to AF. Detecting AF in noisy wearable data poses a significant challenge, leading to the emergence of var… ▽ More

    Submitted 8 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  18. arXiv:2309.13775  [pdf, other

    cs.LG q-bio.GN stat.ML

    The Rashomon Importance Distribution: Getting RID of Unstable, Single Model-based Variable Importance

    Authors: Jon Donnelly, Srikar Katta, Cynthia Rudin, Edward P. Browne

    Abstract: Quantifying variable importance is essential for answering high-stakes questions in fields like genetics, public policy, and medicine. Current methods generally calculate variable importance for a given model trained on a given dataset. However, for a given dataset, there may be many models that explain the target outcome equally well; without accounting for all possible explanations, different re… ▽ More

    Submitted 1 April, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Appeared in NeurIPS 2023 as a spotlight paper

  19. arXiv:2307.05385  [pdf, other

    eess.SP cs.AI cs.LG

    Learned Kernels for Sparse, Interpretable, and Efficient Medical Time Series Processing

    Authors: Sully F. Chen, Zhicheng Guo, Cheng Ding, Xiao Hu, Cynthia Rudin

    Abstract: Background: Rapid, reliable, and accurate interpretation of medical signals is crucial for high-stakes clinical decision-making. The advent of deep learning allowed for an explosion of new models that offered unprecedented performance in medical time series processing but at a cost: deep learning models are often compute-intensive and lack interpretability. Methods: We propose Sparse Mixture of… ▽ More

    Submitted 2 April, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 26 pages, 9 figures

    Journal ref: Nature Machine Intelligence, 2024

  20. arXiv:2307.05339  [pdf, other

    eess.SP cs.LG

    A Self-Supervised Algorithm for Denoising Photoplethysmography Signals for Heart Rate Estimation from Wearables

    Authors: Pranay Jain, Cheng Ding, Cynthia Rudin, Xiao Hu

    Abstract: Smart watches and other wearable devices are equipped with photoplethysmography (PPG) sensors for monitoring heart rate and other aspects of cardiovascular health. However, PPG signals collected from such devices are susceptible to corruption from noise and motion artifacts, which cause errors in heart rate estimation. Typical denoising approaches filter or reconstruct the signal in ways that elim… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 13 pages, 6 figures

  21. arXiv:2307.01449  [pdf, other

    stat.ME cs.AI cs.LG econ.EM

    A Double Machine Learning Approach to Combining Experimental and Observational Data

    Authors: Harsh Parikh, Marco Morucci, Vittorio Orlandi, Sudeepa Roy, Cynthia Rudin, Alexander Volfovsky

    Abstract: Experimental and observational studies often lack validity due to untestable assumptions. We propose a double machine learning approach to combine experimental and observational studies, allowing practitioners to test for assumption violations and estimate treatment effects consistently. Our framework tests for violations of external validity and ignorability under milder assumptions. When only on… ▽ More

    Submitted 2 April, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

  22. arXiv:2304.11749  [pdf, other

    cs.LG

    Missing Values and Imputation in Healthcare Data: Can Interpretable Machine Learning Help?

    Authors: Zhi Chen, Sarah Tan, Urszula Chajewska, Cynthia Rudin, Rich Caruana

    Abstract: Missing values are a fundamental problem in data science. Many datasets have missing values that must be properly handled because the way missing values are treated can have large impact on the resulting machine learning model. In medical applications, the consequences may affect healthcare decisions. There are many methods in the literature for dealing with missing values, including state-of-the-… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: Preprint of a paper accepted by CHIL 2023

  23. arXiv:2304.06686  [pdf, other

    cs.LG stat.ML

    OKRidge: Scalable Optimal k-Sparse Ridge Regression

    Authors: Jiachang Liu, Sam Rosen, Chudi Zhong, Cynthia Rudin

    Abstract: We consider an important problem in scientific discovery, namely identifying sparse governing equations for nonlinear dynamical systems. This involves solving sparse ridge regression problems to provable optimality in order to determine which terms drive the underlying dynamics. We propose a fast algorithm, OKRidge, for sparse ridge regression, using a novel lower bound calculation involving, firs… ▽ More

    Submitted 11 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: NeurIPS 2023 Spotlight

  24. arXiv:2303.16047  [pdf, other

    cs.LG cs.AI stat.ML

    Exploring and Interacting with the Set of Good Sparse Generalized Additive Models

    Authors: Chudi Zhong, Zhi Chen, Jiachang Liu, Margo Seltzer, Cynthia Rudin

    Abstract: In real applications, interaction between machine learning models and domain experts is critical; however, the classical machine learning paradigm that usually produces only a single model does not facilitate such interaction. Approximating and exploring the Rashomon set, i.e., the set of all near-optimal models, addresses this practical challenge by providing the user with a searchable space cont… ▽ More

    Submitted 17 November, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: NeurIPS 2023

  25. arXiv:2302.11715  [pdf, other

    stat.ME cs.LG econ.EM

    Variable Importance Matching for Causal Inference

    Authors: Quinn Lanners, Harsh Parikh, Alexander Volfovsky, Cynthia Rudin, David Page

    Abstract: Our goal is to produce methods for observational causal inference that are auditable, easy to troubleshoot, accurate for treatment effect estimation, and scalable to high-dimensional data. We describe a general framework called Model-to-Match that achieves these goals by (i) learning a distance metric via outcome modeling, (ii) creating matched groups using the distance metric, and (iii) using the… ▽ More

    Submitted 28 June, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Journal ref: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, PMLR 216:1174-1184, 2023

  26. arXiv:2211.14980  [pdf, other

    cs.LG

    Optimal Sparse Regression Trees

    Authors: Rui Zhang, Rui Xin, Margo Seltzer, Cynthia Rudin

    Abstract: Regression trees are one of the oldest forms of AI models, and their predictions can be made without a calculator, which makes them broadly useful, particularly for high-stakes applications. Within the large literature on regression trees, there has been little effort towards full provable optimization, mainly due to the computational hardness of the problem. This work proposes a dynamic-programmi… ▽ More

    Submitted 9 April, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: AAAI 2023, final archival version

  27. arXiv:2211.05207  [pdf, other

    cs.CV cs.AI cs.LG

    Interpretable Machine Learning System to EEG Patterns on the Ictal-Interictal-Injury Continuum

    Authors: Alina Jade Barnett, Zhicheng Guo, Jin Jing, Wendong Ge, Cynthia Rudin, M. Brandon Westover

    Abstract: In intensive care units (ICUs), critically ill patients are monitored with electroencephalograms (EEGs) to prevent serious brain injury. The number of patients who can be monitored is constrained by the availability of trained physicians to read EEGs, and EEG interpretation can be subjective and prone to inter-observer variability. Automated deep learning systems for EEG could reduce human bias an… ▽ More

    Submitted 11 April, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 20 pages including appendices, 7 figures, submitted for peer review

    ACM Class: I.2.6; I.4.9; I.5.4

  28. arXiv:2210.06825  [pdf, other

    cs.LG cs.AI

    Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design

    Authors: Ali Behrouz, Mathias Lecuyer, Cynthia Rudin, Margo Seltzer

    Abstract: Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the algorithms cannot handle weighted data samples. Specifically, they rely on the discreteness of the loss function, which means that real-valued weights cannot be… ▽ More

    Submitted 25 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Advances in Interpretable Machine Learning, AIMLAI 2022. arXiv admin note: text overlap with arXiv:2112.00798

  29. arXiv:2210.05846  [pdf, other

    cs.LG

    FasterRisk: Fast and Accurate Interpretable Risk Scores

    Authors: Jiachang Liu, Chudi Zhong, Boxuan Li, Margo Seltzer, Cynthia Rudin

    Abstract: Over the last century, risk scores have been the most popular form of predictive model used in healthcare and criminal justice. Risk scores are sparse linear models with integer coefficients; often these models can be memorized or placed on an index card. Typically, risk scores have been created either without data or by rounding logistic regression coefficients, but these methods do not reliably… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  30. TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization

    Authors: Zijie J. Wang, Chudi Zhong, Rui Xin, Takuya Takagi, Zhi Chen, Duen Horng Chau, Cynthia Rudin, Margo Seltzer

    Abstract: Given thousands of equally accurate machine learning (ML) models, how can users choose among them? A recent ML technique enables domain experts and data scientists to generate a complete Rashomon set for sparse decision trees--a huge set of almost-optimal interpretable ML models. To help ML practitioners identify models with desirable properties from this Rashomon set, we develop TimberTrek, the f… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted at IEEE VIS 2022. 5 pages, 6 figures. For a demo video, see https://youtu.be/3eGqTmsStJM. For a live demo, visit https://poloclub.github.io/timbertrek

  31. arXiv:2209.08040  [pdf, other

    cs.LG cs.AI

    Exploring the Whole Rashomon Set of Sparse Decision Trees

    Authors: Rui Xin, Chudi Zhong, Zhi Chen, Takuya Takagi, Margo Seltzer, Cynthia Rudin

    Abstract: In any given machine learning problem, there may be many models that could explain the data almost equally well. However, most learning algorithms return only one of these models, leaving practitioners with no practical way to explore alternative models that might have desirable properties beyond what could be expressed within a loss function. The Rashomon set is the set of these all almost-optima… ▽ More

    Submitted 25 October, 2022; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022 (Oral)

  32. arXiv:2206.04266  [pdf, other

    cs.LG

    There is no Accuracy-Interpretability Tradeoff in Reinforcement Learning for Mazes

    Authors: Yishay Mansour, Michal Moshkovitz, Cynthia Rudin

    Abstract: Interpretability is an essential building block for trustworthiness in reinforcement learning systems. However, interpretability might come at the cost of deteriorated performance, leading many researchers to build complex models. Our goal is to analyze the cost of interpretability. We show that in certain cases, one can achieve policy interpretability while maintaining its optimality. We focus on… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  33. arXiv:2204.10926  [pdf, other

    cs.CV cs.AI cs.LG

    SegDiscover: Visual Concept Discovery via Unsupervised Semantic Segmentation

    Authors: Haiyang Huang, Zhi Chen, Cynthia Rudin

    Abstract: Visual concept discovery has long been deemed important to improve interpretability of neural networks, because a bank of semantically meaningful concepts would provide us with a starting point for building machine learning models that exhibit intelligible reasoning process. Previous methods have disadvantages: either they rely on labelled support sets that incorporate human biases for objects tha… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  34. Effects of Epileptiform Activity on Discharge Outcome in Critically Ill Patients

    Authors: Harsh Parikh, Kentaro Hoffman, Haoqi Sun, Wendong Ge, Jin Jing, Rajesh Amerineni, Lin Liu, Jimeng Sun, Sahar Zafar, Aaron Struck, Alexander Volfovsky, Cynthia Rudin, M. Brandon Westover

    Abstract: Epileptiform activity (EA) is associated with worse outcomes including increased risk of disability and death. However, the effect of EA on the neurologic outcome is confounded by the feedback between treatment with anti-seizure medications (ASM) and EA burden. A randomized clinical trial is challenging due to the sequential nature of EA-ASM feedback, as well as ethical reasons. However, some mech… ▽ More

    Submitted 11 March, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: 4 Figures

  35. arXiv:2202.11389  [pdf, other

    cs.LG stat.ML

    Fast Sparse Classification for Generalized Linear and Additive Models

    Authors: Jiachang Liu, Chudi Zhong, Margo Seltzer, Cynthia Rudin

    Abstract: We present fast classification techniques for sparse generalized linear and additive models. These techniques can handle thousands of features and thousands of observations in minutes, even in the presence of many highly correlated features. For fast sparse logistic regression, our computational speed-up over other best-subset search techniques owes to linear and quadratic surrogate cuts for the l… ▽ More

    Submitted 29 October, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: AISTATS 2022

  36. arXiv:2112.00798  [pdf, other

    cs.LG cs.AI

    Fast Sparse Decision Tree Optimization via Reference Ensembles

    Authors: Hayden McTavish, Chudi Zhong, Reto Achermann, Ilias Karimalis, Jacques Chen, Cynthia Rudin, Margo Seltzer

    Abstract: Sparse decision tree optimization has been one of the most fundamental problems in AI since its inception and is a challenge at the core of interpretable machine learning. Sparse decision tree optimization is computationally hard, and despite steady effort since the 1960's, breakthroughs have only been made on the problem within the past few years, primarily on the problem of finding optimal spars… ▽ More

    Submitted 5 July, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: AAAI 2022

  37. arXiv:2111.05949  [pdf, other

    cs.LG cs.CE physics.app-ph

    How to See Hidden Patterns in Metamaterials with Interpretable Machine Learning

    Authors: Zhi Chen, Alexander Ogren, Chiara Daraio, L. Catherine Brinson, Cynthia Rudin

    Abstract: Machine learning models can assist with metamaterials design by approximating computationally expensive simulators or solving inverse design problems. However, past work has usually relied on black box deep neural networks, whose reasoning processes are opaque and require enormous datasets that are expensive to obtain. In this work, we develop two novel machine learning approaches to metamaterials… ▽ More

    Submitted 1 October, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: Accepted to Extreme Mechanics Letters, 2022

  38. arXiv:2109.07623  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    BacHMMachine: An Interpretable and Scalable Model for Algorithmic Harmonization for Four-part Baroque Chorales

    Authors: Yunyao Zhu, Stephen Hahn, Simon Mak, Yue Jiang, Cynthia Rudin

    Abstract: Algorithmic harmonization - the automated harmonization of a musical piece given its melodic line - is a challenging problem that has garnered much interest from both music theorists and computer scientists. One genre of particular interest is the four-part Baroque chorales of J.S. Bach. Methods for algorithmic chorale harmonization typically adopt a black-box, "data-driven" approach: they do not… ▽ More

    Submitted 22 February, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 7 pages, 7 figures

  39. arXiv:2107.05605  [pdf, other

    cs.CV cs.LG

    Interpretable Mammographic Image Classification using Case-Based Reasoning and Deep Learning

    Authors: Alina Jade Barnett, Fides Regina Schwartz, Chaofan Tao, Chaofan Chen, Yinhao Ren, Joseph Y. Lo, Cynthia Rudin

    Abstract: When we deploy machine learning models in high-stakes medical settings, we must ensure these models make accurate predictions that are consistent with known medical science. Inherently interpretable networks address this need by explaining the rationale behind each decision while maintaining equal or higher accuracy compared to black-box models. In this work, we present a novel interpretable neura… ▽ More

    Submitted 4 October, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: 10 pages, 6 figures, accepted for oral presentation at the IJCAI-21 Workshop on Deep Learning, Case-Based Reasoning, and AutoML: Present and Future Synergies. arXiv admin note: substantial text overlap with arXiv:2103.12308

    ACM Class: I.2.6; I.4.9; I.2.10

  40. arXiv:2106.13275  [pdf, other

    cs.LG stat.ML

    Multitask Learning for Citation Purpose Classification

    Authors: Alex Oesterling, Angikar Ghosal, Haoyang Yu, Rui Xin, Yasa Baig, Lesia Semenova, Cynthia Rudin

    Abstract: We present our entry into the 2021 3C Shared Task Citation Context Classification based on Purpose competition. The goal of the competition is to classify a citation in a scientific article based on its purpose. This task is important because it could potentially lead to more comprehensive ways of summarizing the purpose and uses of scientific articles, but it is also difficult, mainly due to the… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Second Workshop on Scholarly Document Processing

    Journal ref: Proceedings of the Second Workshop on Scholarly Document Processing, 2021

  41. arXiv:2106.02605  [pdf, other

    cs.LG

    A Holistic Approach to Interpretability in Financial Lending: Models, Visualizations, and Summary-Explanations

    Authors: Chaofan Chen, Kangcheng Lin, Cynthia Rudin, Yaron Shaposhnik, Sijia Wang, Tong Wang

    Abstract: Lending decisions are usually made with proprietary models that provide minimally acceptable explanations to users. In a future world without such secrecy, what decision support tools would one want to use for justified lending decisions? This question is timely, since the economy has dramatically shifted due to a pandemic, and a massive number of new loans will be necessary in the short term. We… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  42. arXiv:2105.05885  [pdf, other

    cs.CL cs.AI cs.LG

    Playing Codenames with Language Graphs and Word Embeddings

    Authors: Divya Koyyalagunta, Anna Sun, Rachel Lea Draelos, Cynthia Rudin

    Abstract: Although board games and video games have been studied for decades in artificial intelligence research, challenging word games remain relatively unexplored. Word games are not as constrained as games like chess or poker. Instead, word game strategy is defined by the players' understanding of the way words relate to each other. The word game Codenames provides a unique opportunity to investigate co… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: Divya Koyyalagunta and Anna Sun contributed equally to this work. This is an arXiv version of the paper that has been accepted for publication in the Journal of Artificial Intelligence Research (JAIR)

  43. Ethical Implementation of Artificial Intelligence to Select Embryos in In Vitro Fertilization

    Authors: Michael Anis Mihdi Afnan, Cynthia Rudin, Vincent Conitzer, Julian Savulescu, Abhishek Mishra, Yanhe Liu, Masoud Afnan

    Abstract: AI has the potential to revolutionize many areas of healthcare. Radiology, dermatology, and ophthalmology are some of the areas most likely to be impacted in the near future, and they have received significant attention from the broader research community. But AI techniques are now also starting to be used in in vitro fertilization (IVF), in particular for selecting which embryos to transfer to th… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

    Journal ref: AIES 2021

  44. arXiv:2103.12308  [pdf, other

    cs.LG cs.AI cs.CV

    IAIA-BL: A Case-based Interpretable Deep Learning Model for Classification of Mass Lesions in Digital Mammography

    Authors: Alina Jade Barnett, Fides Regina Schwartz, Chaofan Tao, Chaofan Chen, Yinhao Ren, Joseph Y. Lo, Cynthia Rudin

    Abstract: Interpretability in machine learning models is important in high-stakes decisions, such as whether to order a biopsy based on a mammographic exam. Mammography poses important challenges that are not present in other computer vision tasks: datasets are small, confounding information is present, and it can be difficult even for a radiologist to decide between watchful waiting and biopsy based on a m… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: 24 pages, 5 figures, 2 tables

    ACM Class: I.2.6; I.4.9; I.2.10

  45. arXiv:2103.11251  [pdf, other

    cs.LG stat.ML

    Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges

    Authors: Cynthia Rudin, Chaofan Chen, Zhi Chen, Haiyang Huang, Lesia Semenova, Chudi Zhong

    Abstract: Interpretability in machine learning (ML) is crucial for high stakes decisions and troubleshooting. In this work, we provide fundamental principles for interpretable ML, and dispel common misunderstandings that dilute the importance of this crucial topic. We also identify 10 technical challenge areas in interpretable machine learning and provide history and background on each problem. Some of thes… ▽ More

    Submitted 9 July, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

    MSC Class: 68T01 ACM Class: I.2.6

    Journal ref: Statistics Surveys, 2021

  46. arXiv:2103.03775  [pdf, other

    cs.CL

    There Once Was a Really Bad Poet, It Was Automated but You Didn't Know It

    Authors: Jianyou Wang, Xiaoxuan Zhang, Yuren Zhou, Christopher Suh, Cynthia Rudin

    Abstract: Limerick generation exemplifies some of the most difficult challenges faced in poetry generation, as the poems must tell a story in only five lines, with constraints on rhyme, stress, and meter. To address these challenges, we introduce LimGen, a novel and fully automated system for limerick generation that outperforms state-of-the-art neural network-based poetry models, as well as prior rule-base… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: Paper accepted and will be published at TACL (Transactions of the Association for Computational Linguistics) 2021

  47. arXiv:2101.01867  [pdf, other

    cs.LG cs.MS

    dame-flame: A Python Library Providing Fast Interpretable Matching for Causal Inference

    Authors: Neha R. Gupta, Vittorio Orlandi, Chia-Rui Chang, Tianyu Wang, Marco Morucci, Pritam Dey, Thomas J. Howell, Xian Sun, Angikar Ghosal, Sudeepa Roy, Cynthia Rudin, Alexander Volfovsky

    Abstract: dame-flame is a Python package for performing matching for observational causal inference on datasets containing discrete covariates. This package implements the Dynamic Almost Matching Exactly (DAME) and Fast Large-Scale Almost Matching Exactly (FLAME) algorithms, which match treatment and control units on subsets of the covariates. The resulting matched groups are interpretable, because the matc… ▽ More

    Submitted 2 April, 2023; v1 submitted 5 January, 2021; originally announced January 2021.

    Comments: 26 pages, 2 figures

  48. arXiv:2012.04456  [pdf, other

    cs.LG stat.ML

    Understanding How Dimension Reduction Tools Work: An Empirical Approach to Deciphering t-SNE, UMAP, TriMAP, and PaCMAP for Data Visualization

    Authors: Yingfan Wang, Haiyang Huang, Cynthia Rudin, Yaron Shaposhnik

    Abstract: Dimension reduction (DR) techniques such as t-SNE, UMAP, and TriMAP have demonstrated impressive visualization performance on many real world datasets. One tension that has always faced these methods is the trade-off between preservation of global structure and preservation of local structure: these methods can either handle one or the other, but not both. In this work, our main goal is to underst… ▽ More

    Submitted 24 August, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Journal ref: Journal of Machine Learning Research 22(2021) 1-73

  49. arXiv:2011.11020  [pdf, other

    eess.IV cs.CV cs.LG q-bio.BM

    Cryo-ZSSR: multiple-image super-resolution based on deep internal learning

    Authors: Qinwen Huang, Ye Zhou, Xiaochen Du, Reed Chen, Jianyou Wang, Cynthia Rudin, Alberto Bartesaghi

    Abstract: Single-particle cryo-electron microscopy (cryo-EM) is an emerging imaging modality capable of visualizing proteins and macro-molecular complexes at near-atomic resolution. The low electron-doses used to prevent sample radiation damage, result in images where the power of the noise is 100 times greater than the power of the signal. To overcome the low-SNRs, hundreds of thousands of particle project… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

    Comments: 11 pages, 4 figures

  50. arXiv:2007.08703  [pdf, other

    cs.LG stat.ML

    Bandits for BMO Functions

    Authors: Tianyu Wang, Cynthia Rudin

    Abstract: We study the bandit problem where the underlying expected reward is a Bounded Mean Oscillation (BMO) function. BMO functions are allowed to be discontinuous and unbounded, and are useful in modeling signals with infinities in the do-main. We develop a toolset for BMO bandits, and provide an algorithm that can achieve poly-log $δ$-regret -- a regret measured against an arm that is optimal after rem… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.