Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 104 results for author: Hardt, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13882  [pdf, other

    cs.LG cs.CY econ.TH

    Allocation Requires Prediction Only if Inequality Is Low

    Authors: Ali Shirali, Rediet Abebe, Moritz Hardt

    Abstract: Algorithmic predictions are emerging as a promising solution concept for efficiently allocating societal resources. Fueling their use is an underlying assumption that such systems are necessary to identify individuals for interventions. We propose a principled framework for assessing this assumption: Using a simple mathematical model, we evaluate the efficacy of prediction-based allocations in set… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Appeared in Forty-first International Conference on Machine Learning (ICML), 2024

  2. arXiv:2406.03422  [pdf, other

    cs.GT

    Causal Inference from Competing Treatments

    Authors: Ana-Andreea Stoica, Vivian Y. Nastl, Moritz Hardt

    Abstract: Many applications of RCTs involve the presence of multiple treatment administrators -- from field experiments to online advertising -- that compete for the subjects' attention. In the face of competition, estimating a causal effect becomes difficult, as the position at which a subject sees a treatment influences their response, and thus the treatment effect. In this paper, we build a game-theoreti… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 37 pages, 3 figures, accepted at ICML'24

  3. arXiv:2405.19073  [pdf, other

    cs.CY cs.IR

    An engine not a camera: Measuring performative power of online search

    Authors: Celestine Mendler-Dünner, Gabriele Carovano, Moritz Hardt

    Abstract: The power of digital platforms is at the center of major ongoing policy and regulatory efforts. To advance existing debates, we designed and executed an experiment to measure the power of online search providers, building on the recent definition of performative power. Instantiated in our setting, performative power quantifies the ability of a search engine to steer web traffic by rearranging resu… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.01719  [pdf, other

    cs.LG

    Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks

    Authors: Guanhua Zhang, Moritz Hardt

    Abstract: We examine multi-task benchmarks in machine learning through the lens of social choice theory. We draw an analogy between benchmarks and electoral systems, where models are candidates and tasks are voters. This suggests a distinction between cardinal and ordinal benchmark systems. The former aggregate numerical scores into one model ranking; the latter aggregate rankings for each task. We apply Ar… ▽ More

    Submitted 6 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: To be published in ICML 2024

  5. arXiv:2404.02112  [pdf, other

    cs.LG cs.CV

    ImageNot: A contrast with ImageNet preserves model rankings

    Authors: Olawale Salaudeen, Moritz Hardt

    Abstract: We introduce ImageNot, a dataset designed to match the scale of ImageNet while differing drastically in other aspects. We show that key model architectures developed for ImageNet over the years rank identically when trained and evaluated on ImageNot to how they rank on ImageNet. This is true when training models from scratch or fine-tuning them. Moreover, the relative improvements of each model ov… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  6. arXiv:2402.09891  [pdf, other

    cs.LG stat.ML

    Predictors from causal features do not generalize better to new domains

    Authors: Vivian Y. Nastl, Moritz Hardt

    Abstract: We study how well machine learning models trained on causal features generalize across domains. We consider 16 prediction tasks on tabular datasets covering applications in health, employment, education, social benefits, and politics. Each dataset comes with multiple domains, allowing us to test how well a model trained in one domain performs in another. For each prediction task, we select feature… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 13 pages, 7 figures

  7. arXiv:2402.02249  [pdf, other

    cs.LG

    Don't Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget

    Authors: Florian E. Dorner, Moritz Hardt

    Abstract: We study how to best spend a budget of noisy labels to compare the accuracy of two binary classifiers. It's common practice to collect and aggregate multiple noisy labels for a given data point into a less noisy label via a majority vote. We prove a theorem that runs counter to conventional wisdom. If the goal is to identify the better of two classifiers, we show it's best to spend the budget on c… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 34 pages, 3 Figures

  8. arXiv:2311.04806  [pdf, other

    cs.DC cs.LG

    The PetShop Dataset -- Finding Causes of Performance Issues across Microservices

    Authors: Michaela Hardt, William R. Orchard, Patrick Blöbaum, Shiva Kasiviswanathan, Elke Kirschbaum

    Abstract: Identifying root causes for unexpected or undesirable behavior in complex systems is a prevalent challenge. This issue becomes especially crucial in modern cloud applications that employ numerous microservices. Although the machine learning and systems research communities have proposed various techniques to tackle this problem, there is currently a lack of standardized datasets for quantitative b… ▽ More

    Submitted 8 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 22 pages, 6 figures, 10 tables, for associated git repo see https://github.com/amazon-science/petshop-root-cause-analysis/, to be published in Proceedings of Machine Learning Research vol 236, 2024, 3rd Conference on Causal Learning and Reasoning

    ACM Class: E.0

  9. arXiv:2310.16608  [pdf, other

    cs.LG

    Performative Prediction: Past and Future

    Authors: Moritz Hardt, Celestine Mendler-Dünner

    Abstract: Predictions in the social world generally influence the target of prediction, a phenomenon known as performativity. Self-fulfilling and self-negating predictions are examples of performativity. Of fundamental importance to economics, finance, and the social sciences, the notion has been absent from the development of machine learning. In machine learning applications, performativity often surfaces… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  10. arXiv:2306.15769  [pdf, other

    cs.LG cs.CV

    What Makes ImageNet Look Unlike LAION

    Authors: Ali Shirali, Moritz Hardt

    Abstract: ImageNet was famously created from Flickr image search results. What if we recreated ImageNet instead by searching the massive LAION dataset based on image captions alone? In this work, we carry out this counterfactual investigation. We find that the resulting ImageNet recreation, which we call LAIONet, looks distinctly unlike the original. Specifically, the intra-class similarity of images in the… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  11. arXiv:2306.07951  [pdf, other

    cs.CL

    Questioning the Survey Responses of Large Language Models

    Authors: Ricardo Dominguez-Olmedo, Moritz Hardt, Celestine Mendler-Dünner

    Abstract: As large language models increase in capability, researchers have started to conduct surveys of all kinds on these models in order to investigate the population represented by their responses. In this work, we critically examine language models' survey responses on the basis of the well-established American Community Survey by the U.S. Census Bureau and investigate whether they elicit a faithful r… ▽ More

    Submitted 28 February, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

  12. arXiv:2306.07261  [pdf, other

    cs.LG cs.CY

    Unprocessing Seven Years of Algorithmic Fairness

    Authors: André F. Cruz, Moritz Hardt

    Abstract: Seven years ago, researchers proposed a postprocessing method to equalize the error rates of a model across different demographic groups. The work launched hundreds of papers purporting to improve over the postprocessing baseline. We empirically evaluate these claims through thousands of model evaluations on several tabular datasets. We find that the fairness-accuracy Pareto frontier achieved by p… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

    Journal ref: ICLR 2024

  13. arXiv:2305.18466  [pdf, other

    cs.CL cs.LG

    Test-Time Training on Nearest Neighbors for Large Language Models

    Authors: Moritz Hardt, Yu Sun

    Abstract: Many recent efforts augment language models with retrieval, by adding retrieved data to the input context. For this approach to succeed, the retrieved data must be added at both training and test time. Moreover, as input length grows linearly with the size of retrieved data, cost in computation and memory grows quadratically for modern Transformers. To avoid these complications, we simply fine-tun… ▽ More

    Submitted 2 February, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: ICLR final version

  14. arXiv:2305.09565  [pdf, other

    stat.ML cs.LG

    Toward Falsifying Causal Graphs Using a Permutation-Based Test

    Authors: Elias Eulig, Atalanti A. Mastakouri, Patrick Blöbaum, Michaela Hardt, Dominik Janzing

    Abstract: Understanding the causal relationships among the variables of a system is paramount to explain and control its behaviour. Inferring the causal graph from observational data without interventions, however, requires a lot of strong assumptions that are not always realistic. Even for domain experts it can be challenging to express the causal graph. Therefore, metrics that quantitatively assess the go… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 23 pages, 9 figures

  15. arXiv:2305.05832  [pdf, other

    cs.LG cs.AI cs.IT stat.ME

    Causal Information Splitting: Engineering Proxy Features for Robustness to Distribution Shifts

    Authors: Bijan Mazaheri, Atalanti Mastakouri, Dominik Janzing, Michaela Hardt

    Abstract: Statistical prediction models are often trained on data from different probability distributions than their eventual use cases. One approach to proactively prepare for these shifts harnesses the intuition that causal mechanisms should remain invariant between environments. Here we focus on a challenging setting in which the causal and anticausal variables of the target are unobserved. Leaning on i… ▽ More

    Submitted 31 July, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 29th Conference on Uncertainty in Artificial Intelligence (2023)

  16. arXiv:2304.06205  [pdf, other

    cs.CY cs.LG econ.GN stat.AP

    Difficult Lessons on Social Prediction from Wisconsin Public Schools

    Authors: Juan C. Perdomo, Tolani Britton, Moritz Hardt, Rediet Abebe

    Abstract: Early warning systems (EWS) are predictive tools at the center of recent efforts to improve graduation rates in public schools across the United States. These systems assist in targeting interventions to individual students by predicting which students are at risk of dropping out. Despite significant investments in their widespread adoption, there remain large gaps in our understanding of the effi… ▽ More

    Submitted 18 September, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  17. arXiv:2302.04989  [pdf, other

    cs.LG cs.CY stat.ML

    Causal Inference out of Control: Estimating the Steerability of Consumption

    Authors: Gary Cheng, Moritz Hardt, Celestine Mendler-Dünner

    Abstract: Regulators and academics are increasingly interested in the causal effect that algorithmic actions of a digital platform have on consumption. We introduce a general causal inference problem we call the steerability of consumption that abstracts many settings of interest. Focusing on observational designs and exploiting the structure of the problem, we exhibit a set of assumptions for causal identi… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  18. arXiv:2302.04262  [pdf, other

    cs.LG cs.GT stat.ML

    Algorithmic Collective Action in Machine Learning

    Authors: Moritz Hardt, Eric Mazumdar, Celestine Mendler-Dünner, Tijana Zrnic

    Abstract: We initiate a principled study of algorithmic collective action on digital platforms that deploy machine learning algorithms. We propose a simple theoretical model of a collective interacting with a firm's learning algorithm. The collective pools the data of participating individuals and executes an algorithmic strategy by instructing participants how to modify their own data to achieve a collecti… ▽ More

    Submitted 21 June, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: accepted at ICML 2023, camera-ready updates

  19. arXiv:2211.08667  [pdf, other

    cs.SI

    County-level Algorithmic Audit of Racial Bias in Twitter's Home Timeline

    Authors: Luca Belli, Kyra Yee, Uthaipon Tantipongpipat, Aaron Gonzales, Kristian Lum, Moritz Hardt

    Abstract: We report on the outcome of an audit of Twitter's Home Timeline ranking system. The goal of the audit was to determine if authors from some racial groups experience systematically higher impression counts for their Tweets than others. A central obstacle for any such audit is that Twitter does not ordinarily collect or associate racial information with its users, thus prohibiting an analysis at the… ▽ More

    Submitted 10 February, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  20. arXiv:2210.03165  [pdf, other

    cs.LG stat.ML

    A Theory of Dynamic Benchmarks

    Authors: Ali Shirali, Rediet Abebe, Moritz Hardt

    Abstract: Dynamic benchmarks interweave model fitting and data collection in an attempt to mitigate the limitations of static benchmarks. In contrast to an extensive theoretical and empirical study of the static setting, the dynamic counterpart lags behind due to limited empirical studies and no apparent theoretical foundation to date. Responding to this deficit, we initiate a theoretical study of dynamic b… ▽ More

    Submitted 1 March, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 Version

  21. Is your model predicting the past?

    Authors: Moritz Hardt, Michael P. Kim

    Abstract: When does a machine learning model predict the future of individuals and when does it recite patterns that predate the individuals? In this work, we propose a distinction between these two pathways of prediction, supported by theoretical, empirical, and normative arguments. At the center of our proposal is a family of simple and efficient statistical tests, called backward baselines, that demonstr… ▽ More

    Submitted 10 March, 2024; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: Code available at: https://github.com/socialfoundations/backward_baselines

  22. Adversarial Scrutiny of Evidentiary Statistical Software

    Authors: Rediet Abebe, Moritz Hardt, Angela Jin, John Miller, Ludwig Schmidt, Rebecca Wexler

    Abstract: The U.S. criminal legal system increasingly relies on software output to convict and incarcerate people. In a large number of cases each year, the government makes these consequential decisions based on evidence from statistical software -- such as probabilistic genotyping, environmental audio detection, and toolmark analysis tools -- that defense counsel cannot fully cross-examine or scrutinize.… ▽ More

    Submitted 30 September, 2022; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: Typos corrected, appendix B removed

    ACM Class: K.4.1; I.2.1; G.3; D.2.5

  23. arXiv:2203.17232  [pdf, other

    cs.LG cs.CY cs.GT econ.TH

    Performative Power

    Authors: Moritz Hardt, Meena Jagadeesan, Celestine Mendler-Dünner

    Abstract: We introduce the notion of performative power, which measures the ability of a firm operating an algorithmic system, such as a digital content recommendation platform, to cause change in a population of participants. We relate performative power to the economic study of competition in digital economies. Traditional economic concepts struggle with identifying anti-competitive patterns in digital pl… ▽ More

    Submitted 3 November, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: to appear at NeurIPS 2022

  24. arXiv:2203.08074  [pdf, other

    eess.SP cs.LG

    Combining AI/ML and PHY Layer Rule Based Inference -- Some First Results

    Authors: Brenda Vilas Boas, Wolfgang Zirwas, Martin Haardt

    Abstract: In 3GPP New Radio (NR) Release 18 we see the first study item starting in May 2022, which will evaluate the potential of AI/ML methods for Radio Access Network (RAN) 1, i.e., for mobile radio PHY and MAC layer applications. We use the profiling method for accurate iterative estimation of multipath component parameters for PHY layer reference, as it promises a large channel prediction horizon. We i… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: submitted to SPAWC 2022

  25. arXiv:2111.09831  [pdf, other

    stat.ML cs.LG

    Causal Forecasting:Generalization Bounds for Autoregressive Models

    Authors: Leena Chennuru Vankadara, Philipp Michael Faller, Michaela Hardt, Lenon Minorics, Debarghya Ghoshdastidar, Dominik Janzing

    Abstract: Despite the increasing relevance of forecasting methods, causal implications of these algorithms remain largely unexplored. This is concerning considering that, even under simplifying assumptions such as causal sufficiency, the statistical risk of a model can differ significantly from its \textit{causal risk}. Here, we study the problem of \textit{causal generalization} -- generalizing from the ob… ▽ More

    Submitted 8 September, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

  26. arXiv:2111.07858  [pdf, other

    eess.SP cs.LG

    Transfer Learning Capabilities of Untrained Neural Networks for MIMO CSI Recreation

    Authors: Brenda Vilas Boas, Wolfgang Zirwas, Martin Haardt

    Abstract: Machine learning (ML) applications for wireless communications have gained momentum on the standardization discussions for 5G advanced and beyond. One of the biggest challenges for real world ML deployment is the need for labeled signals and big measurement campaigns. To overcome those problems, we propose the use of untrained neural networks (UNNs) for MIMO channel recreation/estimation and low o… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: to be published

  27. arXiv:2111.07854  [pdf, other

    eess.SP cs.IT cs.LG

    Machine Learning for CSI Recreation Based on Prior Knowledge

    Authors: Brenda Vilas Boas, Wolfgang Zirwas, Martin Haardt

    Abstract: Knowledge of channel state information (CSI) is fundamental to many functionalities within the mobile wireless communications systems. With the advance of machine learning (ML) and digital maps, i.e., digital twins, we have a big opportunity to learn the propagation environment and design novel methods to derive and report CSI. In this work, we propose to combine untrained neural networks (UNNs) a… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: submitted for publication

  28. Algorithmic Amplification of Politics on Twitter

    Authors: Ferenc Huszár, Sofia Ira Ktena, Conor O'Brien, Luca Belli, Andrew Schlaikjer, Moritz Hardt

    Abstract: Content on Twitter's home timeline is selected and ordered by personalization algorithms. By consistently ranking certain content higher, these algorithms may amplify some messages while reducing the visibility of others. There's been intense public and scholarly debate about the possibility that some political groups benefit more from algorithmic amplification than others. We provide quantitative… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  29. Amazon SageMaker Clarify: Machine Learning Bias Detection and Explainability in the Cloud

    Authors: Michaela Hardt, Xiaoguang Chen, Xiaoyi Cheng, Michele Donini, Jason Gelman, Satish Gollaprolu, John He, Pedro Larroy, Xinyu Liu, Nick McCarthy, Ashish Rathi, Scott Rees, Ankit Siva, ErhYuan Tsai, Keerthan Vasist, Pinar Yilmaz, Muhammad Bilal Zafar, Sanjiv Das, Kevin Haas, Tyler Hill, Krishnaram Kenthapadi

    Abstract: Understanding the predictions made by machine learning (ML) models and their potential biases remains a challenging and labor-intensive task that depends on the application, the dataset, and the specific model. We present Amazon SageMaker Clarify, an explainability feature for Amazon SageMaker that launched in December 2020, providing insights into data and ML models by identifying biases and expl… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Journal ref: In Proc. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2974-2983 (2021)

  30. arXiv:2108.04884  [pdf, other

    cs.LG stat.ML

    Retiring Adult: New Datasets for Fair Machine Learning

    Authors: Frances Ding, Moritz Hardt, John Miller, Ludwig Schmidt

    Abstract: Although the fairness community has recognized the importance of data, researchers in the area primarily rely on UCI Adult when it comes to tabular data. Derived from a 1994 US Census survey, this dataset has appeared in hundreds of research papers where it served as the basis for the development and comparison of many algorithmic fairness interventions. We reconstruct a superset of the UCI Adult… ▽ More

    Submitted 9 January, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

  31. Causal Inference Struggles with Agency on Online Platforms

    Authors: Smitha Milli, Luca Belli, Moritz Hardt

    Abstract: Online platforms regularly conduct randomized experiments to understand how changes to the platform causally affect various outcomes of interest. However, experimentation on online platforms has been criticized for having, among other issues, a lack of meaningful oversight and user consent. As platforms give users greater agency, it becomes possible to conduct observational studies in which users… ▽ More

    Submitted 10 May, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Accepted to FaccT'22

  32. arXiv:2106.12705  [pdf, other

    cs.LG cs.CY cs.GT econ.TH

    Alternative Microfoundations for Strategic Classification

    Authors: Meena Jagadeesan, Celestine Mendler-Dünner, Moritz Hardt

    Abstract: When reasoning about strategic behavior in a machine learning context it is tempting to combine standard microfoundations of rational agents with the statistical decision theory underlying classification. In this work, we argue that a direct combination of these standard ingredients leads to brittle solution concepts of limited descriptive and prescriptive value. First, we show that rational agent… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at ICML 2021

  33. arXiv:2106.11633  [pdf, other

    eess.SP cs.LG

    Machine Learning for Model Order Selection in MIMO OFDM Systems

    Authors: Brenda Vilas Boas, Wolfgang Zirwas, Martin Haardt

    Abstract: A variety of wireless channel estimation methods, e.g., MUSIC and ESPRIT, rely on prior knowledge of the model order. Therefore, it is important to correctly estimate the number of multipath components (MPCs) which compose such channels. However, environments with many scatterers may generate MPCs which are closely spaced. This clustering of MPCs in addition to noise makes the model order selectio… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: to be published

  34. arXiv:2106.09375  [pdf, other

    cs.IR cs.IT

    Recovery under Side Constraints

    Authors: Khaled Ardah, Martin Haardt, Tianyi Liu, Frederic Matter, Marius Pesavento, Marc E. Pfetsch

    Abstract: This paper addresses sparse signal reconstruction under various types of structural side constraints with applications in multi-antenna systems. Side constraints may result from prior information on the measurement system and the sparse signal structure. They may involve the structure of the sensing matrix, the structure of the non-zero support values, the temporal structure of the sparse represen… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  35. arXiv:2103.00971  [pdf, other

    cs.IT eess.SP

    Low-Complexity Zero-Forcing Precoding for XL-MIMO Transmissions

    Authors: Lucas N. Ribeiro, Stefan Schwarz, Martin Haardt

    Abstract: Deploying antenna arrays with an asymptotically large aperture will be central to achieving the theoretical gains of massive MIMO in beyond-5G systems. Such extra-large MIMO (XL-MIMO) systems experience propagation conditions which are not typically observed in conventional massive MIMO systems, such as spatial non-stationarities and near-field propagation. Moreover, standard precoding schemes, su… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to Eusipco 2021

  36. arXiv:2102.05242  [pdf, other

    cs.LG stat.ML

    Patterns, predictions, and actions: A story about machine learning

    Authors: Moritz Hardt, Benjamin Recht

    Abstract: This graduate textbook on machine learning tells a story of how patterns in data support predictions and consequential actions. Starting with the foundations of decision making, we cover representation, optimization, and generalization as the constituents of supervised learning. A chapter on datasets as benchmarks examines their histories and scientific bases. Self-contained introductions to causa… ▽ More

    Submitted 26 October, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: Manuscript submitted to publisher for copy editing

  37. Two-step Machine Learning Approach for Channel Estimation with Mixed Resolution RF Chains

    Authors: Brenda Vilas Boas, Wolfgang Zirwas, Martin Haardt

    Abstract: Massive MIMO is one of the main features of 5G mobile radio systems. However, it often leads to high cost, size and power consumption. To overcome these issues, the use of constrained radio frequency (RF) frontends has been proposed, as well as novel precoders, e.g., a multi-antenna, greedy, iterative and quantized precoding algorithm (MAGIQ). Nevertheless, the best performance of MAGIQ assumes ac… ▽ More

    Submitted 24 January, 2021; originally announced January 2021.

    Comments: to be published

  38. arXiv:2009.10897  [pdf, other

    cs.LG stat.ML

    Revisiting Design Choices in Proximal Policy Optimization

    Authors: Chloe Ching-Yun Hsu, Celestine Mendler-Dünner, Moritz Hardt

    Abstract: Proximal Policy Optimization (PPO) is a popular deep policy gradient algorithm. In standard implementations, PPO regularizes policy updates with clipped probability ratios, and parameterizes policies with either continuous Gaussian distributions or discrete Softmax distributions. These design choices are widely accepted, and motivated by empirical performance comparisons on MuJoCo and Atari benchm… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

  39. arXiv:2008.12623  [pdf, other

    cs.SI cs.LG stat.ML

    From Optimizing Engagement to Measuring Value

    Authors: Smitha Milli, Luca Belli, Moritz Hardt

    Abstract: Most recommendation engines today are based on predicting user engagement, e.g. predicting whether a user will click on an item or not. However, there is potentially a large gap between engagement signals and a desired notion of "value" that is worth optimizing for. We use the framework of measurement theory to (a) confront the designer with a normative question about what the designer values, (b)… ▽ More

    Submitted 19 July, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: Published at FAccT'21

  40. arXiv:2006.06887  [pdf, other

    cs.LG cs.GT stat.ML

    Stochastic Optimization for Performative Prediction

    Authors: Celestine Mendler-Dünner, Juan C. Perdomo, Tijana Zrnic, Moritz Hardt

    Abstract: In performative prediction, the choice of a model influences the distribution of future data, typically through actions taken based on the model's predictions. We initiate the study of stochastic optimization for performative prediction. What sets this setting apart from traditional stochastic optimization is the difference between merely updating model parameters and deploying the new model. Th… ▽ More

    Submitted 19 February, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: published at NeurIPS 2020

  41. arXiv:2003.06740  [pdf, other

    cs.LG stat.ML

    Balancing Competing Objectives with Noisy Data: Score-Based Classifiers for Welfare-Aware Machine Learning

    Authors: Esther Rolf, Max Simchowitz, Sarah Dean, Lydia T. Liu, Daniel Björkegren, Moritz Hardt, Joshua Blumenstock

    Abstract: While real-world decisions involve many competing objectives, algorithmic decisions are often evaluated with a single objective function. In this paper, we study algorithmic policies which explicitly trade off between a private objective (such as profit) and a public objective (such as social welfare). We analyze a natural class of policies which trace an empirical Pareto frontier based on learned… ▽ More

    Submitted 15 July, 2020; v1 submitted 14 March, 2020; originally announced March 2020.

  42. arXiv:2002.06673  [pdf, other

    cs.LG cs.GT stat.ML

    Performative Prediction

    Authors: Juan C. Perdomo, Tijana Zrnic, Celestine Mendler-Dünner, Moritz Hardt

    Abstract: When predictions support decisions they may influence the outcome they aim to predict. We call such predictions performative; the prediction influences the target. Performativity is a well-studied phenomenon in policy-making that has so far been neglected in supervised learning. When ignored, performativity surfaces as undesirable distribution shift, routinely addressed with retraining. We devel… ▽ More

    Submitted 26 February, 2021; v1 submitted 16 February, 2020; originally announced February 2020.

    Comments: published at ICML'20; fixed some typos

  43. arXiv:1910.10362  [pdf, other

    cs.LG stat.ML

    Strategic Classification is Causal Modeling in Disguise

    Authors: John Miller, Smitha Milli, Moritz Hardt

    Abstract: Consequential decision-making incentivizes individuals to strategically adapt their behavior to the specifics of the decision rule. While a long line of work has viewed strategic adaptation as gaming and attempted to mitigate its effects, recent work has instead sought to design classifiers that incentivize individuals to improve a desired quality. Key to both accounts is a cost function that dict… ▽ More

    Submitted 17 February, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: This paper was previously titled "Strategic Adaptation to Classifiers: A Causal Perspective." The current version subsumes all previous versions

  44. arXiv:1909.13231  [pdf, other

    cs.LG cs.CV stat.ML

    Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

    Authors: Yu Sun, Xiaolong Wang, Zhuang Liu, John Miller, Alexei A. Efros, Moritz Hardt

    Abstract: In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised learning problem, on which we update the model parameters before making a prediction. This also extends naturally to data in an online stream. Our simple appro… ▽ More

    Submitted 1 July, 2020; v1 submitted 29 September, 2019; originally announced September 2019.

    Comments: ICML 2020

  45. arXiv:1908.01039  [pdf, other

    cs.LG stat.ML

    Linear Dynamics: Clustering without identification

    Authors: Chloe Ching-Yun Hsu, Michaela Hardt, Moritz Hardt

    Abstract: Linear dynamical systems are a fundamental and powerful parametric model class. However, identifying the parameters of a linear dynamical system is a venerable task, permitting provably efficient solutions only in special cases. This work shows that the eigenspectrum of unknown linear dynamics can be identified without full system identification. We analyze a computationally efficient and provably… ▽ More

    Submitted 29 February, 2020; v1 submitted 2 August, 2019; originally announced August 2019.

  46. arXiv:1907.04911  [pdf, other

    cs.LG cs.CY stat.AP stat.ML

    Explaining an increase in predicted risk for clinical alerts

    Authors: Michaela Hardt, Alvin Rajkomar, Gerardo Flores, Andrew Dai, Michael Howell, Greg Corrado, Claire Cui, Moritz Hardt

    Abstract: Much work aims to explain a model's prediction on a static input. We consider explanations in a temporal setting where a stateful dynamical model produces a sequence of risk estimates given an input at each time step. When the estimated risk increases, the goal of the explanation is to attribute the increase to a few relevant inputs from the past. While our formal setup and techniques are general,… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  47. arXiv:1905.12580  [pdf, other

    cs.LG stat.ML

    Model Similarity Mitigates Test Set Overuse

    Authors: Horia Mania, John Miller, Ludwig Schmidt, Moritz Hardt, Benjamin Recht

    Abstract: Excessive reuse of test data has become commonplace in today's machine learning workflows. Popular benchmarks, competitions, industrial scale tuning, among other applications, all involve test data reuse beyond guidance by statistical confidence bounds. Nonetheless, recent replication studies give evidence that popular benchmarks continue to support progress despite years of extensive reuse. We pr… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 18 pages, 7 figures

  48. arXiv:1905.10360  [pdf, other

    cs.LG cs.DS stat.ML

    The advantages of multiple classes for reducing overfitting from test set reuse

    Authors: Vitaly Feldman, Roy Frostig, Moritz Hardt

    Abstract: Excessive reuse of holdout data can lead to overfitting. However, there is little concrete evidence of significant overfitting due to holdout reuse in popular multiclass benchmarks today. Known results show that, in the worst-case, revealing the accuracy of $k$ adaptively chosen classifiers on a data set of size $n$ allows to create a classifier with bias of $Θ(\sqrt{k/n})$ for any binary predicti… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

  49. arXiv:1902.04698  [pdf, other

    stat.ML cs.AI cs.LG

    Identity Crisis: Memorization and Generalization under Extreme Overparameterization

    Authors: Chiyuan Zhang, Samy Bengio, Moritz Hardt, Michael C. Mozer, Yoram Singer

    Abstract: We study the interplay between memorization and generalization of overparameterized networks in the extreme case of a single training example and an identity-mapping task. We examine fully-connected and convolutional networks (FCN and CNN), both linear and nonlinear, initialized randomly and then trained to minimize the reconstruction error. The trained networks stereotypically take one of two for… ▽ More

    Submitted 8 January, 2020; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: ICLR 2020

  50. arXiv:1901.11143  [pdf, ps, other

    cs.LG math.ST stat.ML

    Natural Analysts in Adaptive Data Analysis

    Authors: Tijana Zrnic, Moritz Hardt

    Abstract: Adaptive data analysis is frequently criticized for its pessimistic generalization guarantees. The source of these pessimistic bounds is a model that permits arbitrary, possibly adversarial analysts that optimally use information to bias results. While being a central issue in the field, still lacking are notions of natural analysts that allow for more optimistic bounds faithful to the reality tha… ▽ More

    Submitted 11 May, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 22 pages