Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- invited-talkJuly 2024
Computational algebraic geometry for evolutionary biology
ISSAC '24: Proceedings of the 2024 International Symposium on Symbolic and Algebraic ComputationPages 11–12https://doi.org/10.1145/3666000.3672623We discuss the role computational algebraic geometry and symbolic computation has played in regards to statistical problems related to inferring phylogenetic networks with a focus on identifiability. This article is an accompanying extended abstract of ...
- research-articleMay 2024
Towards the Identifiability and Explainability for Personalized Learner Modeling: An Inductive Paradigm
WWW '24: Proceedings of the ACM Web Conference 2024Pages 3420–3431https://doi.org/10.1145/3589334.3645437Personalized learner modeling using cognitive diagnosis (CD), which aims to model learners' cognitive states by diagnosing learner traits from behavioral data, is a fundamental yet significant task in many web learning services. Existing cognitive ...
- research-articleMarch 2024
Clustering and structural robustness in causal diagrams
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 195, Pages 9356–9387Graphs are commonly used to represent and visualize causal relations. For a small number of variables, this approach provides a succinct and clear view of the scenario at hand. As the number of variables under study increases, the graphical approach may ...
- research-articleMarch 2024
Dimension-grouped mixed membership models for multivariate categorical data
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 88, Pages 4012–4060Mixed Membership Models (MMMs) are a popular family of latent structure models for complex multivariate data. Instead of forcing each subject to belong to a single cluster, MMMs incorporate a vector of subject-specific weights characterizing partial ...
- research-articleMarch 2024
Inference for a large directed acyclic graph with unspecified interventions
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 73, Pages 3261–3308Statistical inference of directed relations given some unspecified interventions (i.e., the intervention targets are unknown) is challenging. In this article, we test hypothesized directed relations with unspecified interventions. First, we derive ...
-
- research-articleAugust 2022
pureGAM: Learning an Inherently Pure Additive Model
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 1728–1738https://doi.org/10.1145/3534678.3539256Including pairwise or higher-order interactions among predictors of a Generalized Additive Model (GAM) is gaining increasing attention in the literature. However, existing models face anidentifiability challenge. In this paper, we propose pureGAM, an ...
- tutorialJuly 2022
Applications of Computer Algebra to Parameter Analysis of Dynamical Systems
ISSAC '22: Proceedings of the 2022 International Symposium on Symbolic and Algebraic ComputationPages 33–38https://doi.org/10.1145/3476446.3535473The purpose of this article is to present some recent applications of computer algebra to answer structural and numerical questions in applied sciences. A first example concerns identifiability which is a pre-condition for safely running parameter ...
- research-articleJune 2022
Parameter Identifiability for Nonlinear LPV Models
International Journal of Applied Mathematics and Computer Science (IJAMCS), Volume 32, Issue 2Pages 255–269https://doi.org/10.34768/amcs-2022-0019AbstractLinear parameter varying (LPV) models are being increasingly used as a bridge between linear and nonlinear models. From a mathematical point of view, a large class of nonlinear models can be rewritten in LPV or quasi-LPV forms easing their ...
- research-articleJanuary 2022
Underspecification presents challenges for credibility in modern machine learning
- Alexander D'Amour,
- Katherine Heller,
- Dan Moldovan,
- Ben Adlam,
- Babak Alipanahi,
- Alex Beutel,
- Christina Chen,
- Jonathan Deaton,
- Jacob Eisenstein,
- Matthew D. Hoffman,
- Farhad Hormozdiari,
- Neil Houlsby,
- Shaobo Hou,
- Ghassen Jerfel,
- Alan Karthikesalingam,
- Mario Lucic,
- Yian Ma,
- Cory McLean,
- Diana Mincu,
- Akinori Mitani,
- Andrea Montanari,
- Zachary Nado,
- Vivek Natarajan,
- Christopher Nielson,
- Thomas F. Osborne,
- Rajiv Raman,
- Kim Ramasamy,
- Rory Sayres,
- Jessica Schrouff,
- Martin Seneviratne,
- Shannon Sequeira,
- Harini Suresh,
- Victor Veitch,
- Max Vladymyrov,
- Xuezhi Wang,
- Kellie Webster,
- Steve Yadlowsky,
- Taedong Yun,
- Xiaohua Zhai,
- D. Sculley
The Journal of Machine Learning Research (JMLR), Volume 23, Issue 1Article No.: 226, Pages 10237–10297Machine learning (ML) systems often exhibit unexpectedly poor behavior when they are deployed in real-world domains. We identify underspecification in ML pipelines as a key reason for these failures. An ML pipeline is the full procedure followed to train ...
- research-articleJanuary 2022
A Sampling Algorithm to Compute the Set of Feasible Solutions for NonNegative Matrix Factorization with an Arbitrary Rank
SIAM Journal on Matrix Analysis and Applications (SIMAX), Volume 43, Issue 1Pages 257–273https://doi.org/10.1137/20M1378971Nonnegative matrix factorization (NMF) is a useful method to extract features from multivariate data, but an important and sometimes neglected concern is that NMF can result in nonunique solutions. Often, there exist a set of feasible solutions (SFS), ...
- research-articleJanuary 2021
Identifiability of Infection Model Parameters Early in an Epidemic
SIAM Journal on Control and Optimization (SICON), Volume 60, Issue 2Pages S27–S48https://doi.org/10.1137/20M1353289It is known that the parameters in the deterministic and stochastic SEIR epidemic models are structurally identifiable. For example, from knowledge of the infected population time series $I(t)$ during the entire epidemic, the parameters can be successfully ...
- research-articleNovember 2020
Power in Skin: The Interplay of Self-Presentation, Tactical Play, and Spending in Fortnite
CHI PLAY '20: Proceedings of the Annual Symposium on Computer-Human Interaction in PlayPages 71–80https://doi.org/10.1145/3410404.3414262This paper endeavors to explain how and why self-presentation can affect in-game purchase behavior in Fortnite. As one of the most popular battle royale games in the world, Fortnite employs a free-to-play business model but enjoys a high revenue by ...
- research-articleJune 2020
Structure Learning of H-Colorings
ACM Transactions on Algorithms (TALG), Volume 16, Issue 3Article No.: 36, Pages 1–28https://doi.org/10.1145/3382207We study the following structure learning problem for H-colorings. For a fixed (and known) constraint graph H with q colors, given access to uniformly random H-colorings of an unknown graph G=(V,E), how many samples are required to learn the edges of G? ...
- research-articleJanuary 2020
Identifiability of additive noise models using conditional variances
The Journal of Machine Learning Research (JMLR), Volume 21, Issue 1Article No.: 75, Pages 2896–2929This paper considers a new identifiability condition for additive noise models (ANMs) in which each variable is determined by an arbitrary Borel measurable function of its parents plus an independent error. It has been shown that ANMs are fully ...
- research-articleJuly 2019
Identifiability of Cause and Effect using Regularized Regression
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningPages 852–861https://doi.org/10.1145/3292500.3330854We consider the problem of telling apart cause from effect between two univariate continuous-valued random variables X and Y. In general, it is impossible to make definite statements about causality without making assumptions on the underlying model; ...
- research-articleJuly 2019
Algebraic Methods in the Design of Experiments
ISSAC '19: Proceedings of the 2019 International Symposium on Symbolic and Algebraic ComputationPages 15–20https://doi.org/10.1145/3326229.3326278Applications of Gröbner basis theory to various problems of statistics arises in early 1990s. One of the first works in this field, a computational algebraic statistics, is given by Pistone and Wynn \citePistone-Wynn-1996, where the Gröbner basis theory ...
- research-articleJuly 2019
Linear Compartmental Models: Input-Output Equations and Operations That Preserve Identifiability
SIAM Journal on Applied Mathematics (SJAM), Volume 79, Issue 4Pages 1423–1447https://doi.org/10.1137/18M1204826This work focuses on the question of how identifiability of a mathematical model, that is, whether parameters can be recovered from data, is related to identifiability of its submodels. We look specifically at linear compartmental models and investigate ...
- research-articleJanuary 2019
Learning Paths from Signature Tensors
SIAM Journal on Matrix Analysis and Applications (SIMAX), Volume 40, Issue 2Pages 394–416https://doi.org/10.1137/18M1212331Matrix congruence extends naturally to the setting of tensors. We apply methods from tensor decomposition, algebraic geometry, and numerical optimization to this group action. Given a tensor in the orbit of another tensor, we compute a matrix which ...
- research-articleJuly 2017
Probabilistic Perspectives on Collecting Human Uncertainty in Predictive Data Mining
UMAP '17: Proceedings of the 25th Conference on User Modeling, Adaptation and PersonalizationPages 104–112https://doi.org/10.1145/3079628.3079675In many areas of data mining, data is collected from human beings. In this contribution, we ask the question of how people actually respond to ordinal scales. The main problem observed is that users tend to be volatile in their choices, i.e. complex ...
- research-articleApril 2017
Optimizing Campaigns for Changing Routine Behaviors by Using an Empirically Calibrated Microsimulation Model
Social Science Computer Review (SSCR), Volume 35, Issue 2Pages 184–202https://doi.org/10.1177/0894439315620866We used the model of prospective memory and habit development to derive recommendations for designing behavior-change campaigns that used prompts or household visits as reminders. We followed an exemplary procedure comprising the calibration of the ...