Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleMarch 2024
Hard-constrained deep learning for climate downscaling
- Paula Harder,
- Alex Hernandez-Garcia,
- Venkatesh Ramesh,
- Qidong Yang,
- Prasanna Sattegeri,
- Daniela Szwarcman,
- Campbell D. Watson,
- David Rolnick
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 365, Pages 17534–17573The availability of reliable, high-resolution climate and weather data is important to inform long-term decisions on climate adaptation and mitigation and to guide rapid responses to extreme events. Forecasting models are limited by computational costs ...
- research-articleMarch 2024
Compression, generalization and learning
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 339, Pages 16184–16257A compression function is a map that slims down an observational set into a subset of reduced size, while preserving its informational content. In multiple applications, the condition that one new observation makes the compressed set change is ...
- research-articleMarch 2024
Adaptive learning of density ratios in RKHS
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 395, Pages 18863–18891Estimating the ratio of two probability densities from finitely many observations of the densities is a central problem in machine learning and statistics with applications in two-sample testing, divergence estimation, generative modeling, covariate ...
- research-articleMarch 2024
Revisiting inference after prediction
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 394, Pages 18845–18862Recent work has focused on the very common practice of prediction-based inference: that is, (i) using a pre-trained machine learning model to predict an unobserved response variable, and then (ii) conducting inference on the association between that ...
- research-articleMarch 2024
A unified approach to controlling implicit regularization via mirror descent
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 393, Pages 18787–18844Inspired by the remarkable success of large neural networks, there has been significant interest in understanding the generalization performance of over-parameterized models. Substantial efforts have been invested in characterizing how optimization ...
-
- research-articleMarch 2024
Instance-dependent confidence and early stopping for reinforcement learning
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 392, Pages 18744–18786Reinforcement learning algorithms are known to exhibit a variety of convergence rates depending on the problem structure. Recent years have witnessed considerable progress in developing theory that is instance-dependent, along with algorithms that ...
- research-articleMarch 2024
Hierarchical kernels in deep kernel learning
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 391, Pages 18714–18743Kernel methods are built upon the mathematical theory of reproducing kernels and reproducing kernel Hilbert spaces. They enjoy good interpretability thanks to the solid mathematical foundation. Recently, motivated by deep neural networks in deep learning,...
- research-articleMarch 2024
A scalable and efficient iterative method for copying machine learning classifiers
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 390, Pages 18680–18713Differential replication through copying refers to the process of replicating the decision behavior of a machine learning model using another model that possesses enhanced features and attributes. This process is relevant when external constraints limit ...
- research-articleMarch 2024
Fourier neural operator with learned deformations for PDEs on general geometries
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 388, Pages 18593–18618Deep learning surrogate models have shown promise in solving partial differential equations (PDEs). Among them, the Fourier neural operator (FNO) achieves good accuracy, and is significantly faster compared to numerical solvers, on a variety of PDEs, ...
- research-articleMarch 2024
Scalable PAC-bayesian meta-learning via the PAC-optimal hyper-posterior: from theory to practice
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 386, Pages 18474–18535Meta-Learning aims to speed up the learning process on new tasks by acquiring useful inductive biases from datasets of related learning tasks. While, in practice, the number of related tasks available is often small, most of the existing approaches ...
- research-articleMarch 2024
On unbalanced optimal transport: gradient methods, sparsity and approximation error
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 384, Pages 18390–18430We study the Unbalanced Optimal Transport (UOT) between two measures of possibly different masses with at most n components, where the marginal constraints of standard Optimal Transport (OT) are relaxed via Kullback-Leibler divergence with regularization ...
- research-articleMarch 2024
Over-parameterized deep nonparametric regression for dependent data with its applications to reinforcement learning
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 383, Pages 18350–18389In this paper, we provide statistical guarantees for over-parameterized deep nonparametric regression in the presence of dependent data. By decomposing the error, we establish nonasymptotic error bounds for deep estimation, which is achieved by ...
- research-articleMarch 2024
Low-rank tensor estimation via Riemannian Gauss-Newton: statistical optimality and second-order convergence
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 381, Pages 18274–18321In this paper, we consider the estimation of a low Tucker rank tensor from a number of noisy linear measurements. The general problem covers many specific examples arising from applications, including tensor regression, tensor completion, and tensor PCA/...
- research-articleMarch 2024
On learning rates and schrödinger operators
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 379, Pages 18153–18205Understanding the iterative behavior of stochastic optimization algorithms for minimizing nonconvex functions remains a crucial challenge in demystifying deep learning. In particular, it is not yet understood why certain simple techniques are remarkably ...
- research-articleMarch 2024
Principled out-of-distribution detection via multiple testing
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 378, Pages 18118–18152We study the problem of out-of-distribution (OOD) detection, that is, detecting whether a machine learning (ML) model's output can be trusted at inference time. While a number of tests for OOD detection have been proposed in prior work, a formal ...
- research-articleMarch 2024
Scaling up models and data with t5x and seqio
- Adam Roberts,
- Hyung Won Chung,
- Gaurav Mishra,
- Anselm Levskaya,
- James Bradbury,
- Daniel Andor,
- Sharan Narang,
- Brian Lester,
- Colin Gaffney,
- Afroz Mohiuddin,
- Curtis Hawthorne,
- Aitor Lewkowycz,
- Alex Salcianu,
- Haitang Hu,
- Sasha Tsvyashchenko,
- Aakanksha Chowdhery,
- Jasmijn Bastings,
- Jannis Bulian,
- Xavier Garcia,
- Jianmo Ni,
- Andrew Chen,
- Kathleen Kenealy,
- Kehang Han,
- Michelle Casbon,
- Jonathan H. Clark,
- Stephan Lee,
- Dan Garrette,
- James Lee-Thorp,
- Colin Raffel,
- Noam Shazeer,
- Marvin Ritter,
- Maarten Bosma,
- Alexandre Passos,
- Jeremy Maitin-Shepard,
- Noah Fiedel,
- Mark Omernick,
- Brennan Saeta,
- Ryan Sepassi,
- Alexander Spiridonov,
- Joshua Newlan,
- Andrea Gesmundo,
- Marc Van Zee,
- Jacob Austin,
- Sebastian Goodman,
- Livio Baldini Soares
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 377, Pages 18110–18117Scaling up training datasets and model parameters have benefited neural network-based language models, but also present challenges like distributed compute, input data bottlenecks and reproducibility of results. We introduce two simple and scalable ...
- research-articleMarch 2024
On the dynamics under the unhinged loss and beyond
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 376, Pages 18048–18109Recent works have studied implicit biases in deep learning, especially the behavior of last-layer features and classifier weights. However, they usually need to simplify the intermediate dynamics under gradient ow or gradient descent due to the ...
- research-articleMarch 2024
Set-valued classification with out-of-distribution detection for many classes
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 375, Pages 18009–18047Set-valued classification, a new classification paradigm that aims to identify all the plausible classes that an observation belongs to, improves over the traditional classification paradigms in multiple aspects. Existing set-valued classification ...
- research-articleMarch 2024
Diffusion bridge mixture transports, schrödinger bridge problems and generative modeling
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 374, Pages 17958–18008The dynamic Schrödinger bridge problem seeks a stochastic process that defines a transport between two target probability measures, while optimally satisfying the criteria of being closest, in terms of Kullback-Leibler divergence, to a reference process. ...
- research-articleMarch 2024
Multilevel CNNs for parametric PDEs
The Journal of Machine Learning Research (JMLR), Volume 24, Issue 1Article No.: 373, Pages 17916–17957We combine concepts from multilevel solvers for partial differential equations (PDEs) with neural network based deep learning and propose a new methodology for the efficient numerical solution of high-dimensional parametric PDEs. An in-depth theoretical ...