Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–13 of 13 results for author: Jeon, H J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12288  [pdf, other

    stat.ML cs.AI cs.LG

    Information-Theoretic Foundations for Machine Learning

    Authors: Hong Jun Jeon, Benjamin Van Roy

    Abstract: The staggering progress of machine learning in the past decade has been a sight to behold. In retrospect, it is both remarkable and unsettling that these milestones were achievable with little to no rigorous theory to guide experimentation. Despite this fact, practitioners have been able to guide their future experimentation via observations from previous large-scale empirical investigations. Howe… ▽ More

    Submitted 18 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.01456  [pdf, other

    cs.LG cs.AI

    Information-Theoretic Foundations for Neural Scaling Laws

    Authors: Hong Jun Jeon, Benjamin Van Roy

    Abstract: Neural scaling laws aim to characterize how out-of-sample error behaves as a function of model and training dataset size. Such scaling laws guide allocation of a computational resources between model and data processing to minimize error. However, existing theoretical support for neural scaling laws lacks rigor and clarity, entangling the roles of information and optimization. In this work, we dev… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2212.01365

  3. arXiv:2401.15530  [pdf, ps, other

    cs.LG cs.IT

    An Information-Theoretic Analysis of In-Context Learning

    Authors: Hong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy

    Abstract: Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustra… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  4. arXiv:2401.13239  [pdf, other

    cs.LG cs.HC

    Adaptive Crowdsourcing Via Self-Supervised Learning

    Authors: Anmol Kagrecha, Henrik Marklund, Benjamin Van Roy, Hong Jun Jeon, Richard Zeckhauser

    Abstract: Common crowdsourcing systems average estimates of a latent quantity of interest provided by many crowdworkers to produce a group estimate. We develop a new approach -- predict-each-worker -- that leverages self-supervised learning and a novel aggregation scheme. This approach adapts weights assigned to crowdworkers based on estimates they provided for previous quantities. When skills vary across c… ▽ More

    Submitted 1 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 33 pages, 3 figures

  5. arXiv:2307.04345  [pdf, other

    cs.LG cs.AI

    Continual Learning as Computationally Constrained Reinforcement Learning

    Authors: Saurabh Kumar, Henrik Marklund, Ashish Rao, Yifan Zhu, Hong Jun Jeon, Yueyang Liu, Benjamin Van Roy

    Abstract: An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills over a long lifetime could advance the frontier of artificial intelligence capabilities. The design of such agents, which remains a long-standing challenge of artificial intelligence, is addressed by the subject of continual learning. This monograph clarifies and formalizes concepts of continual learning,… ▽ More

    Submitted 20 August, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  6. arXiv:2212.01365  [pdf, other

    cs.LG

    An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws

    Authors: Hong Jun Jeon, Benjamin Van Roy

    Abstract: We study the compute-optimal trade-off between model and training data set sizes for large neural networks. Our result suggests a linear relation similar to that supported by the empirical analysis of chinchilla. While that work studies transformer-based large language models trained on the MassiveText corpus gopher, as a starting point for development of a mathematical theory, we focus on a simpl… ▽ More

    Submitted 18 October, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

  7. arXiv:2209.08627  [pdf, other

    cs.LG

    Is Stochastic Gradient Descent Near Optimal?

    Authors: Yifan Zhu, Hong Jun Jeon, Benjamin Van Roy

    Abstract: The success of neural networks over the past decade has established them as effective models for many relevant data generating processes. Statistical theory on neural networks indicates graceful scaling of sample complexity. For example, Joen & Van Roy (arXiv:2203.00246) demonstrate that, when data is generated by a ReLU teacher network with $W$ parameters, an optimal learner needs only… ▽ More

    Submitted 6 October, 2022; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.00246

  8. arXiv:2203.00246  [pdf, other

    cs.LG cs.AI stat.ML

    An Information-Theoretic Framework for Supervised Learning

    Authors: Hong Jun Jeon, Yifan Zhu, Benjamin Van Roy

    Abstract: Each year, deep learning demonstrates new and improved empirical results with deeper and wider neural networks. Meanwhile, with existing theoretical frameworks, it is difficult to analyze networks deeper than two layers without resorting to counting parameters or encountering sample complexity bounds that are exponential in depth. Perhaps it may be fruitful to try to analyze modern machine learnin… ▽ More

    Submitted 24 March, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

  9. arXiv:2108.01903  [pdf, other

    cs.LG cs.AI

    Personalized Federated Learning with Clustering: Non-IID Heart Rate Variability Data Application

    Authors: Joo Hun Yoo, Ha Min Son, Hyejun Jeong, Eun-Hye Jang, Ah Young Kim, Han Young Yu, Hong Jin Jeon, Tai-Myoung Chung

    Abstract: While machine learning techniques are being applied to various fields for their exceptional ability to find complex relations in large datasets, the strengthening of regulations on data ownership and privacy is causing increasing difficulty in its application to medical data. In light of this, Federated Learning has recently been proposed as a solution to train on private data without breach of co… ▽ More

    Submitted 10 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 6 pages with two columns, 4 figures, 3 tables

  10. arXiv:2107.02907  [pdf, other

    cs.RO

    Learning Latent Actions to Control Assistive Robots

    Authors: Dylan P. Losey, Hong Jun Jeon, Mengxi Li, Krishnan Srinivasan, Ajay Mandlekar, Animesh Garg, Jeannette Bohg, Dorsa Sadigh

    Abstract: Assistive robot arms enable people with disabilities to conduct everyday tasks on their own. These arms are dexterous and high-dimensional; however, the interfaces people must use to control their robots are low-dimensional. Consider teleoperating a 7-DoF robot arm with a 2-DoF joystick. The robot is helping you eat dinner, and currently you want to cut a piece of tofu. Today's robots assume a pre… ▽ More

    Submitted 10 July, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

  11. arXiv:2005.03210  [pdf, other

    cs.RO

    Shared Autonomy with Learned Latent Actions

    Authors: Hong Jun Jeon, Dylan P. Losey, Dorsa Sadigh

    Abstract: Assistive robots enable people with disabilities to conduct everyday tasks on their own. However, these tasks can be complex, containing both coarse reaching motions and fine-grained manipulation. For example, when eating, not only does one need to move to the correct food item, but they must also precisely manipulate the food in different ways (e.g., cutting, stabbing, scooping). Shared autonomy… ▽ More

    Submitted 11 May, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

  12. arXiv:2002.04833  [pdf, other

    cs.LG cs.AI cs.HC cs.RO

    Reward-rational (implicit) choice: A unifying formalism for reward learning

    Authors: Hong Jun Jeon, Smitha Milli, Anca D. Dragan

    Abstract: It is often difficult to hand-specify what the correct reward function is for a task, so researchers have instead aimed to learn reward functions from human behavior or feedback. The types of behavior interpreted as evidence of the reward function have expanded greatly in recent years. We've gone from demonstrations, to comparisons, to reading into the information leaked when the human is pushing… ▽ More

    Submitted 11 December, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Published at NeurIPS 2020

  13. arXiv:1808.03891  [pdf, other

    cs.RO

    Configuration Space Metrics

    Authors: Hong Jun Jeon, Anca Diana Dragan

    Abstract: When robot manipulators decide how to reach for an object, hand it over, or obey some task constraint, they implicitly assume a Euclidean distance metric in their configuration space. Their notion of what makes a configuration closer or further is dictated by this assumption. But different distance metrics will lead to different solutions. What is efficient under a Euclidean metric might not neces… ▽ More

    Submitted 12 August, 2018; originally announced August 2018.

    Comments: 8 Pages, 12 Figures, IROS 2018

    MSC Class: 68T40