Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Huh, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02890  [pdf, other

    cs.MA cs.AI cs.LG

    Representation Learning For Efficient Deep Multi-Agent Reinforcement Learning

    Authors: Dom Huh, Prasant Mohapatra

    Abstract: Sample efficiency remains a key challenge in multi-agent reinforcement learning (MARL). A promising approach is to learn a meaningful latent representation space through auxiliary learning objectives alongside the MARL objective to aid in learning a successful control policy. In our work, we present MAPO-LSO (Multi-Agent Policy Optimization with Latent Space Optimization) which applies a form of c… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2404.18638  [pdf, other

    cs.AI cs.CL

    Reinforcement Learning Problem Solving with Large Language Models

    Authors: Sina Gholamian, Domingo Huh

    Abstract: Large Language Models (LLMs) encapsulate an extensive amount of world knowledge, and this has enabled their application in various domains to improve the performance of a variety of Natural Language Processing (NLP) tasks. This has also facilitated a more accessible paradigm of conversation-based interactions between humans and AI systems to solve intended problems. However, one interesting avenue… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  3. arXiv:2402.17002  [pdf, other

    cs.LG math.GR math.RT

    Discovering Abstract Symbolic Relations by Learning Unitary Group Representations

    Authors: Dongsung Huh

    Abstract: We investigate a principled approach for symbolic operation completion (SOC), a minimal task for studying symbolic reasoning. While conceptually similar to matrix completion, SOC poses a unique challenge in modeling abstract relationships between discrete symbols. We demonstrate that SOC can be efficiently solved by a minimal model - a bilinear map - with a novel factorized architecture. Inspired… ▽ More

    Submitted 22 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  4. arXiv:2312.10256  [pdf, other

    cs.MA cs.AI cs.LG

    Multi-agent Reinforcement Learning: A Comprehensive Survey

    Authors: Dom Huh, Prasant Mohapatra

    Abstract: Multi-agent systems (MAS) are widely prevalent and crucially important in numerous real-world applications, where multiple agents must make decisions to achieve their objectives in a shared environment. Despite their ubiquity, the development of intelligent decision-making agents in MAS poses several open challenges to their effective implementation. This survey examines these challenges, placing… ▽ More

    Submitted 2 July, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  5. arXiv:2305.00604  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    ISAAC Newton: Input-based Approximate Curvature for Newton's Method

    Authors: Felix Petersen, Tobias Sutter, Christian Borgelt, Dongsung Huh, Hilde Kuehne, Yuekai Sun, Oliver Deussen

    Abstract: We present ISAAC (Input-baSed ApproximAte Curvature), a novel method that conditions the gradient using selected second-order information and has an asymptotically vanishing computational overhead, assuming a batch size smaller than the number of neurons. We show that it is possible to compute a good conditioner based on only the input to a respective layer without a substantial computational over… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: Published at ICLR 2023, Code @ https://github.com/Felix-Petersen/isaac, Video @ https://youtu.be/7RKRX-MdwqM

  6. arXiv:2301.08864  [pdf, other

    cs.MA cs.AI

    Decentralized Multi-agent Filtering

    Authors: Dom Huh, Prasant Mohapatra

    Abstract: This paper addresses the considerations that comes along with adopting decentralized communication for multi-agent localization applications in discrete state spaces. In this framework, we extend the original formulation of the Bayes filter, a foundational probabilistic tool for discrete state estimation, by appending a step of greedy belief sharing as a method to propagate information and improve… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  7. arXiv:2205.14546  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    The Missing Invariance Principle Found -- the Reciprocal Twin of Invariant Risk Minimization

    Authors: Dongsung Huh, Avinash Baidya

    Abstract: Machine learning models often generalize poorly to out-of-distribution (OOD) data as a result of relying on features that are spuriously correlated with the label during training. Recently, the technique of Invariant Risk Minimization (IRM) was proposed to learn predictors that only use invariant features by conserving the feature-conditioned label expectation $\mathbb{E}_e[y|f(x)]$ across environ… ▽ More

    Submitted 16 January, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022

  8. arXiv:2106.13037  [pdf, other

    cs.LG cs.AI

    Mix and Mask Actor-Critic Methods

    Authors: Dom Huh

    Abstract: Shared feature spaces for actor-critic methods aims to capture generalized latent representations to be used by the policy and value function with the hopes for a more stable and sample-efficient optimization. However, such a paradigm present a number of challenges in practice, as parameters generating a shared representation must learn off two distinct objectives, resulting in competing updates a… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

  9. arXiv:2101.00728  [pdf, other

    cs.LG cs.AI

    Synthetic Embedding-based Data Generation Methods for Student Performance

    Authors: Dom Huh

    Abstract: Given the inherent class imbalance issue within student performance datasets, samples belonging to the edges of the target class distribution pose a challenge for predictive machine learning algorithms to learn. In this paper, we introduce a general framework for synthetic embedding-based data generation (SEDG), a search-based approach to generate new synthetic samples using embeddings to correct… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

  10. arXiv:2007.16001  [pdf, other

    cs.LG cs.AI

    Greedy Bandits with Sampled Context

    Authors: Dom Huh

    Abstract: Bayesian strategies for contextual bandits have proved promising in single-state reinforcement learning tasks by modeling uncertainty using context information from the environment. In this paper, we propose Greedy Bandits with Sampled Context (GB-SC), a method for contextual multi-armed bandits to develop the prior from the context information using Thompson Sampling, and arm selection using an e… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  11. arXiv:2007.16000  [pdf, other

    cs.LG cs.IR stat.ML

    Hierarchical BiGraph Neural Network as Recommendation Systems

    Authors: Dom Huh

    Abstract: Graph neural networks emerge as a promising modeling method for applications dealing with datasets that are best represented in the graph domain. In specific, developing recommendation systems often require addressing sparse structured data which often lacks the feature richness in either the user and/or item side and requires processing within the correct context for optimal performance. These da… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  12. arXiv:2003.08743  [pdf, other

    cs.CV

    Generative Multi-Stream Architecture For American Sign Language Recognition

    Authors: Dom Huh, Sai Gurrapu, Frederick Olson, Huzefa Rangwala, Parth Pathak, Jana Kosecka

    Abstract: With advancements in deep model architectures, tasks in computer vision can reach optimal convergence provided proper data preprocessing and model parameter initialization. However, training on datasets with low feature-richness for complex applications limit and detriment optimal convergence below human performance. In past works, researchers have provided external sources of complementary data a… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  13. arXiv:1706.04698  [pdf, other

    q-bio.NC cs.LG cs.NE stat.ML

    Gradient Descent for Spiking Neural Networks

    Authors: Dongsung Huh, Terrence J. Sejnowski

    Abstract: Much of studies on neural computation are based on network models of static neurons that produce analog output, despite the fact that information processing in the brain is predominantly carried out by dynamic neurons that produce discrete pulses called spikes. Research in spike-based computation has been impeded by the lack of efficient supervised learning algorithm for spiking networks. Here, we… ▽ More

    Submitted 19 June, 2017; v1 submitted 14 June, 2017; originally announced June 2017.

  14. arXiv:1506.07515  [pdf, other

    cs.GR cs.RO math.DG q-bio.NC

    The Vector Space of Convex Curves: How to Mix Shapes

    Authors: Dongsung Huh

    Abstract: We present a novel, log-radius profile representation for convex curves and define a new operation for combining the shape features of curves. Unlike the standard, angle profile-based methods, this operation accurately combines the shape features in a visually intuitive manner. This method have implications in shape analysis as well as in investigating how the brain perceives and generates curved… ▽ More

    Submitted 24 June, 2015; originally announced June 2015.