Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–20 of 20 results for author: Lee, H B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17918  [pdf, other

    cs.LG cs.AI

    Cost-Sensitive Multi-Fidelity Bayesian Optimization with Transfer of Learning Curve Extrapolation

    Authors: Dong Bok Lee, Aoxuan Silvia Zhang, Byungjoo Kim, Junhyeon Park, Juho Lee, Sung Ju Hwang, Hae Beom Lee

    Abstract: In this paper, we address the problem of cost-sensitive multi-fidelity Bayesian Optimization (BO) for efficient hyperparameter optimization (HPO). Specifically, we assume a scenario where users want to early-stop the BO when the performance improvement is not satisfactory with respect to the required computational cost. Motivated by this scenario, we introduce utility, which is a function predefin… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2310.02423  [pdf, other

    cs.LG stat.ML

    Delta-AI: Local objectives for amortized inference in sparse graphical models

    Authors: Jean-Pierre Falet, Hae Beom Lee, Nikolay Malkin, Chen Sun, Dragos Secrieru, Thomas Jiralerspong, Dinghuai Zhang, Guillaume Lajoie, Yoshua Bengio

    Abstract: We present a new algorithm for amortized inference in sparse probabilistic graphical models (PGMs), which we call $Δ$-amortized inference ($Δ$-AI). Our approach is based on the observation that when the sampling of variables in a PGM is seen as a sequence of actions taken by an agent, sparsity of the PGM enables local credit assignment in the agent's policy learning objective. This yields a local… ▽ More

    Submitted 13 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; 19 pages, code: https://github.com/GFNOrg/Delta-AI/

  3. arXiv:2208.10494  [pdf, other

    cs.LG cs.AI

    Dataset Condensation with Latent Space Knowledge Factorization and Sharing

    Authors: Hae Beom Lee, Dong Bok Lee, Sung Ju Hwang

    Abstract: In this paper, we introduce a novel approach for systematically solving dataset condensation problem in an efficient manner by exploiting the regularity in a given dataset. Instead of condensing the dataset directly in the original input space, we assume a generative process of the dataset with a set of learnable codes defined in a compact latent space followed by a set of tiny decoders which maps… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

  4. arXiv:2203.02711  [pdf, other

    cs.LG math.OC

    Meta Mirror Descent: Optimiser Learning for Fast Convergence

    Authors: Boyan Gao, Henry Gouk, Hae Beom Lee, Timothy M. Hospedales

    Abstract: Optimisers are an essential component for training machine learning models, and their design influences learning speed and generalisation. Several studies have attempted to learn more effective gradient-descent optimisers via solving a bi-level optimisation problem where generalisation error is minimised with respect to optimiser parameters. However, most existing optimiser learning methods are in… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  5. arXiv:2110.06381  [pdf, other

    stat.ML cs.LG

    Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Uncertainty

    Authors: Jeffrey Willette, Hae Beom Lee, Juho Lee, Sung Ju Hwang

    Abstract: Numerous recent works utilize bi-Lipschitz regularization of neural network layers to preserve relative distances between data instances in the feature spaces of each layer. This distance sensitivity with respect to the data aids in tasks such as uncertainty calibration and out-of-distribution (OOD) detection. In previous works, features extracted with a distance sensitive model are used to constr… ▽ More

    Submitted 15 March, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  6. arXiv:2110.02600  [pdf, other

    cs.CL

    Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning

    Authors: Seanie Lee, Hae Beom Lee, Juho Lee, Sung Ju Hwang

    Abstract: Multilingual models jointly pretrained on multiple languages have achieved remarkable performance on various multilingual downstream tasks. Moreover, models finetuned on a single monolingual downstream task have shown to generalize to unseen languages. In this paper, we first show that it is crucial for those tasks to align gradients between them in order to maximize knowledge transfer while minim… ▽ More

    Submitted 28 February, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  7. arXiv:2110.02508  [pdf, other

    cs.LG

    Online Hyperparameter Meta-Learning with Hypergradient Distillation

    Authors: Hae Beom Lee, Hayeon Lee, Jaewoong Shin, Eunho Yang, Timothy Hospedales, Sung Ju Hwang

    Abstract: Many gradient-based meta-learning methods assume a set of parameters that do not participate in inner-optimization, which can be considered as hyperparameters. Although such hyperparameters can be optimized using the existing gradient-based hyperparameter optimization (HO) methods, they suffer from the following issues. Unrolled differentiation methods do not scale well to high-dimensional hyperpa… ▽ More

    Submitted 11 February, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

  8. arXiv:2102.07215  [pdf, other

    cs.LG

    Large-Scale Meta-Learning with Continual Trajectory Shifting

    Authors: Jaewoong Shin, Hae Beom Lee, Boqing Gong, Sung Ju Hwang

    Abstract: Meta-learning of shared initialization parameters has shown to be highly effective in solving few-shot learning tasks. However, extending the framework to many-shot scenarios, which may further enhance its practicality, has been relatively overlooked due to the technical difficulties of meta-learning over long chains of inner-gradient steps. In this paper, we first show that allowing the meta-lear… ▽ More

    Submitted 16 February, 2022; v1 submitted 14 February, 2021; originally announced February 2021.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:9603-9613, 2021

  9. arXiv:2102.05195  [pdf, other

    cs.CR

    DOVE: A Data-Oblivious Virtual Environment

    Authors: Hyun Bin Lee, Tushar M. Jois, Christopher W. Fletcher, Carl A. Gunter

    Abstract: Users can improve the security of remote communications by using Trusted Execution Environments (TEEs) to protect against direct introspection and tampering of sensitive data. This can even be done with applications coded in high-level languages with complex programming stacks such as R, Python, and Ruby. However, this creates a trade-off between programming convenience versus the risk of attacks… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: Appears in the proceedings of the 28th Network and Distributed System Security Symposium (NDSS), 2021

  10. arXiv:2006.07540  [pdf, other

    cs.LG stat.ML

    MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures

    Authors: Jeongun Ryu, Jaewoong Shin, Hae Beom Lee, Sung Ju Hwang

    Abstract: Regularization and transfer learning are two popular techniques to enhance generalization on unseen data, which is a fundamental problem of machine learning. Regularization techniques are versatile, as they are task- and architecture-agnostic, but they do not exploit a large amount of data available. Transfer learning methods learn to transfer knowledge from one domain to another, but may not gene… ▽ More

    Submitted 15 February, 2022; v1 submitted 12 June, 2020; originally announced June 2020.

    Report number: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  11. arXiv:2004.02863  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs

    Authors: Seong Min Kye, Youngmoon Jung, Hae Beom Lee, Sung Ju Hwang, Hoirin Kim

    Abstract: In practical settings, a speaker recognition system needs to identify a speaker given a short utterance, while the enrollment utterance may be relatively long. However, existing speaker recognition models perform poorly with such short utterances. To solve this problem, we introduce a meta-learning framework for imbalance length pairs. Specifically, we use a Prototypical Networks and train it with… ▽ More

    Submitted 10 August, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted to Interspeech 2020. The codes are available at https://github.com/seongmin-kye/meta-SR

  12. arXiv:2002.12017  [pdf, other

    cs.LG cs.CV stat.ML

    Meta-Learned Confidence for Few-shot Learning

    Authors: Seong Min Kye, Hae Beom Lee, Hoirin Kim, Sung Ju Hwang

    Abstract: Transductive inference is an effective means of tackling the data deficiency problem in few-shot learning settings. A popular transductive inference technique for few-shot metric-based approaches, is to update the prototype of each class with the mean of the most confident query examples, or confidence-weighted average of all the query samples. However, a caveat here is that the model confidence m… ▽ More

    Submitted 24 June, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

  13. arXiv:1908.01457  [pdf, other

    cs.LG stat.ML

    Learning to Generalize to Unseen Tasks with Bilevel Optimization

    Authors: Hayeon Lee, Donghyun Na, Hae Beom Lee, Sung Ju Hwang

    Abstract: Recent metric-based meta-learning approaches, which learn a metric space that generalizes well over combinatorial number of different classification tasks sampled from a task distribution, have been shown to be effective for few-shot classification tasks of unseen classes. They are often trained with episodic training where they iteratively train a common metric space that reduces distance between… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

    Comments: 9 pages, 3 figures

  14. arXiv:1905.12917  [pdf, other

    cs.LG stat.ML

    Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distribution Tasks

    Authors: Hae Beom Lee, Hayeon Lee, Donghyun Na, Saehoon Kim, Minseop Park, Eunho Yang, Sung Ju Hwang

    Abstract: While tasks could come with varying the number of instances and classes in realistic settings, the existing meta-learning approaches for few-shot classification assume that the number of instances per task and class is fixed. Due to such restriction, they learn to equally utilize the meta-knowledge across all the tasks, even when the number of instances per task and class largely varies. Moreover,… ▽ More

    Submitted 12 February, 2022; v1 submitted 30 May, 2019; originally announced May 2019.

  15. arXiv:1905.12914  [pdf, other

    cs.LG stat.ML

    Meta Dropout: Learning to Perturb Features for Generalization

    Authors: Hae Beom Lee, Taewook Nam, Eunho Yang, Sung Ju Hwang

    Abstract: A machine learning model that generalizes well should obtain low errors on unseen test examples. Thus, if we know how to optimally perturb training examples to account for test examples, we may achieve better generalization performance. However, obtaining such perturbation is not possible in standard machine learning frameworks as the distribution of the test data is unknown. To tackle this challe… ▽ More

    Submitted 12 February, 2022; v1 submitted 30 May, 2019; originally announced May 2019.

  16. arXiv:1805.10896  [pdf, other

    stat.ML cs.LG

    Adaptive Network Sparsification with Dependent Variational Beta-Bernoulli Dropout

    Authors: Juho Lee, Saehoon Kim, Jaehong Yoon, Hae Beom Lee, Eunho Yang, Sung Ju Hwang

    Abstract: While variational dropout approaches have been shown to be effective for network sparsification, they are still suboptimal in the sense that they set the dropout rate for each neuron without consideration of the input data. With such input-independent dropout, each neuron is evolved to be generic across inputs, which makes it difficult to sparsify networks without accuracy loss. To overcome this l… ▽ More

    Submitted 3 March, 2019; v1 submitted 28 May, 2018; originally announced May 2018.

  17. arXiv:1805.09653  [pdf, other

    stat.ML cs.AI cs.LG

    Uncertainty-Aware Attention for Reliable Interpretation and Prediction

    Authors: Jay Heo, Hae Beom Lee, Saehoon Kim, Juho Lee, Kwang Joon Kim, Eunho Yang, Sung Ju Hwang

    Abstract: Attention mechanism is effective in both focusing the deep learning models on relevant features and interpreting them. However, attentions may be unreliable since the networks that generate them are often trained in a weakly-supervised manner. To overcome this limitation, we introduce the notion of input-dependent uncertainty to the attention mechanism, such that it generates attention for each fe… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

  18. arXiv:1712.07834  [pdf, other

    cs.LG

    DropMax: Adaptive Variational Softmax

    Authors: Hae Beom Lee, Juho Lee, Saehoon Kim, Eunho Yang, Sung Ju Hwang

    Abstract: We propose DropMax, a stochastic version of softmax classifier which at each iteration drops non-target classes according to dropout probabilities adaptively decided for each instance. Specifically, we overlay binary masking variables over class output probabilities, which are input-adaptively learned via variational inference. This stochastic regularization has an effect of building an ensemble c… ▽ More

    Submitted 2 November, 2018; v1 submitted 21 December, 2017; originally announced December 2017.

  19. arXiv:1708.00260  [pdf, other

    cs.LG stat.ML

    Deep Asymmetric Multi-task Feature Learning

    Authors: Hae Beom Lee, Eunho Yang, Sung Ju Hwang

    Abstract: We propose Deep Asymmetric Multitask Feature Learning (Deep-AMTFL) which can learn deep representations shared across multiple tasks while effectively preventing negative transfer that may happen in the feature sharing process. Specifically, we introduce an asymmetric autoencoder term that allows reliable predictors for the easy tasks to have high contribution to the feature learning while suppres… ▽ More

    Submitted 30 June, 2018; v1 submitted 1 August, 2017; originally announced August 2017.

  20. arXiv:cmp-lg/9606032  [pdf, ps

    cs.CL

    Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach

    Authors: Hwee Tou Ng, Hian Beng Lee

    Abstract: In this paper, we present a new approach for word sense disambiguation (WSD) using an exemplar-based learning algorithm. This approach integrates a diverse set of knowledge sources to disambiguate word sense, including part of speech of neighboring words, morphological form, the unordered set of surrounding words, local collocations, and verb-object syntactic relation. We tested our WSD program,… ▽ More

    Submitted 29 June, 1996; originally announced June 1996.

    Comments: In Proceedings of ACL96, 8 pages

    Journal ref: ACL-96