Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–47 of 47 results for author: Kim, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.13751  [pdf, other

    stat.ML cs.LG math.OC

    Improved identification of breakpoints in piecewise regression and its applications

    Authors: Taehyeong Kim, Hyungu Lee, Hayoung Choi

    Abstract: Identifying breakpoints in piecewise regression is critical in enhancing the reliability and interpretability of data fitting. In this paper, we propose novel algorithms based on the greedy algorithm to accurately and efficiently identify breakpoints in piecewise polynomial regression. The algorithm updates the breakpoints to minimize the error by exploring the neighborhood of each breakpoint. It… ▽ More

    Submitted 27 August, 2024; v1 submitted 25 August, 2024; originally announced August 2024.

    Comments: 13 pages, 6 figures

  2. arXiv:2407.10784  [pdf, other

    cs.LG cs.AI stat.ML

    AdapTable: Test-Time Adaptation for Tabular Data via Shift-Aware Uncertainty Calibrator and Label Distribution Handler

    Authors: Changhun Kim, Taewon Kim, Seungyeon Woo, June Yong Yang, Eunho Yang

    Abstract: In real-world scenarios, tabular data often suffer from distribution shifts that threaten the performance of machine learning models. Despite its prevalence and importance, handling distribution shifts in the tabular domain remains underexplored due to the inherent challenges within the tabular data itself. In this sense, test-time adaptation (TTA) offers a promising solution by adapting models to… ▽ More

    Submitted 26 August, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: Under Review at AAAI 2025

  3. arXiv:2404.13321  [pdf

    stat.AP eess.SY

    Accelerated System-Reliability-based Disaster Resilience Analysis for Structural Systems

    Authors: Taeyong Kim, Sang-ri Yi

    Abstract: Resilience has emerged as a crucial concept for evaluating structural performance under disasters because of its ability to extend beyond traditional risk assessments, accounting for a system's ability to minimize disruptions and maintain functionality during recovery. To facilitate the holistic understanding of resilience performance in structural systems, a system-reliability-based disaster resi… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 25 pages, 18 figures

  4. arXiv:2312.03386  [pdf, other

    cs.LG stat.ML

    An Infinite-Width Analysis on the Jacobian-Regularised Training of a Neural Network

    Authors: Taeyoung Kim, Hongseok Yang

    Abstract: The recent theoretical analysis of deep neural networks in their infinite-width limits has deepened our understanding of initialisation, feature learning, and training of those networks, and brought new practical techniques for finding appropriate hyperparameters, learning network weights, and performing inference. In this paper, we broaden this line of research by showing that this infinite-width… ▽ More

    Submitted 21 August, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted at ICML 2024. 74 pages, 18 figures

  5. arXiv:2310.13349  [pdf, other

    stat.ML cs.CV cs.LG

    DeepFDR: A Deep Learning-based False Discovery Rate Control Method for Neuroimaging Data

    Authors: Taehyo Kim, Hai Shu, Qiran Jia, Mony J. de Leon

    Abstract: Voxel-based multiple testing is widely used in neuroimaging data analysis. Traditional false discovery rate (FDR) control methods often ignore the spatial dependence among the voxel-based tests and thus suffer from substantial loss of testing power. While recent spatial FDR control methods have emerged, their validity and optimality remain questionable when handling the complex spatial dependencie… ▽ More

    Submitted 10 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024), PMLR 238:946-954, 2024

  6. arXiv:2307.09254  [pdf, other

    cs.LG cs.CL stat.ML

    PAC Neural Prediction Set Learning to Quantify the Uncertainty of Generative Language Models

    Authors: Sangdon Park, Taesoo Kim

    Abstract: Uncertainty learning and quantification of models are crucial tasks to enhance the trustworthiness of the models. Importantly, the recent surge of generative language models (GLMs) emphasizes the need for reliable uncertainty quantification due to the concerns on generating hallucinated facts. In this paper, we propose to learn neural prediction set models that comes with the probably approximatel… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  7. arXiv:2307.08150  [pdf, other

    stat.ME

    Efficient Treatment Effect Estimation with Out-of-bag Post-stratification

    Authors: Taebin Kim, Lili Wang, Randy Lai, Sangho Yoon

    Abstract: Post-stratification is often used to estimate treatment effects with higher efficiency. However, the majority of existing post-stratification frameworks depend on prior knowledge of the distributions of covariates and assume that the units are classified into post-strata without error. We propose a novel method to determine a proper stratification rule by mapping the covariates into a post-stratif… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

  8. arXiv:2304.04221  [pdf, other

    stat.ME

    Maximum Agreement Linear Prediction via the Concordance Correlation Coefficient

    Authors: Taeho Kim, George Luta, Matteo Bottai, Pierre Chausse, Gheorghe Doros, Edsel A. Pena

    Abstract: This paper examines distributional properties and predictive performance of the estimated maximum agreement linear predictor (MALP) introduced in Bottai, Kim, Lieberman, Luta, and Pena (2022) paper in The American Statistician, which is the linear predictor maximizing Lin's concordance correlation coefficient (CCC) between the predictor and the predictand. It is compared and contrasted, theoretica… ▽ More

    Submitted 10 February, 2024; v1 submitted 9 April, 2023; originally announced April 2023.

    MSC Class: 62J99; 62H20; 62F99

  9. arXiv:2303.15833  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Complementary Domain Adaptation and Generalization for Unsupervised Continual Domain Shift Learning

    Authors: Wonguk Cho, Jinha Park, Taesup Kim

    Abstract: Continual domain shift poses a significant challenge in real-world applications, particularly in situations where labeled data is not available for new domains. The challenge of acquiring knowledge in this problem setting is referred to as unsupervised continual domain shift learning. Existing methods for domain adaptation and generalization have limitations in addressing this issue, as they focus… ▽ More

    Submitted 13 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  10. arXiv:2210.13533  [pdf, other

    cs.LG cs.AI stat.ML

    Sufficient Invariant Learning for Distribution Shift

    Authors: Taero Kim, Sungjun Lim, Kyungwoo Song

    Abstract: Machine learning algorithms have shown remarkable performance in diverse applications. However, it is still challenging to guarantee performance in distribution shifts when distributions of training and test datasets are different. There have been several approaches to improve the performance in distribution shift cases by learning invariant features across groups or domains. However, we observe t… ▽ More

    Submitted 28 August, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

  11. arXiv:2209.05150  [pdf, other

    cs.LG stat.ML

    Bounding the Rademacher Complexity of Fourier neural operators

    Authors: Taeyoung Kim, Myungjoo Kang

    Abstract: A Fourier neural operator (FNO) is one of the physics-inspired machine learning methods. In particular, it is a neural operator. In recent times, several types of neural operators have been developed, e.g., deep operator networks, Graph neural operator (GNO), and Multiwavelet-based operator (MWTO). Compared with other models, the FNO is computationally efficient and can learn nonlinear operators b… ▽ More

    Submitted 26 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 21 pages, 19 figures

  12. arXiv:2207.07533  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Selection of the Most Probable Best

    Authors: Taeho Kim, Kyoung-kuk Kim, Eunhye Song

    Abstract: We consider an expected-value ranking and selection (R&S) problem where all k solutions' simulation outputs depend on a common parameter whose uncertainty can be modeled by a distribution. We define the most probable best (MPB) to be the solution that has the largest probability of being optimal with respect to the distribution and design an efficient sequential sampling algorithm to learn the MPB… ▽ More

    Submitted 20 April, 2024; v1 submitted 15 July, 2022; originally announced July 2022.

  13. arXiv:2104.14695  [pdf, other

    stat.ME stat.AP

    Dynamic Gene Coexpression Analysis with Correlation Modeling

    Authors: Tae Hyun Kim, Dan Nicolae

    Abstract: In many transcriptomic studies, the correlation of genes might fluctuate with quantitative factors such as genetic ancestry. We propose a method that models the covariance between two variables to vary against a continuous covariate. For the bivariate case, the proposed score test statistic is computationally simple and robust to model misspecification of the covariance term. Subsequently, the met… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  14. arXiv:2103.00083  [pdf, other

    stat.ML cs.LG

    Flexible Model Aggregation for Quantile Regression

    Authors: Rasool Fakoor, Taesup Kim, Jonas Mueller, Alexander J. Smola, Ryan J. Tibshirani

    Abstract: Quantile regression is a fundamental problem in statistical learning motivated by a need to quantify uncertainty in predictions, or to model a diverse population without being overly reductive. For instance, epidemiological forecasts, cost estimates, and revenue predictions all benefit from being able to quantify the range of possible values accurately. As such, many models have been developed for… ▽ More

    Submitted 15 April, 2023; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Accepted at JMLR 2023

  15. arXiv:2101.02491  [pdf, ps, other

    math.ST stat.ME

    Density Deconvolution with Non-Standard Error Distributions: Rates of Convergence and Adaptive Estimation

    Authors: Alexander Goldenshluger, Taeho Kim

    Abstract: It is a typical standard assumption in the density deconvolution problem that the characteristic function of the measurement error distribution is non-zero on the real line. While this condition is assumed in the majority of existing works on the topic, there are many problem instances of interest where it is violated. In this paper we focus on non--standard settings where the characteristic funct… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 32 pages

    MSC Class: 62G07; 62G20

  16. arXiv:2012.03501  [pdf, other

    cs.LG stat.ML

    Adaptive Local Bayesian Optimization Over Multiple Discrete Variables

    Authors: Taehyeon Kim, Jaeyeon Ahn, Nakyil Kim, Seyoung Yun

    Abstract: In the machine learning algorithms, the choice of the hyperparameter is often an art more than a science, requiring labor-intensive search with expert experience. Therefore, automation on hyperparameter optimization to exclude human intervention is a great appeal, especially for the black-box functions. Recently, there have been increasing demands of solving such concealed tasks for better general… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: workshop at NeurIPS 2020 Competition Track on Black-Box Optimization Challenge

  17. arXiv:2010.01792  [pdf, other

    cs.LG cs.CV cs.MA stat.ML

    Can we Generalize and Distribute Private Representation Learning?

    Authors: Sheikh Shams Azam, Taejin Kim, Seyyedali Hosseinalipour, Carlee Joe-Wong, Saurabh Bagchi, Christopher Brinton

    Abstract: We study the problem of learning representations that are private yet informative, i.e., provide information about intended "ally" targets while hiding sensitive "adversary" attributes. We propose Exclusion-Inclusion Generative Adversarial Network (EIGAN), a generalized private representation learning (PRL) architecture that accounts for multiple ally and adversary attributes unlike existing PRL s… ▽ More

    Submitted 30 January, 2022; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: In Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

  18. arXiv:2007.02105  [pdf, other

    stat.ME stat.AP

    Prediction Regions for Poisson and Over-Dispersed Poisson Regression Models with Applications to Forecasting Number of Deaths during the COVID-19 Pandemic

    Authors: T. KIm, B. Lieberman, G. Luta, E. Pena

    Abstract: Motivated by the current Coronavirus Disease (COVID-19) pandemic, which is due to the SARS-CoV-2 virus, and the important problem of forecasting daily deaths and cumulative deaths, this paper examines the construction of prediction regions or intervals under the Poisson regression model and for an over-dispersed Poisson regression model. For the Poisson regression model, several prediction regions… ▽ More

    Submitted 6 July, 2020; v1 submitted 4 July, 2020; originally announced July 2020.

    Comments: There are 16 Figures with some containing one to four plot panels. The appendix section are supplementary materials. Without these supplementary materials, there are 35 pages in this manuscript

    MSC Class: Primary: 62J02; 62P99; Secondary: 62F99; 62M10

  19. arXiv:2006.09679  [pdf, other

    cs.LG cs.CV stat.ML

    FrostNet: Towards Quantization-Aware Network Architecture Search

    Authors: Taehoon Kim, YoungJoon Yoo, Jihoon Yang

    Abstract: INT8 quantization has become one of the standard techniques for deploying convolutional neural networks (CNNs) on edge devices to reduce the memory and computational resource usages. By analyzing quantized performances of existing mobile-target network architectures, we can raise an issue regarding the importance of network architecture for optimal INT8 quantization. In this paper, we present a ne… ▽ More

    Submitted 30 November, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  20. arXiv:2003.01860  [pdf, ps, other

    stat.AP

    Designing a Bonus-Malus system reflecting the claim size under the dependent frequency-severity model

    Authors: Rosy Oh, Joseph H. T. Kim, Jae Youn Ahn

    Abstract: In auto insurance, a Bonus-Malus System (BMS) is commonly used as a posteriori risk classification mechanism to set the premium for the next contract period based on a policyholder's claim history. Even though recent literature reports evidence of a significant dependence between frequency and severity, the current BMS practice is to use a frequency-based transition rule while ignoring severity in… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

  21. arXiv:2002.11903  [pdf, other

    cs.LG stat.ML

    Acceleration of Actor-Critic Deep Reinforcement Learning for Visual Grasping in Clutter by State Representation Learning Based on Disentanglement of a Raw Input Image

    Authors: Taewon Kim, Yeseong Park, Youngbin Park, Il Hong Suh

    Abstract: For a robotic grasping task in which diverse unseen target objects exist in a cluttered environment, some deep learning-based methods have achieved state-of-the-art results using visual input directly. In contrast, actor-critic deep reinforcement learning (RL) methods typically perform very poorly when grasping diverse objects, especially when learning from raw images and sparse rewards. To make t… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  22. arXiv:1912.13366  [pdf, other

    cs.LG cs.AI stat.ML

    Fast and Accurate Transferability Measurement for Heterogeneous Multivariate Data

    Authors: Seungcheol Park, Huiwen Xu, Taehun Kim, Inhwan Hwang, Kyung-Jun Kim, U Kang

    Abstract: Given a set of heterogeneous source datasets with their classifiers, how can we quickly find the most useful source dataset for a specific target task? We address the problem of measuring transferability between source and target datasets, where the source and the target have different feature spaces and distributions. We propose Transmeter, a fast and accurate method to estimate the transferabili… ▽ More

    Submitted 29 January, 2021; v1 submitted 23 December, 2019; originally announced December 2019.

  23. arXiv:1912.04871  [pdf, other

    cs.LG stat.ML

    Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients

    Authors: Brenden K. Petersen, Mikel Landajuela, T. Nathan Mundhenk, Claudio P. Santiago, Soo K. Kim, Joanne T. Kim

    Abstract: Discovering the underlying mathematical expressions describing a dataset is a core challenge for artificial intelligence. This is the problem of $\textit{symbolic regression}$. Despite recent advances in training neural networks to solve complex tasks, deep learning approaches to symbolic regression are underexplored. We propose a framework that leverages deep learning for symbolic regression via… ▽ More

    Submitted 5 April, 2021; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Published at International Conference on Learning Representations, 2021

    Report number: LLNL-CONF-790457

    Journal ref: International Conference on Learning Representations, 2021

  24. arXiv:1912.03756  [pdf, other

    stat.ME

    Improved Multiple Confidence Intervals via Thresholding Informed by Prior Information

    Authors: Taeho Kim, Edsel A. Pena

    Abstract: Consider a statistical problem where a set of parameters are of interest to a researcher. Then multiple confidence intervals can be constructed to infer the set of parameters simultaneously. The constructed multiple confidence intervals are the realization of a multiple interval estimator (MIE), the main focus of this study. In particular, a thresholding approach is introduced to improve the perfo… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

    Comments: 34 pages and 7 figures

    MSC Class: 62F25; 62H12; 62H15

  25. arXiv:1910.00775  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Temporal Abstraction

    Authors: Taesup Kim, Sungjin Ahn, Yoshua Bengio

    Abstract: We introduce a variational approach to learning and inference of temporally hierarchical structure and representation for sequential data. We propose the Variational Temporal Abstraction (VTA), a hierarchical recurrent state space model that can infer the latent temporal structure and thus perform the stochastic state transition hierarchically. We also propose to apply this model to implement the… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted in NeurIPS 2019

  26. arXiv:1906.05956  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Scalable Neural Architecture Search for 3D Medical Image Segmentation

    Authors: Sungwoong Kim, Ildoo Kim, Sungbin Lim, Woonhyuk Baek, Chiheon Kim, Hyungjoo Cho, Boogeon Yoon, Taesup Kim

    Abstract: In this paper, a neural architecture search (NAS) framework is proposed for 3D medical image segmentation, to automatically optimize a neural architecture from a large design space. Our NAS framework searches the structure of each layer including neural connectivities and operation types in both of the encoder and decoder. Since optimizing over a large discrete architecture space is difficult due… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: 9 pages, 3 figures

  27. arXiv:1906.04691  [pdf, other

    cs.LG cs.CV stat.ML

    On Single Source Robustness in Deep Fusion Models

    Authors: Taewan Kim, Joydeep Ghosh

    Abstract: Algorithms that fuse multiple input sources benefit from both complementary and shared information. Shared information may provide robustness against faulty or noisy inputs, which is indispensable for safety-critical applications like self-driving cars. We investigate learning fusion algorithms that are robust against noise added to a single source. We first demonstrate that robustness against sin… ▽ More

    Submitted 16 October, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted to NeurIPS 2019

  28. arXiv:1905.13536  [pdf, other

    cs.CV cs.LG cs.PF eess.IV stat.ML

    Scaling Video Analytics on Constrained Edge Nodes

    Authors: Christopher Canel, Thomas Kim, Giulio Zhou, Conglong Li, Hyeontaek Lim, David G. Andersen, Michael Kaminsky, Subramanya R. Dulloor

    Abstract: As video camera deployments continue to grow, the need to process large volumes of real-time data strains wide area network infrastructure. When per-camera bandwidth is limited, it is infeasible for applications such as traffic monitoring and pedestrian tracking to offload high-quality video streams to a datacenter. This paper presents FilterForward, a new edge-to-cloud system that enables datacen… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: This paper is an extended version of a paper with the same title published in the 2nd SysML Conference, SysML '19 (Canel et. al., 2019)

  29. arXiv:1905.00397  [pdf, other

    cs.LG cs.CV stat.ML

    Fast AutoAugment

    Authors: Sungbin Lim, Ildoo Kim, Taesup Kim, Chiheon Kim, Sungwoong Kim

    Abstract: Data augmentation is an essential technique for improving generalization ability of deep learning models. Recently, AutoAugment has been proposed as an algorithm to automatically search for augmentation policies from a dataset and has significantly enhanced performances on many image recognition tasks. However, its search method requires thousands of GPU hours even for a relatively small dataset.… ▽ More

    Submitted 25 May, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: 8 pages, 2 figure

    Report number: NeurIPS/2019/12

  30. arXiv:1902.06562  [pdf, other

    cs.LG eess.SP stat.ML

    Intra- and Inter-epoch Temporal Context Network (IITNet) Using Sub-epoch Features for Automatic Sleep Scoring on Raw Single-channel EEG

    Authors: Hogeon Seo, Seunghyeok Back, Seongju Lee, Deokhwan Park, Tae Kim, Kyoobin Lee

    Abstract: A deep learning model, named IITNet, is proposed to learn intra- and inter-epoch temporal contexts from raw single-channel EEG for automatic sleep scoring. To classify the sleep stage from half-minute EEG, called an epoch, sleep experts investigate sleep-related events and consider the transition rules between the found events. Similarly, IITNet extracts representative features at a sub-epoch leve… ▽ More

    Submitted 10 June, 2020; v1 submitted 18 February, 2019; originally announced February 2019.

    Comments: First three authors contributed equally to this work; Accepted manuscript for Biomedical Signal Processing and Control (BSPC); 12 pages, 6 figures;

  31. arXiv:1902.04224  [pdf, other

    cs.LG stat.ML

    Effective Network Compression Using Simulation-Guided Iterative Pruning

    Authors: Dae-Woong Jeong, Jaehun Kim, Youngseok Kim, Tae-Ho Kim, Myungsu Chae

    Abstract: Existing high-performance deep learning models require very intensive computing. For this reason, it is difficult to embed a deep learning model into a system with limited resources. In this paper, we propose the novel idea of the network compression as a method to solve this limitation. The principle of this idea is to make iterative pruning more effective and sophisticated by simulating the redu… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: Submitted to NIPS 2018 MLPCD2

    MSC Class: 68T05

  32. arXiv:1812.08997  [pdf, other

    cs.LG stat.ML

    Stochastic Doubly Robust Gradient

    Authors: Kanghoon Lee, Jihye Choi, Moonsu Cha, Jung-Kwon Lee, Taeyoon Kim

    Abstract: When training a machine learning model with observational data, it is often encountered that some values are systemically missing. Learning from the incomplete data in which the missingness depends on some covariates may lead to biased estimation of parameters and even harm the fairness of decision outcome. This paper proposes how to adjust the causal effect of covariates on the missingness when t… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: 9 pages, 2 figures

  33. arXiv:1812.02341  [pdf, other

    cs.LG stat.ML

    Quantifying Generalization in Reinforcement Learning

    Authors: Karl Cobbe, Oleg Klimov, Chris Hesse, Taehoon Kim, John Schulman

    Abstract: In this paper, we investigate the problem of overfitting in deep reinforcement learning. Among the most common benchmarks in RL, it is customary to use the same environments for both training and testing. This practice offers relatively little insight into an agent's ability to generalize. We address this issue by using procedurally generated environments to construct distinct training and test se… ▽ More

    Submitted 14 July, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

  34. arXiv:1810.02358  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Transfer Learning via Unsupervised Task Discovery for Visual Question Answering

    Authors: Hyeonwoo Noh, Taehoon Kim, Jonghwan Mun, Bohyung Han

    Abstract: We study how to leverage off-the-shelf visual and linguistic data to cope with out-of-vocabulary answers in visual question answering task. Existing large-scale visual datasets with annotations such as image class labels, bounding boxes and region descriptions are good sources for learning rich and diverse visual concepts. However, it is not straightforward how the visual concepts can be captured… ▽ More

    Submitted 7 April, 2019; v1 submitted 3 October, 2018; originally announced October 2018.

    Comments: CVPR 2019

  35. arXiv:1809.00758  [pdf

    cs.LG cs.CV cs.SD eess.AS stat.ML

    End-to-end Multimodal Emotion and Gender Recognition with Dynamic Joint Loss Weights

    Authors: Myungsu Chae, Tae-Ho Kim, Young Hoon Shin, June-Woo Kim, Soo-Young Lee

    Abstract: Multi-task learning is a method for improving the generalizability of multiple tasks. In order to perform multiple classification tasks with one neural network model, the losses of each task should be combined. Previous studies have mostly focused on multiple prediction tasks using joint loss with static weights for training models, choosing the weights between tasks without making sufficient cons… ▽ More

    Submitted 2 October, 2018; v1 submitted 3 September, 2018; originally announced September 2018.

    Comments: IROS 2018 Workshop on Crossmodal Learning for Intelligent Robotics

    MSC Class: 68T05

  36. arXiv:1806.03836  [pdf, other

    cs.LG stat.ML

    Bayesian Model-Agnostic Meta-Learning

    Authors: Taesup Kim, Jaesik Yoon, Ousmane Dia, Sungwoong Kim, Yoshua Bengio, Sungjin Ahn

    Abstract: Learning to infer Bayesian posterior from a few-shot dataset is an important step towards robust meta-learning due to the model uncertainty inherent in the problem. In this paper, we propose a novel Bayesian model-agnostic meta-learning method. The proposed method combines scalable gradient-based meta-learning with nonparametric variational inference in a principled probabilistic framework. During… ▽ More

    Submitted 18 November, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

    Comments: First two authors contributed equally. 15 pages with appendix including experimental details. Accepted in NIPS 2018

  37. arXiv:1806.02071  [pdf, other

    cs.LG cs.GR physics.comp-ph physics.flu-dyn stat.ML

    Deep Fluids: A Generative Network for Parameterized Fluid Simulations

    Authors: Byungsoo Kim, Vinicius C. Azevedo, Nils Thuerey, Theodore Kim, Markus Gross, Barbara Solenthaler

    Abstract: This paper presents a novel generative model to synthesize fluid simulations from a set of reduced parameters. A convolutional neural network is trained on a collection of discrete, parameterizable fluid simulation velocity fields. Due to the capability of deep learning architectures to learn representative features of the data, our generative model is able to accurately approximate the training d… ▽ More

    Submitted 1 February, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Computer Graphics Forum (Proceedings of EUROGRAPHICS 2019), additional materials: http://www.byungsoo.me/project/deep-fluids/

    Journal ref: Computer Graphics Forum (Proc. Eurographics), 38, 2 (2019), 59-70

  38. arXiv:1805.10724  [pdf, other

    cs.LG cs.HC stat.ML

    RetainVis: Visual Analytics with Interpretable and Interactive Recurrent Neural Networks on Electronic Medical Records

    Authors: Bum Chul Kwon, Min-Je Choi, Joanne Taery Kim, Edward Choi, Young Bin Kim, Soonwook Kwon, Jimeng Sun, Jaegul Choo

    Abstract: We have recently seen many successful applications of recurrent neural networks (RNNs) on electronic medical records (EMRs), which contain histories of patients' diagnoses, medications, and other various events, in order to predict the current and future states of patients. Despite the strong performance of RNNs, it is often challenging for users to understand why the model makes a particular pred… ▽ More

    Submitted 23 October, 2018; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: Accepted at IEEE VIS 2018. To appear in IEEE Transactions on Visualization and Computer Graphics in January 2019

  39. arXiv:1801.06700  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot (Short Version)

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeswar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including neural network and template-based… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

    Comments: 9 pages, 1 figure, 2 tables; presented at NIPS 2017, Conversational AI: "Today's Practice and Tomorrow's Potential" Workshop

    ACM Class: I.5.1; I.2.7

  40. arXiv:1711.07433  [pdf, other

    stat.ML cs.LG

    Relaxed Oracles for Semi-Supervised Clustering

    Authors: Taewan Kim, Joydeep Ghosh

    Abstract: Pairwise "same-cluster" queries are one of the most widely used forms of supervision in semi-supervised clustering. However, it is impractical to ask human oracles to answer every query correctly. In this paper, we study the influence of allowing "not-sure" answers from a weak oracle and propose an effective algorithm to handle such uncertainties in query responses. Two realistic weak oracle model… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    Comments: NIPS 2017 Workshop: Learning with Limited Labeled Data (LLD 2017)

  41. arXiv:1709.03202  [pdf, other

    stat.ML cs.LG

    Semi-Supervised Active Clustering with Weak Oracles

    Authors: Taewan Kim, Joydeep Ghosh

    Abstract: Semi-supervised active clustering (SSAC) utilizes the knowledge of a domain expert to cluster data points by interactively making pairwise "same-cluster" queries. However, it is impractical to ask human oracles to answer every pairwise query. In this paper, we study the influence of allowing "not-sure" answers from a weak oracle and propose algorithms to efficiently handle uncertainties. Different… ▽ More

    Submitted 10 September, 2017; originally announced September 2017.

  42. arXiv:1709.02349  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    A Deep Reinforcement Learning Chatbot

    Authors: Iulian V. Serban, Chinnadhurai Sankar, Mathieu Germain, Saizheng Zhang, Zhouhan Lin, Sandeep Subramanian, Taesup Kim, Michael Pieper, Sarath Chandar, Nan Rosemary Ke, Sai Rajeshwar, Alexandre de Brebisson, Jose M. R. Sotelo, Dendi Suhubdy, Vincent Michalski, Alexandre Nguyen, Joelle Pineau, Yoshua Bengio

    Abstract: We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is capable of conversing with humans on popular small talk topics through both speech and text. The system consists of an ensemble of natural language generation and retrieval models, including template-based models, bag-of-wor… ▽ More

    Submitted 5 November, 2017; v1 submitted 7 September, 2017; originally announced September 2017.

    Comments: 40 pages, 9 figures, 11 tables

    ACM Class: I.5.1; I.2.7

  43. arXiv:1707.08774  [pdf, other

    q-bio.QM stat.AP

    Topological Data Analysis of Clostridioides difficile Infection and Fecal Microbiota Transplantation

    Authors: Pavel Petrov, Stephen T Rush, Zhichun Zhai, Christine H Lee, Peter T Kim, Giseon Heo

    Abstract: Computational topologists recently developed a method, called persistent homology to analyze data presented in terms of similarity or dissimilarity. Indeed, persistent homology studies the evolution of topological features in terms of a single index, and is able to capture higher order features beyond the usual clustering techniques. There are three descriptive statistics of persistent homology, n… ▽ More

    Submitted 31 July, 2017; v1 submitted 27 July, 2017; originally announced July 2017.

    Comments: 20 pages, 8 figures

    MSC Class: 62-07

  44. arXiv:1607.08877  [pdf, other

    stat.ML q-bio.QM

    The Phylogenetic LASSO and the Microbiome

    Authors: Stephen T Rush, Christine H Lee, Washington Mio, Peter T Kim

    Abstract: Scientific investigations that incorporate next generation sequencing involve analyses of high-dimensional data where the need to organize, collate and interpret the outcomes are pressingly important. Currently, data can be collected at the microbiome level leading to the possibility of personalized medicine whereby treatments can be tailored at this scale. In this paper, we lay down a statistical… ▽ More

    Submitted 29 July, 2016; originally announced July 2016.

    Comments: 31 pages, 6 figures, 5 tables

    MSC Class: 62P10

  45. arXiv:1606.03439  [pdf, other

    cs.LG stat.ML

    Deep Directed Generative Models with Energy-Based Probability Estimation

    Authors: Taesup Kim, Yoshua Bengio

    Abstract: Training energy-based probabilistic models is confronted with apparently intractable sums, whose Monte Carlo estimation requires sampling from the estimated probability distribution in the inner loop of training. This can be approximately achieved by Markov chain Monte Carlo methods, but may still face a formidable obstacle that is the difficulty of mixing between modes with sharp concentrations o… ▽ More

    Submitted 10 June, 2016; originally announced June 2016.

  46. arXiv:1605.04955  [pdf, other

    stat.ML

    Probing the Geometry of Data with Diffusion Fréchet Functions

    Authors: Diego Hernán Díaz Martínez, Christine H. Lee, Peter T. Kim, Washington Mio

    Abstract: Many complex ecosystems, such as those formed by multiple microbial taxa, involve intricate interactions amongst various sub-communities. The most basic relationships are frequently modeled as co-occurrence networks in which the nodes represent the various players in the community and the weighted edges encode levels of interaction. In this setting, the composition of a community may be viewed as… ▽ More

    Submitted 7 March, 2017; v1 submitted 16 May, 2016; originally announced May 2016.

    Comments: 26 pages, 8 figures. Lemma 1b and Theorem 2 have been revised, as well as the results derived from them

    MSC Class: 62-07; 92C50

  47. arXiv:1410.3752  [pdf, ps, other

    cs.CV stat.ML

    Enhanced Random Forest with Image/Patch-Level Learning for Image Understanding

    Authors: Wai Lam Hoo, Tae-Kyun Kim, Yuru Pei, Chee Seng Chan

    Abstract: Image understanding is an important research domain in the computer vision due to its wide real-world applications. For an image understanding framework that uses the Bag-of-Words model representation, the visual codebook is an essential part. Random forest (RF) as a tree-structure discriminative codebook has been a popular choice. However, the performance of the RF can be degraded if the local pa… ▽ More

    Submitted 14 October, 2014; originally announced October 2014.

    Comments: Accepted in ICPR 2014 (Oral)