Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–28 of 28 results for author: Yun, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01624  [pdf, other

    cs.LG cs.AI

    Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization

    Authors: Taeyoung Yun, Sujin Yun, Jaewoo Lee, Jinkyoo Park

    Abstract: Optimizing complex and high-dimensional black-box functions is ubiquitous in science and engineering fields. Unfortunately, the online evaluation of these functions is restricted due to time and safety constraints in most cases. In offline model-based optimization (MBO), we aim to find a design that maximizes the target function using only a pre-existing offline dataset. While prior methods consid… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 29 pages, 11 figures, 17 tables

  2. arXiv:2405.16907  [pdf, other

    cs.AI cs.LG

    GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning

    Authors: Jaewoo Lee, Sujin Yun, Taeyoung Yun, Jinkyoo Park

    Abstract: Offline Reinforcement Learning (Offline RL) presents challenges of learning effective decision-making policies from static datasets without any online interactions. Data augmentation techniques, such as noise injection and data synthesizing, aim to improve Q-function approximation by smoothing the learned state-action region. However, these methods often fall short of directly improving the qualit… ▽ More

    Submitted 12 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted (Spotlight) to ICLR 2024 Workshop on Generative Models for Decision Making. Jaewoo Lee and Sujin Yun are equal contribution authors

  3. arXiv:2404.12652  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Pre-trained Vision-Language Models Learn Discoverable Visual Concepts

    Authors: Yuan Zang, Tian Yun, Hao Tan, Trung Bui, Chen Sun

    Abstract: Do vision-language models (VLMs) pre-trained to caption an image of a "durian" learn visual concepts such as "brown" (color) and "spiky" (texture) at the same time? We aim to answer this question as visual concepts learned "for free" would enable wide applications such as neuro-symbolic reasoning or human-interpretable object classification. We assume that the visual concepts, if captured by pre-t… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  4. arXiv:2404.12444  [pdf, other

    cs.CL cs.AI

    mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models?

    Authors: Tianze Hua, Tian Yun, Ellie Pavlick

    Abstract: Many pretrained multilingual models exhibit cross-lingual transfer ability, which is often attributed to a learned language-neutral representation during pretraining. However, it remains unclear what factors contribute to the learning of a language-neutral representation, and whether the learned language-neutral representation suffices to facilitate cross-lingual transfer. We propose a synthetic t… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted at Findings of NAACL 2024. Project Webpage: https://multilingual-othello.github.io/

  5. arXiv:2402.13562  [pdf, other

    cs.CL

    Analysis of Multi-Source Language Training in Cross-Lingual Transfer

    Authors: Seong Hoon Lim, Taejun Yun, Jinhyeon Kim, Jihun Choi, Taeuk Kim

    Abstract: The successful adaptation of multilingual language models (LMs) to a specific language-task pair critically depends on the availability of data tailored for that condition. While cross-lingual transfer (XLT) methods have contributed to addressing this data scarcity problem, there still exists ongoing debate about the mechanisms behind their effectiveness. In this work, we focus on one of promising… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  6. arXiv:2401.12987  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation

    Authors: Taeyang Yun, Hyunkuk Lim, Jeonghwan Lee, Min Song

    Abstract: Emotion Recognition in Conversation (ERC) plays a crucial role in enabling dialogue systems to effectively respond to user requests. The emotions in a conversation can be identified by the representations from various modalities, such as audio, visual, and text. However, due to the weak contribution of non-verbal modalities to recognize emotions, multimodal ERC has always been considered a challen… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: NAACL 2024 main conference

  7. arXiv:2401.11246  [pdf

    cs.CL cs.IR

    Prompt-RAG: Pioneering Vector Embedding-Free Retrieval-Augmented Generation in Niche Domains, Exemplified by Korean Medicine

    Authors: Bongsu Kang, Jundong Kim, Tae-Rim Yun, Chang-Eop Kim

    Abstract: We propose a natural language prompt-based retrieval augmented generation (Prompt-RAG), a novel approach to enhance the performance of generative large language models (LLMs) in niche domains. Conventional RAG methods mostly require vector embeddings, yet the suitability of generic LLM-based embedding representations for specialized domains remains uncertain. To explore and exemplify this point, w… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 26 pages, 4 figures, 5 tables

    ACM Class: I.2.7; H.3.3; J.3

  8. arXiv:2311.02171  [pdf, other

    cs.LG cs.AI

    Emergence of Abstract State Representations in Embodied Sequence Modeling

    Authors: Tian Yun, Zilai Zeng, Kunal Handa, Ashish V. Thapliyal, Bo Pang, Ellie Pavlick, Chen Sun

    Abstract: Decision making via sequence modeling aims to mimic the success of language models, where actions taken by an embodied agent are modeled as tokens to predict. Despite their promising performance, it remains unclear if embodied sequence modeling leads to the emergence of internal representations that represent the environmental state information. A model that lacks abstract state representations wo… ▽ More

    Submitted 7 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023). Project webpage: https://abstract-state-seqmodel.github.io/

  9. arXiv:2310.17166  [pdf, other

    cs.CL

    X-SNS: Cross-Lingual Transfer Prediction through Sub-Network Similarity

    Authors: Taejun Yun, Jinhyeon Kim, Deokyeong Kang, Seong Hoon Lim, Jihoon Kim, Taeuk Kim

    Abstract: Cross-lingual transfer (XLT) is an emergent ability of multilingual language models that preserves their performance on a task to a significant extent when evaluated in languages that were not included in the fine-tuning process. While English, due to its widespread usage, is typically regarded as the primary language for model adaption in various tasks, recent studies have revealed that the effic… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Findings)

  10. arXiv:2310.12407  [pdf, other

    cs.LG cs.AI eess.SY

    Classification-Aided Robust Multiple Target Tracking Using Neural Enhanced Message Passing

    Authors: Xianglong Bai, Zengfu Wang, Quan Pan, Tao Yun, Hua Lan

    Abstract: We address the challenge of tracking an unknown number of targets in strong clutter environments using measurements from a radar sensor. Leveraging the range-Doppler spectra information, we identify the measurement classes, which serve as additional information to enhance clutter rejection and data association, thus bolstering the robustness of target tracking. We first introduce a novel neural en… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 15 pages

  11. arXiv:2310.02823  [pdf, other

    cs.LG stat.ML

    Learning to Scale Logits for Temperature-Conditional GFlowNets

    Authors: Minsu Kim, Joohwan Ko, Taeyoung Yun, Dinghuai Zhang, Ling Pan, Woochang Kim, Jinkyoo Park, Emmanuel Bengio, Yoshua Bengio

    Abstract: GFlowNets are probabilistic models that sequentially generate compositional structures through a stochastic policy. Among GFlowNets, temperature-conditional GFlowNets can introduce temperature-based controllability for exploration and exploitation. We propose \textit{Logit-scaling GFlowNets} (Logit-GFN), a novel architectural design that greatly accelerates the training of temperature-conditional… ▽ More

    Submitted 2 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICML 2024, 23 pages, 21 figures

  12. arXiv:2310.02710  [pdf, other

    cs.LG stat.ML

    Local Search GFlowNets

    Authors: Minsu Kim, Taeyoung Yun, Emmanuel Bengio, Dinghuai Zhang, Yoshua Bengio, Sungsoo Ahn, Jinkyoo Park

    Abstract: Generative Flow Networks (GFlowNets) are amortized sampling methods that learn a distribution over discrete objects proportional to their rewards. GFlowNets exhibit a remarkable ability to generate diverse samples, yet occasionally struggle to consistently produce samples with high rewards due to over-exploration on wide sample space. This paper proposes to train GFlowNets with local search, which… ▽ More

    Submitted 22 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 (Spotlight paper), 18 pages, 17 figures

  13. arXiv:2310.02462  [pdf, other

    cs.RO cs.AI cs.HC

    Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback

    Authors: Ifrah Idrees, Tian Yun, Naveen Sharma, Yunxin Deng, Nakul Gopalan, George Konidaris, Stefanie Tellex

    Abstract: Conversational assistive robots can aid people, especially those with cognitive impairments, to accomplish various tasks such as cooking meals, performing exercises, or operating machines. However, to interact with people effectively, robots must recognize human plans and goals from noisy observations of human actions, even when the user acts sub-optimally. Previous works on Plan and Goal Recognit… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Published in IROS 2023

  14. arXiv:2310.02013  [pdf, other

    cs.LG

    Spectral operator learning for parametric PDEs without data reliance

    Authors: Junho Choi, Taehyun Yun, Namjung Kim, Youngjoon Hong

    Abstract: In this paper, we introduce the Spectral Coefficient Learning via Operator Network (SCLON), a novel operator learning-based approach for solving parametric partial differential equations (PDEs) without the need for data harnessing. The cornerstone of our method is the spectral methodology that employs expansions using orthogonal functions, such as Fourier series and Legendre polynomials, enabling… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 28 pages, 8 figures

  15. arXiv:2307.08893  [pdf, other

    cs.LG q-bio.GN stat.ML

    Evaluating unsupervised disentangled representation learning for genomic discovery and disease risk prediction

    Authors: Taedong Yun

    Abstract: High-dimensional clinical data have become invaluable resources for genetic studies, due to their accessibility in biobank-scale datasets and the development of high performance modeling techniques especially using deep learning. Recent work has shown that low dimensional embeddings of these clinical data learned by variational autoencoders (VAE) can be used for genome-wide association studies and… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted to the 2023 ICML Workshop on Computational Biology. Honolulu, Hawaii, USA, 2023

  16. GPT-4 can pass the Korean National Licensing Examination for Korean Medicine Doctors

    Authors: Dongyeop Jang, Tae-Rim Yun, Choong-Yeol Lee, Young-Kyu Kwon, Chang-Eop Kim

    Abstract: Traditional Korean medicine (TKM) emphasizes individualized diagnosis and treatment. This uniqueness makes AI modeling difficult due to limited data and implicit processes. Large language models (LLMs) have demonstrated impressive medical inference, even without advanced training in medical texts. This study assessed the capabilities of GPT-4 in TKM, using the Korean National Licensing Examination… ▽ More

    Submitted 16 November, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: 23 pages, 4 figures

    ACM Class: J.3

  17. arXiv:2301.12055  [pdf, other

    cs.LG

    TIDo: Source-free Task Incremental Learning in Non-stationary Environments

    Authors: Abhinit Kumar Ambastha, Leong Tze Yun

    Abstract: This work presents an incremental learning approach for autonomous agents to learn new tasks in a non-stationary environment. Updating a DNN model-based agent to learn new target tasks requires us to store past training data and needs a large labeled target task dataset. Few-shot task incremental learning methods overcome the limitation of labeled target datasets by adapting trained models to lear… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  18. arXiv:2301.12054  [pdf, other

    cs.LG

    Adversarial Learning Networks: Source-free Unsupervised Domain Incremental Learning

    Authors: Abhinit Kumar Ambastha, Leong Tze Yun

    Abstract: This work presents an approach for incrementally updating deep neural network (DNN) models in a non-stationary environment. DNN models are sensitive to changes in input data distribution, which limits their application to problem settings with stationary input datasets. In a non-stationary environment, updating a DNN model requires parameter re-training or model fine-tuning. We propose an unsuperv… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  19. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  20. arXiv:2203.17271  [pdf, other

    cs.CV cs.AI

    Do Vision-Language Pretrained Models Learn Composable Primitive Concepts?

    Authors: Tian Yun, Usha Bhalla, Ellie Pavlick, Chen Sun

    Abstract: Vision-language (VL) pretrained models have achieved impressive performance on multimodal reasoning and zero-shot recognition tasks. Many of these VL models are pretrained on unlabeled image and caption pairs from the internet. In this paper, we study whether representations of primitive concepts--such as colors, shapes, or the attributes of object parts--emerge automatically within these pretrain… ▽ More

    Submitted 27 May, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR) 2023

  21. arXiv:2203.03147  [pdf, other

    cs.AI cs.MA

    Automatic Calibration Framework of Agent-Based Models for Dynamic and Heterogeneous Parameters

    Authors: Dongjun Kim, Tae-Sub Yun, Il-Chul Moon, Jang Won Bae

    Abstract: Agent-based models (ABMs) highlight the importance of simulation validation, such as qualitative face validation and quantitative empirical validation. In particular, we focused on quantitative validation by adjusting simulation input parameters of the ABM. This study introduces an automatic calibration framework that combines the suggested dynamic and heterogeneous calibration methods. Specifical… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 3 pages, 6 figures, Autonomous Agents and Multiagent Systems (AAMAS 2022)

  22. arXiv:2109.10246  [pdf, other

    cs.CL cs.AI cs.CV

    Does Vision-and-Language Pretraining Improve Lexical Grounding?

    Authors: Tian Yun, Chen Sun, Ellie Pavlick

    Abstract: Linguistic representations derived from text alone have been criticized for their lack of grounding, i.e., connecting words to their meanings in the physical world. Vision-and-Language (VL) models, trained jointly on text and image or video data, have been offered as a response to such criticisms. However, while VL pretraining has shown success on multimodal tasks such as visual question answering… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: Camera ready for Findings of EMNLP 2021

  23. arXiv:2103.12725  [pdf, other

    stat.ML cs.LG math.ST

    SLOE: A Faster Method for Statistical Inference in High-Dimensional Logistic Regression

    Authors: Steve Yadlowsky, Taedong Yun, Cory McLean, Alexander D'Amour

    Abstract: Logistic regression remains one of the most widely used tools in applied statistics, machine learning and data science. However, in moderately high-dimensional problems, where the number of features $d$ is a non-negligible fraction of the sample size $n$, the logistic regression maximum likelihood estimator (MLE), and statistical procedures based the large-sample approximation of its distribution,… ▽ More

    Submitted 25 May, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  24. arXiv:2011.03395  [pdf, other

    cs.LG stat.ML

    Underspecification Presents Challenges for Credibility in Modern Machine Learning

    Authors: Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne , et al. (15 additional authors not shown)

    Abstract: ML models often exhibit unexpectedly poor behavior when they are deployed in real-world domains. We identify underspecification as a key reason for these failures. An ML pipeline is underspecified when it can return many predictors with equivalently strong held-out performance in the training domain. Underspecification is common in modern ML pipelines, such as those based on deep learning. Predict… ▽ More

    Submitted 24 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Updates: Updated statistical analysis in Section 6; Additional citations

  25. arXiv:1908.03309  [pdf, other

    cs.MA cs.CY cs.LG

    Automatic Calibration of Dynamic and Heterogeneous Parameters in Agent-based Model

    Authors: Dongjun Kim, Tae-Sub Yun, Il-Chul Moon

    Abstract: While simulations have been utilized in diverse domains, such as urban growth modeling, market dynamics modeling, etc; some of these applications may require validations based upon some real-world observations modeled in the simulation, as well. This validation has been categorized into either qualitative face-validation or quantitative empirical validation, but as the importance and the accumulat… ▽ More

    Submitted 9 August, 2019; originally announced August 2019.

    Comments: 31 pages, 12 figures, Journal of Autonomous Agents and Multi-Agent Systems

  26. Eat & Tell: A Randomized Trial of Random-Loss Incentive to Increase Dietary Self-Tracking Compliance

    Authors: Palakorn Achananuparp, Ee-Peng Lim, Vibhanshu Abhishek, Tianjiao Yun

    Abstract: A growing body of evidence has shown that incorporating behavioral economics principles into the design of financial incentive programs helps improve their cost-effectiveness, promote individuals' short-term engagement, and increase compliance in health behavior interventions. Yet, their effects on long-term engagement have not been fully examined. In study designs where repeated administration of… ▽ More

    Submitted 3 May, 2018; originally announced May 2018.

    Comments: Published at Digital Health 2018

  27. arXiv:1408.5552  [pdf

    cs.CV

    Fuzzy and entropy facial recognition

    Authors: Jaejun Lee, Taeseon Yun

    Abstract: This paper suggests an effective method for facial recognition using fuzzy theory and Shannon entropy. Combination of fuzzy theory and Shannon entropy eliminates the complication of other methods. Shannon entropy calculates the ratio of an element between faces, and fuzzy theory calculates the member ship of the entropy with 1. More details will be mentioned in Section 3. The learning performance… ▽ More

    Submitted 24 August, 2014; originally announced August 2014.

    Comments: 5 pages

    MSC Class: 68T10

  28. arXiv:1208.4270  [pdf, ps, other

    cs.DB

    ODYS: A Massively-Parallel Search Engine Using a DB-IR Tightly-Integrated Parallel DBMS

    Authors: Kyu-Young Whang, Tae-Seob Yun, Yeon-Mi Yeo, Il-Yeol Song, Hyuk-Yoon Kwon, In-Joong Kim

    Abstract: Recently, parallel search engines have been implemented based on scalable distributed file systems such as Google File System. However, we claim that building a massively-parallel search engine using a parallel DBMS can be an attractive alternative since it supports a higher-level (i.e., SQL-level) interface than that of a distributed file system for easy and less error-prone application developme… ▽ More

    Submitted 21 August, 2012; originally announced August 2012.

    Comments: 34 pages, 13 figures