Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 225 results for author: Cho, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16994  [pdf, other

    eess.SP cs.AI

    Quantum Multi-Agent Reinforcement Learning for Cooperative Mobile Access in Space-Air-Ground Integrated Networks

    Authors: Gyu Seon Kim, Yeryeong Cho, Jaehyun Chung, Soohyun Park, Soyi Jung, Zhu Han, Joongheon Kim

    Abstract: Achieving global space-air-ground integrated network (SAGIN) access only with CubeSats presents significant challenges such as the access sustainability limitations in specific regions (e.g., polar regions) and the energy efficiency limitations in CubeSats. To tackle these problems, high-altitude long-endurance unmanned aerial vehicles (HALE-UAVs) can complement these CubeSat shortcomings for prov… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 17 pages, 22 figures

  2. arXiv:2406.10590  [pdf, other

    cs.HC

    LLM-Mediated Domain-Specific Voice Agents: The Case of TextileBot

    Authors: Shu Zhong, Elia Gatti, James Hardwick, Miriam Ribul, Youngjun Cho, Marianna Obrist

    Abstract: Developing domain-specific conversational agents (CAs) has been challenged by the need for extensive domain-focused data. Recent advancements in Large Language Models (LLMs) make them a viable option as a knowledge backbone. LLMs behaviour can be enhanced through prompting, instructing them to perform downstream tasks in a zero-shot fashion (i.e. without training). To this end, we incorporated str… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2406.06587  [pdf, other

    cs.CL cs.AI cs.HC

    Exploring Human-AI Perception Alignment in Sensory Experiences: Do LLMs Understand Textile Hand?

    Authors: Shu Zhong, Elia Gatti, Youngjun Cho, Marianna Obrist

    Abstract: Aligning large language models (LLMs) behaviour with human intent is critical for future AI. An important yet often overlooked aspect of this alignment is the perceptual alignment. Perceptual modalities like touch are more multifaceted and nuanced compared to other sensory modalities such as vision. This work investigates how well LLMs align with human touch experiences using the "textile hand" ta… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Guijin Son, Yejin Cho, Sheikh Shafayat, Jinheon Baek, Sue Hyun Park, Hyeonbin Hwang, Jinkyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  5. arXiv:2406.01506  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    The Geometry of Categorical and Hierarchical Concepts in Large Language Models

    Authors: Kiho Park, Yo Joong Choe, Yibo Jiang, Victor Veitch

    Abstract: Understanding how semantic meaning is encoded in the representation spaces of large language models is a fundamental problem in interpretability. In this paper, we study the two foundational questions in this area. First, how are categorical concepts, such as {'mammal', 'bird', 'reptile', 'fish'}, represented? Second, how are hierarchical relations between concepts encoded? For example, how is the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Code is available at https://github.com/KihoPark/LLM_Categorical_Hierarchical_Representations

  6. arXiv:2405.18832  [pdf, other

    cs.LG cs.AI cs.AR

    MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models

    Authors: Taehyun Kim, Kwanseok Choi, Youngmock Cho, Jaehoon Cho, Hyuk-Jae Lee, Jaewoong Sim

    Abstract: Mixture-of-Experts (MoE) large language models (LLM) have memory requirements that often exceed the GPU memory capacity, requiring costly parameter movement from secondary memories to the GPU for expert computation. In this work, we present Mixture of Near-Data Experts (MoNDE), a near-data computing solution that efficiently enables MoE LLM inference. MoNDE reduces the volume of MoE parameter move… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to DAC 2024

  7. arXiv:2405.18732  [pdf, other

    physics.geo-ph cs.AI cs.LG physics.app-ph

    Gemini & Physical World: Large Language Models Can Estimate the Intensity of Earthquake Shaking from Multi-Modal Social Media Posts

    Authors: S. Mostafa Mousavi, Marc Stogaitis, Tajinder Gadh, Richard M Allen, Alexei Barski, Robert Bosch, Patrick Robertson, Nivetha Thiruverahan, Youngmin Cho, Aman Raj

    Abstract: This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  8. arXiv:2405.18148  [pdf, other

    cs.CV cs.AI

    Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation

    Authors: JuneHyoung Kwon, Eunju Lee, Yunsung Cho, YoungBin Kim

    Abstract: Weakly supervised semantic segmentation (WSSS) employing weak forms of labels has been actively studied to alleviate the annotation cost of acquiring pixel-level labels. However, classifiers trained on biased datasets tend to exploit shortcut features and make predictions based on spurious correlations between certain backgrounds and objects, leading to a poor generalization performance. In this p… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to WACV 2024

  9. arXiv:2405.16424  [pdf, other

    cs.HC cs.AI cs.LG

    Improving Health Professionals' Onboarding with AI and XAI for Trustworthy Human-AI Collaborative Decision Making

    Authors: Min Hun Lee, Silvana Xin Yi Choo, Shamala D/O Thilarajah

    Abstract: With advanced AI/ML, there has been growing research on explainable AI (XAI) and studies on how humans interact with AI and XAI for effective human-AI collaborative decision-making. However, we still have a lack of understanding of how AI systems and XAI should be first presented to users without technical backgrounds. In this paper, we present the findings of semi-structured interviews with healt… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  10. arXiv:2405.11855  [pdf, other

    cs.RO

    Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments

    Authors: Jooyong Park, Jungwoo Lee, Euncheol Choi, Younggun Cho

    Abstract: In urban environments for delivery robots, particularly in areas such as campuses and towns, many custom features defy standard road semantic categorizations. Addressing this challenge, our paper introduces a method leveraging Salient Object Detection (SOD) to extract these unique features, employing them as pivotal factors for enhanced robot loop closure and localization. Traditional geometric fe… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 8 pages, 9 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  11. arXiv:2405.08142  [pdf

    cs.CL cs.CY

    Discursive objection strategies in online comments: Developing a classification schema and validating its training

    Authors: Ashley L. Shea, Aspen K. B. Omapang, Ji Yong Cho, Miryam Y. Ginsparg, Natalie Bazarova, Winice Hui, René F. Kizilcec, Chau Tong, Drew Margolin

    Abstract: Most Americans agree that misinformation, hate speech and harassment are harmful and inadequately curbed on social media through current moderation practices. In this paper, we aim to understand the discursive strategies employed by people in response to harmful speech in news comments. We conducted a content analysis of more than 6500 comment replies to trending news videos on YouTube and Twitter… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: This paper was accepted and presented at the 73rd Annual International Communication Association International Conference, May 2023

    ACM Class: I.2.7, J.4

  12. arXiv:2405.04359  [pdf, other

    cs.RO

    A Personalizable Controller for the Walking Assistive omNi-Directional Exo-Robot (WANDER)

    Authors: A. Fortuna, M. Lorenzini, M. Leonori, JM. Gandarias, P. Balatti, Y. Cho, E. De Momi, A. Ajoudani

    Abstract: Preserving and encouraging mobility in the elderly and adults with chronic conditions is of paramount importance. However, existing walking aids are either inadequate to provide sufficient support to users' stability or too bulky and poorly maneuverable to be used outside hospital environments. In addition, they all lack adaptability to individual requirements. To address these challenges, this pa… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures, IEEE International Conference on Robotics and Automation (2024)

  13. arXiv:2405.03929  [pdf, other

    cs.AI physics.ao-ph

    Unicorn: U-Net for Sea Ice Forecasting with Convolutional Neural Ordinary Differential Equations

    Authors: Jaesung Park, Sungchul Hong, Yoonseo Cho, Jong-June Jeon

    Abstract: Sea ice at the North Pole is vital to global climate dynamics. However, accurately forecasting sea ice poses a significant challenge due to the intricate interaction among multiple variables. Leveraging the capability to integrate multiple inputs and powerful performances seamlessly, many studies have turned to neural networks for sea ice forecasting. This paper introduces a novel deep architectur… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  14. arXiv:2404.18395  [pdf, other

    cs.RO

    Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle

    Authors: Jungwoo Lee, Younggun Cho

    Abstract: This paper proposes a photorealistic real-time dense 3D mapping system that utilizes a learning-based image enhancement method and mesh-based map representation. Due to the characteristics of the underwater environment, where problems such as hazing and low contrast occur, it is hard to apply conventional simultaneous localization and mapping (SLAM) methods. Furthermore, for sensitive tasks like i… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 7 pages, 15 figures, IEEE ICRA Workshop on Field Robotics 2024

  15. arXiv:2404.08611  [pdf, other

    cs.CV cs.AI physics.med-ph

    Automatic Quantification of Serial PET/CT Images for Pediatric Hodgkin Lymphoma Patients Using a Longitudinally-Aware Segmentation Network

    Authors: Xin Tie, Muheon Shin, Changhee Lee, Scott B. Perlman, Zachary Huemann, Amy J. Weisman, Sharon M. Castellino, Kara M. Kelly, Kathleen M. McCarten, Adina L. Alazraki, Junjie Hu, Steve Y. Cho, Tyler J. Bradshaw

    Abstract: $\textbf{Purpose}$: Automatic quantification of longitudinal changes in PET scans for lymphoma patients has proven challenging, as residual disease in interim-therapy scans is often subtle and difficult to detect. Our goal was to develop a longitudinally-aware segmentation network (LAS-Net) that can quantify serial PET/CT images for pediatric Hodgkin lymphoma patients. $\textbf{Materials and Metho… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 6 figures, 4 tables in the main text

  16. arXiv:2404.05417  [pdf, other

    cs.HC cs.AI cs.CY

    Indexing Analytics to Instances: How Integrating a Dashboard can Support Design Education

    Authors: Ajit Jain, Andruid Kerne, Nic Lupfer, Gabriel Britain, Aaron Perrine, Yoonsuck Choe, John Keyser, Ruihong Huang, Jinsil Seo, Annie Sungkajun, Robert Lightfoot, Timothy McGuire

    Abstract: We investigate how to use AI-based analytics to support design education. The analytics at hand measure multiscale design, that is, students' use of space and scale to visually and conceptually organize their design work. With the goal of making the analytics intelligible to instructors, we developed a research artifact integrating a design analytics dashboard with design instances, and the design… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 22 pages, 4 figures, Submitted to ACM DIS

    ACM Class: H.5.2

  17. arXiv:2404.04241  [pdf, other

    cs.RO

    Modeling Kinematic Uncertainty of Tendon-Driven Continuum Robots via Mixture Density Networks

    Authors: Jordan Thompson, Brian Y. Cho, Daniel S. Brown, Alan Kuntz

    Abstract: Tendon-driven continuum robot kinematic models are frequently computationally expensive, inaccurate due to unmodeled effects, or both. In particular, unmodeled effects produce uncertainties that arise during the robot's operation that lead to variability in the resulting geometry. We propose a novel solution to these issues through the development of a Gaussian mixture kinematic model. We train a… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  18. arXiv:2404.03816  [pdf, other

    cs.RO

    Accounting for Hysteresis in the Forward Kinematics of Nonlinearly-Routed Tendon-Driven Continuum Robots via a Learned Deep Decoder Network

    Authors: Brian Y. Cho, Daniel S. Esser, Jordan Thompson, Bao Thach, Robert J. Webster III, Alan Kuntz

    Abstract: Tendon-driven continuum robots have been gaining popularity in medical applications due to their ability to curve around complex anatomical structures, potentially reducing the invasiveness of surgery. However, accurate modeling is required to plan and control the movements of these flexible robots. Physics-based models have limitations due to unmodeled effects, leading to mismatches between model… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, 9 figures, Submitted to IEEE Robotics and Automation Letters

  19. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  20. arXiv:2404.00670  [pdf, other

    cs.CV q-bio.QM stat.AP

    Statistical Analysis by Semiparametric Additive Regression and LSTM-FCN Based Hierarchical Classification for Computer Vision Quantification of Parkinsonian Bradykinesia

    Authors: Youngseo Cho, In Hee Kwak, Dohyeon Kim, Jinhee Na, Hanjoo Sung, Jeongjae Lee, Young Eun Kim, Hyeo-il Ma

    Abstract: Bradykinesia, characterized by involuntary slowing or decrement of movement, is a fundamental symptom of Parkinson's Disease (PD) and is vital for its clinical diagnosis. Despite various methodologies explored to quantify bradykinesia, computer vision-based approaches have shown promising results. However, these methods often fall short in adequately addressing key bradykinesia characteristics in… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  21. arXiv:2403.14176  [pdf, other

    cs.RO

    ReFeree: Radar-based efficient global descriptor using a Feature and Free space for Place Recognition

    Authors: Byunghee Choi, Hogyun Kim, Younggun Cho

    Abstract: Radar is highlighted for robust sensing capabilities in adverse weather conditions (e.g. dense fog, heavy rain, or snowfall). In addition, Radar can cover wide areas and penetrate small particles. Despite these advantages, Radar-based place recognition remains in the early stages compared to other sensors due to its unique characteristics such as low resolution, and significant noise. In this pape… ▽ More

    Submitted 6 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 5 pages, 4 figures

  22. arXiv:2403.10760  [pdf, other

    cs.RO

    CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects

    Authors: Yoonyoung Cho, Junhyek Han, Yoontae Cho, Beomjoon Kim

    Abstract: Nonprehensile manipulation is essential for manipulating objects that are too thin, large, or otherwise ungraspable in the wild. To sidestep the difficulty of contact modeling in conventional modeling-based approaches, reinforcement learning (RL) has recently emerged as a promising alternative. However, previous RL approaches either lack the ability to generalize over diverse object shapes, or use… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  23. arXiv:2403.06342  [pdf, other

    math.NA cs.LG

    Separable Physics-informed Neural Networks for Solving the BGK Model of the Boltzmann Equation

    Authors: Jaemin Oh, Seung Yeon Cho, Seok-Bae Yun, Eunbyung Park, Youngjoon Hong

    Abstract: In this study, we introduce a method based on Separable Physics-Informed Neural Networks (SPINNs) for effectively solving the BGK model of the Boltzmann equation. While the mesh-free nature of PINNs offers significant advantages in handling high-dimensional partial differential equations (PDEs), challenges arise when applying quadrature rules for accurate integral evaluation in the BGK operator, w… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    MSC Class: 68T20; 35R09

  24. arXiv:2403.05861  [pdf, ps, other

    cs.DC

    DeepVM: Integrating Spot and On-Demand VMs for Cost-Efficient Deep Learning Clusters in the Cloud

    Authors: Yoochan Kim, Kihyun Kim, Yonghyeon Cho, Jinwoo Kim, Awais Khan, Ki-Dong Kang, Baik-Song An, Myung-Hoon Cha, Hong-Yeon Kim, Youngjae Kim

    Abstract: Distributed Deep Learning (DDL), as a paradigm, dictates the use of GPU-based clusters as the optimal infrastructure for training large-scale Deep Neural Networks (DNNs). However, the high cost of such resources makes them inaccessible to many users. Public cloud services, particularly Spot Virtual Machines (VMs), offer a cost-effective alternative, but their unpredictable availability poses a sig… ▽ More

    Submitted 14 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

  25. arXiv:2403.02870  [pdf, other

    cs.AI cs.CR cs.LG

    Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices

    Authors: Younghan Lee, Sohee Jun, Yungi Cho, Woorim Han, Hyungon Moon, Yunheung Paek

    Abstract: With growing popularity, deep learning (DL) models are becoming larger-scale, and only the companies with vast training datasets and immense computing power can manage their business serving such large models. Most of those DL models are proprietary to the companies who thus strive to keep their private models safe from the model extraction attack (MEA), whose aim is to steal the model by training… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by 27th European Symposium on Research in Computer Security (ESORICS 2022)

  26. arXiv:2403.02846  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    FLGuard: Byzantine-Robust Federated Learning via Ensemble of Contrastive Models

    Authors: Younghan Lee, Yungi Cho, Woorim Han, Ho Bae, Yunheung Paek

    Abstract: Federated Learning (FL) thrives in training a global model with numerous clients by only sharing the parameters of their local models trained with their private training datasets. Therefore, without revealing the private dataset, the clients can obtain a deep learning (DL) model with high performance. However, recent research proposed poisoning attacks that cause a catastrophic loss in the accurac… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by 28th European Symposium on Research in Computer Security (ESORICS 2023)

  27. arXiv:2402.11477  [pdf, other

    cs.CY

    Studying Differential Mental Health Expressions in India

    Authors: Khushi Shelat, Sunny Rai, Devansh R Jain, Kishen Sivabalan, Young Min Cho, Maitreyi Redkar, Samindara Sawant, Sharath Chandra Guntuku

    Abstract: Psychosocial stressors and the symptomatology of mental disorders vary across cultures. However, current understandings of mental health expressions on social media are predominantly derived from studies in WEIRD (Western, Educated, Industrialized, Rich, and Democratic) contexts. In this paper, we analyze mental health posts on Reddit made by individuals in India, to identify variations in online… ▽ More

    Submitted 16 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  28. arXiv:2402.09698  [pdf, other

    stat.ME cs.LG math.PR math.ST stat.ML

    Combining Evidence Across Filtrations Using Adjusters

    Authors: Yo Joong Choe, Aaditya Ramdas

    Abstract: In anytime-valid sequential inference, it is known that any admissible procedure must be based on e-processes, which are composite generalizations of test martingales that quantify the accumulated evidence against a composite null hypothesis at any arbitrary stopping time. This paper studies methods for combining e-processes constructed using different information sets (filtrations) for the same n… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Substantially revised with new results in Sections 5 and 6. Code is available at https://github.com/yjchoe/CombiningEvidenceAcrossFiltrations

  29. arXiv:2402.08966  [pdf, other

    cs.CV cs.CL

    Pretraining Vision-Language Model for Difference Visual Question Answering in Longitudinal Chest X-rays

    Authors: Yeongjae Cho, Taehee Kim, Heejun Shin, Sungzoon Cho, Dongmyung Shin

    Abstract: Difference visual question answering (diff-VQA) is a challenging task that requires answering complex questions based on differences between a pair of images. This task is particularly important in reading chest X-ray images because radiologists often compare multiple images of the same patient taken at different times to track disease progression and changes in its severity in their clinical prac… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  30. arXiv:2401.16437  [pdf, other

    physics.ao-ph cs.LG

    A Benchmark Dataset for Tornado Detection and Prediction using Full-Resolution Polarimetric Weather Radar Data

    Authors: Mark S. Veillette, James M. Kurdzo, Phillip M. Stepanian, John Y. N. Cho, Siddharth Samsi, Joseph McDonald

    Abstract: Weather radar is the primary tool used by forecasters to detect and warn for tornadoes in near-real time. In order to assist forecasters in warning the public, several algorithms have been developed to automatically detect tornadic signatures in weather radar observations. Recently, Machine Learning (ML) algorithms, which learn directly from large amounts of labeled data, have been shown to be hig… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 37 pages, 15 Figures, 2 Tables

  31. arXiv:2401.13087  [pdf, other

    cs.CV stat.AP

    Open-source data pipeline for street-view images: a case study on community mobility during COVID-19 pandemic

    Authors: Matthew Martell, Nick Terry, Ribhu Sengupta, Chris Salazar, Nicole A. Errett, Scott B. Miles, Joseph Wartman, Youngjun Choe

    Abstract: Street View Images (SVI) are a common source of valuable data for researchers. Researchers have used SVI data for estimating pedestrian volumes, demographic surveillance, and to better understand built and natural environments in cityscapes. However, the most common source of publicly available SVI data is Google Street View. Google Street View images are collected infrequently, making temporal an… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 16 pages, 4 figures, two tables. Martell and Terry are equally contributing first authors

  32. arXiv:2401.06799  [pdf, other

    cs.CL cs.LG

    Make Prompts Adaptable: Bayesian Modeling for Vision-Language Prompt Learning with Data-Dependent Prior

    Authors: Youngjae Cho, HeeSun Bae, Seungjae Shin, Yeo Dong Youn, Weonyoung Joo, Il-Chul Moon

    Abstract: Recent Vision-Language Pretrained (VLP) models have become the backbone for many downstream tasks, but they are utilized as frozen model without learning. Prompt learning is a method to improve the pre-trained VLP model by adding a learnable context vector to the inputs of the text encoder. In a few-shot learning scenario of the downstream task, MLE training can lead the context vector to over-fit… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI-2024

  33. arXiv:2401.06432  [pdf, other

    cs.LG cs.DC

    Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

    Authors: Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi

    Abstract: Foundation models (FMs) adapt well to specific domains or tasks with fine-tuning, and federated learning (FL) enables the potential for privacy-preserving fine-tuning of the FMs with on-device local data. For federated fine-tuning of FMs, we consider the FMs with small to medium parameter sizes of single digit billion at maximum, referred to as on-device FMs (ODFMs) that can be deployed on devices… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  34. arXiv:2401.06400  [pdf, other

    cs.CL cs.CV

    Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model

    Authors: Taehee Kim, Yeongjae Cho, Heejun Shin, Yohan Jo, Dongmyung Shin

    Abstract: Visual question answering (VQA) is a task where an image is given, and a series of questions are asked about the image. To build an efficient VQA algorithm, a large amount of QA data is required which is very expensive. Generating synthetic QA pairs based on templates is a practical way to obtain data. However, VQA models trained on those data do not perform well on complex, human-written question… ▽ More

    Submitted 16 January, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  35. arXiv:2401.05254  [pdf, other

    cs.CY cs.CL

    Language-based Valence and Arousal Expressions between the United States and China: a Cross-Cultural Examination

    Authors: Young-Min Cho, Dandan Pang, Stuti Thapa, Garrick Sherman, Lyle Ungar, Louis Tay, Sharath Chandra Guntuku

    Abstract: Although affective expressions of individuals have been extensively studied using social media, research has primarily focused on the Western context. There are substantial differences among cultures that contribute to their affective expressions. This paper examines the differences between Twitter (X) in the United States and Sina Weibo posts in China on two primary dimensions of affect - valence… ▽ More

    Submitted 11 January, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  36. arXiv:2401.04139  [pdf

    cs.LG

    CCNETS: A Novel Brain-Inspired Approach for Enhanced Pattern Recognition in Imbalanced Datasets

    Authors: Hanbeot Park, Yunjeong Cho, Hoon-Hee Kim

    Abstract: This study introduces CCNETS (Causal Learning with Causal Cooperative Nets), a novel generative model-based classifier designed to tackle the challenge of generating data for imbalanced datasets in pattern recognition. CCNETS is uniquely crafted to emulate brain-like information processing and comprises three main components: Explainer, Producer, and Reasoner. Each component is designed to mimic s… ▽ More

    Submitted 25 January, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: 31 pages, authors (3) is Corresponding Author

  37. arXiv:2401.02710  [pdf, other

    cs.CE cs.AI

    Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning

    Authors: Hong-Gi Shin, Sukhyun Jeong, Eui-Yeon Kim, Sungho Hong, Young-Jin Cho, Yong-Hoon Choi

    Abstract: Mining of formulaic alpha factors refers to the process of discovering and developing specific factors or indicators (referred to as alpha factors) for quantitative trading in stock market. To efficiently discover alpha factors in vast search space, reinforcement learning (RL) is commonly employed. This paper proposes a method to enhance existing alpha factor mining approaches by expanding a searc… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by ICOIN 2024

  38. arXiv:2312.12488  [pdf, other

    cs.LG cs.CR cs.CV

    Foreseeing Reconstruction Quality of Gradient Inversion: An Optimization Perspective

    Authors: HyeongGwon Hong, Yooshin Cho, Hanbyel Cho, Jaesung Ahn, Junmo Kim

    Abstract: Gradient inversion attacks can leak data privacy when clients share weight updates with the server in federated learning (FL). Existing studies mainly use L2 or cosine distance as the loss function for gradient matching in the attack. Our empirical investigation shows that the vulnerability ranking varies with the loss function used. Gradient norm, which is commonly used as a vulnerability proxy f… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: To appear in AAAI 2024

  39. arXiv:2312.04005  [pdf, other

    cs.CV cs.AI

    KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis

    Authors: Youngwan Lee, Kwanyong Park, Yoorhim Cho, Yong-Ju Lee, Sung Ju Hwang

    Abstract: As text-to-image (T2I) synthesis models increase in size, they demand higher inference costs due to the need for more expensive GPUs with larger memory, which makes it challenging to reproduce these models in addition to the restricted access to training datasets. Our study aims to reduce these inference costs and explores how far the generative capabilities of T2I models can be extended using onl… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page: https://youngwanlee.github.io/KOALA/

  40. arXiv:2311.18270  [pdf, other

    cs.CV

    Beyond Entropy: Style Transfer Guided Single Image Continual Test-Time Adaptation

    Authors: Younggeol Cho, Youngrae Kim, Dongman Lee

    Abstract: Continual test-time adaptation (cTTA) methods are designed to facilitate the continual adaptation of models to dynamically changing real-world environments where computational resources are limited. Due to this inherent limitation, existing approaches fail to simultaneously achieve accuracy and efficiency. In detail, when using a single image, the instability caused by batch normalization layers a… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 17 pages, 11 figures

  41. arXiv:2311.13934  [pdf, other

    cs.CV

    Robustness-Reinforced Knowledge Distillation with Correlation Distance and Network Pruning

    Authors: Seonghak Kim, Gyeongdo Ham, Yucheol Cho, Daeshik Kim

    Abstract: The improvement in the performance of efficient and lightweight models (i.e., the student model) is achieved through knowledge distillation (KD), which involves transferring knowledge from more complex models (i.e., the teacher model). However, most existing KD techniques rely on Kullback-Leibler (KL) divergence, which has certain limitations. First, if the teacher distribution has high entropy, t… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 11 pages, 7 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  42. arXiv:2311.09243  [pdf, ps, other

    cs.HC cs.AI

    Evaluating the Efficacy of Interactive Language Therapy Based on LLM for High-Functioning Autistic Adolescent Psychological Counseling

    Authors: Yujin Cho, Mingeon Kim, Seojin Kim, Oyun Kwon, Ryan Donghan Kwon, Yoonha Lee, Dohyun Lim

    Abstract: This study investigates the efficacy of Large Language Models (LLMs) in interactive language therapy for high-functioning autistic adolescents. With the rapid advancement of artificial intelligence, particularly in natural language processing, LLMs present a novel opportunity to augment traditional psychological counseling methods. This research primarily focuses on evaluating the LLM's ability to… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  43. arXiv:2311.03658  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    The Linear Representation Hypothesis and the Geometry of Large Language Models

    Authors: Kiho Park, Yo Joong Choe, Victor Veitch

    Abstract: Informally, the 'linear representation hypothesis' is the idea that high-level concepts are represented linearly as directions in some representation space. In this paper, we address two closely related questions: What does "linear representation" actually mean? And, how do we make sense of geometric notions (e.g., cosine similarity or projection) in the representation space? To answer these, we u… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted for an oral presentation at NeurIPS 2023 Workshop on Causal Representation Learning. Code is available at https://github.com/KihoPark/linear_rep_geometry

  44. arXiv:2311.01908  [pdf, other

    eess.IV cs.CV

    LLM-driven Multimodal Target Volume Contouring in Radiation Oncology

    Authors: Yujin Oh, Sangjoon Park, Hwa Kyung Byun, Yeona Cho, Ik Jae Lee, Jin Sung Kim, Jong Chul Ye

    Abstract: Target volume contouring for radiation therapy is considered significantly more challenging than the normal organ segmentation tasks as it necessitates the utilization of both image and text-based clinical information. Inspired by the recent advancement of large language models (LLMs) that can facilitate the integration of the textural information and images, here we present a novel LLM-driven mul… ▽ More

    Submitted 15 April, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  45. arXiv:2310.17017  [pdf, other

    cs.CL cs.AI

    An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives

    Authors: Young Min Cho, Sunny Rai, Lyle Ungar, João Sedoc, Sharath Chandra Guntuku

    Abstract: Mental health conversational agents (a.k.a. chatbots) are widely studied for their potential to offer accessible support to those experiencing mental health challenges. Previous surveys on the topic primarily consider papers published in either computer science or medicine, leading to a divide in understanding and hindering the sharing of beneficial knowledge between both domains. To bridge this g… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted in EMNLP 2023 Main Conference, camera ready

  46. arXiv:2310.14347  [pdf

    cs.HC

    Self-Assistant: Portable Progressive Muscle Relaxation Training Interface for Anxiety Reduction in Office Workers

    Authors: Lingqian Yang, Katherine Wang, Youngjun Cho

    Abstract: Workload often triggers anxiety for office workers. While a variety of stress intervention and management techniques have been explored, there exist only a few of portable tangible interfaces for anxiety reduction. Contributing to the body of work, we introduce Self-Assistant, a portable anxiety intervention training interface. This is based on progressive muscle relaxation training which is an ef… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Short Technical Report: Best student-led project in COMP0145: Research Methods and Making Skills (2022/23)

  47. arXiv:2310.14343  [pdf

    cs.HC

    Exploring Artistic Visualization of Physiological Signals for Mindfulness and Relaxation: A Pilot Study

    Authors: Zihan Xu, Youngjun Cho

    Abstract: Mindfulness and relaxation techniques for mental health are increasingly being explored in the human-computer interaction community. Physiological signals and their visualization have often been exploited together in a form of biofeedback with other intervention methods. Here, we aim to contribute to the body of existing work on biofeedback interfaces for mindfulness, with a particular focus on in… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  48. arXiv:2310.14342  [pdf

    cs.HC cs.CY

    PulmoBell: Home-based Pulmonary Rehabilitation Assistive Technology for People with COPD

    Authors: Yuanxiang Ma, Andreas Polydorides, Jitesh Joshi, Youngjun Cho

    Abstract: Chronic Obstructive Pulmonary Disease (COPD) can be fatal and is challenging to live with due to its severe symptoms. Pulmonary rehabilitation (PR) is one of the managements means to maintain COPD in a stable status. However, implementation of PR in the UK has been challenging due to the environmental and personal barriers faced by patients, which hinder their uptake, adherence, and completion of… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Short Technical Report: Best student-led project in COMP0145: Research Methods and Making Skills (2022/23)

  49. arXiv:2310.13895  [pdf, other

    cs.CL cs.LG

    RTSUM: Relation Triple-based Interpretable Summarization with Multi-level Salience Visualization

    Authors: Seonglae Cho, Yonggi Cho, HoonJae Lee, Myungha Jang, Jinyoung Yeo, Dongha Lee

    Abstract: In this paper, we present RTSUM, an unsupervised summarization framework that utilizes relation triples as the basic unit for summarization. Given an input document, RTSUM first selects salient relation triples via multi-level salience scoring and then generates a concise summary from the selected relation triples by using a text-to-text language model. On the basis of RTSUM, we also develop a web… ▽ More

    Submitted 25 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 8 pages, 2 figures

  50. arXiv:2310.09614  [pdf, other

    cs.HC

    Bridging the Divide: Unraveling the Knowledge Gap in Data Visualization Research and Practice

    Authors: Nam Wook Kim, Grace Myers, Jinhan Choi, Yoonsuh Cho, Changhoon Oh, Yea-Seul Kim

    Abstract: Empirical research on perception and cognition has laid the foundation for visualization design, often yielding useful design guidelines for practitioners. However, it remains uncertain how well practitioners stay informed about such crucial visualization design knowledge. In this paper, we employed a mixed-method approach to explore the knowledge gap between visualization research and real-world… ▽ More

    Submitted 30 January, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: 15 pages, 5 figures