Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 293 results for author: Kim, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14611  [pdf

    cs.DC cs.DB

    Scalable, reproducible, and cost-effective processing of large-scale medical imaging datasets

    Authors: Michael E. Kim, Karthik Ramadass, Chenyu Gao, Praitayini Kanakaraj, Nancy R. Newlin, Gaurav Rudravaram, Kurt G. Schilling, Blake E. Dewey, Derek Archer, Timothy J. Hohman, Zhiyuan Li, Shunxing Bao, Bennett A. Landman, Nazirah Mohd Khairi

    Abstract: Curating, processing, and combining large-scale medical imaging datasets from national studies is a non-trivial task due to the intense computation and data throughput required, variability of acquired data, and associated financial overhead. Existing platforms or tools for large-scale data curation, processing, and storage have difficulty achieving a viable cost-to-scale ratio of computation spee… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2408.12692  [pdf, other

    cs.AI

    Unlocking Intrinsic Fairness in Stable Diffusion

    Authors: Eunji Kim, Siwon Kim, Rahim Entezari, Sungroh Yoon

    Abstract: Recent text-to-image models like Stable Diffusion produce photo-realistic images but often show demographic biases. Previous debiasing methods focused on training-based approaches, failing to explore the root causes of bias and overlooking Stable Diffusion's potential for unbiased image generation. In this paper, we demonstrate that Stable Diffusion inherently possesses fairness, which can be unlo… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 21 pages, 20 figures; First two authors contributed equally

  3. arXiv:2408.06010  [pdf, other

    cs.CV

    DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation

    Authors: Jisoo Kim, Jungbin Cho, Joonho Park, Soonmin Hwang, Da Eun Kim, Geon Kim, Youngjae Yu

    Abstract: Speech-driven 3D facial animation has garnered lots of attention thanks to its broad range of applications. Despite recent advancements in achieving realistic lip motion, current methods fail to capture the nuanced emotional undertones conveyed through speech and produce monotonous facial motion. These limitations result in blunt and repetitive facial animations, reducing user engagement and hinde… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: First two authors contributed equally

  4. arXiv:2408.03541  [pdf, ps, other

    cs.CL cs.AI

    EXAONE 3.0 7.8B Instruction Tuned Language Model

    Authors: LG AI Research, :, Soyoung An, Kyunghoon Bae, Eunbi Choi, Stanley Jungkyu Choi, Yemuk Choi, Seokhee Hong, Yeonjung Hong, Junwon Hwang, Hyojin Jeon, Gerrard Jeongwon Jo, Hyunjik Jo, Jiyeon Jung, Yountae Jung, Euisoon Kim, Hyosang Kim, Joonkee Kim, Seonghwan Kim, Soyeon Kim, Sunkyoung Kim, Yireun Kim, Youchul Kim, Edward Hwayoung Lee, Haeju Lee , et al. (14 additional authors not shown)

    Abstract: We introduce EXAONE 3.0 instruction-tuned language model, the first open model in the family of Large Language Models (LLMs) developed by LG AI Research. Among different model sizes, we publicly release the 7.8B instruction-tuned model to promote open research and innovations. Through extensive evaluations across a wide range of public and in-house benchmarks, EXAONE 3.0 demonstrates highly compet… ▽ More

    Submitted 13 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

  5. arXiv:2407.20542  [pdf, other

    cs.CV cs.HC

    HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation

    Authors: Wencan Cheng, Eunji Kim, Jong Hwan Ko

    Abstract: The extraction of keypoint positions from input hand frames, known as 3D hand pose estimation, is crucial for various human-computer interaction applications. However, current approaches often struggle with the dynamic nature of self-occlusion of hands and intra-occlusion with interacting objects. To address this challenge, this paper proposes the Denoising Adaptive Graph Transformer, HandDAGT, fo… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Accepted as a conference paper to European Conference on Computer Vision (ECCV) 2024

  6. arXiv:2407.15554  [pdf, other

    cs.CV

    Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping

    Authors: Minseong Park, Suhan Woo, Euntai Kim

    Abstract: Learning efficient representations of local features is a key challenge in feature volume-based 3D neural mapping, especially in large-scale environments. In this paper, we introduce Decomposition-based Neural Mapping (DNMap), a storage-efficient large-scale 3D mapping method that employs a discrete representation based on a decomposition strategy. This decomposition strategy aims to efficiently c… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  7. arXiv:2407.13280  [pdf, other

    cs.SE cs.DB

    AI-Assisted SQL Authoring at Industry Scale

    Authors: Chandra Maddila, Negar Ghorbani, Kosay Jabre, Vijayaraghavan Murali, Edwin Kim, Parth Thakkar, Nikolay Pavlovich Laptev, Olivia Harman, Diana Hsu, Rui Abreu, Peter C. Rigby

    Abstract: SqlCompose brings generative AI into the data analytics domain. SQL is declarative, has formal table schemas, and is often written in a non-linear manner. We address each of these challenges and develop a set of models that shows the importance of each problem. We first develop an internal SQL benchmark to perform offline tests at Meta. We evaluate how well the Public Llama model performs. We atta… ▽ More

    Submitted 19 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 11 pages

  8. arXiv:2407.12982  [pdf, other

    cs.LG cs.CL cs.IR

    Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

    Authors: To Eun Kim, Alireza Salemi, Andrew Drozdov, Fernando Diaz, Hamed Zamani

    Abstract: In the field of language modeling, models augmented with retrieval components have emerged as a promising solution to address several challenges faced in the natural language processing (NLP) field, including knowledge grounding, interpretability, and scalability. Despite the primary focus on NLP, we posit that the paradigm of retrieval-enhancement can be extended to a broader spectrum of machine… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  9. arXiv:2407.09578  [pdf, other

    cs.CV cs.LG

    Unsupervised Anomaly Detection Using Diffusion Trend Analysis

    Authors: Eunwoo Kim, Un Yang, Cheol Lae Roh, Stefano Ermon

    Abstract: Conventional anomaly detection techniques based on reconstruction via denoising diffusion model are widely used due to their ability to identify anomaly locations and shapes with high performance. However, there is a limitation in determining appropriate noise parameters that can degrade anomalies while preserving normal characteristics. Also, due to the volatility of the diffusion model, normal r… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 4 pages, 4 figures, 1 table

    MSC Class: 68T45 (Primary) 68T27 (Secondary) ACM Class: I.2.10

  10. arXiv:2407.06206  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    The Impact of an XAI-Augmented Approach on Binary Classification with Scarce Data

    Authors: Ximing Wen, Rosina O. Weber, Anik Sen, Darryl Hannan, Steven C. Nesbit, Vincent Chan, Alberto Goffi, Michael Morris, John C. Hunninghake, Nicholas E. Villalobos, Edward Kim, Christopher J. MacLellan

    Abstract: Point-of-Care Ultrasound (POCUS) is the practice of clinicians conducting and interpreting ultrasound scans right at the patient's bedside. However, the expertise needed to interpret these images is considerable and may not always be present in emergency situations. This reality makes algorithms such as machine learning classifiers extremely valuable to augment human decisions. POCUS devices are b… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures, accepted by XAI 2024 workshop @ IJCAI

  11. arXiv:2406.19648  [pdf

    cs.HC cs.AI cs.CL

    Designing and Evaluating Multi-Chatbot Interface for Human-AI Communication: Preliminary Findings from a Persuasion Task

    Authors: Sion Yoon, Tae Eun Kim, Yoo Jung Oh

    Abstract: The dynamics of human-AI communication have been reshaped by language models such as ChatGPT. However, extant research has primarily focused on dyadic communication, leaving much to be explored regarding the dynamics of human-AI communication in group settings. The availability of multiple language model chatbots presents a unique opportunity for scholars to better understand the interaction betwe… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  12. arXiv:2406.14706  [pdf

    cs.ET cs.AR

    SWANN: Shuffling Weights in Crossbar Arrays for Enhanced DNN Accuracy in Deeply Scaled Technologies

    Authors: Jeffry Victor, Dong Eun Kim, Chunguang Wang, Kaushik Roy, Sumeet Gupta

    Abstract: Deep neural network (DNN) accelerators employing crossbar arrays capable of in-memory computing (IMC) are highly promising for neural computing platforms. However, in deeply scaled technologies, interconnect resistance severely impairs IMC robustness, leading to a drop in the system accuracy. To address this problem, we propose SWANN - a technique based on shuffling weights in crossbar arrays whic… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  13. arXiv:2406.12254  [pdf, other

    eess.IV cs.CV

    Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation

    Authors: Xin Yu, Qi Yang, Han Liu, Ho Hin Lee, Yucheng Tang, Lucas W. Remedios, Michael E. Kim, Rendong Zhang, Shunxing Bao, Yuankai Huo, Ann Zenobia Moore, Luigi Ferrucci, Bennett A. Landman

    Abstract: 2D single-slice abdominal computed tomography (CT) enables the assessment of body habitus and organ health with low radiation exposure. However, single-slice data necessitates the use of 2D networks for segmentation, but these networks often struggle to capture contextual information effectively. Consequently, even when trained on identical datasets, 3D networks typically achieve superior segmenta… ▽ More

    Submitted 12 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  14. arXiv:2406.12202  [pdf, other

    cs.RO

    Fast Global Localization on Neural Radiance Field

    Authors: Mangyu Kong, Seongwon Lee, Jaewon Lee, Euntai Kim

    Abstract: Neural Radiance Fields (NeRF) presented a novel way to represent scenes, allowing for high-quality 3D reconstruction from 2D images. Following its remarkable achievements, global localization within NeRF maps is an essential task for enabling a wide range of applications. Recently, Loc-NeRF demonstrated a localization approach that combines traditional Monte Carlo Localization with NeRF, showing p… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Preprint, Under review

  15. arXiv:2406.11210  [pdf, other

    cs.CV

    Zero-Shot Scene Change Detection

    Authors: Kyusik Cho, Dong Yeop Kim, Euntai Kim

    Abstract: We present a novel, training-free approach to scene change detection. Our method leverages tracking models, which inherently perform change detection between consecutive frames of video by identifying common objects and detecting new or missing objects. Specifically, our method takes advantage of the change detection effect of the tracking model by inputting reference and query images instead of c… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Preprint. Under review

  16. arXiv:2406.09948  [pdf, other

    cs.CL

    BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

    Authors: Junho Myung, Nayeon Lee, Yi Zhou, Jiho Jin, Rifki Afina Putri, Dimosthenis Antypas, Hsuvas Borkakoty, Eunsu Kim, Carla Perez-Almendros, Abinew Ali Ayele, Víctor Gutiérrez-Basulto, Yazmín Ibåñez-García, Hwaran Lee, Shamsuddeen Hassan Muhammad, Kiwoong Park, Anar Sabuhi Rzayev, Nina White, Seid Muhie Yimam, Mohammad Taher Pilehvar, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh

    Abstract: Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural sensitivities are limited to a single language or collected from online sources such as Wikipedia, which do not reflect the mundane everyday lifestyles of diverse regions. That is, information about the food… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  17. arXiv:2406.07909  [pdf, other

    eess.AS cs.CL cs.SD stat.ML

    Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

    Authors: Eungbeom Kim, Hantae Kim, Kyogu Lee

    Abstract: Transformer encoder with connectionist temporal classification (CTC) framework is widely used for automatic speech recognition (ASR). However, knowledge distillation (KD) for ASR displays a problem of disagreement between teacher-student models in frame-level alignment which ultimately hinders it from improving the student model's performance. In order to resolve this problem, this paper introduce… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  18. arXiv:2406.05963  [pdf, other

    cs.CV cs.AI

    Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024

    Authors: Jinwoo Ahn, Junhyeok Park, Min-Jun Kim, Kang-Hyeon Kim, So-Yeong Sohn, Yun-Ji Lee, Du-Seong Chang, Yu-Jung Heo, Eun-Sol Kim

    Abstract: In this paper, the solution of HYU MLLAB KT Team to the Multimodal Algorithmic Reasoning Task: SMART-101 CVPR 2024 Challenge is presented. Beyond conventional visual question-answering problems, the SMART-101 challenge aims to achieve human-level multimodal understanding by tackling complex visio-linguistic puzzles designed for children in the 6-8 age group. To solve this problem, we suggest two m… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  19. arXiv:2406.05295  [pdf, other

    cs.LG cs.CV

    Information Geometry of Evolution of Neural Network Parameters While Training

    Authors: Abhiram Anand Thiruthummal, Eun-jin Kim, Sergiy Shelyag

    Abstract: Artificial neural networks (ANNs) are powerful tools capable of approximating any arbitrary mathematical function, but their interpretability remains limited, rendering them as black box models. To address this issue, numerous methods have been proposed to enhance the explainability and interpretability of ANNs. In this study, we introduce the application of information geometric framework to inve… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in Neurocomputing journal

  20. arXiv:2406.05290  [pdf, other

    cs.LG cs.CE math.NA physics.comp-ph

    Extremization to Fine Tune Physics Informed Neural Networks for Solving Boundary Value Problems

    Authors: Abhiram Anand Thiruthummal, Sergiy Shelyag, Eun-jin Kim

    Abstract: We propose a novel method for fast and accurate training of physics-informed neural networks (PINNs) to find solutions to boundary value problems (BVPs) and initial boundary value problems (IBVPs). By combining the methods of training deep neural networks (DNNs) and Extreme Learning Machines (ELMs), we develop a model which has the expressivity of DNNs with the fine-tuning ability of ELMs. We show… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in CNSNS

  21. arXiv:2405.07267  [pdf, other

    cs.HC

    Fields, Bridges, and Foundations: How Researchers Browse Citation Network Visualizations

    Authors: Kiroong Choe, Eunhye Kim, Sangwon Park, Jinwook Seo

    Abstract: Visualizing citation relations with network structures is widely used, but the visual complexity can make it challenging for individual researchers to navigate through them. We collected data from 18 researchers using an interface that we designed using network simplification methods and analyzed how users browsed and identified important papers. Our analysis reveals six major patterns used for id… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  22. arXiv:2405.04093  [pdf, other

    cs.CV cs.AI

    DCNN: Dual Cross-current Neural Networks Realized Using An Interactive Deep Learning Discriminator for Fine-grained Objects

    Authors: Da Fu, Mingfei Rong, Eun-Hu Kim, Hao Huang, Witold Pedrycz

    Abstract: Accurate classification of fine-grained images remains a challenge in backbones based on convolutional operations or self-attention mechanisms. This study proposes novel dual-current neural networks (DCNN), which combine the advantages of convolutional operations and self-attention mechanisms to improve the accuracy of fine-grained image classification. The main novel design features for construct… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  23. arXiv:2404.16397  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    Deep Learning-based Prediction of Breast Cancer Tumor and Immune Phenotypes from Histopathology

    Authors: Tiago Gonçalves, Dagoberto Pulido-Arias, Julian Willett, Katharina V. Hoebel, Mason Cleveland, Syed Rakin Ahmed, Elizabeth Gerstner, Jayashree Kalpathy-Cramer, Jaime S. Cardoso, Christopher P. Bridge, Albert E. Kim

    Abstract: The interactions between tumor cells and the tumor microenvironment (TME) dictate therapeutic efficacy of radiation and many systemic therapies in breast cancer. However, to date, there is not a widely available method to reproducibly measure tumor and immune phenotypes for each patient's tumor. Given this unmet clinical need, we applied multiple instance learning (MIL) algorithms to assess activi… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Paper accepted at the First Workshop on Imageomics (Imageomics-AAAI-24) - Discovering Biological Knowledge from Images using AI (https://sites.google.com/vt.edu/imageomics-aaai-24/home), held as part of the 38th Annual AAAI Conference on Artificial Intelligence (https://aaai.org/aaai-conference/)

    MSC Class: 92C55 ACM Class: I.5.1; I.5.4; I.2.10; J.3

  24. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  25. arXiv:2404.13808  [pdf, other

    cs.IR cs.LG cs.MM

    General Item Representation Learning for Cold-start Content Recommendations

    Authors: Jooeun Kim, Jinri Kim, Kwangeun Yeo, Eungi Kim, Kyoung-Woon On, Jonghwan Mun, Joonseok Lee

    Abstract: Cold-start item recommendation is a long-standing challenge in recommendation systems. A common remedy is to use a content-based approach, but rich information from raw contents in various forms has not been fully utilized. In this paper, we propose a domain/data-agnostic item representation learning framework for cold-start recommendations, naturally equipped with multimodal alignment among vario… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 14 pages

  26. arXiv:2404.13113  [pdf, other

    quant-ph cs.ET

    Towards quantum computing for clinical trial design and optimization: A perspective on new opportunities and challenges

    Authors: Hakan Doga, M. Emre Sahin, Joao Bettencourt-Silva, Anh Pham, Eunyoung Kim, Alan Andress, Sudhir Saxena, Aritra Bose, Laxmi Parida, Jan Lukas Robertus, Hideaki Kawaguchi, Radwa Soliman, Daniel Blankenberg

    Abstract: Clinical trials are pivotal in the drug discovery process to determine the safety and efficacy of a drug candidate. The high failure rates of these trials are attributed to deficiencies in clinical model development and protocol design. Improvements in the clinical drug design process could therefore yield significant benefits for all stakeholders involved. This paper examines the current challeng… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  27. arXiv:2404.08814  [pdf, other

    cs.CV cs.AI cs.LG

    E3: Ensemble of Expert Embedders for Adapting Synthetic Image Detectors to New Generators Using Limited Data

    Authors: Aref Azizpour, Tai D. Nguyen, Manil Shrestha, Kaidi Xu, Edward Kim, Matthew C. Stamm

    Abstract: As generative AI progresses rapidly, new synthetic image generators continue to emerge at a swift pace. Traditional detection methods face two main challenges in adapting to these generators: the forensic traces of synthetic images from new techniques can vastly differ from those learned during training, and access to data for these new generators is often limited. To address these issues, we intr… ▽ More

    Submitted 16 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: 11 pages, 4 figures, To be published in CVPRWMF24

  28. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  29. arXiv:2404.01808  [pdf, other

    cs.CR

    Software-Defined Cryptography: A Design Feature of Cryptographic Agility

    Authors: Jihoon Cho, Changhoon Lee, Eunkyung Kim, Jieun Lee, Beumjin Cho

    Abstract: Given the widespread use of cryptography in Enterprise IT, migration to post-quantum cryptography (PQC) is not drop-in replacement at all. Cryptographic agility, or crypto-agility, is a design feature that enables seamless updates to new cryptographic algorithms and standards without the need to modify or replace the surrounding infrastructure. This paper introduces a notion of software-defined cr… ▽ More

    Submitted 1 September, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  30. arXiv:2404.00670  [pdf, other

    cs.CV q-bio.QM stat.AP

    Statistical Analysis by Semiparametric Additive Regression and LSTM-FCN Based Hierarchical Classification for Computer Vision Quantification of Parkinsonian Bradykinesia

    Authors: Youngseo Cho, In Hee Kwak, Dohyeon Kim, Jinhee Na, Hanjoo Sung, Jeongjae Lee, Young Eun Kim, Hyeo-il Ma

    Abstract: Bradykinesia, characterized by involuntary slowing or decrement of movement, is a fundamental symptom of Parkinson's Disease (PD) and is vital for its clinical diagnosis. Despite various methodologies explored to quantify bradykinesia, computer vision-based approaches have shown promising results. However, these methods often fall short in adequately addressing key bradykinesia characteristics in… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  31. arXiv:2403.17428  [pdf, other

    cs.AI cs.CL

    Aligning Large Language Models for Enhancing Psychiatric Interviews through Symptom Delineation and Summarization

    Authors: Jae-hee So, Joonhwan Chang, Eunji Kim, Junho Na, JiYeon Choi, Jy-yong Sohn, Byung-Hoon Kim, Sang Hui Chu

    Abstract: Recent advancements in Large Language Models (LLMs) have accelerated their usage in various domains. Given the fact that psychiatric interviews are goal-oriented and structured dialogues between the professional interviewer and the interviewee, it is one of the most underexplored areas where LLMs can contribute substantial value. Here, we explore the use of LLMs for enhancing psychiatric interview… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  32. arXiv:2403.06412  [pdf, other

    cs.CL

    CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

    Authors: Eunsu Kim, Juyoung Suk, Philhoon Oh, Haneul Yoo, James Thorne, Alice Oh

    Abstract: Despite the rapid development of large language models (LLMs) for the Korean language, there remains an obvious lack of benchmark datasets that test the requisite Korean cultural and linguistic knowledge. Because many existing Korean benchmark datasets are derived from the English counterparts through translation, they often overlook the different cultural contexts. For the few benchmark datasets… ▽ More

    Submitted 4 July, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  33. arXiv:2403.03154  [pdf, other

    physics.comp-ph cond-mat.other cs.AI

    Quantum Many-Body Physics Calculations with Large Language Models

    Authors: Haining Pan, Nayantara Mudur, Will Taranto, Maria Tikhanovskaya, Subhashini Venugopalan, Yasaman Bahri, Michael P. Brenner, Eun-Ah Kim

    Abstract: Large language models (LLMs) have demonstrated an unprecedented ability to perform complex tasks in multiple domains, including mathematical and scientific reasoning. We demonstrate that with carefully designed prompts, LLMs can accurately carry out key calculations in research papers in theoretical physics. We focus on a broadly used approximation method in quantum physics: the Hartree-Fock metho… ▽ More

    Submitted 22 August, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures. Supplemental material in the source file

  34. arXiv:2403.02236  [pdf, other

    eess.IV cs.CV

    Interpretable Models for Detecting and Monitoring Elevated Intracranial Pressure

    Authors: Darryl Hannan, Steven C. Nesbit, Ximing Wen, Glen Smith, Qiao Zhang, Alberto Goffi, Vincent Chan, Michael J. Morris, John C. Hunninghake, Nicholas E. Villalobos, Edward Kim, Rosina O. Weber, Christopher J. MacLellan

    Abstract: Detecting elevated intracranial pressure (ICP) is crucial in diagnosing and managing various neurological conditions. These fluctuations in pressure are transmitted to the optic nerve sheath (ONS), resulting in changes to its diameter, which can then be detected using ultrasound imaging devices. However, interpreting sonographic images of the ONS can be challenging. In this work, we propose two sy… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 5 pages, 2 figures, ISBI 2024

  35. arXiv:2402.18045  [pdf, other

    cs.CL

    Multi-FAct: Assessing Multilingual LLMs' Multi-Regional Knowledge using FActScore

    Authors: Sheikh Shafayat, Eunsu Kim, Juhyun Oh, Alice Oh

    Abstract: Large Language Models (LLMs) are prone to factuality hallucination, generating text that contradicts established knowledge. While extensive research has addressed this in English, little is known about multilingual LLMs. This paper systematically evaluates multilingual LLMs' factual accuracy across languages and geographic regions. We introduce a novel pipeline for multilingual factuality evaluati… ▽ More

    Submitted 1 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  36. arXiv:2402.12298  [pdf, other

    cs.CL cs.AI

    Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports

    Authors: Felix J. Dorfner, Liv JĂŒrgensen, Leonhard Donle, Fares Al Mohamad, Tobias R. Bodenmann, Mason C. Cleveland, Felix Busch, Lisa C. Adams, James Sato, Thomas Schultz, Albert E. Kim, Jameson Merkow, Keno K. Bressem, Christopher P. Bridge

    Abstract: Introduction: With the rapid advances in large language models (LLMs), there have been numerous new open source as well as commercial models. While recent publications have explored GPT-4 in its application to extracting information of interest from radiology reports, there has not been a real-world comparison of GPT-4 to different leading open-source models. Materials and Methods: Two different… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  37. arXiv:2402.06805  [pdf, other

    cs.CV

    Event-to-Video Conversion for Overhead Object Detection

    Authors: Darryl Hannan, Ragib Arnab, Gavin Parpart, Garrett T. Kenyon, Edward Kim, Yijing Watkins

    Abstract: Collecting overhead imagery using an event camera is desirable due to the energy efficiency of the image sensor compared to standard cameras. However, event cameras complicate downstream image processing, especially for complex tasks such as object detection. In this paper, we investigate the viability of event streams for overhead object detection. We demonstrate that across a number of standard… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 5 pages, 1 figure, SSIAI 2024

  38. arXiv:2402.06204  [pdf, other

    cs.CL cs.AI

    The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate

    Authors: Juhyun Oh, Eunsu Kim, Inha Cha, Alice Oh

    Abstract: This paper explores the assumption that Large Language Models (LLMs) skilled in generation tasks are equally adept as evaluators. We assess the performance of three LLMs and one open-source LM in Question-Answering (QA) and evaluation tasks using the TriviaQA (Joshi et al., 2017) dataset. Results indicate a significant disparity, with LLMs exhibiting lower performance in evaluation tasks compared… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  39. arXiv:2402.03303  [pdf, other

    cs.CL cs.AI cs.LG

    Nevermind: Instruction Override and Moderation in Large Language Models

    Authors: Edward Kim

    Abstract: Given the impressive capabilities of recent Large Language Models (LLMs), we investigate and benchmark the most popular proprietary and different sized open source models on the task of explicit instruction following in conflicting situations, e.g. overrides. These include the ability of the model to override the knowledge within the weights of the model, the ability to override (or moderate) extr… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 11 pages

  40. arXiv:2401.03060  [pdf

    eess.IV cs.CV

    Super-resolution multi-contrast unbiased eye atlases with deep probabilistic refinement

    Authors: Ho Hin Lee, Adam M. Saunders, Michael E. Kim, Samuel W. Remedios, Lucas W. Remedios, Yucheng Tang, Qi Yang, Xin Yu, Shunxing Bao, Chloe Cho, Louise A. Mawn, Tonia S. Rex, Kevin L. Schey, Blake E. Dewey, Jeffrey M. Spraggins, Jerry L. Prince, Yuankai Huo, Bennett A. Landman

    Abstract: Purpose: Eye morphology varies significantly across the population, especially for the orbit and optic nerve. These variations limit the feasibility and robustness of generalizing population-wise features of eye organs to an unbiased spatial reference. Approach: To tackle these limitations, we propose a process for creating high-resolution unbiased eye atlases. First, to restore spatial details… ▽ More

    Submitted 14 June, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Revised for submission to SPIE Journal of Medical Imaging. 26 pages, 6 figures

  41. arXiv:2401.02710  [pdf, other

    cs.CE cs.AI

    Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning

    Authors: Hong-Gi Shin, Sukhyun Jeong, Eui-Yeon Kim, Sungho Hong, Young-Jin Cho, Yong-Hoon Choi

    Abstract: Mining of formulaic alpha factors refers to the process of discovering and developing specific factors or indicators (referred to as alpha factors) for quantitative trading in stock market. To efficiently discover alpha factors in vast search space, reinforcement learning (RL) is commonly employed. This paper proposes a method to enhance existing alpha factor mining approaches by expanding a searc… ▽ More

    Submitted 7 July, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by ICOIN 2024

  42. arXiv:2312.16580  [pdf, other

    cs.CV

    VLCounter: Text-aware Visual Representation for Zero-Shot Object Counting

    Authors: Seunggu Kang, WonJun Moon, Euiyeon Kim, Jae-Pil Heo

    Abstract: Zero-Shot Object Counting (ZSOC) aims to count referred instances of arbitrary classes in a query image without human-annotated exemplars. To deal with ZSOC, preceding studies proposed a two-stage pipeline: discovering exemplars and counting. However, there remains a challenge of vulnerability to error propagation of the sequentially designed two-stage process. In this work, an one-stage baseline,… ▽ More

    Submitted 30 December, 2023; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024. Code is available at https://github.com/Seunggu0305/VLCounter

  43. GRIL-Calib: Targetless Ground Robot IMU-LiDAR Extrinsic Calibration Method using Ground Plane Motion Constraints

    Authors: TaeYoung Kim, Gyuhyeon Pak, Euntai Kim

    Abstract: Targetless IMU-LiDAR extrinsic calibration methods are gaining significant attention as the importance of the IMU-LiDAR fusion system increases. Notably, existing calibration methods derive calibration parameters under the assumption that the methods require full motion in all axes. When IMU and LiDAR are mounted on a ground robot the motion of which is restricted to planar motion, existing calibr… ▽ More

    Submitted 24 May, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 8 pages, 7 figures

  44. arXiv:2312.02796  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el cs.LG physics.data-an

    Materials Expert-Artificial Intelligence for Materials Discovery

    Authors: Yanjun Liu, Milena Jovanovic, Krishnanand Mallayya, Wesley J. Maddox, Andrew Gordon Wilson, Sebastian Klemenz, Leslie M. Schoop, Eun-Ah Kim

    Abstract: The advent of material databases provides an unprecedented opportunity to uncover predictive descriptors for emergent material properties from vast data space. However, common reliance on high-throughput ab initio data necessarily inherits limitations of such data: mismatch with experiments. On the other hand, experimental decisions are often guided by an expert's intuition honed from experiences… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 8 pages main text, 4 figs, 8 pages Supplementary material

  45. arXiv:2311.17295  [pdf, other

    cs.CL cs.AI

    Elo Uncovered: Robustness and Best Practices in Language Model Evaluation

    Authors: Meriem Boubdir, Edward Kim, Beyza Ermis, Sara Hooker, Marzieh Fadaee

    Abstract: In Natural Language Processing (NLP), the Elo rating system, originally designed for ranking players in dynamic games such as chess, is increasingly being used to evaluate Large Language Models (LLMs) through "A vs B" paired comparisons. However, while popular, the system's suitability for assessing entities with constant skill levels, such as LLMs, remains relatively unexplored. We study two fund… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 22 pages, 7 figures, 2 tables. Revised version of the paper accepted at GEM Workshop, EMNLP 2023

  46. arXiv:2311.14237  [pdf, other

    cs.LG cs.CV

    Pseudo-label Correction for Instance-dependent Noise Using Teacher-student Framework

    Authors: Eugene Kim

    Abstract: The high capacity of deep learning models to learn complex patterns poses a significant challenge when confronted with label noise. The inability to differentiate clean and noisy labels ultimately results in poor generalization. We approach this problem by reassigning the label for each image using a new teacher-student based framework termed P-LC (pseudo-label correction). Traditional teacher-stu… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  47. arXiv:2311.03500  [pdf

    eess.IV cs.CV q-bio.NC

    Predicting Age from White Matter Diffusivity with Residual Learning

    Authors: Chenyu Gao, Michael E. Kim, Ho Hin Lee, Qi Yang, Nazirah Mohd Khairi, Praitayini Kanakaraj, Nancy R. Newlin, Derek B. Archer, Angela L. Jefferson, Warren D. Taylor, Brian D. Boyd, Lori L. Beason-Held, Susan M. Resnick, The BIOCARD Study Team, Yuankai Huo, Katherine D. Van Schaik, Kurt G. Schilling, Daniel Moyer, Ivana IĆĄgum, Bennett A. Landman

    Abstract: Imaging findings inconsistent with those expected at specific chronological age ranges may serve as early indicators of neurological disorders and increased mortality risk. Estimation of chronological age, and deviations from expected results, from structural MRI data has become an important task for developing biomarkers that are sensitive to such deviations. Complementary to structural analysis,… ▽ More

    Submitted 21 January, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: SPIE Medical Imaging: Image Processing. San Diego, CA. February 2024 (accepted as poster presentation)

  48. arXiv:2310.14424  [pdf, other

    cs.CL cs.AI

    Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation

    Authors: Meriem Boubdir, Edward Kim, Beyza Ermis, Marzieh Fadaee, Sara Hooker

    Abstract: Human evaluation is increasingly critical for assessing large language models, capturing linguistic nuances, and reflecting user preferences more accurately than traditional automated metrics. However, the resource-intensive nature of this type of annotation process poses significant challenges. The key question driving our work: "is it feasible to minimize human-in-the-loop feedback by prioritizi… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 37 pages, 8 figures

  49. arXiv:2310.04941  [pdf, other

    cs.LG cs.AI

    Reliable Test-Time Adaptation via Agreement-on-the-Line

    Authors: Eungyeup Kim, Mingjie Sun, Aditi Raghunathan, Zico Kolter

    Abstract: Test-time adaptation (TTA) methods aim to improve robustness to distribution shifts by adapting models using unlabeled data from the shifted test distribution. However, there remain unresolved challenges that undermine the reliability of TTA, which include difficulties in evaluating TTA performance, miscalibration after TTA, and unreliable hyperparameter tuning for adaptation. In this work, we mak… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: 19 pages, 9 figures

  50. arXiv:2310.02692  [pdf, other

    cs.CV cs.AI

    Clustering-based Image-Text Graph Matching for Domain Generalization

    Authors: Nokyung Park, Daewon Chae, Jeongyong Shim, Sangpil Kim, Eun-Sol Kim, Jinkyu Kim

    Abstract: Learning domain-invariant visual representations is important to train a model that can generalize well to unseen target task domains. Recent works demonstrate that text descriptions contain high-level class-discriminative information and such auxiliary semantic cues can be used as effective pivot embedding for domain generalization problem. However, they use pivot embedding in global manner (i.e.… ▽ More

    Submitted 15 April, 2024; v1 submitted 4 October, 2023; originally announced October 2023.