Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 230 results for author: Park, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16171  [pdf, other

    cs.CV cs.AI cs.MM

    Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality

    Authors: Kyu Ri Park, Hong Joo Lee, Jung Uk Kim

    Abstract: Recent Audio-Visual Question Answering (AVQA) methods rely on complete visual and audio input to answer questions accurately. However, in real-world scenarios, issues such as device malfunctions and data transmission errors frequently result in missing audio or visual modality. In such cases, existing AVQA methods suffer significant performance degradation. In this paper, we propose a framework th… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  2. arXiv:2407.15296  [pdf, other

    cs.CV cs.CL cs.LG

    Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection

    Authors: Kwanyong Park, Kuniaki Saito, Donghyun Kim

    Abstract: Vision-language (VL) models often exhibit a limited understanding of complex expressions of visual objects (e.g., attributes, shapes, and their relations), given complex and diverse language queries. Traditional approaches attempt to improve VL models using hard negative synthetic text, but their effectiveness is limited. In this paper, we harness the exceptional compositional understanding capabi… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  3. arXiv:2407.13942  [pdf, other

    cs.CY cs.AI cs.CL cs.SI

    Harmful Suicide Content Detection

    Authors: Kyumin Park, Myung Jae Baik, YeongJun Hwang, Yen Shin, HoJae Lee, Ruda Lee, Sang Min Lee, Je Young Hannah Sun, Ah Rah Lee, Si Yeun Yoon, Dong-ho Lee, Jihyung Moon, JinYeong Bak, Kyunghyun Cho, Jong-Woo Paik, Sungjoon Park

    Abstract: Harmful suicide content on the Internet is a significant risk factor inducing suicidal thoughts and behaviors among vulnerable populations. Despite global efforts, existing resources are insufficient, specifically in high-risk regions like the Republic of Korea. Current research mainly focuses on understanding negative effects of such content or suicide risk in individuals, rather than on automati… ▽ More

    Submitted 2 June, 2024; originally announced July 2024.

    Comments: 30 pages, 7 figures

  4. arXiv:2407.08464  [pdf, other

    cs.LG cs.AI

    TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations

    Authors: Junik Bae, Kwanyoung Park, Youngwoon Lee

    Abstract: Unsupervised goal-conditioned reinforcement learning (GCRL) is a promising paradigm for developing diverse robotic skills without external supervision. However, existing unsupervised GCRL methods often struggle to cover a wide range of states in complex environments due to their limited exploration and sparse or noisy rewards for GCRL. To overcome these challenges, we propose a novel unsupervised… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Website: https://heatz123.github.io/tldr

  5. arXiv:2407.07314  [pdf, ps, other

    cs.IT

    Proactive Eavesdropping in Relay Systems via Trajectory and Power Optimization

    Authors: Qian Dan, Hongjiang Lei, Ki-Hong Park, Weijia Lei, Gaofeng Pan

    Abstract: Wireless relays can effectively extend the transmission range of information. However, if relay technology is utilized unlawfully, it can amplify potential harm. Effectively surveilling illegitimate relay links poses a challenging problem. Unmanned aerial vehicles (UAVs) can proactively surveil wireless relay systems due to their flexible mobility. This work focuses on maximizing the eavesdropping… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 14 pages, 8 figures, submitted to IEEE Journal for review

  6. arXiv:2407.06521  [pdf, ps, other

    cs.IT eess.SP

    Beamforming Design for Joint Target Sensing and Proactive Eavesdropping

    Authors: Qian Dan, Hongjiang Lei, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: This work studies the beamforming design in the joint target sensing and proactive eavesdropping (JTSAPE) system. The JTSAPE base station (BS) receives the information transmitted by the illegal transmitter and transmits the waveform for target sensing. The shared waveform also serves as artificial noise to interfere with the illegal receiver, thereby achieving proactive eavesdropping. We firstly… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 26 pages, 6 figures, submitted to IEEE Journal for review

  7. arXiv:2407.06333  [pdf, ps, other

    cs.LG cs.NE math.NA

    A third-order finite difference weighted essentially non-oscillatory scheme with shallow neural network

    Authors: Kwanghyuk Park, Xinjuan Chen, Dongjin Lee, Jiaxi Gu, Jae-Hun Jung

    Abstract: In this paper, we introduce the finite difference weighted essentially non-oscillatory (WENO) scheme based on the neural network for hyperbolic conservation laws. We employ the supervised learning and design two loss functions, one with the mean squared error and the other with the mean squared logarithmic error, where the WENO3-JS weights are computed as the labels. Each loss function consists of… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  8. arXiv:2407.00699  [pdf, other

    cs.LG cs.AI

    Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning

    Authors: Kwanyoung Park, Youngwoon Lee

    Abstract: Model-based offline reinforcement learning (RL) is a compelling approach that addresses the challenge of learning from limited, static data by generating imaginary trajectories using learned models. However, it falls short in solving long-horizon tasks due to high bias in value estimation from model rollouts. In this paper, we introduce a novel model-based offline RL method, Lower Expectile Q-lear… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: https://kwanyoungpark.github.io/LEQ/

  9. arXiv:2406.18898  [pdf, other

    cs.CV cs.AI

    360 in the Wild: Dataset for Depth Prediction and View Synthesis

    Authors: Kibaek Park, Francois Rameau, Jaesik Park, In So Kweon

    Abstract: The large abundance of perspective camera datasets facilitated the emergence of novel learning-based strategies for various tasks, such as camera localization, single image depth estimation, or view synthesis. However, panoramic or omnidirectional image datasets, including essential information, such as pose and depth, are mostly made with synthetic scenes. In this work, we introduce a large scale… ▽ More

    Submitted 4 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  10. arXiv:2406.09948  [pdf, other

    cs.CL

    BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

    Authors: Junho Myung, Nayeon Lee, Yi Zhou, Jiho Jin, Rifki Afina Putri, Dimosthenis Antypas, Hsuvas Borkakoty, Eunsu Kim, Carla Perez-Almendros, Abinew Ali Ayele, Víctor Gutiérrez-Basulto, Yazmín Ibáñez-García, Hwaran Lee, Shamsuddeen Hassan Muhammad, Kiwoong Park, Anar Sabuhi Rzayev, Nina White, Seid Muhie Yimam, Mohammad Taher Pilehvar, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh

    Abstract: Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural sensitivities are limited to a single language or collected from online sources such as Wikipedia, which do not reflect the mundane everyday lifestyles of diverse regions. That is, information about the food… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  11. arXiv:2406.06842  [pdf, ps, other

    cs.IT eess.SP

    Aerial Relay to Achieve Covertness and Security

    Authors: Jiacheng Jiang, Hongjiang Lei, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: In this work, a delay-tolerant unmanned aerial vehicle (UAV) relayed covert and secure communication framework is investigated. In this framework, a legitimate UAV serves as an aerial relay to realize communication when the direct link between the terrestrial transmitter and receiver is blocked and also acts as a friendly jammer to suppress the malicious nodes presented on the ground. Subsequently… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, submitted to IEEE Journal for review

  12. arXiv:2406.06527  [pdf, other

    cs.CV cs.AI cs.GR

    IllumiNeRF: 3D Relighting without Inverse Rendering

    Authors: Xiaoming Zhao, Pratul P. Srinivasan, Dor Verbin, Keunhong Park, Ricardo Martin Brualla, Philipp Henzler

    Abstract: Existing methods for relightable view synthesis -- using a set of images of an object under unknown lighting to recover a 3D representation that can be rendered from novel viewpoints under a target illumination -- are based on inverse rendering, and attempt to disentangle the object geometry, materials, and lighting that explain the input images. Furthermore, this typically involves optimization t… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Project page: https://illuminerf.github.io/

  13. arXiv:2406.05936  [pdf, ps, other

    cs.IT

    Multi-UAV Trajectory Design for Fair and Secure Communication

    Authors: Hongjiang Lei, Dongyang Meng, Haoxiang Ran, Ki-Hong Park, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) play an essential role in future wireless communication networks due to their high mobility, low cost, and on-demand deployment. In air-to-ground links, UAVs are widely used to enhance the performance of wireless communication systems due to the presence of high-probability line-of-sight (LoS) links. However, the high probability of LoS links also increases the risk… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 14 pages, 10 figures, submitted to IEEE Journal for review

  14. arXiv:2406.01506  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    The Geometry of Categorical and Hierarchical Concepts in Large Language Models

    Authors: Kiho Park, Yo Joong Choe, Yibo Jiang, Victor Veitch

    Abstract: Understanding how semantic meaning is encoded in the representation spaces of large language models is a fundamental problem in interpretability. In this paper, we study the two foundational questions in this area. First, how are categorical concepts, such as {'mammal', 'bird', 'reptile', 'fish'}, represented? Second, how are hierarchical relations between concepts encoded? For example, how is the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Code is available at https://github.com/KihoPark/LLM_Categorical_Hierarchical_Representations

  15. arXiv:2406.01313  [pdf, ps, other

    cs.IT eess.SP

    3D Trajectory Design for Energy-constrained Aerial CRNs Under Probabilistic LoS Channel

    Authors: Hongjiang Lei, Xiaqiu Wu, Ki-Hong Park, Gaofeng Pan

    Abstract: Unmanned aerial vehicles (UAVs) have been attracting significant attention because there is a high probability of line-of-sight links being obtained between them and terrestrial nodes in high-rise urban areas. In this work, we investigate cognitive radio networks (CRNs) by jointly designing three-dimensional (3D) trajectory, the transmit power of the UAV, and user scheduling. Considering the UAV's… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures,submitted to the IEEE journal for review

  16. arXiv:2405.21047  [pdf, other

    cs.AI cs.CL cs.LG

    Grammar-Aligned Decoding

    Authors: Kanghee Park, Jiayu Wang, Taylor Berg-Kirkpatrick, Nadia Polikarpova, Loris D'Antoni

    Abstract: Large Language Models (LLMs) struggle with reliably generating highly structured outputs, such as program code, mathematical formulas, or well-formed markup. Constrained decoding approaches mitigate this problem by greedily restricting what tokens an LLM can output at each step to guarantee that the output matches a given constraint. Specifically, in grammar-constrained decoding (GCD), the LLM's o… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  17. arXiv:2405.19899  [pdf, other

    cs.CV cs.AI

    Open-Set Domain Adaptation for Semantic Segmentation

    Authors: Seun-An Choe, Ah-Hyung Shin, Keon-Hee Park, Jinwoo Choi, Gyeong-Moon Park

    Abstract: Unsupervised domain adaptation (UDA) for semantic segmentation aims to transfer the pixel-wise knowledge from the labeled source domain to the unlabeled target domain. However, current UDA methods typically assume a shared label space between source and target, limiting their applicability in real-world scenarios where novel categories may emerge in the target domain. In this paper, we introduce O… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures, 13 tables, CVPR 2024 Poster

  18. arXiv:2405.11911  [pdf, other

    cs.AI cs.LG cs.SI

    PULL: PU-Learning-based Accurate Link Prediction

    Authors: Junghun Kim, Ka Hyun Park, Hoyoung Yoon, U Kang

    Abstract: Given an edge-incomplete graph, how can we accurately find the missing links? The link prediction in edge-incomplete graphs aims to discover the missing relations between entities when their relationships are represented as a graph. Edge-incomplete graphs are prevalent in real-world due to practical limitations, such as not checking all users when adding friends in a social network. Addressing the… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 11 pages

  19. arXiv:2405.03162  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Advancing Multimodal Medical Capabilities of Gemini

    Authors: Lin Yang, Shawn Xu, Andrew Sellergren, Timo Kohlberger, Yuchen Zhou, Ira Ktena, Atilla Kiraly, Faruk Ahmed, Farhad Hormozdiari, Tiam Jaroensri, Eric Wang, Ellery Wulczyn, Fayaz Jamil, Theo Guidroz, Chuck Lau, Siyuan Qiao, Yun Liu, Akshay Goel, Kendall Park, Arnav Agharwal, Nick George, Yang Wang, Ryutaro Tanno, David G. T. Barrett, Wei-Hung Weng , et al. (22 additional authors not shown)

    Abstract: Many clinical tasks require an understanding of specialized data, such as medical images and genomics, which is not typically found in general-purpose large multimodal models. Building upon Gemini's multimodal models, we develop several models within the new Med-Gemini family that inherit core capabilities of Gemini and are optimized for medical use via fine-tuning with 2D and 3D radiology, histop… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  20. arXiv:2405.01554  [pdf, other

    cs.LG cs.AI q-bio.NC

    Early-stage detection of cognitive impairment by hybrid quantum-classical algorithm using resting-state functional MRI time-series

    Authors: Junggu Choi, Tak Hur, Daniel K. Park, Na-Young Shin, Seung-Koo Lee, Hakbae Lee, Sanghoon Han

    Abstract: Following the recent development of quantum machine learning techniques, the literature has reported several quantum machine learning algorithms for disease detection. This study explores the application of a hybrid quantum-classical algorithm for classifying region-of-interest time-series data obtained from resting-state functional magnetic resonance imaging in patients with early-stage cognitive… ▽ More

    Submitted 16 March, 2024; originally announced May 2024.

    Comments: 28 pages, 10 figures

  21. arXiv:2404.15882  [pdf, ps, other

    cs.CV cs.AI

    Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains

    Authors: Eunsu Baek, Keondo Park, Jiyoon Kim, Hyung-Sin Kim

    Abstract: Computer vision applications predict on digital images acquired by a camera from physical scenes through light. However, conventional robustness benchmarks rely on perturbations in digitized images, diverging from distribution shifts occurring in the image acquisition process. To bridge this gap, we introduce a new distribution shift dataset, ImageNet-ES, comprising variations in environmental and… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Published as a conference paper at CVPR 2024

  22. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  23. arXiv:2404.02117  [pdf, other

    cs.CV

    Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners

    Authors: Keon-Hee Park, Kyungwoo Song, Gyeong-Moon Park

    Abstract: Few-Shot Class Incremental Learning (FSCIL) is a task that requires a model to learn new classes incrementally without forgetting when only a few samples for each class are given. FSCIL encounters two significant challenges: catastrophic forgetting and overfitting, and these challenges have driven prior studies to primarily rely on shallow models, such as ResNet-18. Even though their limited capac… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  24. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  25. arXiv:2403.20225  [pdf, other

    cs.CV

    MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark

    Authors: Sanghyun Woo, Kwanyong Park, Inkyu Shin, Myungchul Kim, In So Kweon

    Abstract: Multi-target multi-camera tracking is a crucial task that involves identifying and tracking individuals over time using video streams from multiple cameras. This task has practical applications in various fields, such as visual surveillance, crowd behavior analysis, and anomaly detection. However, due to the difficulty and cost of collecting and labeling data, existing datasets for this task are e… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted on CVPR 2024

  26. arXiv:2403.19868  [pdf, other

    cs.IT

    Jamming Intrusions in Extreme Bandwidth Communication: A Comprehensive Overview

    Authors: Richa Priyadarshani, Ki-Hong Park, Yalcin Ata, Mohamed-Slim Alouini

    Abstract: As the evolution of wireless communication progresses towards 6G networks, extreme bandwidth communication (EBC) emerges as a key enabler to meet the ambitious key performance indicator set for this next-generation technology. 6G aims for peak data rates of 1 Tb/s, peak spectral efficiency of 60 b/s/Hz, maximum bandwidth of 100 GHz, and mobility support up to 1000 km/h, while maintaining a high le… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  27. arXiv:2403.19099  [pdf, other

    quant-ph cs.LG

    Optimizing Quantum Convolutional Neural Network Architectures for Arbitrary Data Dimension

    Authors: Changwon Lee, Israel F. Araujo, Dongha Kim, Junghan Lee, Siheon Park, Ju-Young Ryu, Daniel K. Park

    Abstract: Quantum convolutional neural networks (QCNNs) represent a promising approach in quantum machine learning, paving new directions for both quantum and classical data analysis. This approach is particularly attractive due to the absence of the barren plateau problem, a fundamental challenge in training quantum neural networks (QNNs), and its feasibility. However, a limitation arises when applying QCN… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 17 pages, 7 figures

  28. arXiv:2403.17368  [pdf, other

    cs.CL cs.AI

    ChatGPT Rates Natural Language Explanation Quality Like Humans: But on Which Scales?

    Authors: Fan Huang, Haewoon Kwak, Kunwoo Park, Jisun An

    Abstract: As AI becomes more integral in our lives, the need for transparency and responsibility grows. While natural language explanations (NLEs) are vital for clarifying the reasoning behind AI decisions, evaluating them through human judgments is complex and resource-intensive due to subjectivity and the need for fine-grained ratings. This study explores the alignment between ChatGPT and human assessment… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accpeted by LREC-COLING 2024 main conference, long paper

  29. arXiv:2403.14140  [pdf, other

    cs.CV cs.LG

    Learning Decomposable and Debiased Representations via Attribute-Centric Information Bottlenecks

    Authors: Jinyung Hong, Eun Som Jeon, Changhoon Kim, Keun Hee Park, Utkarsh Nath, Yezhou Yang, Pavan Turaga, Theodore P. Pavlic

    Abstract: Biased attributes, spuriously correlated with target labels in a dataset, can problematically lead to neural networks that learn improper shortcuts for classifications and limit their capabilities for out-of-distribution (OOD) generalization. Although many debiasing approaches have been proposed to ensure correct predictions from biased datasets, few studies have considered learning latent embeddi… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 24 pages, 16 figures, 3 tables

  30. arXiv:2403.06433  [pdf, other

    cs.CV cs.AI

    Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection

    Authors: Konyul Park, Yecheol Kim, Junho Koh, Byungwoo Park, Jun Won Choi

    Abstract: Developing high-performance, real-time architectures for LiDAR-based 3D object detectors is essential for the successful commercialization of autonomous vehicles. Pillar-based methods stand out as a practical choice for onboard deployment due to their computational efficiency. However, despite their efficiency, these methods can sometimes underperform compared to alternative point encoding techniq… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: ICRA 2024

  31. arXiv:2402.11159  [pdf, other

    cs.CL cs.CV

    Assessing News Thumbnail Representativeness: Counterfactual text can enhance the cross-modal matching ability

    Authors: Yejun Yoon, Seunghyun Yoon, Kunwoo Park

    Abstract: This paper addresses the critical challenge of assessing the representativeness of news thumbnail images, which often serve as the first visual engagement for readers when an article is disseminated on social media. We focus on whether a news image represents the actors discussed in the news text. To serve the challenge, we introduce NewsTT, a manually annotated dataset of 1000 news thumbnail imag… ▽ More

    Submitted 6 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL 2024 (findings), 16 pages

  32. arXiv:2402.08979  [pdf, ps, other

    eess.SY cs.AI cs.LG

    Learning-enabled Flexible Job-shop Scheduling for Scalable Smart Manufacturing

    Authors: Sihoon Moon, Sanghoon Lee, Kyung-Joon Park

    Abstract: In smart manufacturing systems (SMSs), flexible job-shop scheduling with transportation constraints (FJSPT) is essential to optimize solutions for maximizing productivity, considering production flexibility based on automated guided vehicles (AGVs). Recent developments in deep reinforcement learning (DRL)-based methods for FJSPT have encountered a scale generalization challenge. These methods unde… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  33. arXiv:2402.08958  [pdf, other

    cs.LG cs.AI

    Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers

    Authors: Junhan Kim, Kyungphil Park, Chungman Lee, Ho-young Kim, Joonyoung Kim, Yongkweon Jeon

    Abstract: With the increasing complexity of generative AI models, post-training quantization (PTQ) has emerged as a promising solution for deploying hyper-scale models on edge devices such as mobile devices and TVs. Existing PTQ schemes, however, consume considerable time and resources, which could be a bottleneck in real situations where frequent model updates and multiple hyper-parameter tunings are requi… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 17 pages, under review

  34. arXiv:2312.11890  [pdf, other

    cs.CL cs.SI

    Difficulty-Focused Contrastive Learning for Knowledge Tracing with a Large Language Model-Based Difficulty Prediction

    Authors: Unggi Lee, Sungjun Yoon, Joon Seo Yun, Kyoungsoo Park, YoungHoon Jung, Damji Stratton, Hyeoncheol Kim

    Abstract: This paper presents novel techniques for enhancing the performance of knowledge tracing (KT) models by focusing on the crucial factor of question and concept difficulty level. Despite the acknowledged significance of difficulty, previous KT research has yet to exploit its potential for model optimization and has struggled to predict difficulty from unseen data. To address these problems, we propos… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 10 pages, 4 figures, 2 tables

  35. arXiv:2312.10486  [pdf, other

    cs.DB

    Time-Constrained Continuous Subgraph Matching Using Temporal Information for Filtering and Backtracking

    Authors: Seunghwan Min, Jihoon Jang, Kunsoo Park, Dora Giammarresi, Giuseppe F. Italiano, Wook-Shin Han

    Abstract: Real-time analysis of graphs containing temporal information, such as social media streams, Q&A networks, and cyber data sources, plays an important role in various applications. Among them, detecting patterns is one of the fundamental graph analysis problems. In this paper, we study time-constrained continuous subgraph matching, which detects a pattern with a strict partial order on the edge set… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  36. arXiv:2312.04005  [pdf, other

    cs.CV cs.AI

    KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis

    Authors: Youngwan Lee, Kwanyong Park, Yoorhim Cho, Yong-Ju Lee, Sung Ju Hwang

    Abstract: As text-to-image (T2I) synthesis models increase in size, they demand higher inference costs due to the need for more expensive GPUs with larger memory, which makes it challenging to reproduce these models in addition to the restricted access to training datasets. Our study aims to reduce these inference costs and explores how far the generative capabilities of T2I models can be extended using onl… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page: https://youngwanlee.github.io/KOALA/

  37. arXiv:2312.02981  [pdf, other

    cs.CV

    ReconFusion: 3D Reconstruction with Diffusion Priors

    Authors: Rundi Wu, Ben Mildenhall, Philipp Henzler, Keunhong Park, Ruiqi Gao, Daniel Watson, Pratul P. Srinivasan, Dor Verbin, Jonathan T. Barron, Ben Poole, Aleksander Holynski

    Abstract: 3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at rendering photorealistic novel views of complex scenes. However, recovering a high-quality NeRF typically requires tens to hundreds of input images, resulting in a time-consuming capture process. We present ReconFusion to reconstruct real-world scenes using only a few photos. Our approach leverages a diffusion prior for nove… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Project page: https://reconfusion.github.io/

  38. arXiv:2311.15208  [pdf, other

    cs.CL cs.AI

    LongStory: Coherent, Complete and Length Controlled Long story Generation

    Authors: Kyeongman Park, Nakyeong Yang, Kyomin Jung

    Abstract: A human author can write any length of story without losing coherence. Also, they always bring the story to a proper ending, an ability that current language models lack. In this work, we present the LongStory for coherent, complete, and length-controlled long story generation. LongStory introduces two novel methodologies: (1) the long and short-term contexts weight calibrator (CWC) and (2) long s… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  39. arXiv:2311.11412  [pdf, other

    quant-ph cs.ET

    Neural Quantum Embedding: Pushing the Limits of Quantum Supervised Learning

    Authors: Tak Hur, Israel F. Araujo, Daniel K. Park

    Abstract: Quantum embedding is indispensable for applying quantum machine learning techniques to classical data, and has substantial impacts on performance outcomes. In this study, we present Neural Quantum Embedding (NQE), a method that efficiently optimizes quantum embedding by leveraging classical deep learning techniques. NQE enhances the lower bound of the empirical risk, leading to substantial improve… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 13 pages, 7 figures

  40. arXiv:2311.03658  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    The Linear Representation Hypothesis and the Geometry of Large Language Models

    Authors: Kiho Park, Yo Joong Choe, Victor Veitch

    Abstract: Informally, the 'linear representation hypothesis' is the idea that high-level concepts are represented linearly as directions in some representation space. In this paper, we address two closely related questions: What does "linear representation" actually mean? And, how do we make sense of geometric notions (e.g., cosine similarity or projection) in the representation space? To answer these, we u… ▽ More

    Submitted 17 July, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted for a presentation at ICML 2024 and an oral presentation at NeurIPS 2023 Workshop on Causal Representation Learning. Code is available at https://github.com/KihoPark/linear_rep_geometry

  41. arXiv:2310.15464  [pdf, other

    cs.CL

    Interpreting Answers to Yes-No Questions in User-Generated Content

    Authors: Shivam Mathur, Keun Hee Park, Dhivya Chinnappa, Saketh Kotamraju, Eduardo Blanco

    Abstract: Interpreting answers to yes-no questions in social media is difficult. Yes and no keywords are uncommon, and the few answers that include them are rarely to be interpreted what the keywords suggest. In this paper, we present a new corpus of 4,442 yes-no question-answer pairs from Twitter. We discuss linguistic characteristics of answers whose interpretation is yes or no, as well as answers whose i… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at the Findings of EMNLP 2023

  42. arXiv:2310.15439  [pdf, other

    cs.CL cs.SI

    K-HATERS: A Hate Speech Detection Corpus in Korean with Target-Specific Ratings

    Authors: Chaewon Park, Soohwan Kim, Kyubyong Park, Kunwoo Park

    Abstract: Numerous datasets have been proposed to combat the spread of online hate. Despite these efforts, a majority of these resources are English-centric, primarily focusing on overt forms of hate. This research gap calls for developing high-quality corpora in diverse languages that also encapsulate more subtle hate expressions. This study introduces K-HATERS, a new corpus for hate speech detection in Ko… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 15 pages, EMNLP 2023 (Findings)

  43. arXiv:2310.14200  [pdf, ps, other

    cs.IT

    Dynamic Resource Management in CDRT Systems through Adaptive NOMA

    Authors: Hongjiang Lei, Mingxu Yang, Ki-Hong Park, Nasir Saeed, Xusheng She, Jianling Cao

    Abstract: This paper introduces a novel adaptive transmission scheme to amplify the prowess of coordinated direct and relay transmission (CDRT) systems rooted in non-orthogonal multiple access principles. Leveraging the maximum ratio transmission scheme, we seamlessly meet the prerequisites of CDRT while harnessing the potential of dynamic power allocation and directional antennas to elevate the system's op… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 11 pages, 7 figures, submitted to IEEE journal for review

  44. arXiv:2310.13931  [pdf, ps, other

    cs.IT eess.SP

    Trajectory and power design for aerial CRNs with colluding eavesdroppers

    Authors: Hongjiang Lei, Jiacheng Jiang, Haosi Yang, Ki-Hong Park, Imran Shafique Ansari, Gaofeng Pan, Mohamed-Slim Alouini

    Abstract: Unmanned aerial vehicles (UAVs) can provide wireless access services to terrestrial users without geographical limitations and will become an essential part of the future communication system. However, the openness of wireless channels and the mobility of UAVs make the security of UAV-based communication systems particularly challenging. This work investigates the security of aerial cognitive radi… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 10 pages, 7 figures.submitted to the IEEE journal for review

  45. arXiv:2310.13290  [pdf, other

    cs.CL

    Interpreting Indirect Answers to Yes-No Questions in Multiple Languages

    Authors: Zijie Wang, Md Mosharaf Hossain, Shivam Mathur, Terry Cruz Melo, Kadir Bulut Ozler, Keun Hee Park, Jacob Quintero, MohammadHossein Rezaei, Shreya Nupur Shakya, Md Nayem Uddin, Eduardo Blanco

    Abstract: Yes-no questions expect a yes or no for an answer, but people often skip polar keywords. Instead, they answer with long explanations that must be interpreted. In this paper, we focus on this challenging problem and release new benchmarks in eight languages. We present a distant supervision approach to collect training data. We also demonstrate that direct answers (i.e., with polar keywords) are us… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Findings

  46. arXiv:2309.15433  [pdf, other

    cs.DB

    Cardinality Estimation of Subgraph Matching: A Filtering-Sampling Approach

    Authors: Wonseok Shin, Siwoo Song, Kunsoo Park, Wook-Shin Han

    Abstract: Subgraph counting is a fundamental problem in understanding and analyzing graph structured data, yet computationally challenging. This calls for an accurate and efficient algorithm for Subgraph Cardinality Estimation, which is to estimate the number of all isomorphic embeddings of a query graph in a data graph. We present FaSTest, a novel algorithm that combines (1) a powerful filtering technique… ▽ More

    Submitted 15 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  47. arXiv:2309.10278  [pdf, other

    eess.SY cs.RO math.OC

    Parameter-Varying Koopman Operator for Nonlinear System Modeling and Control

    Authors: Changyu Lee, Kiyong Park, Jinwhan Kim

    Abstract: This paper proposes a novel approach for modeling and controlling nonlinear systems with varying parameters. The approach introduces the use of a parameter-varying Koopman operator (PVKO) in a lifted space, which provides an efficient way to understand system behavior and design control algorithms that account for underlying dynamics and changing parameters. The PVKO builds on a conventional Koopm… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 62nd IEEE Conference on Decision and Control (CDC 2023)

  48. arXiv:2309.03227   

    cs.AI cs.CL cs.LG q-bio.QM

    Learning a Patent-Informed Biomedical Knowledge Graph Reveals Technological Potential of Drug Repositioning Candidates

    Authors: Yongseung Jegal, Jaewoong Choi, Jiho Lee, Ki-Su Park, Seyoung Lee, Janghyeok Yoon

    Abstract: Drug repositioning-a promising strategy for discovering new therapeutic uses for existing drugs-has been increasingly explored in the computational science literature using biomedical databases. However, the technological potential of drug repositioning candidates has often been overlooked. This study presents a novel protocol to comprehensively analyse various sources such as pharmaceutical paten… ▽ More

    Submitted 24 July, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: We are sorry to withdraw this paper. We found some critical errors in the introduction and results sections. Specifically, we found that the first author have wrongly inserted citations on background works and he made mistakes in the graph embedding methods and relevant results are wrongly calculated. In this regard, we tried to revise this paper and withdraw the current version. Thank you

  49. MvFS: Multi-view Feature Selection for Recommender System

    Authors: Youngjune Lee, Yeongjong Jeong, Keunchan Park, SeongKu Kang

    Abstract: Feature selection, which is a technique to select key features in recommender systems, has received increasing research attention. Recently, Adaptive Feature Selection (AdaFS) has shown remarkable performance by adaptively selecting features for each data instance, considering that the importance of a given feature field can vary significantly across data. However, this method still has limitation… ▽ More

    Submitted 6 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: CIKM 2023

  50. arXiv:2308.16659  [pdf, other

    physics.ins-det cs.LG hep-ex physics.data-an

    Autoencoder-based Online Data Quality Monitoring for the CMS Electromagnetic Calorimeter

    Authors: Abhirami Harilal, Kyungmin Park, Michael Andrews, Manfred Paulini

    Abstract: The online Data Quality Monitoring system (DQM) of the CMS electromagnetic calorimeter (ECAL) is a crucial operational tool that allows ECAL experts to quickly identify, localize, and diagnose a broad range of detector issues that would otherwise hinder physics-quality data taking. Although the existing ECAL DQM system has been continuously updated to respond to new problems, it remains one step b… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Submitted to the Proceedings of 21st International Workshop on Advanced Computing and Analysis Techniques in Physics Research ACAT 2022 conference