Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 377 results for author: Oh, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.13278  [pdf, other

    cs.CR cs.LG

    Randomization Techniques to Mitigate the Risk of Copyright Infringement

    Authors: Wei-Ning Chen, Peter Kairouz, Sewoong Oh, Zheng Xu

    Abstract: In this paper, we investigate potential randomization approaches that can complement current practices of input-based methods (such as licensing data and prompt filtering) and output-based methods (such as recitation checker, license checker, and model-based similarity score) for copyright protection. This is motivated by the inherent ambiguity of the rules that determine substantial similarity in… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  2. arXiv:2408.10830  [pdf, other

    cs.DC cs.ET

    Single Bridge Formation in Self-Organizing Particle Systems

    Authors: Joseph Briones, Jacob Calvert, Noah Egan, Shunhao Oh, Dana Randall, Andréa W. Richa

    Abstract: Local interactions of uncoordinated individuals produce the collective behaviors of many biological systems, inspiring much of the current research in programmable matter. A striking example is the spontaneous assembly of fire ants into "bridges" comprising their own bodies to traverse obstacles and reach sources of food. Experiments and simulations suggest that, remarkably, these ants always form… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  3. arXiv:2408.09727  [pdf, other

    cs.RO

    Quantitative 3D Map Accuracy Evaluation Hardware and Algorithm for LiDAR(-Inertial) SLAM

    Authors: Sanghyun Hahn, Seunghun Oh, Minwoo Jung, Ayoung Kim, Sangwoo Jung

    Abstract: Accuracy evaluation of a 3D pointcloud map is crucial for the development of autonomous driving systems. In this work, we propose a user-independent software/hardware system that can quantitatively evaluate the accuracy of a 3D pointcloud map acquired from LiDAR(-Inertial) SLAM. We introduce a LiDAR target that functions robustly in the outdoor environment, while remaining observable by LiDAR. We… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: ICCAS 2024 accepted, 5 pages, 6 figures, 2 Tables

  4. arXiv:2408.05353  [pdf, other

    cs.IR cs.LG

    IntentRec: Predicting User Session Intent with Hierarchical Multi-Task Learning

    Authors: Sejoon Oh, Moumita Bhattacharya, Yesu Feng, Sudarshan Lamkhede

    Abstract: Recommender systems have played a critical role in diverse digital services such as e-commerce, streaming media, social networks, etc. If we know what a user's intent is in a given session (e.g. do they want to watch short videos or a movie or play games; are they shopping for a camping trip), it becomes easier to provide high-quality recommendations. In this paper, we introduce IntentRec, a novel… ▽ More

    Submitted 25 July, 2024; originally announced August 2024.

  5. arXiv:2408.04614  [pdf, other

    cs.CL cs.AI cs.LG

    Better Alignment with Instruction Back-and-Forth Translation

    Authors: Thao Nguyen, Jeffrey Li, Sewoong Oh, Ludwig Schmidt, Jason Weston, Luke Zettlemoyer, Xian Li

    Abstract: We propose a new method, instruction back-and-forth translation, to construct high-quality synthetic data grounded in world knowledge for aligning large language models (LLMs). Given documents from a web corpus, we generate and curate synthetic instructions using the backtranslation approach proposed by Li et al.(2023a), and rewrite the responses to improve their quality further based on the initi… ▽ More

    Submitted 13 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

  6. arXiv:2408.01040  [pdf, other

    cs.DC cs.CR cs.CV cs.LG

    Privacy-Preserving Split Learning with Vision Transformers using Patch-Wise Random and Noisy CutMix

    Authors: Seungeun Oh, Sihun Baek, Jihong Park, Hyelin Nam, Praneeth Vepakomma, Ramesh Raskar, Mehdi Bennis, Seong-Lyun Kim

    Abstract: In computer vision, the vision transformer (ViT) has increasingly superseded the convolutional neural network (CNN) for improved accuracy and robustness. However, ViT's large model sizes and high sample complexity make it difficult to train on resource-constrained edge devices. Split learning (SL) emerges as a viable solution, leveraging server-side resources to train ViTs while utilizing private… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 23 pages, 11 figures, 8 tables, to be published in Transactions on Machine Learning Research (TMLR)

  7. arXiv:2408.00312  [pdf, other

    cs.IR cs.CR cs.LG cs.SI

    Adversarial Text Rewriting for Text-aware Recommender Systems

    Authors: Sejoon Oh, Gaurav Verma, Srijan Kumar

    Abstract: Text-aware recommender systems incorporate rich textual features, such as titles and descriptions, to generate item recommendations for users. The use of textual features helps mitigate cold-start problems, and thus, such recommender systems have attracted increased attention. However, we argue that the dependency on item descriptions makes the recommender system vulnerable to manipulation by adve… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted for publication at: 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024). Code and data at: https://github.com/sejoonoh/ATR

  8. arXiv:2407.16607  [pdf, other

    cs.CL cs.LG

    Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

    Authors: Jonathan Hayase, Alisa Liu, Yejin Choi, Sewoong Oh, Noah A. Smith

    Abstract: The pretraining data of today's strongest language models is opaque; in particular, little is known about the proportions of various domains or languages represented. In this work, we tackle a task which we call data mixture inference, which aims to uncover the distributional make-up of training data. We introduce a novel attack based on a previously overlooked source of information: byte-pair enc… ▽ More

    Submitted 5 September, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: new robustness experiments; new baselines; include Mistral, Mistral-Nemo and GPT-NeoX; link to code

  9. arXiv:2407.04600  [pdf, other

    cs.LG stat.ML

    Understanding the Gains from Repeated Self-Distillation

    Authors: Divyansh Pareek, Simon S. Du, Sewoong Oh

    Abstract: Self-Distillation is a special type of knowledge distillation where the student model has the same architecture as the teacher model. Despite using the same architecture and the same training data, self-distillation has been empirically observed to improve performance, especially when applied repeatedly. For such a process, there is a fundamental question of interest: How much gain is possible by… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 31 pages, 10 figures

  10. arXiv:2407.02447  [pdf, other

    cs.LG

    PLeaS -- Merging Models with Permutations and Least Squares

    Authors: Anshul Nasery, Jonathan Hayase, Pang Wei Koh, Sewoong Oh

    Abstract: The democratization of machine learning systems has made the process of fine-tuning accessible to a large number of practitioners, leading to a wide range of open-source models fine-tuned on specialized tasks and datasets. Recent work has proposed to merge such models to combine their functionalities. However, prior approaches are restricted to models that are fine-tuned from the same base model.… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  11. arXiv:2407.02245  [pdf, other

    cs.RO cs.AI

    Safe CoR: A Dual-Expert Approach to Integrating Imitation Learning and Safe Reinforcement Learning Using Constraint Rewards

    Authors: Hyeokjin Kwon, Gunmin Lee, Junseo Lee, Songhwai Oh

    Abstract: In the realm of autonomous agents, ensuring safety and reliability in complex and dynamic environments remains a paramount challenge. Safe reinforcement learning addresses these concerns by introducing safety constraints, but still faces challenges in navigating intricate environments such as complex driving situations. To overcome these challenges, we present the safe constraint reward (Safe CoR)… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to the Proc. of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024

  12. arXiv:2407.01540  [pdf, other

    cs.NI

    Towards a Partial Computation offloading in In-networking Computing-Assisted MEC: A Digital Twin Approach

    Authors: Ibrahim Aliyu, Awwal Arigi, Seungmin Oh, Tai-Won Um, Jinsul Kim

    Abstract: This paper addresses the problem of minimizing latency with partial computation offloading within Industrial Internet-of-Things (IoT) systems in in-network computing (COIN)-assisted Multiaccess Edge Computing (C-MEC) via ultra-reliable and low latency communications (URLLC) links. We propose a digital twin (DT) scheme for a multiuser scenario, allowing collaborative partial task offloading from us… ▽ More

    Submitted 8 April, 2024; originally announced July 2024.

    Comments: 9 pages, 3 figures

  13. arXiv:2406.11794  [pdf, other

    cs.LG cs.CL

    DataComp-LM: In search of the next generation of training sets for language models

    Authors: Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Gadre, Hritik Bansal, Etash Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner , et al. (34 additional authors not shown)

    Abstract: We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset experiments with the goal of improving language models. As part of DCLM, we provide a standardized corpus of 240T tokens extracted from Common Crawl, effective pretraining recipes based on the OpenLM framework, and a broad suite of 53 downstream evaluations. Participants in the DCLM benchmark can experiment with dat… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Project page: https://www.datacomp.ai/dclm/

  14. arXiv:2406.08527  [pdf, other

    cs.LG cs.AI

    Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

    Authors: Jaehyun Nam, Kyuyoung Kim, Seunghyuk Oh, Jihoon Tack, Jaehyung Kim, Jinwoo Shin

    Abstract: Learning effective representations from raw data is crucial for the success of deep learning methods. However, in the tabular domain, practitioners often prefer augmenting raw column features over using learned representations, as conventional tree-based algorithms frequently outperform competing approaches. As a result, feature engineering methods that automatically generate candidate features ha… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 18 pages

  15. arXiv:2406.06009  [pdf

    cs.DL cs.AI cs.CY

    The Impact of AI on Academic Research and Publishing

    Authors: Brady Lund, Manika Lamba, Sang Hoo Oh

    Abstract: Generative artificial intelligence (AI) technologies like ChatGPT, have significantly impacted academic writing and publishing through their ability to generate content at levels comparable to or surpassing human writers. Through a review of recent interdisciplinary literature, this paper examines ethical considerations surrounding the integration of AI into academia, focusing on the potential for… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  16. arXiv:2406.03665  [pdf, other

    cs.LG cs.AI

    Towards Dynamic Trend Filtering through Trend Point Detection with Reinforcement Learning

    Authors: Jihyeon Seong, Sekwang Oh, Jaesik Choi

    Abstract: Trend filtering simplifies complex time series data by applying smoothness to filter out noise while emphasizing proximity to the original data. However, existing trend filtering methods fail to reflect abrupt changes in the trend due to `approximateness,' resulting in constant smoothness. This approximateness uniformly filters out the tail distribution of time series data, characterized by extrem… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 18 pages, 11 figures

    Journal ref: IJCAI 2024

  17. arXiv:2405.18698  [pdf, other

    cs.LG cs.AI

    Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees

    Authors: Dohyeong Kim, Taehyun Cho, Seungyub Han, Hojun Chung, Kyungjae Lee, Songhwai Oh

    Abstract: The field of risk-constrained reinforcement learning (RCRL) has been developed to effectively reduce the likelihood of worst-case scenarios by explicitly handling risk-measure-based constraints. However, the nonlinearity of risk measures makes it challenging to achieve convergence and optimality. To overcome the difficulties posed by the nonlinearity, we propose a spectral risk measure-constrained… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 26 pages

  18. arXiv:2405.16915  [pdf, other

    cs.CV cs.LG

    Multilingual Diversity Improves Vision-Language Representations

    Authors: Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei Koh, Ranjay Krishna

    Abstract: Massive web-crawled image-text datasets lay the foundation for recent progress in multimodal learning. These datasets are designed with the goal of training a model to do well on standard computer vision benchmarks, many of which, however, have been shown to be English-centric (e.g., ImageNet). Consequently, existing data curation techniques gravitate towards using predominantly English image-text… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  19. arXiv:2405.15640  [pdf, other

    cs.CL cs.AI

    GECKO: Generative Language Model for English, Code and Korean

    Authors: Sungwoo Oh, Donggyu Kim

    Abstract: We introduce GECKO, a bilingual large language model (LLM) optimized for Korean and English, along with programming languages. GECKO is pretrained on the balanced, high-quality corpus of Korean and English employing LLaMA architecture. In this report, we share the experiences of several efforts to build a better data pipeline for the corpus and to train our model. GECKO shows great efficiency in t… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  20. arXiv:2405.13065  [pdf, other

    cs.HC cs.AI cs.CY

    Exploring Teachers' Perception of Artificial Intelligence: The Socio-emotional Deficiency as Opportunities and Challenges in Human-AI Complementarity in K-12 Education

    Authors: Soon-young Oh, Yongsu Ahn

    Abstract: In schools, teachers play a multitude of roles, serving as educators, counselors, decision-makers, and members of the school community. With recent advances in artificial intelligence (AI), there is increasing discussion about how AI can assist, complement, and collaborate with teachers. To pave the way for better teacher-AI complementary relationships in schools, our study aims to expand the disc… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  21. arXiv:2405.05175  [pdf, other

    cs.CR cs.CL cs.LG

    Air Gap: Protecting Privacy-Conscious Conversational Agents

    Authors: Eugene Bagdasaryan, Ren Yi, Sahra Ghalebikesabi, Peter Kairouz, Marco Gruteser, Sewoong Oh, Borja Balle, Daniel Ramage

    Abstract: The growing use of large language model (LLM)-based conversational agents to manage sensitive user data raises significant privacy concerns. While these agents excel at understanding and acting on context, this capability can be exploited by malicious actors. We introduce a novel threat model where adversarial third-party apps manipulate the context of interaction to trick LLM-based agents into re… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  22. arXiv:2405.02341  [pdf, other

    cs.CR cs.LG

    Improved Communication-Privacy Trade-offs in $L_2$ Mean Estimation under Streaming Differential Privacy

    Authors: Wei-Ning Chen, Berivan Isik, Peter Kairouz, Albert No, Sewoong Oh, Zheng Xu

    Abstract: We study $L_2$ mean estimation under central differential privacy and communication constraints, and address two key challenges: firstly, existing mean estimation schemes that simultaneously handle both constraints are usually optimized for $L_\infty$ geometry and rely on random rotation or Kashin's representation to adapt to $L_2$ geometry, resulting in suboptimal leading constants in mean square… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  23. arXiv:2404.16035  [pdf, other

    cs.CV cs.AI

    MaGGIe: Masked Guided Gradual Human Instance Matting

    Authors: Chuong Huynh, Seoung Wug Oh, Abhinav Shrivastava, Joon-Young Lee

    Abstract: Human matting is a foundation task in image and video processing, where human foreground pixels are extracted from the input. Prior works either improve the accuracy by additional guidance or improve the temporal consistency of a single instance across frames. We propose a new framework MaGGIe, Masked Guided Gradual Human Instance Matting, which predicts alpha mattes progressively for each human i… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project link: https://maggie-matt.github.io

  24. arXiv:2404.16032  [pdf, other

    cs.LG

    Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts

    Authors: Evgenii Kortukov, Alexander Rubinstein, Elisa Nguyen, Seong Joon Oh

    Abstract: Retrieval-augmented generation (RAG) mitigates many problems of fully parametric language models, such as temporal degradation, hallucinations, and lack of grounding. In RAG, the model's knowledge can be updated from documents provided in context. This leads to cases of conflict between the model's parametric knowledge and the contextual information, where the model may not always update its knowl… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  25. arXiv:2404.15409  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Insufficient Statistics Perturbation: Stable Estimators for Private Least Squares

    Authors: Gavin Brown, Jonathan Hayase, Samuel Hopkins, Weihao Kong, Xiyang Liu, Sewoong Oh, Juan C. Perdomo, Adam Smith

    Abstract: We present a sample- and time-efficient differentially private algorithm for ordinary least squares, with error that depends linearly on the dimension and is independent of the condition number of $X^\top X$, where $X$ is the design matrix. All prior private algorithms for this task require either $d^{3/2}$ examples, error growing polynomially with the condition number, or exponential time. Our ne… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 42 pages, 3 figures

  26. Minimum Description Feature Selection for Complexity Reduction in Machine Learning-based Wireless Positioning

    Authors: Myeung Suk Oh, Anindya Bijoy Das, Taejoon Kim, David J. Love, Christopher G. Brinton

    Abstract: Recently, deep learning approaches have provided solutions to difficult problems in wireless positioning (WP). Although these WP algorithms have attained excellent and consistent performance against complex channel environments, the computational complexity coming from processing high-dimensional features can be prohibitive for mobile applications. In this work, we design a novel positioning neura… ▽ More

    Submitted 18 August, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted for the publication in IEEE Journal on Selected Areas in Communications. arXiv admin note: text overlap with arXiv:2402.09580

  27. arXiv:2404.10308  [pdf, other

    cs.LG cs.AI

    Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs

    Authors: Woomin Song, Seunghyuk Oh, Sangwoo Mo, Jaehyung Kim, Sukmin Yun, Jung-Woo Ha, Jinwoo Shin

    Abstract: Large language models (LLMs) have shown remarkable performance in various natural language processing tasks. However, a primary constraint they face is the context limit, i.e., the maximum number of tokens they can process. Previous works have explored architectural changes and modifications in positional encoding to relax the constraint, but they often require expensive training or do not address… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted to ICLR 2024. The first two authors contributed equally

  28. arXiv:2404.05767  [pdf, other

    cs.SE cs.AI

    CSA-Trans: Code Structure Aware Transformer for AST

    Authors: Saeyoon Oh, Shin Yoo

    Abstract: When applying the Transformer architecture to source code, designing a good self-attention mechanism is critical as it affects how node relationship is extracted from the Abstract Syntax Trees (ASTs) of the source code. We present Code Structure Aware Transformer (CSA-Trans), which uses Code Structure Embedder (CSE) to generate specific PE for each node in AST. CSE generates node Positional Encodi… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  29. arXiv:2404.04913  [pdf, other

    cs.CV

    CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis

    Authors: Gyeongjin Kang, Younggeun Lee, Seungjun Oh, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRF) have achieved huge success in effectively capturing and representing 3D objects and scenes. However, several factors have impeded its further proliferation as next-generation 3D media. To establish a ubiquitous presence in everyday media formats, such as images and videos, it is imperative to devise a solution that effectively fulfills three key objectives: fast encod… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Project page: https://gynjn.github.io/Codec-NeRF/

  30. arXiv:2404.04096  [pdf, other

    cs.IT eess.SP

    Machine Learning-Aided Cooperative Localization under Dense Urban Environment

    Authors: Hoon Lee, Hong Ki Kim, Seung Hyun Oh, Sang Hyun Lee

    Abstract: Future wireless network technology provides automobiles with the connectivity feature to consolidate the concept of vehicular networks that collaborate on conducting cooperative driving tasks. The full potential of connected vehicles, which promises road safety and quality driving experience, can be leveraged if machine learning models guarantee the robustness in performing core functions includin… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  31. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  32. arXiv:2404.00060  [pdf, other

    q-fin.ST cs.AI cs.LG

    Temporal Graph Networks for Graph Anomaly Detection in Financial Networks

    Authors: Yejin Kim, Youngbin Lee, Minyoung Choe, Sungju Oh, Yongjae Lee

    Abstract: This paper explores the utilization of Temporal Graph Networks (TGN) for financial anomaly detection, a pressing need in the era of fintech and digitized financial transactions. We present a comprehensive framework that leverages TGN, capable of capturing dynamic changes in edges within financial networks, for fraud detection. Our study compares TGN's performance against static Graph Neural Networ… ▽ More

    Submitted 27 March, 2024; originally announced April 2024.

    Comments: Presented at the AAAI 2024 Workshop on AI in Finance for Social Impact (https://sites.google.com/view/aifin-aaai2024)

  33. arXiv:2403.16509  [pdf, other

    cs.LG

    Human Understanding AI Paper Challenge 2024 -- Dataset Design

    Authors: Se Won Oh, Hyuntae Jeong, Jeong Mook Lim, Seungeun Chung, Kyoung Ju Noh

    Abstract: In 2024, we will hold a research paper competition (the third Human Understanding AI Paper Challenge) for the research and development of artificial intelligence technologies to understand human daily life. This document introduces the datasets that will be provided to participants in the competition, and summarizes the issues to consider in data processing and learning model development.

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures

    ACM Class: J.7; E.m

  34. DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics

    Authors: Yoonsung Kim, Changhun Oh, Jinwoo Hwang, Wonung Kim, Seongryong Oh, Yubin Lee, Hardik Sharma, Amir Yazdanbakhsh, Jongse Park

    Abstract: Deep neural network (DNN) video analytics is crucial for autonomous systems such as self-driving vehicles, unmanned aerial vehicles (UAVs), and security robots. However, real-world deployment faces challenges due to their limited computational resources and battery power. To tackle these challenges, continuous learning exploits a lightweight "student" model at deployment (inference), leverages a l… ▽ More

    Submitted 16 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Journal ref: ISCA 2024

  35. arXiv:2403.07968  [pdf, other

    cs.LG cs.AI

    Do Deep Neural Network Solutions Form a Star Domain?

    Authors: Ankit Sonthalia, Alexander Rubinstein, Ehsan Abbasnejad, Seong Joon Oh

    Abstract: It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means that a linear path can connect two independent solutions with low loss, given the weights of one of the models are appropriately permuted. However, current methods to test this theory often require ver… ▽ More

    Submitted 9 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  36. arXiv:2403.05973  [pdf, other

    cs.CL cs.AI cs.LG

    Calibrating Large Language Models Using Their Generations Only

    Authors: Dennis Ulmer, Martin Gubri, Hwaran Lee, Sangdoo Yun, Seong Joon Oh

    Abstract: As large language models (LLMs) are increasingly deployed in user-facing applications, building trust and maintaining safety by accurately quantifying a model's confidence in its prediction becomes even more important. However, finding effective ways to calibrate LLMs - especially when the only interface to the models is their generated text - remains a challenge. We propose APRICOT (auxiliary pre… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  37. arXiv:2403.00282  [pdf, other

    cs.LG

    Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

    Authors: Dohyeong Kim, Mineui Hong, Jeongho Park, Songhwai Oh

    Abstract: In many real-world applications, a reinforcement learning (RL) agent should consider multiple objectives and adhere to safety guidelines. To address these considerations, we propose a constrained multi-objective RL algorithm named Constrained Multi-Objective Gradient Aggregator (CoMOGA). In the field of multi-objective optimization, managing conflicts between the gradients of the multiple objectiv… ▽ More

    Submitted 31 May, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: 25 pages

  38. arXiv:2402.19460  [pdf, other

    cs.LG stat.ML

    Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks

    Authors: Bálint Mucsányi, Michael Kirchhof, Seong Joon Oh

    Abstract: Uncertainty quantification, once a singular task, has evolved into a spectrum of tasks, including abstained prediction, out-of-distribution detection, and aleatoric uncertainty quantification. The latest goal is disentanglement: the construction of multiple estimators that are each tailored to one and only one task. Hence, there is a plethora of recent advances with different intentions - that oft… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 43 pages

  39. arXiv:2402.18905  [pdf, other

    cs.LG cs.AI cs.CR math.OC

    On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?

    Authors: Shuqi Ke, Charlie Hou, Giulia Fanti, Sewoong Oh

    Abstract: Differentially private (DP) machine learning pipelines typically involve a two-phase process: non-private pre-training on a public dataset, followed by fine-tuning on private data using DP optimization techniques. In the DP setting, it has been observed that full fine-tuning may not always yield the best test accuracy, even for in-distribution data. This paper (1) analyzes the training dynamics of… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  40. arXiv:2402.17127  [pdf, other

    cs.SD eess.AS

    Experimental Study: Enhancing Voice Spoofing Detection Models with wav2vec 2.0

    Authors: Taein Kang, Soyul Han, Sunmook Choi, Jaejin Seo, Sanghyeok Chung, Seungeun Lee, Seungsang Oh, Il-Youp Kwak

    Abstract: Conventional spoofing detection systems have heavily relied on the use of handcrafted features derived from speech data. However, a notable shift has recently emerged towards the direct utilization of raw speech waveforms, as demonstrated by methods like SincNet filters. This shift underscores the demand for more sophisticated audio sample features. Moreover, the success of deep learning models, p… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 5 pages

    MSC Class: 00A71 ACM Class: I.2.6

  41. arXiv:2402.16832  [pdf, other

    cs.CL cs.AI cs.CV

    Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space

    Authors: Gaurav Verma, Minje Choi, Kartik Sharma, Jamelle Watson-Daniels, Sejoon Oh, Srijan Kumar

    Abstract: Multimodal large language models (MLLMs) like LLaVA and GPT-4(V) enable general-purpose conversations about images with the language modality. As off-the-shelf MLLMs may have limited capabilities on images from domains like dermatology and agriculture, they must be fine-tuned to unlock domain-specific applications. The prevalent architecture of current open-source MLLMs comprises two major modules… ▽ More

    Submitted 21 July, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted at ACL 2024 (Main, Short)

  42. arXiv:2402.16569  [pdf, other

    cs.CV cs.LG

    Pretrained Visual Uncertainties

    Authors: Michael Kirchhof, Mark Collier, Seong Joon Oh, Enkelejda Kasneci

    Abstract: Accurate uncertainty estimation is vital to trustworthy machine learning, yet uncertainties typically have to be learned for each task anew. This work introduces the first pretrained uncertainty modules for vision models. Similar to standard pretraining this enables the zero-shot transfer of uncertainties learned on a large pretraining dataset to specialized downstream datasets. We enable our larg… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  43. arXiv:2402.15962  [pdf

    cs.LG

    Hierarchical energy signatures using machine learning for operational visibility and diagnostics in automotive manufacturing

    Authors: Ankur Verma, Seog-Chan Oh, Jorge Arinez, Soundar Kumara

    Abstract: Manufacturing energy consumption data contains important process signatures required for operational visibility and diagnostics. These signatures may be of different temporal scales, ranging from monthly to sub-second resolutions. We introduce a hierarchical machine learning approach to identify automotive process signatures from paint shop electricity consumption data at varying temporal scales (… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures

  44. arXiv:2402.13781  [pdf, other

    cs.LG cs.DC

    Preserving Near-Optimal Gradient Sparsification Cost for Scalable Distributed Deep Learning

    Authors: Daegun Yoon, Sangyoon Oh

    Abstract: Communication overhead is a major obstacle to scaling distributed training systems. Gradient sparsification is a potential optimization approach to reduce the communication volume without significant loss of model fidelity. However, existing gradient sparsification methods have low scalability owing to inefficient design of their algorithms, which raises the communication overhead significantly. I… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 24th IEEE/ACM International Symposium on Cluster, Cloud, and Internet Computing (CCGrid 2024). Code: https://github.com/kljp/exdyna

  45. arXiv:2402.13659  [pdf, other

    cs.CR cs.CL

    Privacy-Preserving Instructions for Aligning Large Language Models

    Authors: Da Yu, Peter Kairouz, Sewoong Oh, Zheng Xu

    Abstract: Service providers of large language model (LLM) applications collect user instructions in the wild and use them in further aligning LLMs with users' intentions. These instructions, which potentially contain sensitive information, are annotated by human workers in the process. This poses a new privacy risk not addressed by the typical private optimization. To this end, we propose using synthetic in… ▽ More

    Submitted 2 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: ICML 2024. Code available at https://github.com/google-research/google-research/tree/master/dp_instructions

  46. arXiv:2402.12991  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification

    Authors: Martin Gubri, Dennis Ulmer, Hwaran Lee, Sangdoo Yun, Seong Joon Oh

    Abstract: Large Language Model (LLM) services and models often come with legal rules on who can use them and how they must use them. Assessing the compliance of the released LLMs is crucial, as these rules protect the interests of the LLM contributor and prevent misuse. In this context, we describe the novel fingerprinting problem of Black-box Identity Verification (BBIV). The goal is to determine whether a… ▽ More

    Submitted 6 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted at ACL 2024 (findings)

  47. arXiv:2402.09580  [pdf, other

    cs.LG eess.SP

    Complexity Reduction in Machine Learning-Based Wireless Positioning: Minimum Description Features

    Authors: Myeung Suk Oh, Anindya Bijoy Das, Taejoon Kim, David J. Love, Christopher G. Brinton

    Abstract: A recent line of research has been investigating deep learning approaches to wireless positioning (WP). Although these WP algorithms have demonstrated high accuracy and robust performance against diverse channel conditions, they also have a major drawback: they require processing high-dimensional features, which can be prohibitive for mobile applications. In this work, we design a positioning neur… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted in IEEE International Conference on Communications (ICC) 2024

  48. arXiv:2402.08864  [pdf, other

    cs.IT cs.LG

    DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep Learning

    Authors: S Ashwin Hebbar, Sravan Kumar Ankireddy, Hyeji Kim, Sewoong Oh, Pramod Viswanath

    Abstract: Progress in designing channel codes has been driven by human ingenuity and, fittingly, has been sporadic. Polar codes, developed on the foundation of Arikan's polarization kernel, represent the latest breakthrough in coding theory and have emerged as the state-of-the-art error-correction code for short-to-medium block length regimes. In an effort to automate the invention of good channel codes, es… ▽ More

    Submitted 4 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 22 pages, 24 figures

  49. arXiv:2402.03481  [pdf, other

    cs.IR cs.LG cs.SI

    FINEST: Stabilizing Recommendations by Rank-Preserving Fine-Tuning

    Authors: Sejoon Oh, Berk Ustun, Julian McAuley, Srijan Kumar

    Abstract: Modern recommender systems may output considerably different recommendations due to small perturbations in the training data. Changes in the data from a single user will alter the recommendations as well as the recommendations of other users. In applications like healthcare, housing, and finance, this sensitivity can have adverse effects on user experience. We propose a method to stabilize a given… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted at the 6th FAccTRec Workshop on Responsible Recommendation @ ACM RecSys 2023

  50. arXiv:2401.08178  [pdf, other

    cs.CV

    Key-point Guided Deformable Image Manipulation Using Diffusion Model

    Authors: Seok-Hwan Oh, Guil Jung, Myeong-Gee Kim, Sang-Yun Kim, Young-Min Kim, Hyeon-Jik Lee, Hyuk-Sool Kwon, Hyeon-Min Bae

    Abstract: In this paper, we introduce a Key-point-guided Diffusion probabilistic Model (KDM) that gains precise control over images by manipulating the object's key-point. We propose a two-stage generative model incorporating an optical flow map as an intermediate output. By doing so, a dense pixel-wise understanding of the semantic relation between the image and sparse key point is configured, leading to m… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 24 pages