Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–15 of 15 results for author: Byun, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13502  [pdf, other

    cs.CL cs.SD eess.AS

    ManWav: The First Manchu ASR Model

    Authors: Jean Seo, Minha Kang, Sungjoo Byun, Sangah Lee

    Abstract: This study addresses the widening gap in Automatic Speech Recognition (ASR) research between high resource and extremely low resource languages, with a particular focus on Manchu, a critically endangered language. Manchu exemplifies the challenges faced by marginalized linguistic communities in accessing state-of-the-art technologies. In a pioneering effort, we introduce the first-ever Manchu ASR… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ACL2024/Field Matters

  2. arXiv:2403.16447  [pdf, ps, other

    cs.CL

    A Study on How Attention Scores in the BERT Model are Aware of Lexical Categories in Syntactic and Semantic Tasks on the GLUE Benchmark

    Authors: Dongjun Jang, Sungjoo Byun, Hyopil Shin

    Abstract: This study examines whether the attention scores between tokens in the BERT model significantly vary based on lexical categories during the fine-tuning process for downstream tasks. Drawing inspiration from the notion that in human language processing, syntactic and semantic information is parsed differently, we categorize tokens in sentences according to their lexical categories and focus on chan… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  3. arXiv:2403.16444  [pdf, other

    cs.CL

    KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models

    Authors: Dongjun Jang, Sungjoo Byun, Hyemi Jo, Hyopil Shin

    Abstract: Instruction Tuning on Large Language Models is an essential process for model to function well and achieve high performance in specific tasks. Accordingly, in mainstream languages such as English, instruction-based datasets are being constructed and made publicly available. In the case of Korean, publicly available models and datasets all rely on using the output of ChatGPT or translating datasets… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2403.16158  [pdf, other

    cs.CL

    Korean Bio-Medical Corpus (KBMC) for Medical Named Entity Recognition

    Authors: Sungjoo Byun, Jiseung Hong, Sumin Park, Dongjun Jang, Jean Seo, Minseok Kim, Chaeyoung Oh, Hyopil Shin

    Abstract: Named Entity Recognition (NER) plays a pivotal role in medical Natural Language Processing (NLP). Yet, there has not been an open-source medical NER dataset specifically for the Korean language. To address this, we utilized ChatGPT to assist in constructing the KBMC (Korean Bio-Medical Corpus), which we are now presenting to the public. With the KBMC dataset, we noticed an impressive 20% increase… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Journal ref: LREC-COLING 2024

  5. arXiv:2402.15046  [pdf, other

    cs.CL

    CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean

    Authors: Dongjun Jang, Jean Seo, Sungjoo Byun, Taekyoung Kim, Minseok Kim, Hyopil Shin

    Abstract: This paper explores the challenges posed by aspect-based sentiment classification (ABSC) within pretrained language models (PLMs), with a particular focus on contextualization and hallucination issues. In order to tackle these challenges, we introduce CARBD-Ko (a Contextually Annotated Review Benchmark Dataset for Aspect-Based Sentiment Classification in Korean), a benchmark dataset that incorpora… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  6. arXiv:2402.05448  [pdf, other

    cs.CV cs.AI cs.GR cs.LG cs.MM

    Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application

    Authors: Bumsoo Kim, Sanghyun Byun, Yonghoon Jung, Wonseop Shin, Sareer UI Amin, Sanghyun Seo

    Abstract: In this paper, we first present the character texture generation system \textit{Minecraft-ify}, specified to Minecraft video game toward in-game application. Ours can generate face-focused image for texture mapping tailored to 3D virtual character having cube manifold. While existing projects or works only generate texture, proposed system can inverse the user-provided real image, or generate aver… ▽ More

    Submitted 3 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 2 pages, 2 figures. Accepted as Spotlight to NeurIPS 2023 Workshop on Machine Learning for Creativity and Design

  7. arXiv:2311.18215  [pdf, other

    cs.CL

    Automatic Construction of a Korean Toxic Instruction Dataset for Ethical Tuning of Large Language Models

    Authors: Sungjoo Byun, Dongjun Jang, Hyemi Jo, Hyopil Shin

    Abstract: Caution: this paper may include material that could be offensive or distressing. The advent of Large Language Models (LLMs) necessitates the development of training approaches that mitigate the generation of unethical language and aptly manage toxic user queries. Given the challenges related to human labor and the scarcity of data, we present KoTox, comprising 39K unethical instruction-output pa… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following

  8. arXiv:2311.17492  [pdf, other

    cs.CL

    Mergen: The First Manchu-Korean Machine Translation Model Trained on Augmented Data

    Authors: Jean Seo, Sungjoo Byun, Minha Kang, Sangah Lee

    Abstract: The Manchu language, with its roots in the historical Manchurian region of Northeast China, is now facing a critical threat of extinction, as there are very few speakers left. In our efforts to safeguard the Manchu language, we introduce Mergen, the first-ever attempt at a Manchu-Korean Machine Translation (MT) model. To develop this model, we utilize valuable resources such as the Manwen Laodang(… ▽ More

    Submitted 12 January, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: emnlp2023/mrl2023

  9. arXiv:2311.13784  [pdf, other

    cs.CL

    DaG LLM ver 1.0: Pioneering Instruction-Tuned Language Modeling for Korean NLP

    Authors: Dongjun Jang, Sangah Lee, Sungjoo Byun, Jinwoong Kim, Jean Seo, Minseok Kim, Soyeon Kim, Chaeyoung Oh, Jaeyoon Kim, Hyemi Jo, Hyopil Shin

    Abstract: This paper presents the DaG LLM (David and Goliath Large Language Model), a language model specialized for Korean and fine-tuned through Instruction Tuning across 41 tasks within 13 distinct categories.

    Submitted 22 November, 2023; originally announced November 2023.

  10. arXiv:2310.02588  [pdf, other

    cs.CV cs.LG

    ViT-ReciproCAM: Gradient and Attention-Free Visual Explanations for Vision Transformer

    Authors: Seok-Yong Byun, Wonju Lee

    Abstract: This paper presents a novel approach to address the challenges of understanding the prediction process and debugging prediction errors in Vision Transformers (ViT), which have demonstrated superior performance in various computer vision tasks such as image classification and object detection. While several visual explainability techniques, such as CAM, Grad-CAM, Score-CAM, and Recipro-CAM, have be… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  11. arXiv:2209.14074  [pdf, other

    cs.CV cs.LG

    Recipro-CAM: Fast gradient-free visual explanations for convolutional neural networks

    Authors: Seok-Yong Byun, Wonju Lee

    Abstract: The Convolutional Neural Network (CNN) is a widely used deep learning architecture for computer vision. However, its black box nature makes it difficult to interpret the behavior of the model. To mitigate this issue, AI practitioners have explored explainable AI methods like Class Activation Map (CAM) and Grad-CAM. Although these methods have shown promise, they are limited by architectural constr… ▽ More

    Submitted 12 March, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  12. arXiv:2107.00191  [pdf, other

    cs.LG cs.CV

    Unsupervised Model Drift Estimation with Batch Normalization Statistics for Dataset Shift Detection and Model Selection

    Authors: Wonju Lee, Seok-Yong Byun, Jooeun Kim, Minje Park, Kirill Chechil

    Abstract: While many real-world data streams imply that they change frequently in a nonstationary way, most of deep learning methods optimize neural networks on training data, and this leads to severe performance degradation when dataset shift happens. However, it is less possible to annotate or inspect newly streamed data by humans, and thus it is desired to measure model drift at inference time in an unsu… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

    Comments: 11 pages, 5 figures, 2 tables

  13. arXiv:1904.10788  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Speech Emotion Recognition Using Multi-hop Attention Mechanism

    Authors: Seunghyun Yoon, Seokhyun Byun, Subhadeep Dey, Kyomin Jung

    Abstract: In this paper, we are interested in exploiting textual and acoustic data of an utterance for the speech emotion classification task. The baseline approach models the information from audio and text independently using two deep neural networks (DNNs). The outputs from both the DNNs are then fused for classification. As opposed to using knowledge from both the modalities separately, we propose a fra… ▽ More

    Submitted 9 May, 2019; v1 submitted 23 April, 2019; originally announced April 2019.

    Comments: 5 pages, Accepted as a conference paper at ICASSP 2019 (oral presentation)

  14. arXiv:1810.04635  [pdf, other

    cs.CL

    Multimodal Speech Emotion Recognition Using Audio and Text

    Authors: Seunghyun Yoon, Seokhyun Byun, Kyomin Jung

    Abstract: Speech emotion recognition is a challenging task, and extensive reliance has been placed on models that use audio features in building well-performing classifiers. In this paper, we propose a novel deep dual recurrent encoder model that utilizes text data and audio signals simultaneously to obtain a better understanding of speech data. As emotional dialogue is composed of sound and spoken content,… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

    Comments: 7 pages, Accepted as a conference paper at IEEE SLT 2018

  15. arXiv:cs/0609098  [pdf

    cs.NI

    Reducing the Makespan in Hierarchical Reliable Multicast Tree

    Authors: Sang-Seon Byun, Chuck Yoo

    Abstract: In hierarchical reliable multicast environment, makespan is the time that is required to fully and successfully transmit a packet from the sender to all receivers. Low makespan is vital for achieving high throughput with a TCP-like window based sending scheme. In hierarchical reliable multicast methods, the number of repair servers and their locations influence the makespan. In this paper we pro… ▽ More

    Submitted 17 September, 2006; originally announced September 2006.

    Comments: 26 pages, 9 figures, submitted to IEEE Transactions on Information Theory