Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 136 results for author: Chung, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18375  [pdf, other

    cs.CV

    From Majority to Minority: A Diffusion-based Augmentation for Underrepresented Groups in Skin Lesion Analysis

    Authors: Janet Wang, Yunsung Chung, Zhengming Ding, Jihun Hamm

    Abstract: AI-based diagnoses have demonstrated dermatologist-level performance in classifying skin cancer. However, such systems are prone to under-performing when tested on data from minority groups that lack sufficient representation in the training sets. Although data collection and annotation offer the best means for promoting minority groups, these processes are costly and time-consuming. Prior works h… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2405.13954  [pdf, other

    cs.LG cs.AI cs.CL

    What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions

    Authors: Sang Keun Choe, Hwijeen Ahn, Juhan Bae, Kewen Zhao, Minsoo Kang, Youngseog Chung, Adithya Pratapa, Willie Neiswanger, Emma Strubell, Teruko Mitamura, Jeff Schneider, Eduard Hovy, Roger Grosse, Eric Xing

    Abstract: Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited. In response to this issue, data valuation (or data attribution), which quantifies the contribution or value of each data to the model output, has been discussed as a potential solution. Nevertheless, applying existing data valuation methods to recent LLMs and their vast trai… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2405.11703  [pdf, other

    cs.LG

    QComp: A QSAR-Based Data Completion Framework for Drug Discovery

    Authors: Bingjia Yang, Yunsie Chung, Archer Y. Yang, Bo Yuan, Xiang Yu

    Abstract: In drug discovery, in vitro and in vivo experiments reveal biochemical activities related to the efficacy and toxicity of compounds. The experimental data accumulate into massive, ever-evolving, and sparse datasets. Quantitative Structure-Activity Relationship (QSAR) models, which predict biochemical activities using only the structural information of compounds, face challenges in integrating the… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  4. arXiv:2405.10345  [pdf, other

    q-bio.QM cs.AI cs.LG

    Machine Learning Driven Biomarker Selection for Medical Diagnosis

    Authors: Divyagna Bavikadi, Ayushi Agarwal, Shashank Ganta, Yunro Chung, Lusheng Song, Ji Qiu, Paulo Shakarian

    Abstract: Recent advances in experimental methods have enabled researchers to collect data on thousands of analytes simultaneously. This has led to correlational studies that associated molecular measurements with diseases such as Alzheimer's, Liver, and Gastric Cancer. However, the use of thousands of biomarkers selected from the analytes is not practical for real-world medical diagnosis and is likely unde… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  5. arXiv:2405.05581  [pdf, other

    cs.HC cs.AI cs.CL

    One vs. Many: Comprehending Accurate Information from Multiple Erroneous and Inconsistent AI Generations

    Authors: Yoonjoo Lee, Kihoon Son, Tae Soo Kim, Jisu Kim, John Joon Young Chung, Eytan Adar, Juho Kim

    Abstract: As Large Language Models (LLMs) are nondeterministic, the same input can generate different outputs, some of which may be incorrect or hallucinated. If run again, the LLM may correct itself and produce the correct answer. Unfortunately, most LLM-powered systems resort to single results which, correct or not, users accept. Having the LLM produce multiple outputs may help identify disagreements or a… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Accepted to FAccT 2024

  6. arXiv:2404.12416  [pdf, other

    physics.plasm-ph cs.LG

    Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks

    Authors: Ian Char, Youngseog Chung, Joseph Abbate, Egemen Kolemen, Jeff Schneider

    Abstract: Although tokamaks are one of the most promising devices for realizing nuclear fusion as an energy source, there are still key obstacles when it comes to understanding the dynamics of the plasma and controlling it. As such, it is crucial that high quality models are developed to assist in overcoming these obstacles. In this work, we take an entirely data driven approach to learn such a model. In pa… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2403.20103  [pdf, other

    cs.CL

    NLP for Counterspeech against Hate: A Survey and How-To Guide

    Authors: Helena Bonaldi, Yi-Ling Chung, Gavin Abercrombie, Marco Guerini

    Abstract: In recent years, counterspeech has emerged as one of the most promising strategies to fight online hate. These non-escalatory responses tackle online abuse while preserving the freedom of speech of the users, and can have a tangible impact in reducing online and offline violence. Recently, there has been growing interest from the Natural Language Processing (NLP) community in addressing the challe… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: To appear in Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (findings)

  8. A Design Space for Intelligent and Interactive Writing Assistants

    Authors: Mina Lee, Katy Ilonka Gero, John Joon Young Chung, Simon Buckingham Shum, Vipul Raheja, Hua Shen, Subhashini Venugopalan, Thiemo Wambsganss, David Zhou, Emad A. Alghamdi, Tal August, Avinash Bhat, Madiha Zahrah Choksi, Senjuti Dutta, Jin L. C. Guo, Md Naimul Hoque, Yewon Kim, Simon Knight, Seyed Parsa Neshaei, Agnia Sergeyuk, Antonette Shibani, Disha Shrivastava, Lila Shroff, Jessi Stark, Sarah Sterman , et al. (11 additional authors not shown)

    Abstract: In our era of rapid technological advancement, the research landscape for writing assistants has become increasingly fragmented across various research communities. We seek to address this challenge by proposing a design space as a structured way to examine and explore the multidimensional space of intelligent and interactive writing assistants. Through a large community collaboration, we explore… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at CHI 2024

  9. arXiv:2403.09159  [pdf, ps, other

    cs.CL

    Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation

    Authors: Jaione Bengoetxea, Yi-Ling Chung, Marco Guerini, Rodrigo Agerri

    Abstract: Counter Narratives (CNs) are non-negative textual responses to Hate Speech (HS) aiming at defusing online hatred and mitigating its spreading across media. Despite the recent increase in HS content posted online, research on automatic CN generation has been relatively scarce and predominantly focused on English. In this paper, we present CONAN-EUS, a new Basque and Spanish dataset for CN generatio… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted for the Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING) 2024

  10. arXiv:2403.07592  [pdf, other

    cs.CV

    Accurate Spatial Gene Expression Prediction by integrating Multi-resolution features

    Authors: Youngmin Chung, Ji Hun Ha, Kyeong Chan Im, Joo Sang Lee

    Abstract: Recent advancements in Spatial Transcriptomics (ST) technology have facilitated detailed gene expression analysis within tissue contexts. However, the high costs and methodological limitations of ST necessitate a more robust predictive model. In response, this paper introduces TRIPLEX, a novel deep learning framework designed to predict spatial gene expression from Whole Slide Images (WSIs). TRIPL… ▽ More

    Submitted 25 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  11. Authors' Values and Attitudes Towards AI-bridged Scalable Personalization of Creative Language Arts

    Authors: Taewook Kim, Hyomin Han, Eytan Adar, Matthew Kay, John Joon Young Chung

    Abstract: Generative AI has the potential to create a new form of interactive media: AI-bridged creative language arts (CLA), which bridge the author and audience by personalizing the author's vision to the audience's context and taste at scale. However, it is unclear what the authors' values and attitudes would be regarding AI-bridged CLA. To identify these values and attitudes, we conducted an interview s… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 16 pages, 6 figures, 2 tables. Accepted to ACM CHI 2024

  12. arXiv:2402.11223  [pdf, other

    cs.LG

    HEAL: Brain-inspired Hyperdimensional Efficient Active Learning

    Authors: Yang Ni, Zhuowen Zou, Wenjun Huang, Hanning Chen, William Youngwoo Chung, Samuel Cho, Ranganath Krishnan, Pietro Mercati, Mohsen Imani

    Abstract: Drawing inspiration from the outstanding learning capability of our human brains, Hyperdimensional Computing (HDC) emerges as a novel computing paradigm, and it leverages high-dimensional vector presentation and operations for brain-like lightweight Machine Learning (ML). Practical deployments of HDC have significantly enhanced the learning efficiency compared to current deep ML methods on a broad… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  13. arXiv:2402.08025  [pdf, other

    cs.CV

    Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing

    Authors: Jacob Tyo, Motolani Olarinre, Youngseog Chung, Zachary C. Lipton

    Abstract: Despite significant progress in optical character recognition (OCR) and computer vision systems, robustly recognizing text and identifying people in images taken in unconstrained \emph{in-the-wild} environments remain an ongoing challenge. However, such obstacles must be overcome in practical applications of vision systems, such as identifying racers in photos taken during off-road racing events.… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.09256

  14. arXiv:2401.12295  [pdf, other

    cs.CL

    Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data

    Authors: Leonardo Castro-Gonzalez, Yi-Ling Chung, Hannak Rose Kirk, John Francis, Angus R. Williams, Pica Johansson, Jonathan Bright

    Abstract: The field of machine learning has recently made significant progress in reducing the requirements for labelled training data when building new models. These `cheaper' learning techniques hold significant potential for the social sciences, where development of large labelled training datasets is often a significant practical impediment to the use of machine learning for analytical tasks. In this ar… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 39 pages, 10 figures, 6 tables

    ACM Class: I.2.7; J.4

  15. arXiv:2401.09294  [pdf, other

    cs.SD cs.AI cs.LG eess.AS eess.SP

    T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis

    Authors: Yoonjin Chung, Junwon Lee, Juhan Nam

    Abstract: Foley sound, audio content inserted synchronously with videos, plays a critical role in the user experience of multimedia content. Recently, there has been active research in Foley sound synthesis, leveraging the advancements in deep generative models. However, such works mainly focus on replicating a single sound class or a textual sound description, neglecting temporal information, which is cruc… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  16. arXiv:2401.08117  [pdf, other

    cs.CV cs.AI cs.MM

    E2HQV: High-Quality Video Generation from Event Camera via Theory-Inspired Model-Aided Deep Learning

    Authors: Qiang Qu, Yiran Shen, Xiaoming Chen, Yuk Ying Chung, Tongliang Liu

    Abstract: The bio-inspired event cameras or dynamic vision sensors are capable of asynchronously capturing per-pixel brightness changes (called event-streams) in high temporal resolution and high dynamic range. However, the non-structural spatial-temporal event-streams make it challenging for providing intuitive visualization with rich semantic information for human vision. It calls for events-to-video (E2V… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted in AAAI2024

  17. arXiv:2401.00740  [pdf, other

    eess.IV cs.CV

    Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution

    Authors: Zeke Zexi Hu, Xiaoming Chen, Vera Yuk Ying Chung, Yiran Shen

    Abstract: The effective extraction of spatial-angular features plays a crucial role in light field image super-resolution (LFSR) tasks, and the introduction of convolution and Transformers leads to significant improvement in this area. Nevertheless, due to the large 4D data volume of light field images, many existing methods opted to decompose the data into a number of lower-dimensional subspaces and perfor… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  18. arXiv:2312.11949  [pdf, other

    cs.HC

    CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI

    Authors: DaEun Choi, Sumin Hong, Jeongeon Park, John Joon Young Chung, Juho Kim

    Abstract: Graphic designers often get inspiration through the recombination of references. Our formative study (N=6) reveals that graphic designers focus on conceptual keywords during this process, and want support for discovering the keywords, expanding them, and exploring diverse recombination options of them, while still having room for designers' creativity. We propose CreativeConnect, a system with gen… ▽ More

    Submitted 6 March, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  19. arXiv:2312.06279  [pdf, other

    cs.LG cs.AI

    Regional Correlation Aided Mobile Traffic Prediction with Spatiotemporal Deep Learning

    Authors: JeongJun Park, Lusungu J. Mwasinga, Huigyu Yang, Syed M. Raza, Duc-Tai Le, Moonseong Kim, Min Young Chung, Hyunseung Choo

    Abstract: Mobile traffic data in urban regions shows differentiated patterns during different hours of the day. The exploitation of these patterns enables highly accurate mobile traffic prediction for proactive network management. However, recent Deep Learning (DL) driven studies have only exploited spatiotemporal features and have ignored the geographical correlations, causing high complexity and erroneous… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 4 pages, 5 figures, 1 table. This paper is already accepted on IEEE Consumer Communications & Networking Conference(CCNC) 2024

  20. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  21. arXiv:2311.09256  [pdf, other

    cs.CV

    Reading Between the Mud: A Challenging Motorcycle Racer Number Dataset

    Authors: Jacob Tyo, Youngseog Chung, Motolani Olarinre, Zachary C. Lipton

    Abstract: This paper introduces the off-road motorcycle Racer number Dataset (RnD), a new challenging dataset for optical character recognition (OCR) research. RnD contains 2,411 images from professional motorsports photographers that depict motorcycle racers in off-road competitions. The images exhibit a wide variety of factors that make OCR difficult, including mud occlusions, motion blur, non-standard fo… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  22. arXiv:2311.08488  [pdf, other

    cs.CV

    MUDD: A New Re-Identification Dataset with Efficient Annotation for Off-Road Racers in Extreme Conditions

    Authors: Jacob Tyo, Motolani Olarinre, Youngseog Chung, Zachary C. Lipton

    Abstract: Re-identifying individuals in unconstrained environments remains an open challenge in computer vision. We introduce the Muddy Racer re-IDentification Dataset (MUDD), the first large-scale benchmark for matching identities of motorcycle racers during off-road competitions. MUDD exhibits heavy mud occlusion, motion blurring, complex poses, and extreme lighting conditions previously unseen in existin… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  23. arXiv:2309.07707  [pdf, other

    cs.CL cs.SD eess.AS

    CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders

    Authors: Heng-Jui Chang, Ning Dong, Ruslan Mavlyutov, Sravya Popuri, Yu-An Chung

    Abstract: Large-scale self-supervised pre-trained speech encoders outperform conventional approaches in speech recognition and translation tasks. Due to the high cost of developing these large models, building new encoders for new tasks and deploying them to on-device applications are infeasible. Prior studies propose model compression methods to address this issue, but those works focus on smaller models a… ▽ More

    Submitted 27 December, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024

  24. arXiv:2308.11596  [pdf, other

    cs.CL

    SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Cora Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim , et al. (43 additional authors not shown)

    Abstract: What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    ACM Class: I.2.7

  25. PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions

    Authors: John Joon Young Chung, Eytan Adar

    Abstract: While diffusion-based text-to-image (T2I) models provide a simple and powerful way to generate images, guiding this generation remains a challenge. For concepts that are difficult to describe through language, users may struggle to create prompts. Moreover, many of these models are built as end-to-end systems, lacking support for iterative shaping of the image. In response, we introduce PromptPain… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted to UIST2023

  26. arXiv:2307.16811  [pdf, other

    cs.CL cs.CY

    DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures

    Authors: Angus R. Williams, Hannah Rose Kirk, Liam Burke, Yi-Ling Chung, Ivan Debono, Pica Johansson, Francesca Stevens, Jonathan Bright, Scott A. Hale

    Abstract: Public figures receive a disproportionate amount of abuse on social media, impacting their active participation in public life. Automated systems can identify abuse at scale but labelling training data is expensive, complex and potentially harmful. So, it is desirable that systems are efficient and generalisable, handling both shared and specific aspects of online abuse. We explore the dynamics of… ▽ More

    Submitted 25 April, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 15 pages, 7 figures, 4 tables

  27. arXiv:2307.04761  [pdf, other

    cs.CL cs.AI cs.CY

    Understanding Counterspeech for Online Harm Mitigation

    Authors: Yi-Ling Chung, Gavin Abercrombie, Florence Enock, Jonathan Bright, Verena Rieser

    Abstract: Counterspeech offers direct rebuttals to hateful speech by challenging perpetrators of hate and showing support to targets of abuse. It provides a promising alternative to more contentious measures, such as content moderation and deplatforming, by contributing a greater amount of positive online speech rather than attempting to mitigate harmful content through removal. Advances in the development… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 21 pages, 2 figures, 2 tables

  28. arXiv:2306.15189  [pdf, ps, other

    cs.CV

    FBA-Net: Foreground and Background Aware Contrastive Learning for Semi-Supervised Atrium Segmentation

    Authors: Yunsung Chung, Chanho Lim, Chao Huang, Nassir Marrouche, Jihun Hamm

    Abstract: Medical image segmentation of gadolinium enhancement magnetic resonance imaging (GE MRI) is an important task in clinical applications. However, manual annotation is time-consuming and requires specialized expertise. Semi-supervised segmentation methods that leverage both labeled and unlabeled data have shown promise, with contrastive learning emerging as a particularly effective approach. In this… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 11 pages, 2 figures

  29. Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions

    Authors: John Joon Young Chung, Ece Kamar, Saleema Amershi

    Abstract: Large language models (LLMs) can be used to generate text data for training and evaluating other models. However, creating high-quality datasets with LLMs can be challenging. In this work, we explore human-AI partnerships to facilitate high diversity and accuracy in LLM-based text data generation. We first examine two approaches to diversify text generation: 1) logit suppression, which minimizes t… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted as a long paper at ACL 2023

  30. arXiv:2305.18655  [pdf, other

    cs.LG cs.AI stat.ML

    Parity Calibration

    Authors: Youngseog Chung, Aaron Rumack, Chirag Gupta

    Abstract: In a sequential regression setting, a decision-maker may be primarily concerned with whether the future observation will increase or decrease compared to the current one, rather than the actual value of the future observation. In this context, we introduce the notion of parity calibration, which captures the goal of calibrated forecasting for the increase-decrease (or "parity") event in a timeseri… ▽ More

    Submitted 7 June, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: To appear at UAI 2023 (Oral); 19 pages and 10 figures

  31. arXiv:2305.13931  [pdf, other

    cs.IR

    Position Bias Estimation with Item Embedding for Sparse Dataset

    Authors: Shion Ishikawa, Yun Ching Liu, Young-Joo Chung, Yu Hirate

    Abstract: Estimating position bias is a well-known challenge in Learning to Rank (L2R). Click data in e-commerce applications, such as targeted advertisements and search engines, provides implicit but abundant feedback to improve personalized rankings. However, click data inherently includes various biases like position bias. Based on the position-based click model, Result Randomization and Regression Expec… ▽ More

    Submitted 11 March, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

  32. arXiv:2304.01919  [pdf, other

    cs.HC

    viz2viz: Prompt-driven stylized visualization generation using a diffusion model

    Authors: Jiaqi Wu, John Joon Young Chung, Eytan Adar

    Abstract: Creating stylized visualization requires going beyond the limited, abstract, geometric marks produced by most tools. Rather, the designer builds stylized idioms where the marks are both transformed (e.g., photographs of candles instead of bars) and also synthesized into a 'scene' that pushes the boundaries of traditional visualizations. To support this, we introduce viz2viz, a system for transform… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 17 pages

  33. arXiv:2303.17595  [pdf, other

    cs.CV cs.LG

    Neglected Free Lunch -- Learning Image Classifiers Using Annotation Byproducts

    Authors: Dongyoon Han, Junsuk Choe, Seonghyeok Chun, John Joon Young Chung, Minsuk Chang, Sangdoo Yun, Jean Y. Song, Seong Joon Oh

    Abstract: Supervised learning of image classifiers distills human knowledge into a parametric model through pairs of images and corresponding labels (X,Y). We argue that this simple and widely used representation of human knowledge neglects rich auxiliary information from the annotation procedure, such as the time-series of mouse traces and clicks left after image selection. Our insight is that such annotat… ▽ More

    Submitted 26 July, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Code & data at https://github.com/naver-ai/NeglectedFreeLunch. To be presented at ICCV'23

  34. arXiv:2303.15110  [pdf, other

    cs.CL cs.AI

    Beyond Toxic: Toxicity Detection Datasets are Not Enough for Brand Safety

    Authors: Elizaveta Korotkova, Isaac Kwan Yin Chung

    Abstract: The rapid growth in user generated content on social media has resulted in a significant rise in demand for automated content moderation. Various methods and frameworks have been proposed for the tasks of hate speech detection and toxic comment classification. In this work, we combine common datasets to extend these tasks to brand safety. Brand safety aims to protect commercial branding by identif… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  35. arXiv:2303.14870  [pdf, other

    cs.RO cs.AI cs.LG

    Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning

    Authors: Satoshi Kataoka, Youngseog Chung, Seyed Kamyar Seyed Ghasemipour, Pannag Sanketi, Shixiang Shane Gu, Igor Mordatch

    Abstract: Most successes in robotic manipulation have been restricted to single-arm gripper robots, whose low dexterity limits the range of solvable tasks to pick-and-place, inser-tion, and object rearrangement. More complex tasks such as assembly require dual and multi-arm platforms, but entail a suite of unique challenges such as bi-arm coordination and collision avoidance, robust grasping, and long-horiz… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Our accompanying project webpage can be found at: https://sites.google.com/view/u-shape-block-assembly. arXiv admin note: substantial text overlap with arXiv:2203.08277

  36. Distributed Timed Elastic Band (DTEB) Planner: Trajectory Sharing and Collision Prediction for Multi-Robot Systems

    Authors: Yiu Ming Chung, Hazem Youssef, Moritz Roidl

    Abstract: Autonomous navigation of mobile robots is a well studied problem in robotics. However, the navigation task becomes challenging when multi-robot systems have to cooperatively navigate dynamic environments with deadlock-prone layouts. We present a Distributed Timed Elastic Band (DTEB) Planner that combines Prioritized Planning with the online TEB trajectory Planner, in order to extend the capabiliti… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Published in the International Conference on Robotics and Automation (ICRA) - 2022 https://ieeexplore.ieee.org/document/9811762

    Journal ref: ICRA (2022) pp. 10702-10708

  37. arXiv:2303.10961  [pdf, other

    eess.IV cs.CV cs.MM

    LFACon: Introducing Anglewise Attention to No-Reference Quality Assessment in Light Field Space

    Authors: Qiang Qu, Xiaoming Chen, Yuk Ying Chung, Weidong Cai

    Abstract: Light field imaging can capture both the intensity information and the direction information of light rays. It naturally enables a six-degrees-of-freedom viewing experience and deep user engagement in virtual reality. Compared to 2D image assessment, light field image quality assessment (LFIQA) needs to consider not only the image quality in the spatial domain but also the quality consistency in t… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted for IEEE VR 2023 (TVCG Special Issues) (Early Access)

  38. arXiv:2212.08055  [pdf, other

    cs.CL cs.SD eess.AS

    UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units

    Authors: Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe, Juan Pino

    Abstract: Direct speech-to-speech translation (S2ST), in which all components can be optimized jointly, is advantageous over cascaded approaches to achieve fast inference with a simplified pipeline. We present a novel two-pass direct S2ST architecture, UnitY, which first generates textual representations and predicts discrete acoustic units subsequently. We enhance the model performance by subword predictio… ▽ More

    Submitted 26 May, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: ACL 2023 (main conference)

  39. arXiv:2212.01414  [pdf, other

    cs.IR cs.LG

    Meta-Shop: Improving Item Advertisement For Small Businesses

    Authors: Yang Shi, Guannan Liang, Young-joo Chung

    Abstract: In this paper, we study item advertisements for small businesses. This application recommends prospective customers to specific items requested by businesses. From analysis, we found that the existing Recommender Systems (RS) were ineffective for small/new businesses with a few sales history. Training samples in RS can be highly biased toward popular businesses with sufficient sales and can decrea… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  40. arXiv:2211.07131  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    YM2413-MDB: A Multi-Instrumental FM Video Game Music Dataset with Emotion Annotations

    Authors: Eunjin Choi, Yoonjin Chung, Seolhee Lee, JongIk Jeon, Taegyun Kwon, Juhan Nam

    Abstract: Existing multi-instrumental datasets tend to be biased toward pop and classical music. In addition, they generally lack high-level annotations such as emotion tags. In this paper, we propose YM2413-MDB, an 80s FM video game music dataset with multi-label emotion annotations. It includes 669 audio and MIDI files of music from Sega and MSX PC games in the 80s using YM2413, a programmable sound gener… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: The paper has been accepted for publication at ISMIR 2022

    ACM Class: I.2.1; I.2.7

  41. arXiv:2211.06474  [pdf, other

    cs.CL cs.SD eess.AS

    Speech-to-Speech Translation For A Real-world Unwritten Language

    Authors: Peng-Jen Chen, Kevin Tran, Yilin Yang, Jingfei Du, Justine Kao, Yu-An Chung, Paden Tomasello, Paul-Ambroise Duquenne, Holger Schwenk, Hongyu Gong, Hirofumi Inaguma, Sravya Popuri, Changhan Wang, Juan Pino, Wei-Ning Hsu, Ann Lee

    Abstract: We study speech-to-speech translation (S2ST) that translates speech from one language into another language and focuses on building systems to support languages without standard text writing systems. We use English-Taiwanese Hokkien as a case study, and present an end-to-end solution from training data collection, modeling choices to benchmark dataset release. First, we present efforts on creating… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  42. arXiv:2211.04011  [pdf, other

    cs.LG cs.CV

    An Incremental Phase Mapping Approach for X-ray Diffraction Patterns using Binary Peak Representations

    Authors: Dipendra Jha, K. V. L. V. Narayanachari, Ruifeng Zhang, Justin Liao, Denis T. Keane, Wei-keng Liao, Alok Choudhary, Yip-Wah Chung, Michael Bedzyk, Ankit Agrawal

    Abstract: Despite the huge advancement in knowledge discovery and data mining techniques, the X-ray diffraction (XRD) analysis process has mostly remained untouched and still involves manual investigation, comparison, and verification. Due to the large volume of XRD samples from high-throughput XRD experiments, it has become impossible for domain scientists to process them manually. Recently, they have star… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted and presented at the International Workshop on Domain-Driven Data Mining (DDDM) as a part of the SIAM International Conference on Data Mining (SDM 2021). Contains 11 pages and 5 figures

  43. arXiv:2210.14191  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    A Database of Ultrastable MOFs Reassembled from Stable Fragments with Machine Learning Models

    Authors: Aditya Nandy, Shuwen Yue, Changhwan Oh, Chenru Duan, Gianmarco G. Terrones, Yongchul G. Chung, Heather J. Kulik

    Abstract: High-throughput screening of large hypothetical databases of metal-organic frameworks (MOFs) can uncover new materials, but their stability in real-world applications is often unknown. We leverage community knowledge and machine learning (ML) models to identify MOFs that are thermally stable and stable upon activation. We separate these MOFs into their building blocks and recombine them to make a… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  44. arXiv:2208.14594  [pdf, other

    cs.IR cs.AI

    One-class Recommendation Systems with the Hinge Pairwise Distance Loss and Orthogonal Representations

    Authors: Ramin Raziperchikolaei, Young-joo Chung

    Abstract: In one-class recommendation systems, the goal is to learn a model from a small set of interacted users and items and then identify the positively-related user-item pairs among a large number of pairs with unknown interactions. Most previous loss functions rely on dissimilar pairs of users and items, which are selected from the ones with unknown interactions, to obtain better prediction performance… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 16 pages, 7 figures

  45. arXiv:2208.11926  [pdf, other

    cs.IR cs.AI

    Dynamic collaborative filtering Thompson Sampling for cross-domain advertisements recommendation

    Authors: Shion Ishikawa, Young-joo Chung, Yu Hirate

    Abstract: Recently online advertisers utilize Recommender systems (RSs) for display advertising to improve users' engagement. The contextual bandit model is a widely used RS to exploit and explore users' engagement and maximize the long-term rewards such as clicks or conversions. However, the current models aim to optimize a set of ads only in a specific domain and do not share information with other models… ▽ More

    Submitted 26 October, 2022; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: Published at ADKDD 2022

  46. arXiv:2205.11353  [pdf, ps, other

    cs.CG math.AT

    Gaussian Persistence Curves

    Authors: Yu-Min Chung, Michael Hull, Austin Lawson, Neil Pritchard

    Abstract: Topological data analysis (TDA) is a rising field in the intersection of mathematics, statistics, and computer science/data science. The cornerstone of TDA is persistent homology, which produces a summary of topological information called a persistence diagram. To utilize machine and deep learning methods on persistence diagrams, These diagrams are further summarized by transforming them into func… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 19 pages

  47. arXiv:2205.10439  [pdf, other

    cs.LG

    How Useful are Gradients for OOD Detection Really?

    Authors: Conor Igoe, Youngseog Chung, Ian Char, Jeff Schneider

    Abstract: One critical challenge in deploying highly performant machine learning models in real-life applications is out of distribution (OOD) detection. Given a predictive model which is accurate on in distribution (ID) data, an OOD detection system will further equip the model with the option to defer prediction when the input is novel and the model has little confidence in prediction. There has been some… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  48. arXiv:2204.11593  [pdf, other

    cs.IR cs.CV

    Scaling Cross-Domain Content-Based Image Retrieval for E-commerce Snap and Search Application

    Authors: Isaac Kwan Yin Chung, Minh Tran, Eran Nussinovitch

    Abstract: In this industry talk at ECIR 2022, we illustrate how we approach the main challenges from large scale cross-domain content-based image retrieval using a cascade method and a combination of our visual search and classification capabilities. Specifically, we present a system that is able to handle the scale of the data for e-commerce usage and the cross-domain nature of the query and gallery image… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: ECIR 2022 Industry Day

  49. arXiv:2204.08569  [pdf, other

    cs.IR cs.LG

    Learning Similarity Preserving Binary Codes for Recommender Systems

    Authors: Yang Shi, Young-joo Chung

    Abstract: Hashing-based Recommender Systems (RSs) are widely studied to provide scalable services. The existing methods for the systems combine three modules to achieve efficiency: feature extraction, interaction modeling, and binarization. In this paper, we study an unexplored module combination for the hashing-based recommender systems, namely Compact Cross-Similarity Recommender (CCSR). Inspired by cross… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  50. Exploring the Universality of Hadronic Jet Classification

    Authors: Kingman Cheung, Yi-Lun Chung, Shih-Chieh Hsu, Benjamin Nachman

    Abstract: The modeling of jet substructure significantly differs between Parton Shower Monte Carlo (PSMC) programs. Despite this, we observe that machine learning classifiers trained on different PSMCs learn nearly the same function. This means that when these classifiers are applied to the same PSMC for testing, they result in nearly the same performance. This classifier universality indicates that a machi… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 25 pages, 7 figures, 7 tables