-
Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning
Authors:
Miyoung Ko,
Sue Hyun Park,
Joonsuk Park,
Minjoon Seo
Abstract:
Despite significant advancements, there is a limited understanding of how large language models (LLMs) utilize knowledge for reasoning. To address this, we propose a method that deconstructs complex real-world questions into a graph, representing each question as a node with parent nodes of background knowledge needed to solve the question. We develop the DepthQA dataset, deconstructing questions…
▽ More
Despite significant advancements, there is a limited understanding of how large language models (LLMs) utilize knowledge for reasoning. To address this, we propose a method that deconstructs complex real-world questions into a graph, representing each question as a node with parent nodes of background knowledge needed to solve the question. We develop the DepthQA dataset, deconstructing questions into three depths: (i) recalling conceptual knowledge, (ii) applying procedural knowledge, and (iii) analyzing strategic knowledge. Based on a hierarchical graph, we quantify forward discrepancy, discrepancies in LLMs' performance on simpler sub-problems versus complex questions. We also measure backward discrepancy, where LLMs answer complex questions but struggle with simpler ones. Our analysis shows that smaller models have more discrepancies than larger models. Additionally, guiding models from simpler to complex questions through multi-turn interactions improves performance across model sizes, highlighting the importance of structured intermediate steps in knowledge reasoning. This work enhances our understanding of LLM reasoning and suggests ways to improve their problem-solving abilities.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
VS-PINN: A Fast and efficient training of physics-informed neural networks using variable-scaling methods for solving PDEs with stiff behavior
Authors:
Seungchan Ko,
Sang Hyeon Park
Abstract:
Physics-informed neural networks (PINNs) have recently emerged as a promising way to compute the solutions of partial differential equations (PDEs) using deep neural networks. However, despite their significant success in various fields, it remains unclear in many aspects how to effectively train PINNs if the solutions of PDEs exhibit stiff behaviors or high frequencies. In this paper, we propose…
▽ More
Physics-informed neural networks (PINNs) have recently emerged as a promising way to compute the solutions of partial differential equations (PDEs) using deep neural networks. However, despite their significant success in various fields, it remains unclear in many aspects how to effectively train PINNs if the solutions of PDEs exhibit stiff behaviors or high frequencies. In this paper, we propose a new method for training PINNs using variable-scaling techniques. This method is simple and it can be applied to a wide range of problems including PDEs with rapidly-varying solutions. Throughout various numerical experiments, we will demonstrate the effectiveness of the proposed method for these problems and confirm that it can significantly improve the training efficiency and performance of PINNs. Furthermore, based on the analysis of the neural tangent kernel (NTK), we will provide theoretical evidence for this phenomenon and show that our methods can indeed improve the performance of PINNs.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Authors:
Seungone Kim,
Juyoung Suk,
Ji Yong Cho,
Shayne Longpre,
Chaeeun Kim,
Dongkeun Yoon,
Guijin Son,
Yejin Cho,
Sheikh Shafayat,
Jinheon Baek,
Sue Hyun Park,
Hyeonbin Hwang,
Jinkyung Jo,
Hyowon Cho,
Haebin Shin,
Seongyun Lee,
Hanseok Oh,
Noah Lee,
Namgyu Ho,
Se June Joo,
Miyoung Ko,
Yoonjoo Lee,
Hyungjoo Chae,
Jamin Shin,
Joel Jang
, et al. (7 additional authors not shown)
Abstract:
As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec…
▽ More
As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on specific capabilities such as instruction following, leading to coverage bias. To overcome these limitations, we introduce the BiGGen Bench, a principled generation benchmark designed to thoroughly evaluate nine distinct capabilities of LMs across 77 diverse tasks. A key feature of the BiGGen Bench is its use of instance-specific evaluation criteria, closely mirroring the nuanced discernment of human evaluation. We apply this benchmark to assess 103 frontier LMs using five evaluator LMs. Our code, data, and evaluation results are all publicly available at https://github.com/prometheus-eval/prometheus-eval/tree/main/BiGGen-Bench.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Improving Text Generation on Images with Synthetic Captions
Authors:
Jun Young Koh,
Sang Hyun Park,
Joy Song
Abstract:
The recent emergence of latent diffusion models such as SDXL and SD 1.5 has shown significant capability in generating highly detailed and realistic images. Despite their remarkable ability to produce images, generating accurate text within images still remains a challenging task. In this paper, we examine the validity of fine-tuning approaches in generating legible text within the image. We propo…
▽ More
The recent emergence of latent diffusion models such as SDXL and SD 1.5 has shown significant capability in generating highly detailed and realistic images. Despite their remarkable ability to produce images, generating accurate text within images still remains a challenging task. In this paper, we examine the validity of fine-tuning approaches in generating legible text within the image. We propose a low-cost approach by leveraging SDXL without any time-consuming training on large-scale datasets. The proposed strategy employs a fine-tuning technique that examines the effects of data refinement levels and synthetic captions. Moreover, our results demonstrate how our small scale fine-tuning approach can improve the accuracy of text generation in different scenarios without the need of additional multimodal encoders. Our experiments show that with the addition of random letters to our raw dataset, our model's performance improves in producing well-formed visual text.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Subject-Adaptive Transfer Learning Using Resting State EEG Signals for Cross-Subject EEG Motor Imagery Classification
Authors:
Sion An,
Myeongkyun Kang,
Soopil Kim,
Philip Chikontwe,
Li Shen,
Sang Hyun Park
Abstract:
Electroencephalography (EEG) motor imagery (MI) classification is a fundamental, yet challenging task due to the variation of signals between individuals i.e., inter-subject variability. Previous approaches try to mitigate this using task-specific (TS) EEG signals from the target subject in training. However, recording TS EEG signals requires time and limits its applicability in various fields. In…
▽ More
Electroencephalography (EEG) motor imagery (MI) classification is a fundamental, yet challenging task due to the variation of signals between individuals i.e., inter-subject variability. Previous approaches try to mitigate this using task-specific (TS) EEG signals from the target subject in training. However, recording TS EEG signals requires time and limits its applicability in various fields. In contrast, resting state (RS) EEG signals are a viable alternative due to ease of acquisition with rich subject information. In this paper, we propose a novel subject-adaptive transfer learning strategy that utilizes RS EEG signals to adapt models on unseen subject data. Specifically, we disentangle extracted features into task- and subject-dependent features and use them to calibrate RS EEG signals for obtaining task information while preserving subject characteristics. The calibrated signals are then used to adapt the model to the target subject, enabling the model to simulate processing TS EEG signals of the target subject. The proposed method achieves state-of-the-art accuracy on three public benchmarks, demonstrating the effectiveness of our method in cross-subject EEG MI classification. Our findings highlight the potential of leveraging RS EEG signals to advance practical brain-computer interface systems.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Aligning to Thousands of Preferences via System Message Generalization
Authors:
Seongyun Lee,
Sue Hyun Park,
Seungone Kim,
Minjoon Seo
Abstract:
Although humans inherently have diverse values, current large language model (LLM) alignment methods often assume that aligning LLMs with the general public's preferences is optimal. A major challenge in adopting a more individualized approach to LLM alignment is its lack of scalability, as it involves repeatedly acquiring preference data and training new reward models and LLMs for each individual…
▽ More
Although humans inherently have diverse values, current large language model (LLM) alignment methods often assume that aligning LLMs with the general public's preferences is optimal. A major challenge in adopting a more individualized approach to LLM alignment is its lack of scalability, as it involves repeatedly acquiring preference data and training new reward models and LLMs for each individual's preferences. To address these challenges, we propose a new paradigm where users specify what they value most within the system message, steering the LLM's generation behavior to better align with the user's intentions. However, a naive application of such an approach is non-trivial since LLMs are typically trained on a uniform system message (e.g., "You are a helpful assistant") which limits their ability to generalize to diverse, unseen system messages. To improve this generalization, we create the Multifaceted Collection, a preference dataset with 192k combinations of values beyond generic helpfulness and harmlessness, spanning 65k user instructions. Using this dataset, we train a 7B LLM called Janus and test it on 921 prompts from 5 benchmarks (AlpacaEval 2.0, FLASK, Koala, MT-Bench, and Self-Instruct) by adding various unseen system messages that reflect user preferences. Janus achieves tie+win rate of 75.2%, 72.4%, and 66.4% against Mistral 7B Instruct v0.2, GPT-3.5 Turbo, and GPT-4, respectively. Unexpectedly, on three benchmarks focused on response helpfulness (AlpacaEval 2.0, MT-Bench, Arena Hard Auto v0.1), Janus also outperforms LLaMA 3 8B Instruct by a +4.0%, +0.1%, +3.0% margin, underscoring that training with a vast array of system messages could also enhance alignment to the general public's preference as well. Our code, dataset, benchmark, and models are available at https://github.com/kaistAI/Janus.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
CAT: Contrastive Adapter Training for Personalized Image Generation
Authors:
Jae Wan Park,
Sang Hyun Park,
Jun Young Koh,
Junha Lee,
Min Song
Abstract:
The emergence of various adapters, including Low-Rank Adaptation (LoRA) applied from the field of natural language processing, has allowed diffusion models to personalize image generation at a low cost. However, due to the various challenges including limited datasets and shortage of regularization and computation resources, adapter training often results in unsatisfactory outcomes, leading to the…
▽ More
The emergence of various adapters, including Low-Rank Adaptation (LoRA) applied from the field of natural language processing, has allowed diffusion models to personalize image generation at a low cost. However, due to the various challenges including limited datasets and shortage of regularization and computation resources, adapter training often results in unsatisfactory outcomes, leading to the corruption of the backbone model's prior knowledge. One of the well known phenomena is the loss of diversity in object generation, especially within the same class which leads to generating almost identical objects with minor variations. This poses challenges in generation capabilities. To solve this issue, we present Contrastive Adapter Training (CAT), a simple yet effective strategy to enhance adapter training through the application of CAT loss. Our approach facilitates the preservation of the base model's original knowledge when the model initiates adapters. Furthermore, we introduce the Knowledge Preservation Score (KPS) to evaluate CAT's ability to keep the former information. We qualitatively and quantitatively compare CAT's improvement. Finally, we mention the possibility of CAT in the aspects of multi-concept adapter and optimization.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Achieving Optical Refractive Index of 10-Plus by Colloidal Self-Assembly
Authors:
NaYeoun Kim,
Ji-Hyeok Huh,
YongDeok Cho,
Sung Hun Park,
Hyeon Ho Kim,
Kyung Hun Rho,
Jaewon Lee,
Seungwoo Lee
Abstract:
This study demonstrates the developments of self-assembled optical metasurfaces to overcome inherent limitations in polarization density (P) within natural materials, which hinder achieving high refractive indices (n) at optical frequencies. The Maxwellian macroscopic description establishes a link between P and n, revealing a static limit in natural materials, restricting n to approximately 4.0 a…
▽ More
This study demonstrates the developments of self-assembled optical metasurfaces to overcome inherent limitations in polarization density (P) within natural materials, which hinder achieving high refractive indices (n) at optical frequencies. The Maxwellian macroscopic description establishes a link between P and n, revealing a static limit in natural materials, restricting n to approximately 4.0 at optical frequencies. Optical metasurfaces, utilizing metallic colloids on a deep-subwavelength scale, offer a solution by unnaturally enhancing n through electric dipolar (ED) resonances. Self-assembly enables the creation of nanometer-scale metallic gaps between metallic nanoparticles (NPs), paving the way for achieving exceptionally high n at optical frequencies. This study focuses on assembling polyhedral gold (Au) NPs into a closely packed monolayer by rationally designing the polymeric ligand to balance attractive and repulsive forces, in that polymeric brush-mediated self-assembly of the close-packed Au NP monolayer is robustly achieved over a large-area. The resulting monolayer of Au nanospheres (NSs), nanooctahedras (NOs), and nanocubes (NCs) exhibits high macroscopic integrity and crystallinity, sufficiently enough for pushing n to record-high regimes. The study underlies the significance of capacitive coupling in achieving an unnaturally high n and explores fine-tuning Au NC size to optimize this coupling. The achieved n of 10.12 at optical frequencies stands as a benchmark, highlighting the potential of polyhedral Au NPs in advancing optical metasurfaces.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Tunable incommensurability and spontaneous symmetry breaking in the reconstructed moiré-of-moiré lattices
Authors:
Daesung Park,
Changwon Park,
Eunjung Ko,
Kunihiro Yananose,
Rebecca Engelke,
Xi Zhang,
Konstantin Davydov,
Matthew Green,
Sang Hwa Park,
Jae Heon Lee,
Kenji Watanabe,
Takashi Taniguchi,
Sang Mo Yang,
Ke Wang,
Philip Kim,
Young-Woo Son,
Hyobin Yoo
Abstract:
Imposing incommensurable periodicity on the periodic atomic lattice can lead to complex structural phases consisting of locally periodic structure bounded by topological defects. Twisted trilayer graphene (TTG) is an ideal material platform to study the interplay between different atomic periodicities, which can be tuned by twist angles between the layers, leading to moiré-of-moiré lattices. Inter…
▽ More
Imposing incommensurable periodicity on the periodic atomic lattice can lead to complex structural phases consisting of locally periodic structure bounded by topological defects. Twisted trilayer graphene (TTG) is an ideal material platform to study the interplay between different atomic periodicities, which can be tuned by twist angles between the layers, leading to moiré-of-moiré lattices. Interlayer and intralayer interactions between two interfaces in TTG transform this moiré-of-moiré lattice into an intricate network of domain structures at small twist angles, which can harbor exotic electronic behaviors. Here we report a complete structural phase diagram of TTG with atomic scale lattice reconstruction. Using transmission electron microscopy combined with a new interatomic potential simulation, we show that a cornucopia of large-scale moiré lattices, ranging from triangular, kagome, and a corner-shared hexagram-shaped domain pattern, are present. For small twist angles below 0.1°, all domains are bounded by a network of two-dimensional domain wall lattices. In particular, in the limit of small twist angles, the competition between interlayer stacking energy and the formation of discommensurate domain walls leads to unique spontaneous symmetry breaking structures with nematic orders, suggesting the pivotal role of long-range interactions across entire layers. The diverse tessellation of distinct domains, whose topological network can be tuned by the adjustment of the twist angles, establishes TTG as a platform for exploring the interplay between emerging quantum properties and controllable nontrivial lattices.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation
Authors:
Seongyun Lee,
Seungone Kim,
Sue Hyun Park,
Geewook Kim,
Minjoon Seo
Abstract:
Assessing long-form responses generated by Vision-Language Models (VLMs) is challenging. It not only requires checking whether the VLM follows the given instruction but also verifying whether the text output is properly grounded on the given image. Inspired by the recent approach of evaluating LMs with LMs, in this work, we propose to evaluate VLMs with VLMs. For this purpose, we present a new fee…
▽ More
Assessing long-form responses generated by Vision-Language Models (VLMs) is challenging. It not only requires checking whether the VLM follows the given instruction but also verifying whether the text output is properly grounded on the given image. Inspired by the recent approach of evaluating LMs with LMs, in this work, we propose to evaluate VLMs with VLMs. For this purpose, we present a new feedback dataset called the Perception Collection, encompassing 15K customized score rubrics that users might care about during assessment. Using the Perception Collection, we train Prometheus-Vision, the first open-source VLM evaluator model that can understand the user-defined score criteria during evaluation. Prometheus-Vision shows the highest Pearson correlation with human evaluators and GPT-4V among open-source models, showing its effectiveness for transparent and accessible evaluation of VLMs. We open-source our code, dataset, and model at https://github.com/kaistAI/prometheus-vision
△ Less
Submitted 12 January, 2024;
originally announced January 2024.
-
Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection
Authors:
Soopil Kim,
Sion An,
Philip Chikontwe,
Myeongkyun Kang,
Ehsan Adeli,
Kilian M. Pohl,
Sang Hyun Park
Abstract:
Logical anomalies (LA) refer to data violating underlying logical constraints e.g., the quantity, arrangement, or composition of components within an image. Detecting accurately such anomalies requires models to reason about various component types through segmentation. However, curation of pixel-level annotations for semantic segmentation is both time-consuming and expensive. Although there are s…
▽ More
Logical anomalies (LA) refer to data violating underlying logical constraints e.g., the quantity, arrangement, or composition of components within an image. Detecting accurately such anomalies requires models to reason about various component types through segmentation. However, curation of pixel-level annotations for semantic segmentation is both time-consuming and expensive. Although there are some prior few-shot or unsupervised co-part segmentation algorithms, they often fail on images with industrial object. These images have components with similar textures and shapes, and a precise differentiation proves challenging. In this study, we introduce a novel component segmentation model for LA detection that leverages a few labeled samples and unlabeled images sharing logical constraints. To ensure consistent segmentation across unlabeled images, we employ a histogram matching loss in conjunction with an entropy loss. As segmentation predictions play a crucial role, we propose to enhance both local and global sample validity detection by capturing key aspects from visual semantics via three memory banks: class histograms, component composition embeddings and patch-level representations. For effective LA detection, we propose an adaptive scaling strategy to standardize anomaly scores from different memory banks in inference. Extensive experiments on the public benchmark MVTec LOCO AD reveal our method achieves 98.1% AUROC in LA detection vs. 89.6% from competing methods.
△ Less
Submitted 15 April, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision
Authors:
Seongyun Lee,
Sue Hyun Park,
Yongrae Jo,
Minjoon Seo
Abstract:
Large multimodal models suffer from multimodal hallucination, where they provide incorrect responses misaligned with the given visual information. Recent works have conjectured that one of the reasons behind multimodal hallucination is due to the vision encoder failing to ground on the image properly. To mitigate this issue, we propose a novel approach that leverages self-feedback as visual cues.…
▽ More
Large multimodal models suffer from multimodal hallucination, where they provide incorrect responses misaligned with the given visual information. Recent works have conjectured that one of the reasons behind multimodal hallucination is due to the vision encoder failing to ground on the image properly. To mitigate this issue, we propose a novel approach that leverages self-feedback as visual cues. Building on this approach, we introduce Volcano, a multimodal self-feedback guided revision model. Volcano generates natural language feedback to its initial response based on the provided visual information and utilizes this feedback to self-revise its initial response. Volcano effectively reduces multimodal hallucination and achieves state-of-the-art on MMHal-Bench, POPE, and GAVIE. It also improves on general multimodal abilities and outperforms previous models on MM-Vet and MMBench. Through qualitative analysis, we show that Volcano's feedback is properly grounded on the image than the initial response. This indicates that Volcano can provide itself with richer visual information through feedback generation, leading to self-correct hallucinations. We publicly release our model, data, and code at https://github.com/kaistAI/Volcano}{github.com/kaistAI/Volcano
△ Less
Submitted 2 April, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
The effects of Thomson scattering and chemical mixing on early-time light curves of double peaked type IIb supernovae
Authors:
Seong Hyun Park,
Sung-Chul Yoon,
Sergei Blinnikov
Abstract:
Previous numerical simulations of double-peaked SNe IIb light curves have demonstrated that the radius and mass of the hydrogen-rich envelope of the progenitor star can significantly influence the brightness and timescale of the early-time light curve around the first peak. In this study, we investigate how Thomson scattering and chemical mixing in the SN ejecta affect the optical light curves dur…
▽ More
Previous numerical simulations of double-peaked SNe IIb light curves have demonstrated that the radius and mass of the hydrogen-rich envelope of the progenitor star can significantly influence the brightness and timescale of the early-time light curve around the first peak. In this study, we investigate how Thomson scattering and chemical mixing in the SN ejecta affect the optical light curves during the early stages of the SNe IIb using radiation hydrodynamics simulations. By comparing the results from two different numerical codes (i.e., \stella{} and \snec{}), we find that the optical brightness of the first peak can be reduced by more than a factor of 3 due to the effect of Thomson scattering that causes the thermalization depth to be located below the Rosseland-mean photosphere, compared to the corresponding case where this effect is ignored. We also observe a short-lived plateau-like feature lasting for a few days in the early-time optical light curves of our models, in contrast to typical observed SNe IIb that show a quasi-linear decrease in optical magnitudes after the first peak. A significant degree of chemical mixing between the hydrogen-rich envelope and the helium core in SN ejecta is required to reconcile this discrepancy between the model prediction and observation. Meanwhile, to properly reproduce the first peak, a significant mixing of \nifs{} into the hydrogen-rich outermost layers should be restricted. Our findings indicate that inferring the SN IIb progenitor structure from a simplified approach that ignores these two factors may introduce substantial uncertainty.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Characterization of Broadband Purcell Filters with Compact Footprint for Fast Multiplexed Superconducting Qubit Readout
Authors:
Seong Hyeon Park,
Gahyun Choi,
Gyunghun Kim,
Jaehyeong Jo,
Bumsung Lee,
Geonyoung Kim,
Kibog Park,
Yong-Ho Lee,
Seungyong Hahn
Abstract:
Engineering the admittance of external environments connected to superconducting qubits is essential, as increasing the measurement speed introduces spontaneous emission loss to superconducting qubits, known as Purcell loss. Here, we report a broad bandwidth Purcell filter design within a small footprint, which effectively suppresses Purcell loss without losing the fast measurement speed. We chara…
▽ More
Engineering the admittance of external environments connected to superconducting qubits is essential, as increasing the measurement speed introduces spontaneous emission loss to superconducting qubits, known as Purcell loss. Here, we report a broad bandwidth Purcell filter design within a small footprint, which effectively suppresses Purcell loss without losing the fast measurement speed. We characterize the filter's frequency response at 4.3 K and also estimate Purcell loss suppression by finite-element-method simulations of superconducting planar circuit layouts with the proposed filter design. The measured bandwidth is over 790 MHz within 0.29 mm$^2$ while the estimated lifetime enhancement can be over 5000 times with multiple Purcell filters. The presented filter design is expected to be easily integrated on existing superconducting quantum circuits for fast and multiplexed readout without occupying large footprint.
△ Less
Submitted 27 December, 2023; v1 submitted 20 October, 2023;
originally announced October 2023.
-
4$f$ electron temperature driven ultrafast electron localization
Authors:
Kohei Yamagami,
Hiroki Ueda,
Urs Staub,
Yujun Zhang,
Kohei Yamamoto,
Sang Han Park,
Soonnam Kwon,
Akihiro Mitsuda,
Hirofumi Wada,
Takayuki Uozumi,
Kojiro Mimura,
Hiroki Wadati
Abstract:
Valence transitions in strongly correlated electron systems are caused by orbital hybridization and Coulomb interactions between localized and delocalized electrons. The transition can be triggered by changes in the electronic structure and is sensitive to temperature variations, applications of magnetic fields, and physical or chemical pressure. Launching the transition by photoelectric fields ca…
▽ More
Valence transitions in strongly correlated electron systems are caused by orbital hybridization and Coulomb interactions between localized and delocalized electrons. The transition can be triggered by changes in the electronic structure and is sensitive to temperature variations, applications of magnetic fields, and physical or chemical pressure. Launching the transition by photoelectric fields can directly excite the electronic states and thus provides an ideal platform to study the correlation among electrons on ultrafast timescales. The EuNi$_2$(Si$_{0.21}$Ge$_{0.79}$)$_2$ mixed-valence metal is an ideal material to investigate the valence transition of the Eu ions via the amplified orbital hybridization by the photoelectric field on sub-picosecond timescales. A direct view on the 4$f$ electron occupancy of the Eu ions is required to understand the microscopic origin of the transition. Here we probe the 4$f$ electron states of EuNi$_2$(Si$_{0.21}$Ge$_{0.79}$)$_2$ at the sub-ps timescale after photoexcitation by X-ray absorption spectroscopy across the Eu $M_5$-absorption edge. The observed spectral changes due to the excitation indicate a population change of total angular momentum multiplet states $J$ = 0, 1, 2, and 3 of Eu$^{3+}$, and the Eu$^{2+}$ $J$ = 7/2 multiplet state caused by an increase in 4$f$ electron temperature that results in a 4$f$ localization process. This electronic temperature increase combined with fluence-dependent screening accounts for the strongly non-linear effective valence change. The data allow us to extract a time-dependent determination of an effective temperature of the 4$f$ shell, which is also of great relevance in the understanding of metallic systems' properties, such as the ultrafast demagnetization of ferromagnetic rare-earth intermetallics and their all-optical magnetization switching.
△ Less
Submitted 27 October, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Helical boundary modes from synthetic spin in a plasmonic lattice
Authors:
Sang Hyun Park,
Michael Sammon,
Eugene Mele,
Tony Low
Abstract:
Artificial lattices have been used as a platform to extend the application of topological physics beyond electronic systems. Here, using the two-dimensional Lieb lattice as a prototypical example, we show that an array of disks which each support localized plasmon modes give rise to an analog of the quantum spin Hall state enforced by a synthetic time reversal symmetry. We find that an effective n…
▽ More
Artificial lattices have been used as a platform to extend the application of topological physics beyond electronic systems. Here, using the two-dimensional Lieb lattice as a prototypical example, we show that an array of disks which each support localized plasmon modes give rise to an analog of the quantum spin Hall state enforced by a synthetic time reversal symmetry. We find that an effective next-nearest-neighbor coupling mechanism intrinsic to the plasmonic disk array introduces a nontrivial $Z_2$ topological order and gaps out the Bloch spectrum. A faithful mapping of the plasmonic system onto a tight-binding model is developed and shown to capture its essential topological signatures. Full wave numerical simulations of graphene disks arranged in a Lieb lattice confirm the existence of propagating helical boundary modes in the nontrivial band gap.
△ Less
Submitted 21 May, 2023;
originally announced May 2023.
-
IFSeg: Image-free Semantic Segmentation via Vision-Language Model
Authors:
Sukmin Yun,
Seong Hyeon Park,
Paul Hongsuck Seo,
Jinwoo Shin
Abstract:
Vision-language (VL) pre-training has recently gained much attention for its transferability and flexibility in novel concepts (e.g., cross-modality transfer) across various visual tasks. However, VL-driven segmentation has been under-explored, and the existing approaches still have the burden of acquiring additional training images or even segmentation annotations to adapt a VL model to downstrea…
▽ More
Vision-language (VL) pre-training has recently gained much attention for its transferability and flexibility in novel concepts (e.g., cross-modality transfer) across various visual tasks. However, VL-driven segmentation has been under-explored, and the existing approaches still have the burden of acquiring additional training images or even segmentation annotations to adapt a VL model to downstream segmentation tasks. In this paper, we introduce a novel image-free segmentation task where the goal is to perform semantic segmentation given only a set of the target semantic categories, but without any task-specific images and annotations. To tackle this challenging task, our proposed method, coined IFSeg, generates VL-driven artificial image-segmentation pairs and updates a pre-trained VL model to a segmentation task. We construct this artificial training data by creating a 2D map of random semantic categories and another map of their corresponding word tokens. Given that a pre-trained VL model projects visual and text tokens into a common space where tokens that share the semantics are located closely, this artificially generated word map can replace the real image inputs for such a VL model. Through an extensive set of experiments, our model not only establishes an effective baseline for this novel task but also demonstrates strong performances compared to existing methods that rely on stronger supervision, such as task-specific images and segmentation masks. Code is available at https://github.com/alinlab/ifseg.
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
Epitaxially strained ultrathin LaNiO$_3$/LaAlO$_3$ and LaNiO$_3$/SrTiO$_3$ superlattices: a density functional theory + $U$ study
Authors:
Heung-Sik Kim,
Sang Hyeon Park,
Myung Joon Han
Abstract:
By employing first-principles electronic structure calculations we investigate nickelate superlattices [LaNiO$_3$]$_1$/[LaAlO$_3$]$_1$ and [LaNiO$_3$]$_1$/[SrTiO$_3$]$_1$ with (001) orientation under epitaxial tensile strain. Within density functional theory augmented by mean-field treatement of on-site electronic correlations, the ground states show remarkable dependence on the correlation streng…
▽ More
By employing first-principles electronic structure calculations we investigate nickelate superlattices [LaNiO$_3$]$_1$/[LaAlO$_3$]$_1$ and [LaNiO$_3$]$_1$/[SrTiO$_3$]$_1$ with (001) orientation under epitaxial tensile strain. Within density functional theory augmented by mean-field treatement of on-site electronic correlations, the ground states show remarkable dependence on the correlation strength and the strain. In the weakly and intermediately correlated regimes with small epitaxial strain, the charge-disproportionated insulating states with antiferromagneitc order is favored over the other orbital and spin ordered phases. On the other hand, in the strongly correlated regime or under the large tensile strain, ferromagnetic spin states with Jahn-Teller orbital order become most stable. The effect from polar interfaces in LaNiO$_3$]$_1$/[SrTiO$_3$]$_1$ is found to be noticeable in our single-layered geometry. Detailed discussion is presented in comparison with previous experimental and theoretical studies.
△ Less
Submitted 10 April, 2023; v1 submitted 5 February, 2023;
originally announced February 2023.
-
Search for the decay $B^0_s \rightarrow π^0 π^0$ at Belle
Authors:
Belle Collaboration,
J. Borah,
B. Bhuyan,
I. Adachi,
H. Aihara,
D. M. Asner,
V. Aulchenko,
T. Aushev,
R. Ayad,
V. Babu,
S. Bahinipati,
Sw. Banerjee,
P. Behera,
K. Belous,
J. Bennett,
M. Bessner,
V. Bhardwaj,
T. Bilka,
D. Biswas,
D. Bodrov,
A. Bozek,
M. Bračko,
P. Branchini,
T. E. Browder,
A. Budano
, et al. (189 additional authors not shown)
Abstract:
We report the results of the first search for the decay $B_s^0\rightarrowπ^0π^0$ using $121.4\ \rm fb^{-1}$ of data collected at the $Υ(5\rm S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+e^-$ collider. We observe no signal and set a 90\% confidence level upper limit of $7.7\times 10^{-6}$ on the $B_s^0\rightarrowπ^0π^0$ decay branching fraction.
We report the results of the first search for the decay $B_s^0\rightarrowπ^0π^0$ using $121.4\ \rm fb^{-1}$ of data collected at the $Υ(5\rm S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+e^-$ collider. We observe no signal and set a 90\% confidence level upper limit of $7.7\times 10^{-6}$ on the $B_s^0\rightarrowπ^0π^0$ decay branching fraction.
△ Less
Submitted 20 January, 2023;
originally announced January 2023.
-
Generating Realistic Brain MRIs via a Conditional Diffusion Probabilistic Model
Authors:
Wei Peng,
Ehsan Adeli,
Tomas Bosschieter,
Sang Hyun Park,
Qingyu Zhao,
Kilian M. Pohl
Abstract:
As acquiring MRIs is expensive, neuroscience studies struggle to attain a sufficient number of them for properly training deep learning models. This challenge could be reduced by MRI synthesis, for which Generative Adversarial Networks (GANs) are popular. GANs, however, are commonly unstable and struggle with creating diverse and high-quality data. A more stable alternative is Diffusion Probabilis…
▽ More
As acquiring MRIs is expensive, neuroscience studies struggle to attain a sufficient number of them for properly training deep learning models. This challenge could be reduced by MRI synthesis, for which Generative Adversarial Networks (GANs) are popular. GANs, however, are commonly unstable and struggle with creating diverse and high-quality data. A more stable alternative is Diffusion Probabilistic Models (DPMs) with a fine-grained training strategy. To overcome their need for extensive computational resources, we propose a conditional DPM (cDPM) with a memory-efficient process that generates realistic-looking brain MRIs. To this end, we train a 2D cDPM to generate an MRI subvolume conditioned on another subset of slices from the same MRI. By generating slices using arbitrary combinations between condition and target slices, the model only requires limited computational resources to learn interdependencies between slices even if they are spatially far apart. After having learned these dependencies via an attention network, a new anatomy-consistent 3D brain MRI is generated by repeatedly applying the cDPM. Our experiments demonstrate that our method can generate high-quality 3D MRIs that share a similar distribution to real MRIs while still diversifying the training set. The code is available at https://github.com/xiaoiker/mask3DMRI_diffusion and also will be released as part of MONAI, at https://github.com/Project-MONAI/GenerativeModels.
△ Less
Submitted 7 September, 2023; v1 submitted 15 December, 2022;
originally announced December 2022.
-
Gate Error Analysis of Tunable Coupling Architecture in the Large-scale Superconducting Quantum System
Authors:
Dowon Baek,
Seong Hyeon Park,
Suhwan Choi,
Chanwoo Yoo,
Seungyong Hahn
Abstract:
In this paper, we examine various software and hardware strategies for implementing high-fidelity controlled-Z gate in the large-scale quantum system by solving the system's Hamiltonian with the Lindblad master equation. First, we show that the optimal single-parameter pulse achieved the gate error on the order of $10^{-4}$ for the 40 ns controlled-Z gate in the 4-qubit system. Second, we illustra…
▽ More
In this paper, we examine various software and hardware strategies for implementing high-fidelity controlled-Z gate in the large-scale quantum system by solving the system's Hamiltonian with the Lindblad master equation. First, we show that the optimal single-parameter pulse achieved the gate error on the order of $10^{-4}$ for the 40 ns controlled-Z gate in the 4-qubit system. Second, we illustrate that the pulse optimized in the isolated 2-qubit system must be further optimized in the larger-scale system to achieve errors lower than the fault-tolerant threshold. Lastly, we explain that the hardware parameter regions with low gate fidelities are characterized by resonances in the large-scale quantum system. Our study provides software-oriented and hardware-level guidelines for building a large-scale fault-tolerant quantum system.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Shape optimization of superconducting transmon qubit for low surface dielectric loss
Authors:
Sungjun Eun,
Seong Hyeon Park,
Kyungsik Seo,
Kibum Choi,
Seungyong Hahn
Abstract:
Surface dielectric loss of superconducting transmon qubit is believed as one of the dominant sources of decoherence. Reducing surface dielectric loss of superconducting qubit is known to be a great challenge for achieving high quality factor and a long relaxation time ($T_{1}$). Changing the geometry of capacitor pads and junction wire of transmon qubit makes it possible to engineer the surface di…
▽ More
Surface dielectric loss of superconducting transmon qubit is believed as one of the dominant sources of decoherence. Reducing surface dielectric loss of superconducting qubit is known to be a great challenge for achieving high quality factor and a long relaxation time ($T_{1}$). Changing the geometry of capacitor pads and junction wire of transmon qubit makes it possible to engineer the surface dielectric loss. In this paper, we present the shape optimization approach for reducing Surface dielectric loss in transmon qubit. The capacitor pad and junction wire of the transmon qubit are shaped as spline curves and optimized through the combination of the finite-element method and global optimization algorithm. Then, we compared the surface participation ratio, which represents the portion of electric energy stored in each dielectric layer and proportional to two-level system (TLS) loss, of optimized structure and existing geometries to show the effectiveness of our approach. The result suggests that the participation ratio of capacitor pad, and junction wire can be reduced by 16% and 26% compared to previous designs through shape optimization, while overall footprint and anharmonicity maintain acceptable value. As a result, the TLS-limited quality factor and corresponding $T_{1}$ were increased by approximately 21.6%.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
Perception-Oriented Single Image Super-Resolution using Optimal Objective Estimation
Authors:
Seung Ho Park,
Young Su Moon,
Nam Ik Cho
Abstract:
Single-image super-resolution (SISR) networks trained with perceptual and adversarial losses provide high-contrast outputs compared to those of networks trained with distortion-oriented losses, such as L1 or L2. However, it has been shown that using a single perceptual loss is insufficient for accurately restoring locally varying diverse shapes in images, often generating undesirable artifacts or…
▽ More
Single-image super-resolution (SISR) networks trained with perceptual and adversarial losses provide high-contrast outputs compared to those of networks trained with distortion-oriented losses, such as L1 or L2. However, it has been shown that using a single perceptual loss is insufficient for accurately restoring locally varying diverse shapes in images, often generating undesirable artifacts or unnatural details. For this reason, combinations of various losses, such as perceptual, adversarial, and distortion losses, have been attempted, yet it remains challenging to find optimal combinations. Hence, in this paper, we propose a new SISR framework that applies optimal objectives for each region to generate plausible results in overall areas of high-resolution outputs. Specifically, the framework comprises two models: a predictive model that infers an optimal objective map for a given low-resolution (LR) input and a generative model that applies a target objective map to produce the corresponding SR output. The generative model is trained over our proposed objective trajectory representing a set of essential objectives, which enables the single network to learn various SR results corresponding to combined losses on the trajectory. The predictive model is trained using pairs of LR images and corresponding optimal objective maps searched from the objective trajectory. Experimental results on five benchmarks show that the proposed method outperforms state-of-the-art perception-driven SR methods in LPIPS, DISTS, PSNR, and SSIM metrics. The visual results also demonstrate the superiority of our method in perception-oriented reconstruction. The code and models are available at https://github.com/seungho-snu/SROOE.
△ Less
Submitted 11 March, 2023; v1 submitted 24 November, 2022;
originally announced November 2022.
-
A Formal CHERI-C Semantics for Verification
Authors:
Seung Hoon Park,
Rekha Pai,
Tom Melham
Abstract:
CHERI-C extends the C programming language by adding hardware capabilities, ensuring a certain degree of memory safety while remaining efficient. Capabilities can also be employed for higher-level security measures, such as software compartmentalization, that have to be used correctly to achieve the desired security guarantees. As the extension changes the semantics of C, new theories and tooling…
▽ More
CHERI-C extends the C programming language by adding hardware capabilities, ensuring a certain degree of memory safety while remaining efficient. Capabilities can also be employed for higher-level security measures, such as software compartmentalization, that have to be used correctly to achieve the desired security guarantees. As the extension changes the semantics of C, new theories and tooling are required to reason about CHERI-C code and verify correctness. In this work, we present a formal memory model that provides a memory semantics for CHERI-C programs. We present a generalised theory with rich properties suitable for verification and potentially other types of analyses. Our theory is backed by an Isabelle/HOL formalisation that also generates an OCaml executable instance of the memory model. The verified and extracted code is then used to instantiate the parametric Gillian program analysis framework, with which we can perform concrete execution of CHERI-C programs. The tool can run a CHERI-C test suite, demonstrating the correctness of our tool, and catch a good class of safety violations that the CHERI hardware might miss.
△ Less
Submitted 26 January, 2023; v1 submitted 14 November, 2022;
originally announced November 2022.
-
X-ray Free Electron Laser Studies of Electron and Phonon Dynamics of Graphene Adsorbed on Copper
Authors:
Hirohito Ogasawara,
Han Wang,
Jörgen Gladh,
Alessandro Gallo,
Ralph Page,
Johannes Voss,
Alan Luntz,
Elias Diesen,
Frank Abild-Pedersen,
Anders Nilsson,
Markus Soldemo,
Marc Zajac,
Andrew Attar,
Michelle E. Chen,
Sang Wan Cho,
Abhishek Katoch,
Ki-Jeong Kim,
Kyung Hwan Kim,
Minseok Kim,
Soonnam Kwon,
Sang Han Park,
Henrique Ribeiro,
Sami Sainio,
Hsin-Yi Wang,
Cheolhee Yang
, et al. (1 additional authors not shown)
Abstract:
We report optical pumping and X-ray absorption spectroscopy experiments at the PAL free electron laser that directly probe the electron dynamics of a graphene monolayer adsorbed on copper in the femtosecond regime. By analyzing the results with ab-initio theory we infer that the excitation of graphene is dominated by indirect excitation from hot electron-hole pairs created in the copper by the opt…
▽ More
We report optical pumping and X-ray absorption spectroscopy experiments at the PAL free electron laser that directly probe the electron dynamics of a graphene monolayer adsorbed on copper in the femtosecond regime. By analyzing the results with ab-initio theory we infer that the excitation of graphene is dominated by indirect excitation from hot electron-hole pairs created in the copper by the optical laser pulse. However, once the excitation is created in graphene, its decay follows a similar path as in many previous studies of graphene adsorbed on semiconductors, i e. rapid excitation of SCOPS (Strongly Coupled Optical Phonons) and eventual thermalization. It is likely that the lifetime of the hot electron-hole pairs in copper governs the lifetime of the electronic excitation of the graphene.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Near-Infrared and Optical Observations of Type Ic SN 2021krf: Luminous Late-time Emission and Dust Formation
Authors:
Aravind P. Ravi,
Jeonghee Rho,
Sangwook Park,
Seong Hyun Park,
Sung-Chul Yoon,
T. R. Geballe,
Jozsef Vinko,
Samaporn Tinyanont,
K. Azalee Bostroem,
Jamison Burke,
Daichi Hiramatsu,
D. Andrew Howell,
Curtis McCully,
Megan Newsome,
Estefania Padilla Gonzalez,
Craig Pellegrino,
Regis Cartier,
Tyler Pritchard,
Morten Andersen,
Sergey Blinnikov,
Yize Dong,
Peter Blanchard,
Charles D. Kilpatrick,
Peter Hoeflich,
Stefano Valenti
, et al. (7 additional authors not shown)
Abstract:
We present near-infrared (NIR) and optical observations of the Type Ic supernova (SN Ic) SN 2021krf obtained between days 13 and 259 at several ground-based telescopes. The NIR spectrum at day 68 exhibits a rising $K$-band continuum flux density longward of $\sim$ 2.0 $μ$m, and a late-time optical spectrum at day 259 shows strong [O I] 6300 and 6364 Å emission-line asymmetry, both indicating the p…
▽ More
We present near-infrared (NIR) and optical observations of the Type Ic supernova (SN Ic) SN 2021krf obtained between days 13 and 259 at several ground-based telescopes. The NIR spectrum at day 68 exhibits a rising $K$-band continuum flux density longward of $\sim$ 2.0 $μ$m, and a late-time optical spectrum at day 259 shows strong [O I] 6300 and 6364 Å emission-line asymmetry, both indicating the presence of dust, likely formed in the SN ejecta. We estimate a carbon-grain dust mass of $\sim$ 2 $\times$ 10$^{-5}$ M$_{\odot}$ and a dust temperature of $\sim$ 900 - 1200 K associated with this rising continuum and suggest the dust has formed in SN ejecta. Utilizing the one-dimensional multigroup radiation hydrodynamics code STELLA, we present two degenerate progenitor solutions for SN 2021krf, characterized by C-O star masses of 3.93 and 5.74 M$_{\odot}$, but with the same best-fit $^{56}$Ni mass of 0.11 M$_{\odot}$ for early times (0-70 days). At late times (70-300 days), optical light curves of SN 2021krf decline substantially more slowly than that expected from $^{56}$Co radioactive decay. Lack of H and He lines in the late-time SN spectrum suggests the absence of significant interaction of the ejecta with the circumstellar medium. We reproduce the entire bolometric light curve with a combination of radioactive decay and an additional powering source in the form of a central engine of a millisecond pulsar with a magnetic field smaller than that of a typical magnetar.
△ Less
Submitted 19 April, 2023; v1 submitted 31 October, 2022;
originally announced November 2022.
-
Non-Hermitian chiral degeneracy of gated graphene metasurfaces
Authors:
Soojeong Baek,
Sang Hyun Park,
Donghak Oh,
Kanghee Lee,
Sangha Lee,
Hosub Lim,
Taewoo Ha,
Hyun-Sung Park,
Shuang Zhang,
Lan Yang,
Bumki Min,
Teun-Teun Kim
Abstract:
Non-Hermitian degeneracies, also known as exceptional points (EPs), have been the focus of much attention due to their singular eigenvalue surface structure. Nevertheless, as pertaining to a non-Hermitian metasurface platform, the reduction of an eigenspace dimensionality at the EP has been investigated mostly in a passive repetitive manner. Here, we propose an electrical and spectral way of resol…
▽ More
Non-Hermitian degeneracies, also known as exceptional points (EPs), have been the focus of much attention due to their singular eigenvalue surface structure. Nevertheless, as pertaining to a non-Hermitian metasurface platform, the reduction of an eigenspace dimensionality at the EP has been investigated mostly in a passive repetitive manner. Here, we propose an electrical and spectral way of resolving chiral EPs and clarifying the consequences of chiral mode collapsing of a non-Hermitian gated graphene metasurface. More specifically, the measured non-Hermitian Jones matrix in parameter space enables the quantification of nonorthogonality of polarisation eigenstates and half-integer topological charges associated with a chiral EP. Interestingly, the output polarisation state can be made orthogonal to the coalesced polarisation eigenstate of the metasurface, revealing the missing dimension at the chiral EP. In addition, the maximal nonorthogonality at the chiral EP leads to a blocking of one of the cross-polarised transmission pathways and, consequently, the observation of enhanced asymmetric polarisation conversion. We anticipate that electrically controllable non-Hermitian metasurface platforms can serve as an interesting framework for the investigation of rich non-Hermitian polarisation dynamics around chiral EPs.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification
Authors:
Philip Chikontwe,
Soo Jeong Nam,
Heounjeong Go,
Meejeong Kim,
Hyun Jung Sung,
Sang Hyun Park
Abstract:
Whole slide image (WSI) classification is a fundamental task for the diagnosis and treatment of diseases; but, curation of accurate labels is time-consuming and limits the application of fully-supervised methods. To address this, multiple instance learning (MIL) is a popular method that poses classification as a weakly supervised learning task with slide-level labels only. While current MIL method…
▽ More
Whole slide image (WSI) classification is a fundamental task for the diagnosis and treatment of diseases; but, curation of accurate labels is time-consuming and limits the application of fully-supervised methods. To address this, multiple instance learning (MIL) is a popular method that poses classification as a weakly supervised learning task with slide-level labels only. While current MIL methods apply variants of the attention mechanism to re-weight instance features with stronger models, scant attention is paid to the properties of the data distribution. In this work, we propose to re-calibrate the distribution of a WSI bag (instances) by using the statistics of the max-instance (critical) feature. We assume that in binary MIL, positive bags have larger feature magnitudes than negatives, thus we can enforce the model to maximize the discrepancy between bags with a metric feature loss that models positive bags as out-of-distribution. To achieve this, unlike existing MIL methods that use single-batch training modes, we propose balanced-batch sampling to effectively use the feature loss i.e., (+/-) bags simultaneously. Further, we employ a position encoding module (PEM) to model spatial/morphological information, and perform pooling by multi-head self-attention (PSMA) with a Transformer encoder. Experimental results on existing benchmark datasets show our approach is effective and improves over state-of-the-art MIL methods.
△ Less
Submitted 21 July, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Plasmonic gain in current biased tilted Dirac nodes
Authors:
Sang Hyun Park,
Michael Sammon,
Eugene Mele,
Tony Low
Abstract:
Surface plasmons, which allow extreme confinement of light, suffer from high intrinsic electronic losses. It has been shown that stimulated emission of electrons can transfer energy to plasmons and compensate for the high intrinsic losses. To-date, these realizations have relied on introducing an external gain media coupled to the surface plasmon. Here, we propose that plasmons in two-dimensional…
▽ More
Surface plasmons, which allow extreme confinement of light, suffer from high intrinsic electronic losses. It has been shown that stimulated emission of electrons can transfer energy to plasmons and compensate for the high intrinsic losses. To-date, these realizations have relied on introducing an external gain media coupled to the surface plasmon. Here, we propose that plasmons in two-dimensional materials with closely located electron and hole Fermi pockets can experience gain, when an electrical current bias is applied along the displaced electron-hole pockets, without the need for an external gain media. As a prototypical example, we consider WTe$_2$ from the family of 1T$'$-MX$_2$ materials, whose electronic structure can be described within a type-II tilted massive Dirac model. We find that the nonlocal plasmonic response experiences prominent gain for experimentally accessible currents on the order of mA$μ$m$^{-1}$. Furthermore, the group velocity of the plasmon found from the isofrequency curves imply that the amplified plasmons are highly collimated along a direction perpendicular to the Dirac node tilt when the electrical current is applied along it.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
Hybrid Numerical Modeling of Ballistic Clay under Low-Speed Impact using Artificial Neural Networks
Authors:
YeonSu Kim,
Yoon A Kim,
Seo Hwee Park,
YunHo Kim
Abstract:
Roma Plastilina No. 1 clay has been widely used as a conservative boundary condition in bulletproof vests, namely to play the role of a human body. Interestingly, the effect of this boundary condition on the ballistic performance of the vests is indiscernible. Moreover, back face deformation should be characterized by measuring the indentation in the deformed clay, which is important for determini…
▽ More
Roma Plastilina No. 1 clay has been widely used as a conservative boundary condition in bulletproof vests, namely to play the role of a human body. Interestingly, the effect of this boundary condition on the ballistic performance of the vests is indiscernible. Moreover, back face deformation should be characterized by measuring the indentation in the deformed clay, which is important for determining the lethality of gunshots. Therefore, several studies have focused on modeling not only bulletproof vests but also the clay backing material. Despite various attempts to develop a suitable numerical model, determining the appropriate physical parameters that can capture the high-strain-rate behavior of clay is still challenging. In this study, we predicted indentation depth in clay using an artificial neural network (ANN) and determined the optimal material parameters required for a finite element method (FEM)-based model using an inverse tracking method. Our ANN-FEM hybrid model successfully optimized high-strain-rate material parameters without the need for any independent mechanical tests. The proposed novel model achieved a high prediction accuracy of over 98% referring impact cases.
△ Less
Submitted 30 May, 2022;
originally announced May 2022.
-
CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification
Authors:
Philip Chikontwe,
Soopil Kim,
Sang Hyun Park
Abstract:
Few-shot classification is a challenging problem that aims to learn a model that can adapt to unseen classes given a few labeled samples. Recent approaches pre-train a feature extractor, and then fine-tune for episodic meta-learning. Other methods leverage spatial features to learn pixel-level correspondence while jointly training a classifier. However, results using such approaches show marginal…
▽ More
Few-shot classification is a challenging problem that aims to learn a model that can adapt to unseen classes given a few labeled samples. Recent approaches pre-train a feature extractor, and then fine-tune for episodic meta-learning. Other methods leverage spatial features to learn pixel-level correspondence while jointly training a classifier. However, results using such approaches show marginal improvements. In this paper, inspired by the transformer style self-attention mechanism, we propose a strategy to cross-attend and re-weight discriminative features for few-shot classification. Given a base representation of support and query images after global pooling, we introduce a single shared module that projects features and cross-attends in two aspects: (i) query to support, and (ii) support to query. The module computes attention scores between features to produce an attention pooled representation of features in the same class that is later added to the original representation followed by a projection head. This effectively re-weights features in both aspects (i & ii) to produce features that better facilitate improved metric-based meta-learning. Extensive experiments on public benchmarks show our approach outperforms state-of-the-art methods by 3%~5%.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
Ultrafast X-ray imaging of the light-induced phase transition in VO2
Authors:
Allan S. Johnson,
Daniel Pérez-Salinas,
Khalid M. Siddiqui,
Sungwon Kim,
Sungwook Choi,
Klara Volckaert,
Paulina E. Majchrzak,
Søren Ulstrup,
Naman Agarwal,
Kent Hallman,
Richard F. Haglund Jr.,
Christian M. Günther,
Bastian Pfau,
Stefan Eisebitt,
Dirk Backes,
Francesco Maccherozzi,
Ann Fitzpatrick,
Sarnjeet Dhesi,
Pierluigi Gargiani,
Manuel Valvidares,
Nongnuch Artrith,
Frank de Groot,
Hyeongi Choi,
Dogeun Jang,
Abhishek Katoch
, et al. (4 additional authors not shown)
Abstract:
Using light to control transient phases in quantum materials is an emerging route to engineer new properties and functionality, with both thermal and non-thermal phases observed out of equilibrium. Transient phases are expected to be heterogeneous, either through photo-generated domain growth or by generating topological defects, and this impacts the dynamics of the system. However, this nanoscale…
▽ More
Using light to control transient phases in quantum materials is an emerging route to engineer new properties and functionality, with both thermal and non-thermal phases observed out of equilibrium. Transient phases are expected to be heterogeneous, either through photo-generated domain growth or by generating topological defects, and this impacts the dynamics of the system. However, this nanoscale heterogeneity has not been directly observed. Here we use time- and spectrally resolved coherent X-ray imaging to track the prototypical light induced insulator-to-metal phase transition in vanadium dioxide on the nanoscale with femtosecond time resolution. We show that the early-time dynamics are independent of the initial spatial heterogeneity and observe a 200 fs switch to the metallic phase. A heterogeneous response emerges only after hundreds of picoseconds. Through spectroscopic imaging, we reveal that the transient metallic phase is a highly orthorhombically strained rutile metallic phase, an interpretation that is in contrast to those based on spatially averaged probes. Our results demonstrate the critical importance of spatially and spectrally resolved measurements for understanding and interpreting the transient phases of quantum materials.
△ Less
Submitted 15 January, 2023; v1 submitted 17 February, 2022;
originally announced February 2022.
-
Flexible Style Image Super-Resolution using Conditional Objective
Authors:
Seung Ho Park,
Young Su Moon,
Nam Ik Cho
Abstract:
Recent studies have significantly enhanced the performance of single-image super-resolution (SR) using convolutional neural networks (CNNs). While there can be many high-resolution (HR) solutions for a given input, most existing CNN-based methods do not explore alternative solutions during the inference. A typical approach to obtaining alternative SR results is to train multiple SR models with dif…
▽ More
Recent studies have significantly enhanced the performance of single-image super-resolution (SR) using convolutional neural networks (CNNs). While there can be many high-resolution (HR) solutions for a given input, most existing CNN-based methods do not explore alternative solutions during the inference. A typical approach to obtaining alternative SR results is to train multiple SR models with different loss weightings and exploit the combination of these models. Instead of using multiple models, we present a more efficient method to train a single adjustable SR model on various combinations of losses by taking advantage of multi-task learning. Specifically, we optimize an SR model with a conditional objective during training, where the objective is a weighted sum of multiple perceptual losses at different feature levels. The weights vary according to given conditions, and the set of weights is defined as a style controller. Also, we present an architecture appropriate for this training scheme, which is the Residual-in-Residual Dense Block equipped with spatial feature transformation layers. At the inference phase, our trained model can generate locally different outputs conditioned on the style control map. Extensive experiments show that the proposed SR model produces various desirable reconstructions without artifacts and yields comparable quantitative performance to state-of-the-art SR methods.
△ Less
Submitted 8 March, 2022; v1 submitted 13 January, 2022;
originally announced January 2022.
-
Measurements of the branching fractions of $Ξ_c^0 \to ΛK_S^0$, $Ξ_c^0 \to Σ^0 K_S^0$, and $Ξ_c^0 \to Σ^+ K^-$ decays at Belle
Authors:
Belle collaboration,
Y. Li,
J. X. Cui,
S. Jia,
C. P. Shen,
I. Adachi,
J. K. Ahn,
H. Aihara,
S. Al Said,
D. M. Asner,
H. Atmacan,
T. Aushev,
R. Ayad,
V. Babu,
S. Bahinipati,
P. Behera,
K. Belous,
J. Bennett,
M. Bessner,
V. Bhardwaj,
B. Bhuyan,
T. Bilka,
A. Bobrov,
D. Bodrov,
G. Bonvicini
, et al. (191 additional authors not shown)
Abstract:
Using the entire data sample of $980\mathrm{~fb}^{-1}$ collected with the Belle detector at the KEKB asymmetric-energy $e^+e^-$ collider, we present measurements of the branching fractions of the Cabibbo-favored decays $Ξ_c^0 \to ΛK_S^0$, $Ξ_c^0 \to Σ^0 K_S^0$, and $Ξ_c^0 \to Σ^+ K^-$. Taking the decay $Ξ_c^0 \to Ξ^- \pip$ as the normalization mode, we measure the branching fraction ratio…
▽ More
Using the entire data sample of $980\mathrm{~fb}^{-1}$ collected with the Belle detector at the KEKB asymmetric-energy $e^+e^-$ collider, we present measurements of the branching fractions of the Cabibbo-favored decays $Ξ_c^0 \to ΛK_S^0$, $Ξ_c^0 \to Σ^0 K_S^0$, and $Ξ_c^0 \to Σ^+ K^-$. Taking the decay $Ξ_c^0 \to Ξ^- \pip$ as the normalization mode, we measure the branching fraction ratio ${\cal B}(Ξ_c^0 \to ΛK_S^0)/{\cal B}(Ξ_c^0 \to Ξ^- π^+) = 0.229\pm0.008\pm0.012$ with improved precision, and measure the branching fraction ratios ${\cal B}(Ξ_c^0 \to Σ^0 K_S^0)/{\cal B}(Ξ_c^0 \to Ξ^- π^+) = 0.038\pm0.006\pm0.004$ and ${\cal B}(Ξ_c^0 \to Σ^+ K^-)/{\cal B}(Ξ_c^0 \to Ξ^- π^+) = 0.123\pm0.007\pm0.010$ for the first time. Taking into account the branching fraction of the normalization mode, the absolute branching fractions are determined to be ${\cal B}(Ξ_c^0 \to ΛK_S^0) = (3.27\pm0.11\pm0.17\pm0.73) \times 10^{-3}$, ${\cal B}(Ξ_c^0 \to Σ^0 K_S^0) = (0.54\pm 0.09\pm 0.06\pm 0.12) \times 10^{-3}$, and ${\cal B}(Ξ_c^0 \to Σ^+ K^-) = (1.76\pm 0.10\pm0.14\pm 0.39) \times 10^{-3}$. The first and second uncertainties above are statistical and systematic, respectively, while the third ones arise from the uncertainty of the branching fraction of $Ξ_c^0 \to Ξ^- π^+$.
△ Less
Submitted 14 December, 2021; v1 submitted 17 November, 2021;
originally announced November 2021.
-
Uncertainty-Aware Semi-Supervised Few Shot Segmentation
Authors:
Soopil Kim,
Philip Chikontwe,
Sang Hyun Park
Abstract:
Few shot segmentation (FSS) aims to learn pixel-level classification of a target object in a query image using only a few annotated support samples. This is challenging as it requires modeling appearance variations of target objects and the diverse visual cues between query and support images with limited information. To address this problem, we propose a semi-supervised FSS strategy that leverage…
▽ More
Few shot segmentation (FSS) aims to learn pixel-level classification of a target object in a query image using only a few annotated support samples. This is challenging as it requires modeling appearance variations of target objects and the diverse visual cues between query and support images with limited information. To address this problem, we propose a semi-supervised FSS strategy that leverages additional prototypes from unlabeled images with uncertainty guided pseudo label refinement. To obtain reliable prototypes from unlabeled images, we meta-train a neural network to jointly predict segmentation and estimate the uncertainty of predictions. We employ the uncertainty estimates to exclude predictions with high degrees of uncertainty for pseudo label construction to obtain additional prototypes based on the refined pseudo labels. During inference, query segmentation is predicted using prototypes from both support and unlabeled images including low-level features of the query images. Our approach is end-to-end and can easily supplement existing approaches without the requirement of additional training to employ unlabeled samples. Extensive experiments on PASCAL-$5^i$ and COCO-$20^i$ demonstrate that our model can effectively remove unreliable predictions to refine pseudo labels and significantly improve upon state-of-the-art performances.
△ Less
Submitted 17 October, 2021;
originally announced October 2021.
-
Content Preserving Image Translation with Texture Co-occurrence and Spatial Self-Similarity for Texture Debiasing and Domain Adaptation
Authors:
Myeongkyun Kang,
Dongkyu Won,
Miguel Luna,
Philip Chikontwe,
Kyung Soo Hong,
June Hong Ahn,
Sang Hyun Park
Abstract:
Models trained on datasets with texture bias usually perform poorly on out-of-distribution samples since biased representations are embedded into the model. Recently, various image translation and debiasing methods have attempted to disentangle texture biased representations for downstream tasks, but accurately discarding biased features without altering other relevant information is still challen…
▽ More
Models trained on datasets with texture bias usually perform poorly on out-of-distribution samples since biased representations are embedded into the model. Recently, various image translation and debiasing methods have attempted to disentangle texture biased representations for downstream tasks, but accurately discarding biased features without altering other relevant information is still challenging. In this paper, we propose a novel framework that leverages image translation to generate additional training images using the content of a source image and the texture of a target image with a different bias property to explicitly mitigate texture bias when training a model on a target task. Our model ensures texture similarity between the target and generated images via a texture co-occurrence loss while preserving content details from source images with a spatial self-similarity loss. Both the generated and original training images are combined to train improved classification or segmentation models robust to inconsistent texture bias. Evaluation on five classification- and two segmentation-datasets with known texture biases demonstrates the utility of our method, and reports significant improvements over recent state-of-the-art methods in all cases.
△ Less
Submitted 3 January, 2023; v1 submitted 15 October, 2021;
originally announced October 2021.
-
First Demonstration of the Korean eLoran Accuracy in a Narrow Waterway Using Improved ASF Maps
Authors:
Woohyun Kim,
Pyo-Woong Son,
Sul Gee Park,
Sang Hyun Park,
Jiwon Seo
Abstract:
The vulnerabilities of global navigation satellite systems (GNSSs) to radio frequency jamming and spoofing have attracted significant research attention. In particular, the large-scale jamming incidents that occurred in South Korea substantiate the practical importance of implementing a complementary navigation system. This letter briefly summarizes the efforts of South Korea to deploy an enhanced…
▽ More
The vulnerabilities of global navigation satellite systems (GNSSs) to radio frequency jamming and spoofing have attracted significant research attention. In particular, the large-scale jamming incidents that occurred in South Korea substantiate the practical importance of implementing a complementary navigation system. This letter briefly summarizes the efforts of South Korea to deploy an enhanced long-range navigation (eLoran) system, which is a terrestrial low-frequency radio navigation system that can complement GNSSs. After four years of research and development, the Korean eLoran testbed system has been recently deployed and is operational since June 1, 2021. Although its initial performance at sea is satisfactory, navigation through a narrow waterway is still challenging because a complete survey of the additional secondary factor (ASF), which is the largest source of error for eLoran, is practically difficult in a narrow waterway. This letter proposes an alternative way to survey the ASF in a narrow waterway and improve the ASF map generation methods. Moreover, the performance of the proposed approach was validated experimentally.
△ Less
Submitted 28 September, 2021; v1 submitted 18 September, 2021;
originally announced September 2021.
-
A note on degenerate generalized Laguerre polynomials and Lah numbers
Authors:
Taekyun Kim,
Dmitry V. Dolgy,
Dae san Kim,
Hye Kyung Kim,
Seong Ho Park
Abstract:
The aim of this paper is to introduce the degenerate generalized Laguerre polynomials as the degenerate version of the generalized Laguerre polynomials and to derive some properties related to those polynomials and Lah numbers, including an explicit expression, a Rodrigues' type formula and expressions for the derivatives.
The novelty of the present paper is that it is the first paper on degener…
▽ More
The aim of this paper is to introduce the degenerate generalized Laguerre polynomials as the degenerate version of the generalized Laguerre polynomials and to derive some properties related to those polynomials and Lah numbers, including an explicit expression, a Rodrigues' type formula and expressions for the derivatives.
The novelty of the present paper is that it is the first paper on degenerate versions of orthogonal polynomials.
△ Less
Submitted 6 July, 2021;
originally announced July 2021.
-
Rediscovery of $B^0\to J\mskip 1mu / ψ\mskip 2mu K^0_{\scriptscriptstyle L}$ at Belle II
Authors:
Belle II Collaboration,
F. Abudinén,
I. Adachi,
R. Adak,
K. Adamczyk,
P. Ahlburg,
J. K. Ahn,
H. Aihara,
N. Akopov,
A. Aloisio,
F. Ameli,
L. Andricek,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
V. Aulchenko,
T. Aushev,
V. Aushev,
T. Aziz,
V. Babu,
S. Bacher,
S. Baehr,
S. Bahinipati,
A. M. Bakich,
P. Bambade
, et al. (523 additional authors not shown)
Abstract:
We present preliminary results on the reconstruction of the $B^0\to J\mskip 1mu / ψ\mskip 2mu K^0_{\scriptscriptstyle L}$ decay, where $J\mskip 1mu / ψ\mskip 2mu\toμ^+μ^-$ or $e^+e^-$. Using a dataset corresponding to a luminosity of $62.8\pm0.6\mbox{fb}^{-1}$ collected by the Belle II experiment at the SuperKEKB asymmetric energy $e^+e^-$ collider, we measure a total of $267\pm21$ candidates with…
▽ More
We present preliminary results on the reconstruction of the $B^0\to J\mskip 1mu / ψ\mskip 2mu K^0_{\scriptscriptstyle L}$ decay, where $J\mskip 1mu / ψ\mskip 2mu\toμ^+μ^-$ or $e^+e^-$. Using a dataset corresponding to a luminosity of $62.8\pm0.6\mbox{fb}^{-1}$ collected by the Belle II experiment at the SuperKEKB asymmetric energy $e^+e^-$ collider, we measure a total of $267\pm21$ candidates with $J\mskip 1mu / ψ\mskip 2mu\toμ^+μ^-$ and $226\pm20$ with with $J\mskip 1mu / ψ\mskip 2mu\to e^+e^-$. The quoted errors are statistical only.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
A study on properties of degenerate and zero-truncated degenerate Poisson random variables
Authors:
Taekyun Kim,
Dae san Kim,
Hyunseok Lee,
Seong Ho Park,
Jongkyum Kwon
Abstract:
Carlitz [2] initiated a study on degenerate versions of Bernoulli and Euler numbers which has been extended recently to the researches on various degenerate versions of quite a few special numbers and polynomials. They have been explored by using several different tools including generating functions, combinatorial methods, $p$-adic analysis, umbral calculus, special functions, differential equati…
▽ More
Carlitz [2] initiated a study on degenerate versions of Bernoulli and Euler numbers which has been extended recently to the researches on various degenerate versions of quite a few special numbers and polynomials. They have been explored by using several different tools including generating functions, combinatorial methods, $p$-adic analysis, umbral calculus, special functions, differential equations and probability theory as well. \par The degenerate Poisson random variables are degenerate versions of the Poisson random variables. In [6], studied are the degenerate binomial and degenerate Poisson random variables in relation to the degenerate Lah-Bell polynomials. Among other things, it is shown that the rising factorial moments of the degenerate Poisson random variable are expressed by the degenerate Lah-Bell polynomials. Also, it is shown that the probability-generating function of the degenerate Poisson random variable is equal to the generating function of the degenerate Lah-Bell polynomials. The zero-truncated Poisson distributions (also called the conditional or the positive Poisson distributions) are certain discrete probability distributions whose supports are the set of positive integers. In [10], the zero-truncated degenerate Poisson random variables, whose probability mass functions are a natural extension of the zero-truncated Poisson random variables, are introduced and various properties of those random variables are investigated. Specifically, for those distributions, studied are its expectation, its variance, its n-th moment, its cumulative distribution function and certain expressions for the probability function of a finite sum of independent degenerate zero-truncated Poisson random variables with equal and unequal parameters.
△ Less
Submitted 25 June, 2021;
originally announced June 2021.
-
A Meta-Learning Approach for Medical Image Registration
Authors:
Heejung Park,
Gyeong Min Lee,
Soopil Kim,
Ga Hyung Ryu,
Areum Jeong,
Sang Hyun Park,
Min Sagong
Abstract:
Non-rigid registration is a necessary but challenging task in medical imaging studies. Recently, unsupervised registration models have shown good performance, but they often require a large-scale training dataset and long training times. Therefore, in real world application where only dozens to hundreds of image pairs are available, existing models cannot be practically used. To address these limi…
▽ More
Non-rigid registration is a necessary but challenging task in medical imaging studies. Recently, unsupervised registration models have shown good performance, but they often require a large-scale training dataset and long training times. Therefore, in real world application where only dozens to hundreds of image pairs are available, existing models cannot be practically used. To address these limitations, we propose a novel unsupervised registration model which is integrated with a gradient-based meta learning framework. In particular, we train a meta learner which finds an initialization point of parameters by utilizing a variety of existing registration datasets. To quickly adapt to various tasks, the meta learner was updated to get close to the center of parameters which are fine-tuned for each registration task. Thereby, our model can adapt to unseen domain tasks via a short fine-tuning process and perform accurate registration. To verify the superiority of our model, we train the model for various 2D medical image registration tasks such as retinal choroid Optical Coherence Tomography Angiography (OCTA), CT organs, and brain MRI scans and test on registration of retinal OCTA Superficial Capillary Plexus (SCP). In our experiments, the proposed model obtained significantly improved performance in terms of accuracy and training time compared to other registration models.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Measurement of the branching fractions of $B\toη' K$ decays using 2019/2020 Belle II data
Authors:
Belle II Collaboration,
F. Abudinén,
I. Adachi,
R. Adak,
K. Adamczyk,
P. Ahlburg,
J. K. Ahn,
H. Aihara,
N. Akopov,
A. Aloisio,
F. Ameli,
L. Andricek,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
V. Aulchenko,
T. Aushev,
V. Aushev,
T. Aziz,
V. Babu,
S. Bacher,
S. Baehr,
S. Bahinipati,
A. M. Bakich,
P. Bambade
, et al. (523 additional authors not shown)
Abstract:
This note describes the rediscovery of $B\toη' K$ decays in Belle II data, both in the charged and neutral final state: $B_0\toη' K_S$ and $B^\pm\toη' K^\pm$. The $η'$ is searched for in two decay modes: $η'\toηπ^+π^-$ with $η\toγγ$, and $η'\toργ$. The analysis uses data collected in 2019 and 2020 at the SuperKEKB asymmetric $e^+e^-$ collider, with an integrated luminosity of $62.8~fb^{-1}$, corre…
▽ More
This note describes the rediscovery of $B\toη' K$ decays in Belle II data, both in the charged and neutral final state: $B_0\toη' K_S$ and $B^\pm\toη' K^\pm$. The $η'$ is searched for in two decay modes: $η'\toηπ^+π^-$ with $η\toγγ$, and $η'\toργ$. The analysis uses data collected in 2019 and 2020 at the SuperKEKB asymmetric $e^+e^-$ collider, with an integrated luminosity of $62.8~fb^{-1}$, corresponding to $68.2$ million of $B\bar{B}$ pairs produced. The signal yield is obtained via an unbinned maximum likelihood fit to signal sensitive variables, obtaining branching ratios:
$$\mathcal{B}\left(B^\pm\toη'K^\pm\right) = \left(63.4~^{+3.4}_{-3.3}\,(stat)\,\pm3.2\,(syst)\,\right) \times10^{-6} $$
$$\mathcal{B}\left(B_0\toη'K_S\right) = \left(59.9~^{+5.8}_{-5.5}\,(stat)\,\pm2.9\,(syst)\,\right) \times10^{-6} $$ which are consistent with world average.
△ Less
Submitted 12 May, 2021; v1 submitted 13 April, 2021;
originally announced April 2021.
-
Self-Supervised Learning based CT Denoising using Pseudo-CT Image Pairs
Authors:
Dongkyu Won,
Euijin Jung,
Sion An,
Philip Chikontwe,
Sang Hyun Park
Abstract:
Recently, Self-supervised learning methods able to perform image denoising without ground truth labels have been proposed. These methods create low-quality images by adding random or Gaussian noise to images and then train a model for denoising. Ideally, it would be beneficial if one can generate high-quality CT images with only a few training samples via self-supervision. However, the performance…
▽ More
Recently, Self-supervised learning methods able to perform image denoising without ground truth labels have been proposed. These methods create low-quality images by adding random or Gaussian noise to images and then train a model for denoising. Ideally, it would be beneficial if one can generate high-quality CT images with only a few training samples via self-supervision. However, the performance of CT denoising is generally limited due to the complexity of CT noise. To address this problem, we propose a novel self-supervised learning-based CT denoising method. In particular, we train pre-train CT denoising and noise models that can predict CT noise from Low-dose CT (LDCT) using available LDCT and Normal-dose CT (NDCT) pairs. For a given test LDCT, we generate Pseudo-LDCT and NDCT pairs using the pre-trained denoising and noise models and then update the parameters of the denoising model using these pairs to remove noise in the test LDCT. To make realistic Pseudo LDCT, we train multiple noise models from individual images and generate the noise using the ensemble of noise models. We evaluate our method on the 2016 AAPM Low-Dose CT Grand Challenge dataset. The proposed ensemble noise model can generate realistic CT noise, and thus our method significantly improves the denoising performance existing denoising models trained by supervised- and self-supervised learning.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
LaPred: Lane-Aware Prediction of Multi-Modal Future Trajectories of Dynamic Agents
Authors:
ByeoungDo Kim,
Seong Hyeon Park,
Seokhwan Lee,
Elbek Khoshimjonov,
Dongsuk Kum,
Junsoo Kim,
Jeong Soo Kim,
Jun Won Choi
Abstract:
In this paper, we address the problem of predicting the future motion of a dynamic agent (called a target agent) given its current and past states as well as the information on its environment. It is paramount to develop a prediction model that can exploit the contextual information in both static and dynamic environments surrounding the target agent and generate diverse trajectory samples that ar…
▽ More
In this paper, we address the problem of predicting the future motion of a dynamic agent (called a target agent) given its current and past states as well as the information on its environment. It is paramount to develop a prediction model that can exploit the contextual information in both static and dynamic environments surrounding the target agent and generate diverse trajectory samples that are meaningful in a traffic context. We propose a novel prediction model, referred to as the lane-aware prediction (LaPred) network, which uses the instance-level lane entities extracted from a semantic map to predict the multi-modal future trajectories. For each lane candidate found in the neighborhood of the target agent, LaPred extracts the joint features relating the lane and the trajectories of the neighboring agents. Then, the features for all lane candidates are fused with the attention weights learned through a self-supervised learning task that identifies the lane candidate likely to be followed by the target agent. Using the instance-level lane information, LaPred can produce the trajectories compliant with the surroundings better than 2D raster image-based methods and generate the diverse future trajectories given multiple lane candidates. The experiments conducted on the public nuScenes dataset and Argoverse dataset demonstrate that the proposed LaPred method significantly outperforms the existing prediction models, achieving state-of-the-art performance in the benchmarks.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
Mixing-AdaSIN: Constructing a De-biased Dataset using Adaptive Structural Instance Normalization and Texture Mixing
Authors:
Myeongkyun Kang,
Philip Chikontwe,
Miguel Luna,
Kyung Soo Hong,
June Hong Ahn,
Sang Hyun Park
Abstract:
Following the pandemic outbreak, several works have proposed to diagnose COVID-19 with deep learning in computed tomography (CT); reporting performance on-par with experts. However, models trained/tested on the same in-distribution data may rely on the inherent data biases for successful prediction, failing to generalize on out-of-distribution samples or CT with different scanning protocols. Early…
▽ More
Following the pandemic outbreak, several works have proposed to diagnose COVID-19 with deep learning in computed tomography (CT); reporting performance on-par with experts. However, models trained/tested on the same in-distribution data may rely on the inherent data biases for successful prediction, failing to generalize on out-of-distribution samples or CT with different scanning protocols. Early attempts have partly addressed bias-mitigation and generalization through augmentation or re-sampling, but are still limited by collection costs and the difficulty of quantifying bias in medical images. In this work, we propose Mixing-AdaSIN; a bias mitigation method that uses a generative model to generate de-biased images by mixing texture information between different labeled CT scans with semantically similar features. Here, we use Adaptive Structural Instance Normalization (AdaSIN) to enhance de-biasing generation quality and guarantee structural consistency. Following, a classifier trained with the generated images learns to correctly predict the label without bias and generalizes better. To demonstrate the efficacy of our method, we construct a biased COVID-19 vs. bacterial pneumonia dataset based on CT protocols and compare with existing state-of-the-art de-biasing methods. Our experiments show that classifiers trained with de-biased generated images report improved in-distribution performance and generalization on an external COVID-19 dataset.
△ Less
Submitted 31 July, 2021; v1 submitted 26 March, 2021;
originally announced March 2021.
-
Measurement of the Resonant and Non-Resonant Branching Ratios in $Ξ_{c}^{0} \rightarrow Ξ^{0} K^+ K^-$
Authors:
J. T. McNeil,
J. Yelton,
J. Bennett,
I. Adachi,
K. Adamczyk,
J. K. Ahn,
H. Aihara,
S. Al Said,
D. M. Asner,
H. Atmacan,
V. Aulchenko,
T. Aushev,
R. Ayad,
V. Babu,
S. Bahinipati,
P. Behera,
M. Bessner,
T. Bilka,
J. Biswal,
A. Bobrov,
G. Bonvicini,
A. Bozek,
M. Bracko,
T. E. Browder,
M. Campajola
, et al. (170 additional authors not shown)
Abstract:
Using the entire data sample of $980$ $fb^{-1}$ integrated luminosity collected with the Belle detector at the KEKB asymmetric-energy $e^{+}e^{-}$ collider, we present an amplitude analysis measuring the branching fractions of the Cabibbo-allowed, $W$-exchange resonant decay $Ξ_{c}^{0} \rightarrow Ξ^{0} φ(\to K^+ K^-)$ with a polarized $φ$ and the non-resonant decay via a direct process…
▽ More
Using the entire data sample of $980$ $fb^{-1}$ integrated luminosity collected with the Belle detector at the KEKB asymmetric-energy $e^{+}e^{-}$ collider, we present an amplitude analysis measuring the branching fractions of the Cabibbo-allowed, $W$-exchange resonant decay $Ξ_{c}^{0} \rightarrow Ξ^{0} φ(\to K^+ K^-)$ with a polarized $φ$ and the non-resonant decay via a direct process $Ξ_{c}^{0} \rightarrow Ξ^{0} K^+ K^-$. We present these measurements, relative to the normalization mode $Ξ^{-}π^{+}$, and find branching ratios $\frac{\mathcal{B}(Ξ_{c}^{0} \rightarrow Ξ^{0} φ(\rightarrow K^{+}K^{-}))}{\mathcal{B}(Ξ_{c}^{0} \rightarrow Ξ^{-} π^{+})} = 0.036 \pm 0.004 (stat.) \pm 0.002 (syst.)$ and $\frac{\mathcal{B}(Ξ_{c}^{0} \rightarrow Ξ^{0} K^{+} K^{-})}{\mathcal{B}(Ξ_{c}^{0} \rightarrow Ξ^{-} π^{+})} = 0.039 \pm 0.004 (stat.) \pm 0.002 (syst.)$ which suggest that only minor cusping peaks occur in the combinatorial background of $Ω^{*-} \to Ξ^{0}K^{-}$ due to these $Ξ_{c}^{0}$ decays.
△ Less
Submitted 10 May, 2021; v1 submitted 10 December, 2020;
originally announced December 2020.
-
Graphene plasmon-phonon coupled modes at the exceptional point
Authors:
Sang Hyun Park,
Shengxuan Xia,
Sang-Hyun Oh,
Phaedon Avouris,
Tony Low
Abstract:
Properties of graphene plasmons are greatly affected by their coupling to phonons. While such coupling has been routinely observed in both near-field and far-field graphene spectroscopy, the interplay between coupling strength and mode losses, and its exceptional point physics has not been discussed. By applying a non-Hermitian framework, we identify the transition point between strong and weak co…
▽ More
Properties of graphene plasmons are greatly affected by their coupling to phonons. While such coupling has been routinely observed in both near-field and far-field graphene spectroscopy, the interplay between coupling strength and mode losses, and its exceptional point physics has not been discussed. By applying a non-Hermitian framework, we identify the transition point between strong and weak coupling as the exceptional point. Enhanced sensitivity to perturbations near the exceptional point is observed by varying the coupling strength and through gate modulation of the graphene Fermi level. Finally, we also show that the transition from strong to weak coupling is observable by changing the incident angle of radiation.
△ Less
Submitted 7 December, 2020;
originally announced December 2020.
-
Bidirectional RNN-based Few Shot Learning for 3D Medical Image Segmentation
Authors:
Soopil Kim,
Sion An,
Philip Chikontwe,
Sang Hyun Park
Abstract:
Segmentation of organs of interest in 3D medical images is necessary for accurate diagnosis and longitudinal studies. Though recent advances using deep learning have shown success for many segmentation tasks, large datasets are required for high performance and the annotation process is both time consuming and labor intensive. In this paper, we propose a 3D few shot segmentation framework for accu…
▽ More
Segmentation of organs of interest in 3D medical images is necessary for accurate diagnosis and longitudinal studies. Though recent advances using deep learning have shown success for many segmentation tasks, large datasets are required for high performance and the annotation process is both time consuming and labor intensive. In this paper, we propose a 3D few shot segmentation framework for accurate organ segmentation using limited training samples of the target organ annotation. To achieve this, a U-Net like network is designed to predict segmentation by learning the relationship between 2D slices of support data and a query image, including a bidirectional gated recurrent unit (GRU) that learns consistency of encoded features between adjacent slices. Also, we introduce a transfer learning method to adapt the characteristics of the target image and organ by updating the model before testing with arbitrary support and query data sampled from the support data. We evaluate our proposed model using three 3D CT datasets with annotations of different organs. Our model yielded significantly improved performance over state-of-the-art few shot segmentation models and was comparable to a fully supervised model trained with more target training data.
△ Less
Submitted 18 November, 2020;
originally announced November 2020.
-
Differential Emission Measure Evolution as a Precursor of Solar Flares
Authors:
C. Gontikakis,
I. Kontogiannis,
M. K. Georgoulis,
C. Guennou,
P. Syntelis,
S. H. Park,
E. Buchlin
Abstract:
We analyse the temporal evolution of the Differential Emission Measure (DEM) of solar active regions and explore its usage in solar flare prediction. The DEM maps are provided by the Gaussian Atmospheric Imaging Assembly (GAIA-DEM) archive, calculated assuming a Gaussian dependence of the DEM on the logarithmic temperature. We analyse time-series of sixteen solar active regions and a statistically…
▽ More
We analyse the temporal evolution of the Differential Emission Measure (DEM) of solar active regions and explore its usage in solar flare prediction. The DEM maps are provided by the Gaussian Atmospheric Imaging Assembly (GAIA-DEM) archive, calculated assuming a Gaussian dependence of the DEM on the logarithmic temperature. We analyse time-series of sixteen solar active regions and a statistically significant sample of 9454 point-in-time observations corresponding to hundreds of regions observed during solar cycle 24. The time-series analysis shows that the temporal derivatives of the Emission Measure dEM/dt and the maximum DEM temperature dTmax/dt frequently exhibit high positive values a few hours before M- and X-class flares, indicating that flaring regions become brighter and hotter as the flare onset approaches. From the point-in-time observations we compute the conditional probabilities of flare occurrences using the distributions of positive values of the dEM/dt, and dTmax/dt and compare them with corresponding flaring probabilities of the total unsigned magnetic flux, a conventionally used, standard flare predictor. For C-class flares, conditional probabilities have lower or similar values with the ones derived for the unsigned magnetic flux, for 24 and 12 hours forecast windows. For M- and X-class flares, these probabilities are higher than those of the unsigned flux for higher parameter values. Shorter forecast windows improve the conditional probabilities of dEM/dt, and dTmax/dt in comparison to those of the unsigned magnetic flux. We conclude that flare forerunner events such as preflare heating or small flare activity prior to major flares reflect on the temporal evolution of EM and Tmax. Of these two, the temporal derivative of the EM could conceivably be used as a credible precursor, or short-term predictor, of an imminent flare.
△ Less
Submitted 12 November, 2020;
originally announced November 2020.