-
First Measurement of Missing Energy Due to Nuclear Effects in Monoenergetic Neutrino Charged Current Interactions
Authors:
E. Marzec,
S. Ajimura,
A. Antonakis,
M. Botran,
M. K. Cheoun,
J. H. Choi,
J. W. Choi,
J. Y. Choi,
T. Dodo,
H. Furuta,
J. H. Goh,
K. Haga,
M. Harada,
S. Hasegawa,
Y. Hino,
T. Hiraiwa,
W. Hwang,
T. Iida,
E. Iwai,
S. Iwata,
H. I. Jang,
J. S. Jang,
M. C. Jang,
H. K. Jeon,
S. H. Jeon
, et al. (59 additional authors not shown)
Abstract:
We present the first measurement of the missing energy due to nuclear effects in monoenergetic, muon neutrino charged-current interactions on carbon, originating from $K^+ \rightarrow μ^+ ν_μ$ decay-at-rest ($E_{ν_μ}=235.5$ MeV), performed with the JSNS$^2$ liquid scintillator based experiment. Towards characterizing the neutrino interaction, ostensibly $ν_μn \rightarrow μ^- p$ or $ν_μ$…
▽ More
We present the first measurement of the missing energy due to nuclear effects in monoenergetic, muon neutrino charged-current interactions on carbon, originating from $K^+ \rightarrow μ^+ ν_μ$ decay-at-rest ($E_{ν_μ}=235.5$ MeV), performed with the JSNS$^2$ liquid scintillator based experiment. Towards characterizing the neutrino interaction, ostensibly $ν_μn \rightarrow μ^- p$ or $ν_μ$$^{12}\mathrm{C}$ $\rightarrow μ^-$$^{12}\mathrm{N}$, and in analogy to similar electron scattering based measurements, we define the missing energy as the energy transferred to the nucleus ($ω$) minus the kinetic energy of the outgoing proton(s), $E_{m} \equiv ω-\sum T_p$, and relate this to visible energy in the detector, $E_{m}=E_{ν_μ}~(235.5~\mathrm{MeV})-m_μ~(105.7~\mathrm{MeV}) - E_{vis}$. The missing energy, which is naively expected to be zero in the absence of nuclear effects (e.g. nucleon separation energy, Fermi momenta, and final-state interactions), is uniquely sensitive to many aspects of the interaction, and has previously been inaccessible with neutrinos. The shape-only, differential cross section measurement reported, based on a $(77\pm3)$% pure double-coincidence KDAR signal (621 total events), provides an important benchmark for models and event generators at 100s-of-MeV neutrino energies, characterized by the difficult-to-model transition region between neutrino-nucleus and neutrino-nucleon scattering, and relevant for applications in nuclear physics, neutrino oscillation measurements, and Type-II supernova studies.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
External Steering of Vine Robots via Magnetic Actuation
Authors:
Nam Gyun Kim,
Nikita J. Greenidge,
Joshua Davy,
Shinwoo Park,
James H. Chandler,
Jee-Hwan Ryu,
Pietro Valdastri
Abstract:
This paper explores the concept of external magnetic control for vine robots to enable their high curvature steering and navigation for use in endoluminal applications. Vine robots, inspired by natural growth and locomotion strategies, present unique shape adaptation capabilities that allow passive deformation around obstacles. However, without additional steering mechanisms, they lack the ability…
▽ More
This paper explores the concept of external magnetic control for vine robots to enable their high curvature steering and navigation for use in endoluminal applications. Vine robots, inspired by natural growth and locomotion strategies, present unique shape adaptation capabilities that allow passive deformation around obstacles. However, without additional steering mechanisms, they lack the ability to actively select the desired direction of growth. The principles of magnetically steered growing robots are discussed, and experimental results showcase the effectiveness of the proposed magnetic actuation approach. We present a 25 mm diameter vine robot with integrated magnetic tip capsule, including 6 Degrees of Freedom (DOF) localization and camera and demonstrate a minimum bending radius of 3.85 cm with an internal pressure of 30 kPa. Furthermore, we evaluate the robot's ability to form tight curvature through complex navigation tasks, with magnetic actuation allowing for extended free-space navigation without buckling. The suspension of the magnetic tip was also validated using the 6 DOF localization system to ensure that the shear-free nature of vine robots was preserved. Additionally, by exploiting the magnetic wrench at the tip, we showcase preliminary results of vine retraction. The findings contribute to the development of controllable vine robots for endoluminal applications, providing high tip force and shear-free navigation.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
Realization of geometric phase topology induced by multiple exceptional points
Authors:
Jung-Wan Ryu,
Jae-Ho Han,
Chang-Hwan Yi
Abstract:
Non-Hermitian systems have Riemann surface structures of complex eigenvalues that admit singularities known as exceptional points. Combining with geometric phases of eigenstates gives rise to unique properties of non-Hermitian systems, and their classifications have been studied recently. However, the physical realizations of classes of the classifications have been relatively limited because a sm…
▽ More
Non-Hermitian systems have Riemann surface structures of complex eigenvalues that admit singularities known as exceptional points. Combining with geometric phases of eigenstates gives rise to unique properties of non-Hermitian systems, and their classifications have been studied recently. However, the physical realizations of classes of the classifications have been relatively limited because a small number of modes and exceptional points are involved. In this work, we show in microcavities that all five classes [J.-W. Ryu, et al., Commun. Phys. 7, 109 (2024)] of three modes can emerge with three exceptional points. In demonstrations, we identified various combinations of exceptional points within a two-dimensional parameter space of a single microcavity and defined five distinct encircling loops based on three selected exceptional points. According to the classification, these loops facilitate different mode exchanges and the acquisition of additional geometric phases during the adiabatic encircling of exceptional points. Our results provide a broad description of the geometric phases-associated topology induced by multiple exceptional points in realistic physical systems.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
Designing elegant Bell inequalities
Authors:
Kwangil Bae,
Junghee Ryu,
Ilkwon Sohn,
Wonhyuk Lee
Abstract:
Elegant Bell inequality is well known for its much exploited property, being maximally violated by maximal entanglement, mutually unbiased bases, and symmetric informationally complete positive operator-valued measure elements. It is the only one with such property known so far. We present a method to construct Bell inequalities with violation feature analogous to original elegant Bell inequality…
▽ More
Elegant Bell inequality is well known for its much exploited property, being maximally violated by maximal entanglement, mutually unbiased bases, and symmetric informationally complete positive operator-valued measure elements. It is the only one with such property known so far. We present a method to construct Bell inequalities with violation feature analogous to original elegant Bell inequality in high dimension from a simple analytic quantum bound. A Bell inequality with such feature is derived in three dimension for the first time. It shows larger violation than existing Bell inequalities of similar classes while requiring arguably small number of measurements.
△ Less
Submitted 23 August, 2024; v1 submitted 21 August, 2024;
originally announced August 2024.
-
UV-Plane Beam Mapping for Non-Terrestrial Networks in 3GPP System-Level Simulations
Authors:
Dong-Hyun Jung,
Sucheol Kim,
Miyeon Lee,
Joon-Gyu Ryu,
Junil Choi
Abstract:
Due to the high altitudes and large beam sizes of satellites, the curvature of the Earth's surface can impact system-level performance. To consider this, 3GPP introduces the UV-plane beam mapping for system-level simulations of non-terrestrial networks (NTNs). This paper aims to provide a comprehensive understanding of how beams and user equipments (UEs) are placed on the UV-plane and subsequently…
▽ More
Due to the high altitudes and large beam sizes of satellites, the curvature of the Earth's surface can impact system-level performance. To consider this, 3GPP introduces the UV-plane beam mapping for system-level simulations of non-terrestrial networks (NTNs). This paper aims to provide a comprehensive understanding of how beams and user equipments (UEs) are placed on the UV-plane and subsequently mapped to the Earth's surface. We present a general process of projecting UEs on the UV-plane onto the Earth's surface. This process could offer a useful guideline for beam and UE deployment when evaluating the system-level performance of NTNs.
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
Rate-Splitting Multiple Access for GEO-LEO Coexisting Satellite Systems: A Traffic-Aware Throughput Maximization Precoder Design
Authors:
Jaehak Ryu,
Aryan Kaushik,
Byungju Lee,
Wonjae Shin
Abstract:
The frequency coexistence between geostationary orbit (GEO) and low earth orbit (LEO) satellite systems is expected to be a promising approach for relieving spectrum scarcity. However, it is essential to manage mutual interference between GEO and LEO satellite systems for frequency coexistence. Specifically, \emph{in-line interference}, caused by LEO satellites moving near the line-of-sight path b…
▽ More
The frequency coexistence between geostationary orbit (GEO) and low earth orbit (LEO) satellite systems is expected to be a promising approach for relieving spectrum scarcity. However, it is essential to manage mutual interference between GEO and LEO satellite systems for frequency coexistence. Specifically, \emph{in-line interference}, caused by LEO satellites moving near the line-of-sight path between GEO satellite and GEO users (GUs), can significantly degrade GEO system throughput. This paper put forth a novel rate-splitting multiple access (RSMA) with a super-common message for GEO-LEO coexisting satellite systems (CSS). By employing a super-common message that GUs can decode, GUs can mitigate the in-line interference by successive interference cancellation (SIC). Moreover, we formulate a traffic-aware throughput maximization (TTM) problem to satisfy the heterogeneous traffic demands of users by minimizing total unmet throughput demands (or user dissatisfaction). By doing so, the TTM precoder can be flexibly adjusted according to the interference leakage from LEO satellites to GUs and target traffic demands. Numerical results confirm that our proposed method ensures seamless connectivity even in the GEO-LEO in-line interference regime under imperfect channel state information (CSI) at both the transmitter and receiver.
△ Less
Submitted 4 August, 2024;
originally announced August 2024.
-
Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation
Authors:
Jiwoo Ryu,
Hao-Wen Dong,
Jongmin Jung,
Dasaem Jeong
Abstract:
Representing symbolic music with compound tokens, where each token consists of several different sub-tokens representing a distinct musical feature or attribute, offers the advantage of reducing sequence length. While previous research has validated the efficacy of compound tokens in music sequence modeling, predicting all sub-tokens simultaneously can lead to suboptimal results as it may not full…
▽ More
Representing symbolic music with compound tokens, where each token consists of several different sub-tokens representing a distinct musical feature or attribute, offers the advantage of reducing sequence length. While previous research has validated the efficacy of compound tokens in music sequence modeling, predicting all sub-tokens simultaneously can lead to suboptimal results as it may not fully capture the interdependencies between them. We introduce the Nested Music Transformer (NMT), an architecture tailored for decoding compound tokens autoregressively, similar to processing flattened tokens, but with low memory usage. The NMT consists of two transformers: the main decoder that models a sequence of compound tokens and the sub-decoder for modeling sub-tokens of each compound token. The experiment results showed that applying the NMT to compound tokens can enhance the performance in terms of better perplexity in processing various symbolic music datasets and discrete audio tokens from the MAESTRO dataset.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
Generalizing AI-driven Assessment of Immunohistochemistry across Immunostains and Cancer Types: A Universal Immunohistochemistry Analyzer
Authors:
Biagio Brattoli,
Mohammad Mostafavi,
Taebum Lee,
Wonkyung Jung,
Jeongun Ryu,
Seonwook Park,
Jongchan Park,
Sergio Pereira,
Seunghwan Shin,
Sangjoon Choi,
Hyojin Kim,
Donggeun Yoo,
Siraj M. Ali,
Kyunghyun Paeng,
Chan-Young Ock,
Soo Ick Cho,
Seokhwi Kim
Abstract:
Despite advancements in methodologies, immunohistochemistry (IHC) remains the most utilized ancillary test for histopathologic and companion diagnostics in targeted therapies. However, objective IHC assessment poses challenges. Artificial intelligence (AI) has emerged as a potential solution, yet its development requires extensive training for each cancer and IHC type, limiting versatility. We dev…
▽ More
Despite advancements in methodologies, immunohistochemistry (IHC) remains the most utilized ancillary test for histopathologic and companion diagnostics in targeted therapies. However, objective IHC assessment poses challenges. Artificial intelligence (AI) has emerged as a potential solution, yet its development requires extensive training for each cancer and IHC type, limiting versatility. We developed a Universal IHC (UIHC) analyzer, an AI model for interpreting IHC images regardless of tumor or IHC types, using training datasets from various cancers stained for PD-L1 and/or HER2. This multi-cohort trained model outperforms conventional single-cohort models in interpreting unseen IHCs (Kappa score 0.578 vs. up to 0.509) and consistently shows superior performance across different positive staining cutoff values. Qualitative analysis reveals that UIHC effectively clusters patches based on expression levels. The UIHC model also quantitatively assesses c-MET expression with MET mutations, representing a significant advancement in AI application in the era of personalized medicine and accumulating novel biomarkers.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Engineering high Chern number insulators
Authors:
Sungjong Woo,
Seungbum Woo,
Jung-Wan Ryu,
Hee Chul Park
Abstract:
The concept of Chern insulators is one of the most important buliding block of topological physics, enabling the quantum Hall effect without external magnetic fields. The construction of Chern insulators has been typically through an guess-and-confirm approach, which can be inefficient and unpredictable. In this paper, we introduce a systematic method to directly construct two-dimensional Chern in…
▽ More
The concept of Chern insulators is one of the most important buliding block of topological physics, enabling the quantum Hall effect without external magnetic fields. The construction of Chern insulators has been typically through an guess-and-confirm approach, which can be inefficient and unpredictable. In this paper, we introduce a systematic method to directly construct two-dimensional Chern insulators that can provide any nontrivial Chern number. Our method is built upon the one-dimensional Rice-Mele model, which is well known for its adjustable polarization properties, providing a reliable framework for manipulation. By extending this model into two dimensions, we are able to engineer lattice structures that demonstrate predetermined topological quantities effectively. This research not only contributes the development of Chern insulators but also paves the way for designing a variety of lattice structures with significant topological implications, potentially impacting quantum computing and materials science. With this approach, we are to shed light on the pathways for designing more complex and functional topological phases in synthetic materials.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
A Bistatic ISAC Framework for LEO Satellite Systems: A Rate-Splitting Approach
Authors:
Juha Park,
Jaehyup Seong,
Jaehak Ryu,
Yijie Mao,
Wonjae Shin
Abstract:
Aiming to achieve ubiquitous global connectivity and target detection on the same platform with improved spectral/energy efficiency and reduced onboard hardware cost, low Earth orbit (LEO) satellite systems capable of simultaneously performing communications and radar have attracted significant attention. Designing such a joint system should address not only the challenges of integrating two funct…
▽ More
Aiming to achieve ubiquitous global connectivity and target detection on the same platform with improved spectral/energy efficiency and reduced onboard hardware cost, low Earth orbit (LEO) satellite systems capable of simultaneously performing communications and radar have attracted significant attention. Designing such a joint system should address not only the challenges of integrating two functions but also the unique propagation characteristics of the satellites. To overcome severe echo signal path loss due to the high altitude of the satellite, we put forth a bistatic integrated sensing and communication (ISAC) framework with a radar receiver separated from the satellite. For robust and effective interference management, we employ rate-splitting multiple access (RSMA), which splits and encodes users messages into private and common streams. We optimize the dual-functional precoders to maximize the minimum rate among all users while satisfying the Cramer-Rao bound (CRB) constraints. Given the challenge of acquiring instantaneous channel state information (iCSI) for LEO satellites, we exploit the geometrical and statistical characteristics of the satellite channel. To develop an efficient optimization algorithm, semidefinite relaxation (SDR), sequential rank-1 constraint relaxation (SROCR), and successive convex approximation (SCA) are utilized. Numerical results show that the proposed framework efficiently performs both communication and radar, demonstrating superior interference control capabilities. Furthermore, it is validated that the common stream plays three vital roles: i) beamforming towards the radar target, ii) interference management between communications and radar, and iii) interference management among communication users.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
The Radiation Gauge: When is it Valid?
Authors:
Jie Zhu,
Christopher J. Ryu,
Dong-Yeop Na,
Weng Cho Chew
Abstract:
In this paper, we shall show that the vector-scalar potential ($\mathbf{A}$-$Φ$) formulation, for many problems, can be further simplified by ignoring the scalar potential contribution and setting it to zero.
In this paper, we shall show that the vector-scalar potential ($\mathbf{A}$-$Φ$) formulation, for many problems, can be further simplified by ignoring the scalar potential contribution and setting it to zero.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher
Authors:
Hyunjong Ok,
Jegwang Ryu,
Jaeho Lee
Abstract:
How can sLLMs efficiently utilize the supervision of LLMs to improve their generative quality? This question has been well studied in scenarios where there is no restriction on the number of LLM supervisions one can use, giving birth to many decoding algorithms that utilize supervision without further training. However, it is still unclear what is an effective strategy under the limited supervisio…
▽ More
How can sLLMs efficiently utilize the supervision of LLMs to improve their generative quality? This question has been well studied in scenarios where there is no restriction on the number of LLM supervisions one can use, giving birth to many decoding algorithms that utilize supervision without further training. However, it is still unclear what is an effective strategy under the limited supervision scenario, where we assume that no more than a few tokens can be generated by LLMs. To this end, we develop an algorithm to effectively aggregate the sLLM and LLM predictions on initial tokens so that the generated tokens can more accurately condition the subsequent token generation by sLLM only. Critically, we find that it is essential to adaptively overtrust or disregard the LLM prediction based on the confidence of the sLLM. Through our experiments on a wide range of models and datasets, we demonstrate that our method provides a consistent improvement over conventional decoding strategies.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Exponentially Enhanced Scheme for the Heralded Qudit GHZ State in Linear Optics
Authors:
Seungbeom Chin,
Junghee Ryu,
Yong-Su Kim
Abstract:
High-dimensional multipartite entanglement plays a crucial role in quantum information science. However, existing schemes for generating such entanglement become increasingly complex and costly as the dimension of quantum units increases. In this work, we overcome the limitation by proposing a significantly enhanced linear optical heralded scheme that generates the $d$-level $N$-partite GHZ state…
▽ More
High-dimensional multipartite entanglement plays a crucial role in quantum information science. However, existing schemes for generating such entanglement become increasingly complex and costly as the dimension of quantum units increases. In this work, we overcome the limitation by proposing a significantly enhanced linear optical heralded scheme that generates the $d$-level $N$-partite GHZ state with single-photon sources and their linear operations. Our scheme requires $dN$ photons to generate the target state with substantially improved success probability from previous schemes. It employs linear optical logic gates compatible with any qudit encoding system and can generate generalized GHZ states with installments of beamsplitters. With efficient generations of high-dimensional resource states, our work opens avenues for further exploration in high-dimensional quantum information processing.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records
Authors:
Jaehee Ryu,
Seonhee Cho,
Gyubok Lee,
Edward Choi
Abstract:
In this paper, we introduce EHR-SeqSQL, a novel sequential text-to-SQL dataset for Electronic Health Record (EHR) databases. EHR-SeqSQL is designed to address critical yet underexplored aspects in text-to-SQL parsing: interactivity, compositionality, and efficiency. To the best of our knowledge, EHR-SeqSQL is not only the largest but also the first medical text-to-SQL dataset benchmark to include…
▽ More
In this paper, we introduce EHR-SeqSQL, a novel sequential text-to-SQL dataset for Electronic Health Record (EHR) databases. EHR-SeqSQL is designed to address critical yet underexplored aspects in text-to-SQL parsing: interactivity, compositionality, and efficiency. To the best of our knowledge, EHR-SeqSQL is not only the largest but also the first medical text-to-SQL dataset benchmark to include sequential and contextual questions. We provide a data split and the new test set designed to assess compositional generalization ability. Our experiments demonstrate the superiority of a multi-turn approach over a single-turn approach in learning compositionality. Additionally, our dataset integrates specially crafted tokens into SQL queries to improve execution efficiency. With EHR-SeqSQL, we aim to bridge the gap between practical needs and academic research in the text-to-SQL domain. EHR-SeqSQL is available at https://github.com/seonhee99/EHR-SeqSQL.
△ Less
Submitted 30 July, 2024; v1 submitted 23 May, 2024;
originally announced June 2024.
-
Pseudo-Hermitian Topology of Multiband Non-Hermitian Systems
Authors:
Jung-Wan Ryu,
Jae-Ho Han,
Chang-Hwan Yi,
Hee Chul Park,
Moon Jip Park
Abstract:
The complex eigenenergies and non-orthogonal eigenstates of non-Hermitian systems exhibit unique topological phenomena that cannot appear in Hermitian systems. Representative examples are the non-Hermitian skin effect and exceptional points. In a two-dimensional parameter space, topological classifications of non-separable bands in multiband non-Hermitian systems can be established by invoking a p…
▽ More
The complex eigenenergies and non-orthogonal eigenstates of non-Hermitian systems exhibit unique topological phenomena that cannot appear in Hermitian systems. Representative examples are the non-Hermitian skin effect and exceptional points. In a two-dimensional parameter space, topological classifications of non-separable bands in multiband non-Hermitian systems can be established by invoking a permutation group, where the product of the permutation represents state exchange due to exceptional points in the space. We unveil in this work the role of pseudo-Hermitian lines in non-Hermitian topology for multiple bands. Contrary to current understanding, the non-separability of non-Hermitian multibands can be topologically non-trivial without exceptional points in two-dimensional space. Our work builds on the fundamental and comprehensive understanding of non-Hermitian multiband systems and also offers versatile applications and realizations of non-Hermitian systems without the need to consider exceptional points.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Generalized Einstein Relations between Absorption and Emission Spectra at Thermodynamic Equilibrium
Authors:
Jisu Ryu,
Sarang Yeola,
David M. Jonas
Abstract:
We present Einstein coefficient spectra and a detailed-balance derivation of generalized Einstein relations between them that is based on the connection between spontaneous and stimulated emission. If two broadened levels or bands overlap in energy, transitions between them need not be purely absorptive or emissive. Consequently, spontaneous emission can occur in both transition directions, and fo…
▽ More
We present Einstein coefficient spectra and a detailed-balance derivation of generalized Einstein relations between them that is based on the connection between spontaneous and stimulated emission. If two broadened levels or bands overlap in energy, transitions between them need not be purely absorptive or emissive. Consequently, spontaneous emission can occur in both transition directions, and four Einstein coefficient spectra replace the three Einstein coefficients for a line. At equilibrium, the four different spectra obey five pairwise relationships and one lineshape generates all four. These relationships are independent of molecular quantum statistics and predict the Stokes' shift between forward and reverse transitions required by equilibrium with blackbody radiation. For Boltzmann statistics, the relative strengths of forward and reverse transitions depend on the formal chemical potential difference between the initial and final bands, which becomes the standard chemical potential difference for ideal solutes. The formal chemical potential of a band replaces both the energy and degeneracy of a quantum level. Like the energies of quantum levels, the formal chemical potentials of bands obey the Rydberg-Ritz combination principle. Each stimulated Einstein coefficient spectrum gives a frequency-dependent transition cross section. Transition cross sections obey causality and a detailed-balance condition with spontaneous emission, but do not directly obey generalized Einstein relations. Even with an energetic width much less than the photon energy, an absorptive forward transition with an energetic width much greater than the thermal energy can have such an extreme Stokes' shift that its reverse transition cross section becomes predominantly absorptive rather than emissive.
△ Less
Submitted 20 August, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
Cross-modality Matching and Prediction of Perturbation Responses with Labeled Gromov-Wasserstein Optimal Transport
Authors:
Jayoung Ryu,
Romain Lopez,
Charlotte Bunne,
Aviv Regev
Abstract:
It is now possible to conduct large scale perturbation screens with complex readout modalities, such as different molecular profiles or high content cell images. While these open the way for systematic dissection of causal cell circuits, integrated such data across screens to maximize our ability to predict circuits poses substantial computational challenges, which have not been addressed. Here, w…
▽ More
It is now possible to conduct large scale perturbation screens with complex readout modalities, such as different molecular profiles or high content cell images. While these open the way for systematic dissection of causal cell circuits, integrated such data across screens to maximize our ability to predict circuits poses substantial computational challenges, which have not been addressed. Here, we extend two Gromov-Wasserstein Optimal Transport methods to incorporate the perturbation label for cross-modality alignment. The obtained alignment is then employed to train a predictive model that estimates cellular responses to perturbations observed with only one measurement modality. We validate our method for the tasks of cross-modality alignment and cross-modality prediction in a recent multi-modal single-cell perturbation dataset. Our approach opens the way to unified causal models of cell biology.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Boundary effect and quantum phases in spin chains
Authors:
Jinhyeok Ryu,
Jaeyoon Cho
Abstract:
Boundary effect is a widespread idea in many-body theories. However, it is more of a conceptual notion than a rigorously defined physical quantity. One can quantify the boundary effect by comparing two ground states of the same physical model, which differ only slightly in system size. Here, we analyze the quantity, which we call a boundary effect function, for an XXZ spin-1/2 model using density…
▽ More
Boundary effect is a widespread idea in many-body theories. However, it is more of a conceptual notion than a rigorously defined physical quantity. One can quantify the boundary effect by comparing two ground states of the same physical model, which differ only slightly in system size. Here, we analyze the quantity, which we call a boundary effect function, for an XXZ spin-1/2 model using density matrix renormalization group calculations. We find that three quantum phases of the model manifest as different functional forms of the boundary effect function. As a result, the quantum phase transition of the model is associated with a nonanalytic change of the boundary effect function. This work thus provides and concretizes a novel perspective on the relationship between bulk and boundary properties of ground states.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Spontaneous emission decay and excitation in photonic temporal crystals
Authors:
Jagang Park,
Kyungmin Lee,
Ruo-Yang Zhang,
Hee-Chul Park,
Jung-Wan Ryu,
Gil Young Cho,
Min Yeul Lee,
Zhaoqing Zhang,
Namkyoo Park,
Wonju Jeon,
Jonghwa Shin,
C. T. Chan,
Bumki Min
Abstract:
Over the last few decades, the prominent strategies for controlling spontaneous emission has been the use of resonant or space-periodic photonic structures. This approach, initially articulated by Purcell and later expanded upon by Yablonovitch in the context of photonic crystals, leverages the spatial surroundings to modify the spontaneous emission decay rate of atoms or quantum emitters. However…
▽ More
Over the last few decades, the prominent strategies for controlling spontaneous emission has been the use of resonant or space-periodic photonic structures. This approach, initially articulated by Purcell and later expanded upon by Yablonovitch in the context of photonic crystals, leverages the spatial surroundings to modify the spontaneous emission decay rate of atoms or quantum emitters. However, the rise of time-varying photonics has compelled a reevaluation of the spontaneous emission process within dynamically changing environments, especially concerning photonic temporal crystals where optical properties undergo time-periodic modulation. Here, we apply classical light-matter interaction theory along with Floquet analysis to reveal a substantial enhancement in the spontaneous emission decay rate at the momentum gap frequency in photonic temporal crystals. This enhancement is attributed to time-periodicity-induced loss and gain mechanisms, as well as the non-orthogonality of Floquet eigenstates that are inherent to photonic temporal crystals. Intriguingly, our findings also suggest that photonic temporal crystals enable the spontaneous excitation of an atom from its ground state to an excited state, accompanied by the concurrent emission of a photon.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
CAUS: A Dataset for Question Generation based on Human Cognition Leveraging Large Language Models
Authors:
Minjung Shin,
Donghyun Kim,
Jeh-Kwang Ryu
Abstract:
We introduce the Curious About Uncertain Scene (CAUS) dataset, designed to enable Large Language Models, specifically GPT-4, to emulate human cognitive processes for resolving uncertainties. Leveraging this dataset, we investigate the potential of LLMs to engage in questioning effectively. Our approach involves providing scene descriptions embedded with uncertainties to stimulate the generation of…
▽ More
We introduce the Curious About Uncertain Scene (CAUS) dataset, designed to enable Large Language Models, specifically GPT-4, to emulate human cognitive processes for resolving uncertainties. Leveraging this dataset, we investigate the potential of LLMs to engage in questioning effectively. Our approach involves providing scene descriptions embedded with uncertainties to stimulate the generation of reasoning and queries. The queries are then classified according to multi-dimensional criteria. All procedures are facilitated by a collaborative system involving both LLMs and human researchers. Our results demonstrate that GPT-4 can effectively generate pertinent questions and grasp their nuances, particularly when given appropriate context and instructions. The study suggests that incorporating human-like questioning into AI models improves their ability to manage uncertainties, paving the way for future advancements in Artificial Intelligence (AI).
△ Less
Submitted 19 May, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Boundary effect and correlations in fermionic Gaussian states
Authors:
Jinhyeok Ryu,
Jaeyoon Cho
Abstract:
The effect of boundaries on the bulk properties of quantum many-body systems is an intriguing subject of study. One can define a boundary effect function, which quantifies the change in the ground state as a function of the distance from the boundary. This function serves as an upper bound for the correlation functions and the entanglement entropies in the thermodynamic limit. Here, we perform num…
▽ More
The effect of boundaries on the bulk properties of quantum many-body systems is an intriguing subject of study. One can define a boundary effect function, which quantifies the change in the ground state as a function of the distance from the boundary. This function serves as an upper bound for the correlation functions and the entanglement entropies in the thermodynamic limit. Here, we perform numerical analyses of the boundary effect function for one-dimensional free-fermion models. We find that the upper bound established by the boundary effect fuction is tight for the examined systems, providing a deep insight into how correlations and entanglement are developed in the ground state as the system size grows. As a by-product, we derive a general fidelity formula for fermionic Gaussian states in a self-contained manner, rendering the formula easier to apprehend.
△ Less
Submitted 18 June, 2024; v1 submitted 14 April, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Entanglement-based quantum information protocols designed with silicon quantum dot platform
Authors:
Junghee Ryu,
Hoon Ryu
Abstract:
Electron spins in silicon quantum dot platform provide great potential for quantum information processing due to excellent physical properties and modern fabrication technologies. Spin-based quantum bit (qubit) operations are intensively studied to realize universal logic gates with a high fidelity, fast gating operations, and basic programmability. Although recent experimental achievements can be…
▽ More
Electron spins in silicon quantum dot platform provide great potential for quantum information processing due to excellent physical properties and modern fabrication technologies. Spin-based quantum bit (qubit) operations are intensively studied to realize universal logic gates with a high fidelity, fast gating operations, and basic programmability. Although recent experimental achievements can be considered as remarkable results for utilizing quantum computation, more advanced quantum information protocols should be demonstrated with a large number of qubit system to enable programmability of silicon devices. Here, we computationally explore entanglement-based quantum information protocols in electrically defined five silicon quantum dot system. To this end, device simulations are employed to demonstrate $1$-qubit gate and $2$-qubit gate operations. Additionally, we discuss the implementations of three applications: the generation of magic states, entanglement swapping, and quantum teleportation in our silicon device. All the results will secure the scalability of quantum information processing with electron spin qubits in silicon quantum dot system.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Optimizing Quantum Convolutional Neural Network Architectures for Arbitrary Data Dimension
Authors:
Changwon Lee,
Israel F. Araujo,
Dongha Kim,
Junghan Lee,
Siheon Park,
Ju-Young Ryu,
Daniel K. Park
Abstract:
Quantum convolutional neural networks (QCNNs) represent a promising approach in quantum machine learning, paving new directions for both quantum and classical data analysis. This approach is particularly attractive due to the absence of the barren plateau problem, a fundamental challenge in training quantum neural networks (QNNs), and its feasibility. However, a limitation arises when applying QCN…
▽ More
Quantum convolutional neural networks (QCNNs) represent a promising approach in quantum machine learning, paving new directions for both quantum and classical data analysis. This approach is particularly attractive due to the absence of the barren plateau problem, a fundamental challenge in training quantum neural networks (QNNs), and its feasibility. However, a limitation arises when applying QCNNs to classical data. The network architecture is most natural when the number of input qubits is a power of two, as this number is reduced by a factor of two in each pooling layer. The number of input qubits determines the dimensions (i.e. the number of features) of the input data that can be processed, restricting the applicability of QCNN algorithms to real-world data. To address this issue, we propose a QCNN architecture capable of handling arbitrary input data dimensions while optimizing the allocation of quantum resources such as ancillary qubits and quantum gates. This optimization is not only important for minimizing computational resources, but also essential in noisy intermediate-scale quantum (NISQ) computing, as the size of the quantum circuits that can be executed reliably is limited. Through numerical simulations, we benchmarked the classification performance of various QCNN architectures when handling arbitrary input data dimensions on the MNIST and Breast Cancer datasets. The results validate that the proposed QCNN architecture achieves excellent classification performance while utilizing a minimal resource overhead, providing an optimal solution when reliable quantum computation is constrained by noise and imperfections.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
LR-FHSS Transceiver for Direct-to-Satellite IoT Communications: Design, Implementation, and Verification
Authors:
Sooyeob Jung,
Seongah Jeong,
Jinkyu Kang,
Gyeongrae Im,
Sangjae Lee,
Mi-Kyung Oh,
Joon Gyu Ryu,
Joonhyuk Kang
Abstract:
This paper proposes a long range-frequency hopping spread spectrum (LR-FHSS) transceiver design for the Direct-to-Satellite Internet of Things (DtS-IoT) communication system. The DtS-IoT system has recently attracted attention as a promising nonterrestrial network (NTN) solution to provide high-traffic and low-latency data transfer services to IoT devices in global coverage. In particular, this st…
▽ More
This paper proposes a long range-frequency hopping spread spectrum (LR-FHSS) transceiver design for the Direct-to-Satellite Internet of Things (DtS-IoT) communication system. The DtS-IoT system has recently attracted attention as a promising nonterrestrial network (NTN) solution to provide high-traffic and low-latency data transfer services to IoT devices in global coverage. In particular, this study provides guidelines for the overall DtS-IoT system architecture and design details that conform to the Long Range Wide-Area Network (LoRaWAN). Furthermore, we also detail various DtS-IoT use cases. Considering the multiple low-Earth orbit (LEO) satellites, we developed the LR-FHSS transceiver to improve system efficiency, which is the first attempt in real satellite communication systems using LR-FHSS. Moreover, as an extension of our previous work with perfect synchronization, we applied a robust synchronization scheme against the Doppler effect and co-channel interference (CCI) caused by LEO satellite channel environments, including signal detection for the simultaneous reception of numerous frequency hopping signals and an enhanced soft-output-Viterbi-algorithm (SOVA) for the header and payload receptions. Lastly, we present proof-of-concept implementation and testbeds using an application-specific integrated circuit (ASIC) chipset and a field-programmable gate array (FPGA) that verify the performance of the proposed LR-FHSS transceiver design of DtS-IoT communication systems. The laboratory test results reveal that the proposed LR-FHSS-based framework with the robust synchronization technique can provide wide coverage, seamless connectivity, and high throughput communication links for the realization of future sixth-generation (6G) networks.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Interactive Multi-Head Self-Attention with Linear Complexity
Authors:
Hankyul Kang,
Ming-Hsuan Yang,
Jongbin Ryu
Abstract:
We propose an efficient interactive method for multi-head self-attention via decomposition. For existing methods using multi-head self-attention, the attention operation of each head is computed independently. However, we show that the interactions between cross-heads of the attention matrix enhance the information flow of the attention operation. Considering that the attention matrix of each head…
▽ More
We propose an efficient interactive method for multi-head self-attention via decomposition. For existing methods using multi-head self-attention, the attention operation of each head is computed independently. However, we show that the interactions between cross-heads of the attention matrix enhance the information flow of the attention operation. Considering that the attention matrix of each head can be seen as a feature of networks, it is beneficial to establish connectivity between them to capture interactions better. However, a straightforward approach to capture the interactions between the cross-heads is computationally prohibitive as the complexity grows substantially with the high dimension of an attention matrix. In this work, we propose an effective method to decompose the attention operation into query- and key-less components. This will result in a more manageable size for the attention matrix, specifically for the cross-head interactions. Expensive experimental results show that the proposed cross-head interaction approach performs favorably against existing efficient attention methods and state-of-the-art backbone models.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Sobolev Training for Operator Learning
Authors:
Namkyeong Cho,
Junseung Ryu,
Hyung Ju Hwang
Abstract:
This study investigates the impact of Sobolev Training on operator learning frameworks for improving model performance. Our research reveals that integrating derivative information into the loss function enhances the training process, and we propose a novel framework to approximate derivatives on irregular meshes in operator learning. Our findings are supported by both experimental evidence and th…
▽ More
This study investigates the impact of Sobolev Training on operator learning frameworks for improving model performance. Our research reveals that integrating derivative information into the loss function enhances the training process, and we propose a novel framework to approximate derivatives on irregular meshes in operator learning. Our findings are supported by both experimental evidence and theoretical analysis. This demonstrates the effectiveness of Sobolev Training in approximating the solution operators between infinite-dimensional spaces.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage?
Authors:
Maohao Shen,
J. Jon Ryu,
Soumya Ghosh,
Yuheng Bu,
Prasanna Sattigeri,
Subhro Das,
Gregory W. Wornell
Abstract:
This paper questions the effectiveness of a modern predictive uncertainty quantification approach, called \emph{evidential deep learning} (EDL), in which a single neural network model is trained to learn a meta distribution over the predictive distribution by minimizing a specific objective function. Despite their perceived strong empirical performance on downstream tasks, a line of recent studies…
▽ More
This paper questions the effectiveness of a modern predictive uncertainty quantification approach, called \emph{evidential deep learning} (EDL), in which a single neural network model is trained to learn a meta distribution over the predictive distribution by minimizing a specific objective function. Despite their perceived strong empirical performance on downstream tasks, a line of recent studies by Bengs et al. identify limitations of the existing methods to conclude their learned epistemic uncertainties are unreliable, e.g., in that they are non-vanishing even with infinite data. Building on and sharpening such analysis, we 1) provide a sharper understanding of the asymptotic behavior of a wide class of EDL methods by unifying various objective functions; 2) reveal that the EDL methods can be better interpreted as an out-of-distribution detection algorithm based on energy-based-models; and 3) conduct extensive ablation studies to better assess their empirical effectiveness with real-world datasets. Through all these analyses, we conclude that even when EDL methods are empirically effective on downstream tasks, this occurs despite their poor uncertainty quantification capabilities. Our investigation suggests that incorporating model uncertainty can help EDL methods faithfully quantify uncertainties and further improve performance on representative downstream tasks, albeit at the cost of additional computational complexity.
△ Less
Submitted 12 June, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Gambling-Based Confidence Sequences for Bounded Random Vectors
Authors:
J. Jon Ryu,
Gregory W. Wornell
Abstract:
A confidence sequence (CS) is a sequence of confidence sets that contains a target parameter of an underlying stochastic process at any time step with high probability. This paper proposes a new approach to constructing CSs for means of bounded multivariate stochastic processes using a general gambling framework, extending the recently established coin toss framework for bounded random processes.…
▽ More
A confidence sequence (CS) is a sequence of confidence sets that contains a target parameter of an underlying stochastic process at any time step with high probability. This paper proposes a new approach to constructing CSs for means of bounded multivariate stochastic processes using a general gambling framework, extending the recently established coin toss framework for bounded random processes. The proposed gambling framework provides a general recipe for constructing CSs for categorical and probability-vector-valued observations, as well as for general bounded multidimensional observations through a simple reduction. This paper specifically explores the use of the mixture portfolio, akin to Cover's universal portfolio, in the proposed framework and investigates the properties of the resulting CSs. Simulations demonstrate the tightness of these confidence sequences compared to existing methods. When applied to the sampling without-replacement setting for finite categorical data, it is shown that the resulting CS based on a universal gambling strategy is provably tighter than that of the posterior-prior ratio martingale proposed by Waudby-Smith and Ramdas.
△ Less
Submitted 21 August, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
Operator SVD with Neural Networks via Nested Low-Rank Approximation
Authors:
J. Jon Ryu,
Xiangxiang Xu,
H. S. Melihcan Erol,
Yuheng Bu,
Lizhong Zheng,
Gregory W. Wornell
Abstract:
Computing eigenvalue decomposition (EVD) of a given linear operator, or finding its leading eigenvalues and eigenfunctions, is a fundamental task in many machine learning and scientific computing problems. For high-dimensional eigenvalue problems, training neural networks to parameterize the eigenfunctions is considered as a promising alternative to the classical numerical linear algebra technique…
▽ More
Computing eigenvalue decomposition (EVD) of a given linear operator, or finding its leading eigenvalues and eigenfunctions, is a fundamental task in many machine learning and scientific computing problems. For high-dimensional eigenvalue problems, training neural networks to parameterize the eigenfunctions is considered as a promising alternative to the classical numerical linear algebra techniques. This paper proposes a new optimization framework based on the low-rank approximation characterization of a truncated singular value decomposition, accompanied by new techniques called \emph{nesting} for learning the top-$L$ singular values and singular functions in the correct order. The proposed method promotes the desired orthogonality in the learned functions implicitly and efficiently via an unconstrained optimization formulation, which is easy to solve with off-the-shelf gradient-based optimization algorithms. We demonstrate the effectiveness of the proposed optimization framework for use cases in computational physics and machine learning.
△ Less
Submitted 21 August, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
Semiclassical $L^p$ quasimode restriction estimates in two dimensions
Authors:
Sewook Oh,
Jaehyeon Ryu
Abstract:
We establish the $L^p$ restriction estimates for quasimodes on a smooth curve in two dimensions. Our estimates are sharp for all smooth curves. As an application, we address $L^p$ eigenfunction restriction estimates for Laplace-Beltrami eigenfunctions on $2$-dimensional compact Riemannian manifolds without boundary and Hermite functions on $\mathbb R^2$. Our method involves a geometric analysis of…
▽ More
We establish the $L^p$ restriction estimates for quasimodes on a smooth curve in two dimensions. Our estimates are sharp for all smooth curves. As an application, we address $L^p$ eigenfunction restriction estimates for Laplace-Beltrami eigenfunctions on $2$-dimensional compact Riemannian manifolds without boundary and Hermite functions on $\mathbb R^2$. Our method involves a geometric analysis of the contact order between the curve and the bicharacteristic flow of the semiclassical pseudodifferential operator.
△ Less
Submitted 24 February, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data
Authors:
Young-Jae Park,
Minseok Seo,
Doyi Kim,
Hyeri Kim,
Sanghoon Choi,
Beomkyu Choi,
Jeongwon Ryu,
Sohee Son,
Hae-Gon Jeon,
Yeji Choi
Abstract:
In the face of escalating climate changes, typhoon intensities and their ensuing damage have surged. Accurate trajectory prediction is crucial for effective damage control. Traditional physics-based models, while comprehensive, are computationally intensive and rely heavily on the expertise of forecasters. Contemporary data-driven methods often rely on reanalysis data, which can be considered to b…
▽ More
In the face of escalating climate changes, typhoon intensities and their ensuing damage have surged. Accurate trajectory prediction is crucial for effective damage control. Traditional physics-based models, while comprehensive, are computationally intensive and rely heavily on the expertise of forecasters. Contemporary data-driven methods often rely on reanalysis data, which can be considered to be the closest to the true representation of weather conditions. However, reanalysis data is not produced in real-time and requires time for adjustment because prediction models are calibrated with observational data. This reanalysis data, such as ERA5, falls short in challenging real-world situations. Optimal preparedness necessitates predictions at least 72 hours in advance, beyond the capabilities of standard physics models. In response to these constraints, we present an approach that harnesses real-time Unified Model (UM) data, sidestepping the limitations of reanalysis data. Our model provides predictions at 6-hour intervals for up to 72 hours in advance and outperforms both state-of-the-art data-driven methods and numerical weather prediction models. In line with our efforts to mitigate adversities inflicted by \rthree{typhoons}, we release our preprocessed \textit{PHYSICS TRACK} dataset, which includes ERA5 reanalysis data, typhoon best-track, and UM forecast data.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Seamless monolithic three-dimensional integration of single-crystalline films by growth
Authors:
Ki Seok Kim,
Seunghwan Seo,
Junyoung Kwon,
Doyoon Lee,
Changhyun Kim,
Jung-El Ryu,
Jekyung Kim,
Min-Kyu Song,
Jun Min Suh,
Hang-Gyo Jung,
Youhwan Jo,
Hogeun Ahn,
Sangho Lee,
Kyeongjae Cho,
Jongwook Jeon,
Minsu Seol,
Jin-Hong Park,
Sang Won Kim,
Jeehwan Kim
Abstract:
The demand for the three-dimensional (3D) integration of electronic components is on a steady rise. The through-silicon-via (TSV) technique emerges as the only viable method for integrating single-crystalline device components in a 3D format, despite encountering significant processing challenges. While monolithic 3D (M3D) integration schemes show promise, the seamless connection of single-crystal…
▽ More
The demand for the three-dimensional (3D) integration of electronic components is on a steady rise. The through-silicon-via (TSV) technique emerges as the only viable method for integrating single-crystalline device components in a 3D format, despite encountering significant processing challenges. While monolithic 3D (M3D) integration schemes show promise, the seamless connection of single-crystalline semiconductors without intervening wafers has yet to be demonstrated. This challenge arises from the inherent difficulty of growing single crystals on amorphous or polycrystalline surfaces post the back-end-of-the-line process at low temperatures to preserve the underlying circuitry. Consequently, a practical growth-based solution for M3D of single crystals remains elusive. Here, we present a method for growing single-crystalline channel materials, specifically composed of transition metal dichalcogenides, on amorphous and polycrystalline surfaces at temperatures lower than 400 °C. Building on this developed technique, we demonstrate the seamless monolithic integration of vertical single-crystalline logic transistor arrays. This accomplishment leads to the development of unprecedented vertical CMOS arrays, thereby constructing vertical inverters. Ultimately, this achievement sets the stage to pave the way for M3D integration of various electronic and optoelectronic hardware in the form of single crystals.
△ Less
Submitted 6 December, 2023; v1 submitted 5 December, 2023;
originally announced December 2023.
-
Robustness measures for quantifying nonlocality
Authors:
Kyunghyun Baek,
Junghee Ryu,
Jinhyoung Lee
Abstract:
We suggest generalized robustness for quantifying nonlocality and investigate its properties by comparing it with white-noise and standard robustness measures. As a result, we show that white-noise robustness does not fulfill monotonicity under local operation and shared randomness, whereas the other measures do. To compare the standard and generalized robustness measures, we introduce the concept…
▽ More
We suggest generalized robustness for quantifying nonlocality and investigate its properties by comparing it with white-noise and standard robustness measures. As a result, we show that white-noise robustness does not fulfill monotonicity under local operation and shared randomness, whereas the other measures do. To compare the standard and generalized robustness measures, we introduce the concept of inequivalence, which indicates a reversal in the order relationship depending on the choice of monotones. From an operational perspective, the inequivalence of monotones for resourceful objects implies the absence of free operations that connect them. Applying this concept, we find that standard and generalized robustness measures are inequivalent between even- and odd-dimensional cases up to eight dimensions. This is obtained using randomly performed CGLMP measurement settings in a maximally entangled state. This study contributes to the resource theory of nonlocality and sheds light on comparing monotones by using the concept of inequivalence valid for all resource theories.
△ Less
Submitted 12 November, 2023;
originally announced November 2023.
-
Analysis of the User Perception of Chatbots in Education Using A Partial Least Squares Structural Equation Modeling Approach
Authors:
Md Rabiul Hasan,
Nahian Ismail Chowdhury,
Md Hadisur Rahman,
Md Asif Bin Syed,
JuHyeong Ryu
Abstract:
The integration of Artificial Intelligence (AI) into education is a recent development, with chatbots emerging as a noteworthy addition to this transformative landscape. As online learning platforms rapidly advance, students need to adapt swiftly to excel in this dynamic environment. Consequently, understanding the acceptance of chatbots, particularly those employing Large Language Model (LLM) suc…
▽ More
The integration of Artificial Intelligence (AI) into education is a recent development, with chatbots emerging as a noteworthy addition to this transformative landscape. As online learning platforms rapidly advance, students need to adapt swiftly to excel in this dynamic environment. Consequently, understanding the acceptance of chatbots, particularly those employing Large Language Model (LLM) such as Chat Generative Pretrained Transformer (ChatGPT), Google Bard, and other interactive AI technologies, is of paramount importance. However, existing research on chatbots in education has overlooked key behavior-related aspects, such as Optimism, Innovativeness, Discomfort, Insecurity, Transparency, Ethics, Interaction, Engagement, and Accuracy, creating a significant literature gap. To address this gap, this study employs Partial Least Squares Structural Equation Modeling (PLS-SEM) to investigate the determinant of chatbots adoption in education among students, considering the Technology Readiness Index (TRI) and Technology Acceptance Model (TAM). Utilizing a five-point Likert scale for data collection, we gathered a total of 185 responses, which were analyzed using R-Studio software. We established 12 hypotheses to achieve its objectives. The results showed that Optimism and Innovativeness are positively associated with Perceived Ease of Use (PEOU) and Perceived Usefulness (PU). Conversely, Discomfort and Insecurity negatively impact PEOU, with only Insecurity negatively affecting PU. These findings provide insights for future technology designers, elucidating critical user behavior factors influencing chatbots adoption and utilization in educational contexts.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Network Contention-Aware Cluster Scheduling with Reinforcement Learning
Authors:
Junyeol Ryu,
Jeongyoon Eo
Abstract:
With continuous advances in deep learning, distributed training is becoming common in GPU clusters. Specifically, for emerging workloads with diverse amounts, ratios, and patterns of communication, we observe that network contention can significantly degrade training throughput. However, widely used scheduling policies often face limitations as they are agnostic to network contention between jobs.…
▽ More
With continuous advances in deep learning, distributed training is becoming common in GPU clusters. Specifically, for emerging workloads with diverse amounts, ratios, and patterns of communication, we observe that network contention can significantly degrade training throughput. However, widely used scheduling policies often face limitations as they are agnostic to network contention between jobs. In this paper, we present a new approach to mitigate network contention in GPU clusters using reinforcement learning. We formulate GPU cluster scheduling as a reinforcement learning problem and opt to learn a network contention-aware scheduling policy that efficiently captures contention sensitivities and dynamically adapts scheduling decisions through continuous evaluation and improvement. We show that compared to widely used scheduling policies, our approach reduces average job completion time by up to 18.2\% and effectively cuts the tail job completion time by up to 20.7\% while allowing a preferable trade-off between average job completion time and resource utilization.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images
Authors:
Seongsu Bae,
Daeun Kyung,
Jaehee Ryu,
Eunbyeol Cho,
Gyubok Lee,
Sunjun Kweon,
Jungwoo Oh,
Lei Ji,
Eric I-Chao Chang,
Tackeun Kim,
Edward Choi
Abstract:
Electronic Health Records (EHRs), which contain patients' medical histories in various multi-modal formats, often overlook the potential for joint reasoning across imaging and table modalities underexplored in current EHR Question Answering (QA) systems. In this paper, we introduce EHRXQA, a novel multi-modal question answering dataset combining structured EHRs and chest X-ray images. To develop o…
▽ More
Electronic Health Records (EHRs), which contain patients' medical histories in various multi-modal formats, often overlook the potential for joint reasoning across imaging and table modalities underexplored in current EHR Question Answering (QA) systems. In this paper, we introduce EHRXQA, a novel multi-modal question answering dataset combining structured EHRs and chest X-ray images. To develop our dataset, we first construct two uni-modal resources: 1) The MIMIC-CXR-VQA dataset, our newly created medical visual question answering (VQA) benchmark, specifically designed to augment the imaging modality in EHR QA, and 2) EHRSQL (MIMIC-IV), a refashioned version of a previously established table-based EHR QA dataset. By integrating these two uni-modal resources, we successfully construct a multi-modal EHR QA dataset that necessitates both uni-modal and cross-modal reasoning. To address the unique challenges of multi-modal questions within EHRs, we propose a NeuralSQL-based strategy equipped with an external VQA API. This pioneering endeavor enhances engagement with multi-modal EHR sources and we believe that our dataset can catalyze advances in real-world medical scenarios such as clinical decision-making and research. EHRXQA is available at https://github.com/baeseongsu/ehrxqa.
△ Less
Submitted 25 December, 2023; v1 submitted 28 October, 2023;
originally announced October 2023.
-
Gramian Attention Heads are Strong yet Efficient Vision Learners
Authors:
Jongbin Ryu,
Dongyoon Han,
Jongwoo Lim
Abstract:
We introduce a novel architecture design that enhances expressiveness by incorporating multiple head classifiers (\ie, classification heads) instead of relying on channel expansion or additional building blocks. Our approach employs attention-based aggregation, utilizing pairwise feature similarity to enhance multiple lightweight heads with minimal resource overhead. We compute the Gramian matrice…
▽ More
We introduce a novel architecture design that enhances expressiveness by incorporating multiple head classifiers (\ie, classification heads) instead of relying on channel expansion or additional building blocks. Our approach employs attention-based aggregation, utilizing pairwise feature similarity to enhance multiple lightweight heads with minimal resource overhead. We compute the Gramian matrices to reinforce class tokens in an attention layer for each head. This enables the heads to learn more discriminative representations, enhancing their aggregation capabilities. Furthermore, we propose a learning algorithm that encourages heads to complement each other by reducing correlation for aggregation. Our models eventually surpass state-of-the-art CNNs and ViTs regarding the accuracy-throughput trade-off on ImageNet-1K and deliver remarkable performance across various downstream tasks, such as COCO object instance segmentation, ADE20k semantic segmentation, and fine-grained visual classification datasets. The effectiveness of our framework is substantiated by practical experimental results and further underpinned by generalization error bound. We release the code publicly at: https://github.com/Lab-LVM/imagenet-models.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge
Authors:
Gregory Holste,
Yiliang Zhou,
Song Wang,
Ajay Jaiswal,
Mingquan Lin,
Sherry Zhuge,
Yuzhe Yang,
Dongkyun Kim,
Trong-Hieu Nguyen-Mau,
Minh-Triet Tran,
Jaehyup Jeong,
Wongi Park,
Jongbin Ryu,
Feng Hong,
Arsh Verma,
Yosuke Yamagishi,
Changhyun Kim,
Hyeryeong Seo,
Myungjoo Kang,
Leo Anthony Celi,
Zhiyong Lu,
Ronald M. Summers,
George Shih,
Zhangyang Wang,
Yifan Peng
Abstract:
Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" $\unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of…
▽ More
Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" $\unicode{x2013}$ there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.
△ Less
Submitted 1 April, 2024; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Bochner-Riesz mean for the twisted Laplacian in $\mathbb R^2$
Authors:
Eunhee Jeong,
Sanghyuk Lee,
Jaehyeon Ryu
Abstract:
We study the Bochner-Riesz problem for the twisted Laplacian $\mathcal L$ on $\mathbb R^2$. For $p\in [1, \infty]\setminus\{2\}$, it has been conjectured that the Bochner-Riesz means $S_λ^δ(\mathcal L) f$ of order $δ$ converges in $L^p$ for every $f\in L^p$ if and only if $δ> \max(0,|(p-2)/p|-1/2)$. We prove the conjecture by obtaining uniform $L^p$ bounds on $S_λ^δ(\mathcal L)$ up to the sharp su…
▽ More
We study the Bochner-Riesz problem for the twisted Laplacian $\mathcal L$ on $\mathbb R^2$. For $p\in [1, \infty]\setminus\{2\}$, it has been conjectured that the Bochner-Riesz means $S_λ^δ(\mathcal L) f$ of order $δ$ converges in $L^p$ for every $f\in L^p$ if and only if $δ> \max(0,|(p-2)/p|-1/2)$. We prove the conjecture by obtaining uniform $L^p$ bounds on $S_λ^δ(\mathcal L)$ up to the sharp summability indices.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Spontaneous Unidirectional Loop Extrusion Emerges from Symmetry Breaking of SMC Extension
Authors:
Andrea Bonato,
Jae-Won Jang,
Kyoung-Wook Moon,
Davide Michieletto,
Je-Kyung Ryu
Abstract:
DNA loop extrusion is arguably one of the most important players in genome organization. The precise mechanism by which loop extruding factors (LEFs) work is still unresolved and much debated. One of the major open questions in this field is how do LEFs establish and maintain unidirectional motion along DNA. In this paper, we use High-Speed AFM data to show that condensin hinge domain displays a s…
▽ More
DNA loop extrusion is arguably one of the most important players in genome organization. The precise mechanism by which loop extruding factors (LEFs) work is still unresolved and much debated. One of the major open questions in this field is how do LEFs establish and maintain unidirectional motion along DNA. In this paper, we use High-Speed AFM data to show that condensin hinge domain displays a structural, geometric constraint on the angle within which it can extend with respect to the DNA-bound domains. Using computer simulations, we then show that such a geometrical constraint results in a local symmetry breaking and is enough to rectify the extrusion process, yielding unidirectional loop extrusion along DNA. Our work highlights an overlooked geometric aspect of the loop extrusion process that may have a universal impact on SMC function across organisms.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Spherical maximal functions on two step nilpotent Lie groups
Authors:
Jaehyeon Ryu,
Andreas Seeger
Abstract:
Consider $\mathbb R^d\times \mathbb R^m$ with the group structure of a two-step nilpotent Lie group and natural parabolic dilations. The maximal function originally introduced by Nevo and Thangavelu in the setting of the Heisenberg group deals with noncommutative convolutions associated to measures on spheres or generalized spheres in $\mathbb R^d$. We drop the nondegeneracy condition in the known…
▽ More
Consider $\mathbb R^d\times \mathbb R^m$ with the group structure of a two-step nilpotent Lie group and natural parabolic dilations. The maximal function originally introduced by Nevo and Thangavelu in the setting of the Heisenberg group deals with noncommutative convolutions associated to measures on spheres or generalized spheres in $\mathbb R^d$. We drop the nondegeneracy condition in the known results on Métivier groups and prove the sharp $L^p$ boundedness result for all two step nilpotent Lie groups with $d\ge 3$.
△ Less
Submitted 28 June, 2024; v1 submitted 14 September, 2023;
originally announced September 2023.
-
Nonlocal elliptic and parabolic equations with general stable operators in weighted Sobolev spaces
Authors:
Hongjie Dong,
Junhee Ryu
Abstract:
We study nonlocal elliptic and parabolic equations on $C^{1,τ}$ open sets in weighted Sobolev spaces, where $τ\in (0,1)$. The operators we consider are infinitesimal generators of symmetric stable Lévy processes, whose Lévy measures are allowed to be very singular. Additionally, for parabolic equations, the measures are assumed to be merely measurable in the time variable.
We study nonlocal elliptic and parabolic equations on $C^{1,τ}$ open sets in weighted Sobolev spaces, where $τ\in (0,1)$. The operators we consider are infinitesimal generators of symmetric stable Lévy processes, whose Lévy measures are allowed to be very singular. Additionally, for parabolic equations, the measures are assumed to be merely measurable in the time variable.
△ Less
Submitted 30 March, 2024; v1 submitted 10 September, 2023;
originally announced September 2023.
-
Symmetry-protected flatband condition for Hamiltonians with local symmetry
Authors:
Jung-Wan Ryu,
Alexei Andreanov,
Hee Chul Park,
Jae-Ho Han
Abstract:
We derive symmetry-based conditions for tight-binding Hamiltonians with flatbands to have compact localized eigenstates occupying a single unit cell. The conditions are based on unitary operators commuting with the Hamiltonian and associated with local symmetries that guarantee compact localized states and a flatband. We illustrate the conditions for compact localized states and flatbands with sim…
▽ More
We derive symmetry-based conditions for tight-binding Hamiltonians with flatbands to have compact localized eigenstates occupying a single unit cell. The conditions are based on unitary operators commuting with the Hamiltonian and associated with local symmetries that guarantee compact localized states and a flatband. We illustrate the conditions for compact localized states and flatbands with simple Hamiltonians with given symmetries. We also apply these results to general cases such as the Hamiltonian with long-range hoppings and higher-dimensional Hamiltonian.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Computational Synthesis of Wearable Robot Mechanisms: Application to Hip-Joint Mechanisms
Authors:
Seok Won Kang,
Jegyeong Ryu,
Suh In Kim,
Youngsoo Kim,
Yoon Young Kim
Abstract:
Since wearable linkage mechanisms could control the moment transmission from actuator(s) to wearers, they can help ensure that even low-cost wearable systems provide advanced functionality tailored to users' needs. For example, if a hip mechanism transforms an input torque into a spatially-varying moment, a wearer can get effective assistance both in the sagittal and frontal planes during walking,…
▽ More
Since wearable linkage mechanisms could control the moment transmission from actuator(s) to wearers, they can help ensure that even low-cost wearable systems provide advanced functionality tailored to users' needs. For example, if a hip mechanism transforms an input torque into a spatially-varying moment, a wearer can get effective assistance both in the sagittal and frontal planes during walking, even with an affordable single-actuator system. However, due to the combinatorial nature of the linkage mechanism design space, the topologies of such nonlinear-moment-generating mechanisms are challenging to determine, even with significant computational resources and numerical data. Furthermore, on-premise production development and interactive design are nearly impossible in conventional synthesis approaches. Here, we propose an innovative autonomous computational approach for synthesizing such wearable robot mechanisms, eliminating the need for exhaustive searches or numerous data sets. Our method transforms the synthesis problem into a gradient-based optimization problem with sophisticated objective and constraint functions while ensuring the desired degree of freedom, range of motion, and force transmission characteristics. To generate arbitrary mechanism topologies and dimensions, we employed a unified ground model. By applying the proposed method for the design of hip joint mechanisms, the topologies and dimensions of non-series-type hip joint mechanisms were obtained. Biomechanical simulations validated its multi-moment assistance capability, and its wearability was verified via prototype fabrication. The proposed design strategy can open a new way to design various wearable robot mechanisms, such as shoulders, knees, and ankles.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Fine-Grained Self-Supervised Learning with Jigsaw Puzzles for Medical Image Classification
Authors:
Wongi Park,
Jongbin Ryu
Abstract:
Classifying fine-grained lesions is challenging due to minor and subtle differences in medical images. This is because learning features of fine-grained lesions with highly minor differences is very difficult in training deep neural networks. Therefore, in this paper, we introduce Fine-Grained Self-Supervised Learning(FG-SSL) method for classifying subtle lesions in medical images. The proposed me…
▽ More
Classifying fine-grained lesions is challenging due to minor and subtle differences in medical images. This is because learning features of fine-grained lesions with highly minor differences is very difficult in training deep neural networks. Therefore, in this paper, we introduce Fine-Grained Self-Supervised Learning(FG-SSL) method for classifying subtle lesions in medical images. The proposed method progressively learns the model through hierarchical block such that the cross-correlation between the fine-grained Jigsaw puzzle and regularized original images is close to the identity matrix. We also apply hierarchical block for progressive fine-grained learning, which extracts different information in each step, to supervised learning for discovering subtle differences. Our method does not require an asymmetric model, nor does a negative sampling strategy, and is not sensitive to batch size. We evaluate the proposed fine-grained self-supervised learning method on comprehensive experiments using various medical image recognition datasets. In our experiments, the proposed method performs favorably compared to existing state-of-the-art approaches on the widely-used ISIC2018, APTOS2019, and ISIC2017 datasets.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
Robust Asymmetric Loss for Multi-Label Long-Tailed Learning
Authors:
Wongi Park,
Inhyuk Park,
Sungeun Kim,
Jongbin Ryu
Abstract:
In real medical data, training samples typically show long-tailed distributions with multiple labels. Class distribution of the medical data has a long-tailed shape, in which the incidence of different diseases is quite varied, and at the same time, it is not unusual for images taken from symptomatic patients to be multi-label diseases. Therefore, in this paper, we concurrently address these two i…
▽ More
In real medical data, training samples typically show long-tailed distributions with multiple labels. Class distribution of the medical data has a long-tailed shape, in which the incidence of different diseases is quite varied, and at the same time, it is not unusual for images taken from symptomatic patients to be multi-label diseases. Therefore, in this paper, we concurrently address these two issues by putting forth a robust asymmetric loss on the polynomial function. Since our loss tackles both long-tailed and multi-label classification problems simultaneously, it leads to a complex design of the loss function with a large number of hyper-parameters. Although a model can be highly fine-tuned due to a large number of hyper-parameters, it is difficult to optimize all hyper-parameters at the same time, and there might be a risk of overfitting a model. Therefore, we regularize the loss function using the Hill loss approach, which is beneficial to be less sensitive against the numerous hyper-parameters so that it reduces the risk of overfitting the model. For this reason, the proposed loss is a generic method that can be applied to most medical image classification tasks and does not make the training process more time-consuming. We demonstrate that the proposed robust asymmetric loss performs favorably against the long-tailed with multi-label medical image classification in addition to the various long-tailed single-label datasets. Notably, our method achieves Top-5 results on the CXR-LT dataset of the ICCV CVAMD 2023 competition. We opensource our implementation of the robust asymmetric loss in the public repository: https://github.com/kalelpark/RAL.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
BubbleML: A Multi-Physics Dataset and Benchmarks for Machine Learning
Authors:
Sheikh Md Shakeel Hassan,
Arthur Feeney,
Akash Dhruv,
Jihoon Kim,
Youngjoon Suh,
Jaiyoung Ryu,
Yoonjin Won,
Aparna Chandramowlishwaran
Abstract:
In the field of phase change phenomena, the lack of accessible and diverse datasets suitable for machine learning (ML) training poses a significant challenge. Existing experimental datasets are often restricted, with limited availability and sparse ground truth data, impeding our understanding of this complex multiphysics phenomena. To bridge this gap, we present the BubbleML Dataset \footnote{\la…
▽ More
In the field of phase change phenomena, the lack of accessible and diverse datasets suitable for machine learning (ML) training poses a significant challenge. Existing experimental datasets are often restricted, with limited availability and sparse ground truth data, impeding our understanding of this complex multiphysics phenomena. To bridge this gap, we present the BubbleML Dataset \footnote{\label{git_dataset}\url{https://github.com/HPCForge/BubbleML}} which leverages physics-driven simulations to provide accurate ground truth information for various boiling scenarios, encompassing nucleate pool boiling, flow boiling, and sub-cooled boiling. This extensive dataset covers a wide range of parameters, including varying gravity conditions, flow rates, sub-cooling levels, and wall superheat, comprising 79 simulations. BubbleML is validated against experimental observations and trends, establishing it as an invaluable resource for ML research. Furthermore, we showcase its potential to facilitate exploration of diverse downstream tasks by introducing two benchmarks: (a) optical flow analysis to capture bubble dynamics, and (b) operator networks for learning temperature dynamics. The BubbleML dataset and its benchmarks serve as a catalyst for advancements in ML-driven research on multiphysics phase change phenomena, enabling the development and comparison of state-of-the-art techniques and models.
△ Less
Submitted 24 August, 2023; v1 submitted 27 July, 2023;
originally announced July 2023.
-
Dynamics in non-Hermitian systems with nonreciprocal coupling
Authors:
Jung-Wan Ryu
Abstract:
We reveal that non-Hermitian Hamiltonians with nonreciprocal coupling can achieve amplification of initial states without external gain due to a kind of inherent source. We discuss the source and its effect on time evolution in terms of complex eigenenergies and non-orthogonal eigenstates. Demonstrating two extreme cases of Hamiltonians, namely one having complex eigenenergies with orthogonal eige…
▽ More
We reveal that non-Hermitian Hamiltonians with nonreciprocal coupling can achieve amplification of initial states without external gain due to a kind of inherent source. We discuss the source and its effect on time evolution in terms of complex eigenenergies and non-orthogonal eigenstates. Demonstrating two extreme cases of Hamiltonians, namely one having complex eigenenergies with orthogonal eigenstates and one having real eigenenergies with non-orthogonal eigenstates, we elucidate the differences between the amplifications from complex eigenenergies and from non-orthogonal eigenstates.
△ Less
Submitted 22 July, 2023;
originally announced July 2023.
-
PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM
Authors:
Seungho Baek,
Hyerin Im,
Jiseung Ryu,
Juhyeong Park,
Takyeon Lee
Abstract:
Text-to-image generation model is able to generate images across a diverse range of subjects and styles based on a single prompt. Recent works have proposed a variety of interaction methods that help users understand the capabilities of models and utilize them. However, how to support users to efficiently explore the model's capability and to create effective prompts are still open-ended research…
▽ More
Text-to-image generation model is able to generate images across a diverse range of subjects and styles based on a single prompt. Recent works have proposed a variety of interaction methods that help users understand the capabilities of models and utilize them. However, how to support users to efficiently explore the model's capability and to create effective prompts are still open-ended research questions. In this paper, we present PromptCrafter, a novel mixed-initiative system that allows step-by-step crafting of text-to-image prompt. Through the iterative process, users can efficiently explore the model's capability, and clarify their intent. PromptCrafter also supports users to refine prompts by answering various responses to clarifying questions generated by a Large Language Model. Lastly, users can revert to a desired step by reviewing the work history. In this workshop paper, we discuss the design process of PromptCrafter and our plans for follow-up studies.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.