Search | arXiv e-print repository

Sparse $L^1$-Autoencoders for Scientific Data Compression

Authors: Matthias Chung, Rick Archibald, Paul Atzberger, Jack Michael Solomon

Abstract: Scientific datasets present unique challenges for machine learning-driven compression methods, including more stringent requirements on accuracy and mitigation of potential invalidating artifacts. Drawing on results from compressed sensing and rate-distortion theory, we introduce effective data compression methods by developing autoencoders using high dimensional latent spaces that are $L^1$-regul… ▽ More Scientific datasets present unique challenges for machine learning-driven compression methods, including more stringent requirements on accuracy and mitigation of potential invalidating artifacts. Drawing on results from compressed sensing and rate-distortion theory, we introduce effective data compression methods by developing autoencoders using high dimensional latent spaces that are $L^1$-regularized to obtain sparse low dimensional representations. We show how these information-rich latent spaces can be used to mitigate blurring and other artifacts to obtain highly effective data compression methods for scientific data. We demonstrate our methods for short angle scattering (SAS) datasets showing they can achieve compression ratios around two orders of magnitude and in some cases better. Our compression methods show promise for use in addressing current bottlenecks in transmission, storage, and analysis in high-performance distributed computing environments. This is central to processing the large volume of SAS data being generated at shared experimental facilities around the world to support scientific investigations. Our approaches provide general ways for obtaining specialized compression methods for targeted scientific datasets. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 11 pages, 6 figures

arXiv:2405.13220 [pdf, other]

Paired Autoencoders for Inverse Problems

Authors: Matthias Chung, Emma Hart, Julianne Chung, Bas Peters, Eldad Haber

Abstract: We consider the solution of nonlinear inverse problems where the forward problem is a discretization of a partial differential equation. Such problems are notoriously difficult to solve in practice and require minimizing a combination of a data-fit term and a regularization term. The main computational bottleneck of typical algorithms is the direct estimation of the data misfit. Therefore, likelih… ▽ More We consider the solution of nonlinear inverse problems where the forward problem is a discretization of a partial differential equation. Such problems are notoriously difficult to solve in practice and require minimizing a combination of a data-fit term and a regularization term. The main computational bottleneck of typical algorithms is the direct estimation of the data misfit. Therefore, likelihood-free approaches have become appealing alternatives. Nonetheless, difficulties in generalization and limitations in accuracy have hindered their broader utility and applicability. In this work, we use a paired autoencoder framework as a likelihood-free estimator for inverse problems. We show that the use of such an architecture allows us to construct a solution efficiently and to overcome some known open problems when using likelihood-free estimators. In particular, our framework can assess the quality of the solution and improve on it if needed. We demonstrate the viability of our approach using examples from full waveform inversion and inverse electromagnetic imaging. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 18 pages, 6 figures

arXiv:2405.07835 [pdf, other]

Topological Embedding of Human Brain Networks with Applications to Dynamics of Temporal Lobe Epilepsy

Authors: Moo K. Chung, Ji Bi Che, Veena A. Nair, Camille Garcia Ramos, Jedidiah Ray Mathis, Vivek Prabhakaran, Elizabeth Meyerand, Bruce P. Hermann, Jeffrey R. Binder, Aaron F. Struck

Abstract: We introduce a novel, data-driven topological data analysis (TDA) approach for embedding brain networks into a lower-dimensional space in quantifying the dynamics of temporal lobe epilepsy (TLE) obtained from resting-state functional magnetic resonance imaging (rs-fMRI). This embedding facilitates the orthogonal projection of 0D and 1D topological features, allowing for the visualization and model… ▽ More We introduce a novel, data-driven topological data analysis (TDA) approach for embedding brain networks into a lower-dimensional space in quantifying the dynamics of temporal lobe epilepsy (TLE) obtained from resting-state functional magnetic resonance imaging (rs-fMRI). This embedding facilitates the orthogonal projection of 0D and 1D topological features, allowing for the visualization and modeling of the dynamics of functional human brain networks in a resting state. We then quantify the topological disparities between networks to determine the coordinates for embedding. This framework enables us to conduct a coherent statistical inference within the embedded space. Our results indicate that brain network topology in TLE patients exhibits increased rigidity in 0D topology but more rapid flections compared to that of normal controls in 1D topology. △ Less

Submitted 13 May, 2024; originally announced May 2024.

arXiv:2405.07440 [pdf, other]

Maximizing Information Gain in Privacy-Aware Active Learning of Email Anomalies

Authors: Mu-Huan Miles Chung, Sharon Li, Jaturong Kongmanee, Lu Wang, Yuhong Yang, Calvin Giang, Khilan Jerath, Abhay Raman, David Lie, Mark Chignell

Abstract: Redacted emails satisfy most privacy requirements but they make it more difficult to detect anomalous emails that may be indicative of data exfiltration. In this paper we develop an enhanced method of Active Learning using an information gain maximizing heuristic, and we evaluate its effectiveness in a real world setting where only redacted versions of email could be labeled by human analysts due… ▽ More Redacted emails satisfy most privacy requirements but they make it more difficult to detect anomalous emails that may be indicative of data exfiltration. In this paper we develop an enhanced method of Active Learning using an information gain maximizing heuristic, and we evaluate its effectiveness in a real world setting where only redacted versions of email could be labeled by human analysts due to privacy concerns. In the first case study we examined how Active Learning should be carried out. We found that model performance was best when a single highly skilled (in terms of the labelling task) analyst provided the labels. In the second case study we used confidence ratings to estimate the labeling uncertainty of analysts and then prioritized instances for labeling based on the expected information gain (the difference between model uncertainty and analyst uncertainty) that would be provided by labelling each instance. We found that the information maximization gain heuristic improved model performance over existing sampling methods for Active Learning. Based on the results obtained, we recommend that analysts should be screened, and possibly trained, prior to implementation of Active Learning in cybersecurity applications. We also recommend that the information gain maximizing sample method (based on expert confidence) should be used in early stages of Active Learning, providing that well-calibrated confidence can be obtained. We also note that the expertise of analysts should be assessed prior to Active Learning, as we found that analysts with lower labelling skill had poorly calibrated (over-) confidence in their labels. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2303.00870

arXiv:2403.19892 [pdf]

Heat Transfer Coefficients of Moving Particle Beds from Flow-Dependent Particle Bed Thermal Conductivity and Near-Wall Resistance

Authors: Sarath R. Adapa, Xintong Zhang, Tianshi Feng, Ka Man Chung, Kevin J. Albrecht, Clifford K. Ho, Dimitri A. Madden, Renkun Chen

Abstract: Determination of heat transfer coefficients for flowing packed particle beds is essential to the design of particle heat exchangers, and other thermal processes. While such dense granular flows fall into the well-known plug-flow regime, the discrete nature of granular materials alters the thermal transport processes in both the near-wall and bulk regions of flowing particle beds from their station… ▽ More Determination of heat transfer coefficients for flowing packed particle beds is essential to the design of particle heat exchangers, and other thermal processes. While such dense granular flows fall into the well-known plug-flow regime, the discrete nature of granular materials alters the thermal transport processes in both the near-wall and bulk regions of flowing particle beds from their stationary counterparts. As a result, heat transfer correlations based on the stationary particle bed thermal conductivity could be inadequate for flowing particles in a heat exchanger. Earlier works have achieved reasonable agreement with experiments by treating granular media as a plug-flow continuum with a near-wall thermal resistance in series. However, the properties of the continuum were often obtained from measurements on stationary beds owing to the difficulty of flowing bed measurements. In this work, it was found that the properties of a stationary bed are highly sensitive to the method of particle packing and there is a decrease in the particle bed thermal conductivity and increase in the near-wall thermal resistance, measured as an effective air gap thickness, on the onset of particle flow. These variations in the thermophysical properties of stationary and flowing particle beds can lead to errors in heat transfer coefficient calculations. Therefore, the heat transfer coefficients for granular flows were calculated using experimentally determined flowing particle bed thermal conductivity and near-wall air gap for ceramic particles -CARBOCP40/100(275 um), HSP40/70(404um) and HSP16/30(956um); at velocities of 5-15mms-1; and temperatures of 300-650C. The thermal conductivity and air gap values for CP40/100 and HSP40/70 were further used to calculate heat transfer coefficients across different particle bed temperatures and velocities for different parallel-plate heat exchanger dimensions. △ Less

Submitted 27 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.10774 [pdf, other]

Detecting Bias in Large Language Models: Fine-tuned KcBERT

Authors: J. K. Lee, T. M. Chung

Abstract: The rapid advancement of large language models (LLMs) has enabled natural language processing capabilities similar to those of humans, and LLMs are being widely utilized across various societal domains such as education and healthcare. While the versatility of these models has increased, they have the potential to generate subjective and normative language, leading to discriminatory treatment or o… ▽ More The rapid advancement of large language models (LLMs) has enabled natural language processing capabilities similar to those of humans, and LLMs are being widely utilized across various societal domains such as education and healthcare. While the versatility of these models has increased, they have the potential to generate subjective and normative language, leading to discriminatory treatment or outcomes among social groups, especially due to online offensive language. In this paper, we define such harm as societal bias and assess ethnic, gender, and racial biases in a model fine-tuned with Korean comments using Bidirectional Encoder Representations from Transformers (KcBERT) and KOLD data through template-based Masked Language Modeling (MLM). To quantitatively evaluate biases, we employ LPBS and CBS metrics. Compared to KcBERT, the fine-tuned model shows a reduction in ethnic bias but demonstrates significant changes in gender and racial biases. Based on these results, we propose two methods to mitigate societal bias. Firstly, a data balancing approach during the pre-training phase adjusts the uniformity of data by aligning the distribution of the occurrences of specific words and converting surrounding harmful words into non-harmful words. Secondly, during the in-training phase, we apply Debiasing Regularization by adjusting dropout and regularization, confirming a decrease in training loss. Our contribution lies in demonstrating that societal bias exists in Korean language models due to language-dependent characteristics. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 14 pages, 5 figures

arXiv:2403.10764 [pdf, other]

ECRC: Emotion-Causality Recognition in Korean Conversation for GCN

Authors: J. K. Lee, T. M. Chung

Abstract: In this multi-task learning study on simultaneous analysis of emotions and their underlying causes in conversational contexts, deep neural network methods were employed to effectively process and train large labeled datasets. However, these approaches are typically limited to conducting context analyses across the entire corpus because they rely on one of the two methods: word- or sentence-level e… ▽ More In this multi-task learning study on simultaneous analysis of emotions and their underlying causes in conversational contexts, deep neural network methods were employed to effectively process and train large labeled datasets. However, these approaches are typically limited to conducting context analyses across the entire corpus because they rely on one of the two methods: word- or sentence-level embedding. The former struggles with polysemy and homonyms, whereas the latter causes information loss when processing long sentences. In this study, we overcome the limitations of previous embeddings by utilizing both word- and sentence-level embeddings. Furthermore, we propose the emotion-causality recognition in conversation (ECRC) model, which is based on a novel graph structure, thereby leveraging the strengths of both embedding methods. This model uniquely integrates the bidirectional long short-term memory (Bi-LSTM) and graph neural network (GCN) models for Korean conversation analysis. Compared with models that rely solely on one embedding method, the proposed model effectively structures abstract concepts, such as language features and relationships, thereby minimizing information loss. To assess model performance, we compared the multi-task learning results of three deep neural network models with varying graph structures. Additionally, we evaluated the proposed model using Korean and English datasets. The experimental results show that the proposed model performs better in emotion and causality multi-task learning (74.62% and 75.30%, respectively) when node and edge characteristics are incorporated into the graph structure. Similar results were recorded for the Korean ECC and Wellness datasets (74.62% and 73.44%, respectively) with 71.35% on the IEMOCAP English dataset. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 10 pages, 5 figures

arXiv:2403.06687 [pdf, other]

Advancing Graph Neural Networks with HL-HGAT: A Hodge-Laplacian and Attention Mechanism Approach for Heterogeneous Graph-Structured Data

Authors: Jinghan Huang, Qiufeng Chen, Yijun Bian, Pengli Zhu, Nanguang Chen, Moo K. Chung, Anqi Qiu

Abstract: Graph neural networks (GNNs) have proven effective in capturing relationships among nodes in a graph. This study introduces a novel perspective by considering a graph as a simplicial complex, encompassing nodes, edges, triangles, and $k$-simplices, enabling the definition of graph-structured data on any $k$-simplices. Our contribution is the Hodge-Laplacian heterogeneous graph attention network (H… ▽ More Graph neural networks (GNNs) have proven effective in capturing relationships among nodes in a graph. This study introduces a novel perspective by considering a graph as a simplicial complex, encompassing nodes, edges, triangles, and $k$-simplices, enabling the definition of graph-structured data on any $k$-simplices. Our contribution is the Hodge-Laplacian heterogeneous graph attention network (HL-HGAT), designed to learn heterogeneous signal representations across $k$-simplices. The HL-HGAT incorporates three key components: HL convolutional filters (HL-filters), simplicial projection (SP), and simplicial attention pooling (SAP) operators, applied to $k$-simplices. HL-filters leverage the unique topology of $k$-simplices encoded by the Hodge-Laplacian (HL) operator, operating within the spectral domain of the $k$-th HL operator. To address computation challenges, we introduce a polynomial approximation for HL-filters, exhibiting spatial localization properties. Additionally, we propose a pooling operator to coarsen $k$-simplices, combining features through simplicial attention mechanisms of self-attention and cross-attention via transformers and SP operators, capturing topological interconnections across multiple dimensions of simplices. The HL-HGAT is comprehensively evaluated across diverse graph applications, including NP-hard problems, graph multi-label and classification challenges, and graph regression tasks in logistics, computer vision, biology, chemistry, and neuroscience. The results demonstrate the model's efficacy and versatility in handling a wide range of graph-based scenarios. △ Less

Submitted 22 April, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

arXiv:2403.03212 [pdf, other]

Performance of a modular ton-scale pixel-readout liquid argon time projection chamber

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

Abstract: The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi… ▽ More The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 47 pages, 41 figures

Report number: FERMILAB-PUB-24-0073-LBNF

arXiv:2402.15539 [pdf, ps, other]

Speech Corpus for Korean Children with Autism Spectrum Disorder: Towards Automatic Assessment Systems

Authors: Seonwoo Lee, Jihyun Mun, Sunhee Kim, Minhwa Chung

Abstract: Despite the growing demand for digital therapeutics for children with Autism Spectrum Disorder (ASD), there is currently no speech corpus available for Korean children with ASD. This paper introduces a speech corpus specifically designed for Korean children with ASD, aiming to advance speech technologies such as pronunciation and severity evaluation. Speech recordings from speech and language eval… ▽ More Despite the growing demand for digital therapeutics for children with Autism Spectrum Disorder (ASD), there is currently no speech corpus available for Korean children with ASD. This paper introduces a speech corpus specifically designed for Korean children with ASD, aiming to advance speech technologies such as pronunciation and severity evaluation. Speech recordings from speech and language evaluation sessions were transcribed, and annotated for articulatory and linguistic characteristics. Three speech and language pathologists rated these recordings for social communication severity (SCS) and pronunciation proficiency (PP) using a 3-point Likert scale. The total number of participants will be 300 for children with ASD and 50 for typically developing (TD) children. The paper also analyzes acoustic and linguistic features extracted from speech data collected and completed for annotation from 73 children with ASD and 9 TD children to investigate the characteristics of children with ASD and identify significant features that correlate with the clinical scores. The results reveal some speech and linguistic characteristics in children with ASD that differ from those in TD children or another subgroup of ASD categorized by clinical scores, demonstrating the potential for developing automatic assessment systems for SCS and PP. △ Less

Submitted 23 February, 2024; originally announced February 2024.

Comments: 11 pages, Accepted for LREC-COLING 2024

arXiv:2402.01568 [pdf, other]

Doping Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar Es-sghir, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1300 additional authors not shown)

Abstract: Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN… ▽ More Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon doping can substantially recover light losses due to contamination of the liquid argon by nitrogen. △ Less

Submitted 9 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: 35 pages, 20 figures

Report number: CERN-EP-2024-024; FERMILAB-PUB-23-0819-LBNF

arXiv:2401.11083 [pdf]

New Beam Dynamics Code for Cyclotron Analysis

Authors: G-H. Kim, H-J. Cho, B-H. Oh, G-R. Hahn, M. Chung, S. Park, S. Shin

Abstract: This paper describes the beam dynamic simulation with transfer matrix method for cyclotron. Starting from a description on the equation of motion in the cyclotron, lattice functions were determined from transfer matrix method and the solutions for the 2nd-order nonlinear Hamiltonian were introduced and used in phase space particle tracking. Based on the description of beam dynamics in the cyclotro… ▽ More This paper describes the beam dynamic simulation with transfer matrix method for cyclotron. Starting from a description on the equation of motion in the cyclotron, lattice functions were determined from transfer matrix method and the solutions for the 2nd-order nonlinear Hamiltonian were introduced and used in phase space particle tracking. Based on the description of beam dynamics in the cyclotron, simulation code was also developed for cyclotron design. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:1807.01397

arXiv:2401.05343 [pdf, other]

Spectral Topological Data Analysis of Brain Signals

Authors: Anass B. El-Yaagoubi, Shuhao Jiao, Moo K. Chung, Hernando Ombao

Abstract: Topological data analysis (TDA) has become a powerful approach over the last twenty years, mainly due to its ability to capture the shape and the geometry inherent in the data. Persistence homology, which is a particular tool in TDA, has been demonstrated to be successful in analyzing functional brain connectivity. One limitation of standard approaches is that they use arbitrarily chosen threshold… ▽ More Topological data analysis (TDA) has become a powerful approach over the last twenty years, mainly due to its ability to capture the shape and the geometry inherent in the data. Persistence homology, which is a particular tool in TDA, has been demonstrated to be successful in analyzing functional brain connectivity. One limitation of standard approaches is that they use arbitrarily chosen threshold values for analyzing connectivity matrices. To overcome this weakness, TDA provides a filtration of the weighted brain network across a range of threshold values. However, current analyses of the topological structure of functional brain connectivity primarily rely on overly simplistic connectivity measures, such as the Pearson orrelation. These measures do not provide information about the specific oscillators that drive dependence within the brain network. Here, we develop a frequency-specific approach that utilizes coherence, a measure of dependence in the spectral domain, to evaluate the functional connectivity of the brain. Our approach, the spectral TDA (STDA), has the ability to capture more nuanced and detailed information about the underlying brain networks. The proposed STDA method leads to a novel topological summary, the spectral landscape, which is a 2D-generalization of the persistence landscape. Using the novel spectral landscape, we analyze the EEG brain connectivity of patients with attention deficit hyperactivity disorder (ADHD) and shed light on the frequency-specific differences in the topology of brain connectivity between the controls and ADHD patients. △ Less

Submitted 1 December, 2023; originally announced January 2024.

Comments: 28 pages, 23 figures

arXiv:2312.17505 [pdf, other]

Leveraging Open-Vocabulary Diffusion to Camouflaged Instance Segmentation

Authors: Tuan-Anh Vu, Duc Thanh Nguyen, Qing Guo, Binh-Son Hua, Nhat Minh Chung, Ivor W. Tsang, Sai-Kit Yeung

Abstract: Text-to-image diffusion techniques have shown exceptional capability of producing high-quality images from text descriptions. This indicates that there exists a strong correlation between the visual and textual domains. In addition, text-image discriminative models such as CLIP excel in image labelling from text prompts, thanks to the rich and diverse information available from open concepts. In t… ▽ More Text-to-image diffusion techniques have shown exceptional capability of producing high-quality images from text descriptions. This indicates that there exists a strong correlation between the visual and textual domains. In addition, text-image discriminative models such as CLIP excel in image labelling from text prompts, thanks to the rich and diverse information available from open concepts. In this paper, we leverage these technical advances to solve a challenging problem in computer vision: camouflaged instance segmentation. Specifically, we propose a method built upon a state-of-the-art diffusion model, empowered by open-vocabulary to learn multi-scale textual-visual features for camouflaged object representations. Such cross-domain representations are desirable in segmenting camouflaged objects where visual cues are subtle to distinguish the objects from the background, especially in segmenting novel objects which are not seen in training. We also develop technically supportive components to effectively fuse cross-domain features and engage relevant features towards respective foreground objects. We validate our method and compare it with existing ones on several benchmark datasets of camouflaged instance segmentation and generic open-vocabulary instance segmentation. Experimental results confirm the advances of our method over existing ones. We will publish our code and pre-trained models to support future research. △ Less

Submitted 29 December, 2023; originally announced December 2023.

Comments: This work is under review

arXiv:2312.06279 [pdf, other]

Regional Correlation Aided Mobile Traffic Prediction with Spatiotemporal Deep Learning

Authors: JeongJun Park, Lusungu J. Mwasinga, Huigyu Yang, Syed M. Raza, Duc-Tai Le, Moonseong Kim, Min Young Chung, Hyunseung Choo

Abstract: Mobile traffic data in urban regions shows differentiated patterns during different hours of the day. The exploitation of these patterns enables highly accurate mobile traffic prediction for proactive network management. However, recent Deep Learning (DL) driven studies have only exploited spatiotemporal features and have ignored the geographical correlations, causing high complexity and erroneous… ▽ More Mobile traffic data in urban regions shows differentiated patterns during different hours of the day. The exploitation of these patterns enables highly accurate mobile traffic prediction for proactive network management. However, recent Deep Learning (DL) driven studies have only exploited spatiotemporal features and have ignored the geographical correlations, causing high complexity and erroneous mobile traffic predictions. This paper addresses these limitations by proposing an enhanced mobile traffic prediction scheme that combines the clustering strategy of daily mobile traffic peak time and novel multi Temporal Convolutional Network with a Long Short Term Memory (multi TCN-LSTM) model. The mobile network cells that exhibit peak traffic during the same hour of the day are clustered together. Our experiments on large-scale real-world mobile traffic data show up to 28% performance improvement compared to state-of-the-art studies, which confirms the efficacy and viability of the proposed approach. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 4 pages, 5 figures, 1 table. This paper is already accepted on IEEE Consumer Communications & Networking Conference(CCNC) 2024

arXiv:2312.03180 [pdf, other]

Image reconstructions using sparse dictionary representations and implicit, non-negative mappings

Authors: Elizabeth Newman, Jack Michael Solomon, Matthias Chung

Abstract: Many imaging science tasks can be modeled as a discrete linear inverse problem. Solving linear inverse problems is often challenging, with ill-conditioned operators and potentially non-unique solutions. Embedding prior knowledge, such as smoothness, into the solution can overcome these challenges. In this work, we encode prior knowledge using a non-negative patch dictionary, which effectively lear… ▽ More Many imaging science tasks can be modeled as a discrete linear inverse problem. Solving linear inverse problems is often challenging, with ill-conditioned operators and potentially non-unique solutions. Embedding prior knowledge, such as smoothness, into the solution can overcome these challenges. In this work, we encode prior knowledge using a non-negative patch dictionary, which effectively learns a basis from a training set of natural images. In this dictionary basis, we desire solutions that are non-negative and sparse (i.e., contain many zero entries). With these constraints, standard methods for solving discrete linear inverse problems are not directly applicable. One such approach is the modified residual norm steepest descent (MRNSD), which produces non-negative solutions but does not induce sparsity. In this paper, we provide two methods based on MRNSD that promote sparsity. In our first method, we add an $\ell_1$-regularization term with a new, optimal step size. In our second method, we propose a new non-negative, sparsity-promoting mapping of the solution. We compare the performance of our proposed methods on a number of numerical experiments, including deblurring, image completion, computer tomography, and superresolution. Our results show that these methods effectively solve discrete linear inverse problems with non-negativity and sparsity constraints. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 22 pages, 15 figures

MSC Class: 65F10; 65F22 ACM Class: G.1.3

arXiv:2312.03130 [pdf, other]

The DUNE Far Detector Vertical Drift Technology, Technical Design Report

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1304 additional authors not shown)

Abstract: DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi… ▽ More DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precision measurements of the PMNS matrix parameters, including the CP-violating phase. It will also stand ready to observe supernova neutrino bursts, and seeks to observe nucleon decay as a signature of a grand unified theory underlying the standard model. The DUNE far detector implements liquid argon time-projection chamber (LArTPC) technology, and combines the many tens-of-kiloton fiducial mass necessary for rare event searches with the sub-centimeter spatial resolution required to image those events with high precision. The addition of a photon detection system enhances physics capabilities for all DUNE physics drivers and opens prospects for further physics explorations. Given its size, the far detector will be implemented as a set of modules, with LArTPC designs that differ from one another as newer technologies arise. In the vertical drift LArTPC design, a horizontal cathode bisects the detector, creating two stacked drift volumes in which ionization charges drift towards anodes at either the top or bottom. The anodes are composed of perforated PCB layers with conductive strips, enabling reconstruction in 3D. Light-trap-style photon detection modules are placed both on the cryostat's side walls and on the central cathode where they are optically powered. This Technical Design Report describes in detail the technical implementations of each subsystem of this LArTPC that, together with the other far detector modules and the near detector, will enable DUNE to achieve its physics goals. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 425 pages; 281 figures Central editing team: A. Heavey, S. Kettell, A. Marchionni, S. Palestini, S. Rajogopalan, R. J. Wilson

Report number: Fermilab Report no: TM-2813-LBNF

arXiv:2311.16646 [pdf, other]

Rethinking Backdoor Attacks on Dataset Distillation: A Kernel Method Perspective

Authors: Ming-Yu Chung, Sheng-Yen Chou, Chia-Mu Yu, Pin-Yu Chen, Sy-Yen Kuo, Tsung-Yi Ho

Abstract: Dataset distillation offers a potential means to enhance data efficiency in deep learning. Recent studies have shown its ability to counteract backdoor risks present in original training samples. In this study, we delve into the theoretical aspects of backdoor attacks and dataset distillation based on kernel methods. We introduce two new theory-driven trigger pattern generation methods specialized… ▽ More Dataset distillation offers a potential means to enhance data efficiency in deep learning. Recent studies have shown its ability to counteract backdoor risks present in original training samples. In this study, we delve into the theoretical aspects of backdoor attacks and dataset distillation based on kernel methods. We introduce two new theory-driven trigger pattern generation methods specialized for dataset distillation. Following a comprehensive set of analyses and experiments, we show that our optimization-based trigger design framework informs effective backdoor attacks on dataset distillation. Notably, datasets poisoned by our designed trigger prove resilient against conventional backdoor attack detection and mitigation methods. Our empirical results validate that the triggers developed using our approaches are proficient at executing resilient backdoor attacks. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 19 pages, 4 figures

arXiv:2311.11244 [pdf]

Micromechanical Origin of Heat Transfer to Granular Flow

Authors: Xintong Zhang, Sarath Adapa, Tianshi Feng, Jian Zeng, Ka Man Chung, Clifford Ho, Kevin Albrecht, Renkun Chen

Abstract: Heat transfer to a granular flow is comprised of two resistances in series: near the wall and within the bulk particle bed, neither of which is well understood due to the lack of experimental probes to separate their respective contribution. Here, we use a frequency modulated photothermal technique to separately quantify the thermal resistances in the near-wall and the bulk bed regions of particle… ▽ More Heat transfer to a granular flow is comprised of two resistances in series: near the wall and within the bulk particle bed, neither of which is well understood due to the lack of experimental probes to separate their respective contribution. Here, we use a frequency modulated photothermal technique to separately quantify the thermal resistances in the near-wall and the bulk bed regions of particles in flowing states. Compared to the stationary state, the flowing leads to a higher near-wall resistance and a lower thermal conductivity of bulk beds. Coupled with discrete element method simulation, we show that the near-wall resistance can be explained by particle diffusion in granular flows. △ Less

Submitted 28 May, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

Comments: 15 pages, 5 figures

arXiv:2310.05669 [pdf, other]

Transverse Emittance Reduction in Muon Beams by Ionization Cooling

Authors: The MICE Collaboration, M. Bogomilov, R. Tsenov, G. Vankova-Kirilova, Y. P. Song, J. Y. Tang, Z. H. Li, R. Bertoni, M. Bonesini, F. Chignoli, R. Mazza, A. de Bari, D. Orestano, L. Tortora, Y. Kuno, H. Sakamoto, A. Sato, S. Ishimoto, M. Chung, C. K. Sung, F. Filthaut, M. Fedorov, D. Jokovic, D. Maletic, M. Savic , et al. (112 additional authors not shown)

Abstract: Accelerated muon beams have been considered for next-generation studies of high-energy lepton-antilepton collisions and neutrino oscillations. However, high-brightness muon beams have not yet been produced. The main challenge for muon acceleration and storage stems from the large phase-space volume occupied by the beam, derived from the muon production mechanism through the decay of pions from pro… ▽ More Accelerated muon beams have been considered for next-generation studies of high-energy lepton-antilepton collisions and neutrino oscillations. However, high-brightness muon beams have not yet been produced. The main challenge for muon acceleration and storage stems from the large phase-space volume occupied by the beam, derived from the muon production mechanism through the decay of pions from proton collisions. Ionization cooling is the technique proposed to decrease the muon beam phase-space volume. Here we demonstrate a clear signal of ionization cooling through the observation of transverse emittance reduction in beams that traverse lithium hydride or liquid hydrogen absorbers in the Muon Ionization Cooling Experiment (MICE). The measurement is well reproduced by the simulation of the experiment and the theoretical model. The results shown here represent a substantial advance towards the realization of muon-based facilities that could operate at the energy and intensity frontiers. △ Less

Submitted 13 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

Comments: 23 pages and 5 figures

Report number: STFC-P-2023-004

arXiv:2309.15121 [pdf]

Thermal Conductivity Measurement Using Modulated Photothermal Radiometry for Nitrate and Chloride Molten Salts

Authors: Ka Man Chung, Tianshi Feng, Jian Zeng, Sarath Reddy Adapa, Xintong Zhang, Andrew Z. Zhao, Ye Zhang, Peiwen Li, Youyang Zhao, Javier E. Garay, Renkun Chen

Abstract: Molten salts are being used or explored for thermal energy storage and conversion systems in concentrating solar power and nuclear power plants. Thermal conductivity of molten salts is an important thermophysical property dictating the performance and cost of these systems, but its accurate measurement has been challenging, as evidenced by wide scattering of existing data in literature. The corros… ▽ More Molten salts are being used or explored for thermal energy storage and conversion systems in concentrating solar power and nuclear power plants. Thermal conductivity of molten salts is an important thermophysical property dictating the performance and cost of these systems, but its accurate measurement has been challenging, as evidenced by wide scattering of existing data in literature. The corrosive and conducting nature of these fluids also leads to time consuming sample preparation processes of many contact-based measurements. Here, we report the measurement of thermal conductivity of molten salts using a modulated photothermal radiometry (MPR) technique, which is a laser-based, non-contact, frequency-domain method adopted for molten salts for the first time. By unitizing the advantages of front side sensing of frequency-domain measurements and the vertical holder orientation, the technique can minimize the natural convection and salt creeping effects, thus yielding accurate molten salt thermal conductivity. The MPR technique is first calibrated using standard molten materials including paraffin wax and sulfur. It is then applied on measuring pure nitrate salts ($NaNO_3$ and $KNO_3$), solar salt ($NaNO_3-KNO_3$ mixture), and chloride salt ($NaCl-KCl-MgCl_2$). The measurement results are compared with data from literature, especially those obtained from laser flash analysis (LFA). Our results demonstrate that the MPR is a convenient and reliable technique of measuring thermal conductivity of molten salts. Accurate thermal conductivity data of molten salts will be valuable in developing the next-generation high-temperature thermal energy storage and conversion systems. △ Less

Submitted 31 August, 2023; originally announced September 2023.

arXiv:2309.00106 [pdf]

In-situ Thermophysical Measurement of Flowing Molten Chloride Salt Using Modulated Photothermal Radiometry

Authors: Ka Man Chung, Ye Zhang, Jian Zeng, Fouad Haddad, Sarath Reddy Adapa, Tianshi Feng, Peiwen Li, Renkun Chen

Abstract: Molten salts are a leading candidate for high-temperature heat transfer fluids (HTFs) for thermal energy storage and conversion systems in concentrated solar power (CSP) and nuclear energy power plants. The ability to probe molten salt thermal transport properties in both stationary and flowing status is important for the evaluation of their heat transfer performance under realistic operational co… ▽ More Molten salts are a leading candidate for high-temperature heat transfer fluids (HTFs) for thermal energy storage and conversion systems in concentrated solar power (CSP) and nuclear energy power plants. The ability to probe molten salt thermal transport properties in both stationary and flowing status is important for the evaluation of their heat transfer performance under realistic operational conditions, including the temperature range and potential degradation due to corrosion and contamination. However, accurate thermal transport properties are usually challenging to obtain even for stagnant molten salts due to different sources of errors from convection, radiation, and corrosion, let alone flowing ones. To the best of authors' knowledge, there is no available in-situ technique for measuring flowing molten salt thermal conductivity. Here, we report the first in-situ flowing molten salt thermal conductivity measurement using modulated photothermal radiometry (MPR). We could successfully perform the first in-situ thermal conductivity measurement of flowing molten $NaCl-KCl-MgCl_2$ in the typical operating temperature (520 and 580 $^oC$) with flow velocities ranging from around 0.3 to 1.0 $m$$s^-1$. The relative change of the molten salt thermal conductivity was measured. Gnielinski's correlation was also used to estimate the heat transfer coefficient h of the flowing $NaCl-KCl-MgCl_2$ in the given experimental condition. The work showed the potential of the MPR technique serving as an in-situ diagnostics tool to evaluate the heat transfer performance of flowing molten salts and other high-temperature HTFs. △ Less

Submitted 31 August, 2023; originally announced September 2023.

arXiv:2307.00385 [pdf, other]

Sulcal Pattern Matching with the Wasserstein Distance

Authors: Zijian Chen, Soumya Das, Moo K. Chung

Abstract: We present the unified computational framework for modeling the sulcal patterns of human brain obtained from the magnetic resonance images. The Wasserstein distance is used to align the sulcal patterns nonlinearly. These patterns are topologically different across subjects making the pattern matching a challenge. We work out the mathematical details and develop the gradient descent algorithms for… ▽ More We present the unified computational framework for modeling the sulcal patterns of human brain obtained from the magnetic resonance images. The Wasserstein distance is used to align the sulcal patterns nonlinearly. These patterns are topologically different across subjects making the pattern matching a challenge. We work out the mathematical details and develop the gradient descent algorithms for estimating the deformation field. We further quantify the image registration performance. This method is applied in identifying the differences between male and female sulcal patterns. △ Less

Submitted 1 July, 2023; originally announced July 2023.

Comments: In press in IEEE ISBI

arXiv:2306.15801 [pdf, other]

doi 10.1140/epjc/s10052-023-12137-y

Production of antihydrogen atoms by 6 keV antiprotons through a positronium cloud

Authors: P. Adrich, P. Blumer, G. Caratsch, M. Chung, P. Cladé, P. Comini, P. Crivelli, O. Dalkarov, P. Debu, A. Douillet, D. Drapier, P. Froelich, N. Garroum, S. Guellati-Khelifa, J. Guyomard, P-A. Hervieux, L. Hilico, P. Indelicato, S. Jonsell, J-P. Karr, B. Kim, S. Kim, E-S. Kim, Y. J. Ko, T. Kosinski , et al. (39 additional authors not shown)

Abstract: We report on the first production of an antihydrogen beam by charge exchange of 6.1 keV antiprotons with a cloud of positronium in the GBAR experiment at CERN. The antiproton beam was delivered by the AD/ELENA facility. The positronium target was produced from a positron beam itself obtained from an electron linear accelerator. We observe an excess over background indicating antihydrogen productio… ▽ More We report on the first production of an antihydrogen beam by charge exchange of 6.1 keV antiprotons with a cloud of positronium in the GBAR experiment at CERN. The antiproton beam was delivered by the AD/ELENA facility. The positronium target was produced from a positron beam itself obtained from an electron linear accelerator. We observe an excess over background indicating antihydrogen production with a significance of 3-4 standard deviations. △ Less

Submitted 3 July, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

Journal ref: European Physical Journal C 83, 1004 (2023)

arXiv:2306.10821 [pdf, other]

Comparison of L2 Korean pronunciation error patterns from five L1 backgrounds by using automatic phonetic transcription

Authors: Eun Jung Yeo, Hyungshin Ryu, Jooyoung Lee, Sunhee Kim, Minhwa Chung

Abstract: This paper presents a large-scale analysis of L2 Korean pronunciation error patterns from five different language backgrounds, Chinese, Vietnamese, Japanese, Thai, and English, by using automatic phonetic transcription. For the analysis, confusion matrices are generated for each L1, by aligning canonical phone sequences and automatically transcribed phone sequences obtained from fine-tuned Wav2Vec… ▽ More This paper presents a large-scale analysis of L2 Korean pronunciation error patterns from five different language backgrounds, Chinese, Vietnamese, Japanese, Thai, and English, by using automatic phonetic transcription. For the analysis, confusion matrices are generated for each L1, by aligning canonical phone sequences and automatically transcribed phone sequences obtained from fine-tuned Wav2Vec2 XLS-R phone recognizer. Each value in the confusion matrices is compared to capture frequent common error patterns and to specify patterns unique to a certain language background. Using the Foreign Speakers' Voice Data of Korean for Artificial Intelligence Learning dataset, common error pattern types are found to be (1) substitutions of aspirated or tense consonants with plain consonants, (2) deletions of syllable-final consonants, and (3) substitutions of diphthongs with monophthongs. On the other hand, thirty-nine patterns including (1) syllable-final /l/ substitutions with /n/ for Vietnamese and (2) /\textturnm/ insertions for Japanese are discovered as language-dependent. △ Less

Submitted 19 June, 2023; originally announced June 2023.

Comments: 5 pages, 2 figures, accepted to ICPhS 2023

arXiv:2306.06590 [pdf]

Mean-Variance Efficient Collaborative Filtering for Stock Recommendation

Authors: Munki Chung, Yongjae Lee, Woo Chang Kim

Abstract: The rise of FinTech has transformed financial services onto online platforms, yet stock investment recommender systems have received limited attention compared to other industries. Personalized stock recommendations can significantly impact customer engagement and satisfaction within the industry. However, traditional investment recommendations focus on high-return stocks or highly diversified por… ▽ More The rise of FinTech has transformed financial services onto online platforms, yet stock investment recommender systems have received limited attention compared to other industries. Personalized stock recommendations can significantly impact customer engagement and satisfaction within the industry. However, traditional investment recommendations focus on high-return stocks or highly diversified portfolios based on the modern portfolio theory, often neglecting user preferences. On the other hand, collaborative filtering (CF) methods also may not be directly applicable to stock recommendations, because it is inappropriate to just recommend stocks that users like. The key is to optimally blend users preference with the portfolio theory. However, research on stock recommendations within the recommender system domain remains comparatively limited, and no existing model considers both the preference of users and the risk-return characteristics of stocks. In this regard, we propose a mean-variance efficient collaborative filtering (MVECF) model for stock recommendations that consider both aspects. Our model is specifically designed to improve the pareto optimality (mean-variance efficiency) in a trade-off between the risk (variance of return) and return (mean return) by systemically handling uncertainties in stock prices. Such improvements are incorporated into the MVECF model using regularization, and the model is restructured to fit into the ordinary matrix factorization scheme to boost computational efficiency. Experiments on real-world fund holdings data show that our model can increase the mean-variance efficiency of suggested portfolios while sacrificing just a small amount of mean average precision and recall. Finally, we further show MVECF is easily applicable to the state-of-the-art graph-based ranking models. △ Less

Submitted 11 June, 2023; originally announced June 2023.

Comments: 12 pages, 4 figures, preprint, under review

arXiv:2305.18392 [pdf, other]

Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification

Authors: Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

Abstract: This paper proposes an improved Goodness of Pronunciation (GoP) that utilizes Uncertainty Quantification (UQ) for automatic speech intelligibility assessment for dysarthric speech. Current GoP methods rely heavily on neural network-driven overconfident predictions, which is unsuitable for assessing dysarthric speech due to its significant acoustic differences from healthy speech. To alleviate the… ▽ More This paper proposes an improved Goodness of Pronunciation (GoP) that utilizes Uncertainty Quantification (UQ) for automatic speech intelligibility assessment for dysarthric speech. Current GoP methods rely heavily on neural network-driven overconfident predictions, which is unsuitable for assessing dysarthric speech due to its significant acoustic differences from healthy speech. To alleviate the problem, UQ techniques were used on GoP by 1) normalizing the phoneme prediction (entropy, margin, maxlogit, logit-margin) and 2) modifying the scoring function (scaling, prior normalization). As a result, prior-normalized maxlogit GoP achieves the best performance, with a relative increase of 5.66%, 3.91%, and 23.65% compared to the baseline GoP for English, Korean, and Tamil, respectively. Furthermore, phoneme analysis is conducted to identify which phoneme scores significantly correlate with intelligibility scores in each language. △ Less

Submitted 28 May, 2023; originally announced May 2023.

Comments: Accepted to Interspeech 2023

arXiv:2305.18277 [pdf, other]

3DTeethSeg'22: 3D Teeth Scan Segmentation and Labeling Challenge

Authors: Achraf Ben-Hamadou, Oussama Smaoui, Ahmed Rekik, Sergi Pujades, Edmond Boyer, Hoyeon Lim, Minchang Kim, Minkyung Lee, Minyoung Chung, Yeong-Gil Shin, Mathieu Leclercq, Lucia Cevidanes, Juan Carlos Prieto, Shaojie Zhuang, Guangshun Wei, Zhiming Cui, Yuanfeng Zhou, Tudor Dascalu, Bulat Ibragimov, Tae-Hoon Yong, Hong-Gi Ahn, Wan Kim, Jae-Hwan Han, Byungsun Choi, Niels van Nistelrooij , et al. (7 additional authors not shown)

Abstract: Teeth localization, segmentation, and labeling from intra-oral 3D scans are essential tasks in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, developing automated algorithms for teeth analysis presents significant challenges due to variations in dental anatomy, imaging protocols, and limited availability of publicly accessi… ▽ More Teeth localization, segmentation, and labeling from intra-oral 3D scans are essential tasks in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health. However, developing automated algorithms for teeth analysis presents significant challenges due to variations in dental anatomy, imaging protocols, and limited availability of publicly accessible data. To address these challenges, the 3DTeethSeg'22 challenge was organized in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) in 2022, with a call for algorithms tackling teeth localization, segmentation, and labeling from intraoral 3D scans. A dataset comprising a total of 1800 scans from 900 patients was prepared, and each tooth was individually annotated by a human-machine hybrid algorithm. A total of 6 algorithms were evaluated on this dataset. In this study, we present the evaluation results of the 3DTeethSeg'22 challenge. The 3DTeethSeg'22 challenge code can be accessed at: https://github.com/abenhamadou/3DTeethSeg22_challenge △ Less

Submitted 29 May, 2023; originally announced May 2023.

Comments: 29 pages, MICCAI 2022 Singapore, Satellite Event, Challenge

arXiv:2305.16942 [pdf, other]

Multipolar Pseudochirality Induced Optical Torque

Authors: Karim Achouri, Mintae Chung, Andrei Kiselev, Olivier J. F. Martin

Abstract: It has been observed that achiral nano-particles, such as flat helices, may be subjected to an optical torque even when illuminated by normally incident linearly polarized light. However, the origin of this fascinating phenomenon has so far remained mostly unexplained. We therefore propose an exhaustive discussion that provides a clear and rigorous explanation for the existence of such a torque. U… ▽ More It has been observed that achiral nano-particles, such as flat helices, may be subjected to an optical torque even when illuminated by normally incident linearly polarized light. However, the origin of this fascinating phenomenon has so far remained mostly unexplained. We therefore propose an exhaustive discussion that provides a clear and rigorous explanation for the existence of such a torque. Using multipolar theory, and taking into account nonlocal interactions, we find that this torque stems from multipolar pseudochiral responses that generate both spin and orbital angular momenta. We also show that the nature of these peculiar responses makes them particularly dependent on the asymmetry of the particles. By elucidating the origin of this type of torque, this work may prove instrumental for the design of high-performance nano-rotors. △ Less

Submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.13048 [pdf, other]

RWKV: Reinventing RNNs for the Transformer Era

Authors: Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran GV, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang , et al. (9 additional authors not shown)

Abstract: Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence length. In contrast, recurrent neural networks (RNNs) exhibit linear scaling in memory and computational requirements but struggle to match the same performance as Transformers due to limitations in parallelization and scala… ▽ More Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence length. In contrast, recurrent neural networks (RNNs) exhibit linear scaling in memory and computational requirements but struggle to match the same performance as Transformers due to limitations in parallelization and scalability. We propose a novel model architecture, Receptance Weighted Key Value (RWKV), that combines the efficient parallelizable training of transformers with the efficient inference of RNNs. Our approach leverages a linear attention mechanism and allows us to formulate the model as either a Transformer or an RNN, thus parallelizing computations during training and maintains constant computational and memory complexity during inference. We scale our models as large as 14 billion parameters, by far the largest dense RNN ever trained, and find RWKV performs on par with similarly sized Transformers, suggesting future work can leverage this architecture to create more efficient models. This work presents a significant step towards reconciling trade-offs between computational efficiency and model performance in sequence processing tasks. △ Less

Submitted 10 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

arXiv:2305.08002 [pdf, ps, other]

doi 10.1016/j.phycom.2023.102108

Proportional Fair Scheduling Using Water-Filling Technique for SC-FDMA Based D2D Communication

Authors: Syed Tariq Shah, Jaheon Gu, Syed Faraz Hasan, Min Young Chung

Abstract: The resource allocation in SC-FDMA is constrained by the condition that multiple subchannels should be allocated to a single user only if they are adjacent. Therefore, the scheduling scheme of a D2D-cellular system that uses SC-FDMA must also conform to the so-called adjacency constraint. This paper proposes a heuristic algorithm with low computational complexity that applies proportional fair (PF… ▽ More The resource allocation in SC-FDMA is constrained by the condition that multiple subchannels should be allocated to a single user only if they are adjacent. Therefore, the scheduling scheme of a D2D-cellular system that uses SC-FDMA must also conform to the so-called adjacency constraint. This paper proposes a heuristic algorithm with low computational complexity that applies proportional fair (PF) scheduling in the D2D-cellular system. The proposed algorithm consists of two main phases: i) subchannel allocation and ii) adjustment of data rates, which are executed for both CUEs and DUEs. In the subchannel allocation phase for CUEs (or D2D pairs), the users' data rates are maximized via optimal power allocation to frequency-contiguous subchannels. In the second phase, a PF scheduling problem is solved to decide the modulation and coding scheme (MCS) of both CUEs and D2D pairs. Both phases of the proposed algorithm benefit from the Water-Filling (WF) technique. The simulation results suggest that the proposed scheme performs similarly to optimal PF scheduling from the perspective of users' data rate and their logarithmic sum. An additional benefit of the proposed scheme is its low computational overhead. △ Less

Submitted 2 June, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

arXiv:2304.08324 [pdf, other]

Goal-oriented Uncertainty Quantification for Inverse Problems via Variational Encoder-Decoder Networks

Authors: Babak Maboudi Afkham, Julianne Chung, Matthias Chung

Abstract: In this work, we describe a new approach that uses variational encoder-decoder (VED) networks for efficient goal-oriented uncertainty quantification for inverse problems. Contrary to standard inverse problems, these approaches are \emph{goal-oriented} in that the goal is to estimate some quantities of interest (QoI) that are functions of the solution of an inverse problem, rather than the solution… ▽ More In this work, we describe a new approach that uses variational encoder-decoder (VED) networks for efficient goal-oriented uncertainty quantification for inverse problems. Contrary to standard inverse problems, these approaches are \emph{goal-oriented} in that the goal is to estimate some quantities of interest (QoI) that are functions of the solution of an inverse problem, rather than the solution itself. Moreover, we are interested in computing uncertainty metrics associated with the QoI, thus utilizing a Bayesian approach for inverse problems that incorporates the prediction operator and techniques for exploring the posterior. This may be particularly challenging, especially for nonlinear, possibly unknown, operators and nonstandard prior assumptions. We harness recent advances in machine learning, i.e., VED networks, to describe a data-driven approach to large-scale inverse problems. This enables a real-time goal-oriented uncertainty quantification for the QoI. One of the advantages of our approach is that we avoid the need to solve challenging inversion problems by training a network to approximate the mapping from observations to QoI. Another main benefit is that we enable uncertainty quantification for the QoI by leveraging probability distributions in the latent space. This allows us to efficiently generate QoI samples and circumvent complicated or even unknown forward models and prediction operators. Numerical results from medical tomography reconstruction and nonlinear hydraulic tomography demonstrate the potential and broad applicability of the approach. △ Less

Submitted 29 September, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

Comments: 28 pages, 13 figures

MSC Class: 15A29; 6208; 68U07

arXiv:2304.05912 [pdf, other]

PH-STAT

Authors: Moo K. Chung

Abstract: We introduce PH-STAT, a comprehensive Matlab toolbox designed for performing a wide range of statistical inferences on persistent homology. Persistent homology is a prominent tool in topological data analysis (TDA) that captures the underlying topological features of complex data sets. The toolbox aims to provide users with an accessible and user-friendly interface for analyzing and interpreting t… ▽ More We introduce PH-STAT, a comprehensive Matlab toolbox designed for performing a wide range of statistical inferences on persistent homology. Persistent homology is a prominent tool in topological data analysis (TDA) that captures the underlying topological features of complex data sets. The toolbox aims to provide users with an accessible and user-friendly interface for analyzing and interpreting topological data. The package is distributed in https://github.com/laplcebeltrami/PH-STAT. △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: arXiv admin note: text overlap with arXiv:2302.06673

arXiv:2304.05908 [pdf, other]

Altered Topological Structure of the Brain White Matter in Maltreated Children through Topological Data Analysis

Authors: Moo K. Chung, Tahmineh Azizi, Jamie L. Hanson, Andrew L. Alexander, Richard J. Davidson, Seth D. Pollak

Abstract: Childhood maltreatment may adversely affect brain development and consequently influence behavioral, emotional, and psychological patterns during adulthood. In this study, we propose an analytical pipeline for modeling the altered topological structure of brain white matter in maltreated and typically developing children. We perform topological data analysis (TDA) to assess the alteration in the g… ▽ More Childhood maltreatment may adversely affect brain development and consequently influence behavioral, emotional, and psychological patterns during adulthood. In this study, we propose an analytical pipeline for modeling the altered topological structure of brain white matter in maltreated and typically developing children. We perform topological data analysis (TDA) to assess the alteration in the global topology of the brain white-matter structural covariance network among children. We use persistent homology, an algebraic technique in TDA, to analyze topological features in the brain covariance networks constructed from structural magnetic resonance imaging (MRI) and diffusion tensor imaging (DTI). We develop a novel framework for statistical inference based on the Wasserstein distance to assess the significance of the observed topological differences. Using these methods in comparing maltreated children to a typically developing control group, we find that maltreatment may increase homogeneity in white matter structures and thus induce higher correlations in the structural covariance; this is reflected in the topological profile. Our findings strongly suggest that TDA can be a valuable framework to model altered topological structures of the brain. The MATLAB codes and processed data used in this study can be found at https://github.com/laplcebeltrami/maltreated. △ Less

Submitted 14 November, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

arXiv:2303.17007 [pdf]

doi 10.1103/PhysRevD.107.112012

Impact of cross-section uncertainties on supernova neutrino spectral parameter fitting in the Deep Underground Neutrino Experiment

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, Z. Ahmad, J. Ahmed, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, P. Amedo, J. Anderson, D. A. Andrade , et al. (1294 additional authors not shown)

Abstract: A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics… ▽ More A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics and astrophysics measurements. A key requirement for a correct interpretation of these measurements is a good understanding of the energy-dependent total cross section $σ(E_ν)$ for charged-current $ν_e$ absorption on argon. In the context of a simulated extraction of supernova $ν_e$ spectral parameters from a toy analysis, we investigate the impact of $σ(E_ν)$ modeling uncertainties on DUNE's supernova neutrino physics sensitivity for the first time. We find that the currently large theoretical uncertainties on $σ(E_ν)$ must be substantially reduced before the $ν_e$ flux parameters can be extracted reliably: in the absence of external constraints, a measurement of the integrated neutrino luminosity with less than 10\% bias with DUNE requires $σ(E_ν)$ to be known to about 5%. The neutrino spectral shape parameters can be known to better than 10% for a 20% uncertainty on the cross-section scale, although they will be sensitive to uncertainties on the shape of $σ(E_ν)$. A direct measurement of low-energy $ν_e$-argon scattering would be invaluable for improving the theoretical precision to the needed level. △ Less

Submitted 7 July, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

Comments: 25 pages, 21 figures

Report number: FERMILAB-PUB-23-132-CSAID-LBNF-ND-T

Journal ref: Phys. Rev. D 107, 112012 (2023)

arXiv:2303.12053 [pdf]

Tomography Scan of Charge Density Wave in NbSe2

Authors: Jyun-Yu Wu, Yung-Ting Lee, Guan-Hao Chen, Zheng-Hong Li, Chang-Tsan Lee, Jie-Yu Hsu, Chia-Nung Kuo, Juhn-Jong Lin, Wen-Hao Chang, Chin-Shan Lue, Po-Tuan Cheng, Cheng-Tien Chiang, Chien-Cheng Kuo, Chien-Te Wu, Chi-Cheng Lee, Ming-Chiang Chung, Hung-Chung Hsueh, Chun-Liang Lin

Abstract: Charge density wave (CDW) resulted from a small distortion in the lattice is able to create new orders beyond the original lattice. In 2H-NbSe2, one of the layered transition metal dichalcogenides (TMD), the 3x3 charge order appears in two-dimensional (2D) layers. Although CDW is usually described by a sine wave, the spatial distribution within a 2D layer has never been systematically visualized.… ▽ More Charge density wave (CDW) resulted from a small distortion in the lattice is able to create new orders beyond the original lattice. In 2H-NbSe2, one of the layered transition metal dichalcogenides (TMD), the 3x3 charge order appears in two-dimensional (2D) layers. Although CDW is usually described by a sine wave, the spatial distribution within a 2D layer has never been systematically visualized. Here by using scanning tunneling microscopy (STM) and density functional theory (DFT), we have monitored the evolution of 3x3 CDW along c-axis and realized a nearly tomography scan of CDW of the topmost layer. The results show that the strength of 3x3 charge order varies while increasing the tunneling current. The 3x3 charge order is relatively strong at the outermost Se level and decreases while probing in between Se and Nb levels. Interestingly, the 3x3 charge order gets strong again as reaching Nb level but along with a phase shift. We further calculated the orbital charge distributions and found that both CDW intensity modulation and phase shift are strongly correlated with the distribution of Se p orbitals and Nb d orbitals. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: 12 pages, 4 figures

arXiv:2303.11279 [pdf, other]

doi 10.1109/ICRA46639.2022.9811762

Distributed Timed Elastic Band (DTEB) Planner: Trajectory Sharing and Collision Prediction for Multi-Robot Systems

Authors: Yiu Ming Chung, Hazem Youssef, Moritz Roidl

Abstract: Autonomous navigation of mobile robots is a well studied problem in robotics. However, the navigation task becomes challenging when multi-robot systems have to cooperatively navigate dynamic environments with deadlock-prone layouts. We present a Distributed Timed Elastic Band (DTEB) Planner that combines Prioritized Planning with the online TEB trajectory Planner, in order to extend the capabiliti… ▽ More Autonomous navigation of mobile robots is a well studied problem in robotics. However, the navigation task becomes challenging when multi-robot systems have to cooperatively navigate dynamic environments with deadlock-prone layouts. We present a Distributed Timed Elastic Band (DTEB) Planner that combines Prioritized Planning with the online TEB trajectory Planner, in order to extend the capabilities of the latter to multi-robot systems. The proposed planner is able to reactively avoid imminent collisions as well as predictively resolve potential deadlocks among a team of robots, while navigating in a complex environment. The results of our simulation demonstrate the reliable performance and the versatility of the planner in different environment settings. The code and tests for our approach are available online. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: Published in the International Conference on Robotics and Automation (ICRA) - 2022 https://ieeexplore.ieee.org/document/9811762

Journal ref: ICRA (2022) pp. 10702-10708

arXiv:2303.00870 [pdf, other]

Implementing Active Learning in Cybersecurity: Detecting Anomalies in Redacted Emails

Authors: Mu-Huan Chung, Lu Wang, Sharon Li, Yuhong Yang, Calvin Giang, Khilan Jerath, Abhay Raman, David Lie, Mark Chignell

Abstract: Research on email anomaly detection has typically relied on specially prepared datasets that may not adequately reflect the type of data that occurs in industry settings. In our research, at a major financial services company, privacy concerns prevented inspection of the bodies of emails and attachment details (although subject headings and attachment filenames were available). This made labeling… ▽ More Research on email anomaly detection has typically relied on specially prepared datasets that may not adequately reflect the type of data that occurs in industry settings. In our research, at a major financial services company, privacy concerns prevented inspection of the bodies of emails and attachment details (although subject headings and attachment filenames were available). This made labeling possible anomalies in the resulting redacted emails more difficult. Another source of difficulty is the high volume of emails combined with the scarcity of resources making machine learning (ML) a necessity, but also creating a need for more efficient human training of ML models. Active learning (AL) has been proposed as a way to make human training of ML models more efficient. However, the implementation of Active Learning methods is a human-centered AI challenge due to potential human analyst uncertainty, and the labeling task can be further complicated in domains such as the cybersecurity domain (or healthcare, aviation, etc.) where mistakes in labeling can have highly adverse consequences. In this paper we present research results concerning the application of Active Learning to anomaly detection in redacted emails, comparing the utility of different methods for implementing active learning in this context. We evaluate different AL strategies and their impact on resulting model performance. We also examine how ratings of confidence that experts have in their labels can inform AL. The results obtained are discussed in terms of their implications for AL methodology and for the role of experts in model-assisted email anomaly screening. △ Less

Submitted 2 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

arXiv:2302.09323 [pdf, other]

Heterogeneous Graph Convolutional Neural Network via Hodge-Laplacian for Brain Functional Data

Authors: Jinghan Huang, Moo K. Chung, Anqi Qiu

Abstract: This study proposes a novel heterogeneous graph convolutional neural network (HGCNN) to handle complex brain fMRI data at regional and across-region levels. We introduce a generic formulation of spectral filters on heterogeneous graphs by introducing the $k-th$ Hodge-Laplacian (HL) operator. In particular, we propose Laguerre polynomial approximations of HL spectral filters and prove that their sp… ▽ More This study proposes a novel heterogeneous graph convolutional neural network (HGCNN) to handle complex brain fMRI data at regional and across-region levels. We introduce a generic formulation of spectral filters on heterogeneous graphs by introducing the $k-th$ Hodge-Laplacian (HL) operator. In particular, we propose Laguerre polynomial approximations of HL spectral filters and prove that their spatial localization on graphs is related to the polynomial order. Furthermore, based on the bijection property of boundary operators on simplex graphs, we introduce a generic topological graph pooling (TGPool) method that can be used at any dimensional simplices. This study designs HL-node, HL-edge, and HL-HGCNN neural networks to learn signal representation at a graph node, edge levels, and both, respectively. Our experiments employ fMRI from the Adolescent Brain Cognitive Development (ABCD; n=7693) to predict general intelligence. Our results demonstrate the advantage of the HL-edge network over the HL-node network when functional brain connectivity is considered as features. The HL-HGCNN outperforms the state-of-the-art graph neural networks (GNNs) approaches, such as GAT, BrainGNN, dGCN, BrainNetCNN, and Hypergraph NN. The functional connectivity features learned from the HL-HGCNN are meaningful in interpreting neural circuits related to general intelligence. △ Less

Submitted 18 February, 2023; originally announced February 2023.

Journal ref: IPMI 2023

arXiv:2302.06673 [pdf, other]

Unified Topological Inference for Brain Networks in Temporal Lobe Epilepsy Using the Wasserstein Distance

Authors: Moo K. Chung, Camille Garcia Ramos, Felipe Branco De Paiva, Jedidiah Mathis, Vivek Prabharakaren, Veena A. Nair, Elizabeth Meyerand, Bruce P. Hermann, Jeffrey R. Binder, Aaron F. Struck

Abstract: Persistent homology offers a powerful tool for extracting hidden topological signals from brain networks. It captures the evolution of topological structures across multiple scales, known as filtrations, thereby revealing topological features that persist over these scales. These features are summarized in persistence diagrams, and their dissimilarity is quantified using the Wasserstein distance.… ▽ More Persistent homology offers a powerful tool for extracting hidden topological signals from brain networks. It captures the evolution of topological structures across multiple scales, known as filtrations, thereby revealing topological features that persist over these scales. These features are summarized in persistence diagrams, and their dissimilarity is quantified using the Wasserstein distance. However, the Wasserstein distance does not follow a known distribution, posing challenges for the application of existing parametric statistical models.To tackle this issue, we introduce a unified topological inference framework centered on the Wasserstein distance. Our approach has no explicit model and distributional assumptions. The inference is performed in a completely data driven fashion. We apply this method to resting-state functional magnetic resonance images (rs-fMRI) of temporal lobe epilepsy patients collected from two different sites: the University of Wisconsin-Madison and the Medical College of Wisconsin. Importantly, our topological method is robust to variations due to sex and image acquisition, obviating the need to account for these variables as nuisance covariates. We successfully localize the brain regions that contribute the most to topological differences. A MATLAB package used for all analyses in this study is available at https://github.com/laplcebeltrami/PH-STAT. △ Less

Submitted 20 September, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

Comments: arXiv admin note: text overlap with arXiv:2201.00087

arXiv:2301.06669 [pdf, other]

doi 10.1103/PhysRevB.107.214451

Deep Learning of Phase Transitions for Quantum Spin Chains from Correlation Aspects

Authors: Ming-Chiang Chung, Guang-Yu Huang, Ian P. McCulloch, Yuan-Hong Tsai

Abstract: Using machine learning (ML) to recognize different phases of matter and to infer the entire phase diagram has proven to be an effective tool given a large dataset. In our previous proposals, we have successfully explored phase transitions for topological phases of matter at low dimensions either in a supervised or an unsupervised learning protocol with the assistance of quantum information related… ▽ More Using machine learning (ML) to recognize different phases of matter and to infer the entire phase diagram has proven to be an effective tool given a large dataset. In our previous proposals, we have successfully explored phase transitions for topological phases of matter at low dimensions either in a supervised or an unsupervised learning protocol with the assistance of quantum information related quantities. In this work, we adopt our previous ML procedures to study quantum phase transitions of magnetism systems such as the XY and XXZ spin chains by using spin-spin correlation functions as the input data. We find that our proposed approach not only maps out the phase diagrams with accurate phase boundaries, but also indicates some new features that have not observed before. In particular, we define so-called relevant correlation functions to some corresponding phases that can always distinguish between those and their neighbors. Based on the unsupervised learning protocol we proposed [Phys. Rev. B 104, 165108 (2021)], the reduced latent representations of the inputs combined with the clustering algorithm show the connectedness or disconnectedness between neighboring clusters (phases), just corresponding to the continuous or disrupt quantum phase transition, respectively. △ Less

Submitted 9 May, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

Comments: 18 pages, 21 figures

arXiv:2212.09807 [pdf, other]

Highly-parallelized simulation of a pixelated LArTPC on a GPU

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, Z. Ahmad, J. Ahmed, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, C. Alt, A. Alton, R. Alvarez, P. Amedo, J. Anderson , et al. (1282 additional authors not shown)

Abstract: The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we pr… ▽ More The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we present the first implementation of a full microphysical simulator of a liquid argon time projection chamber (LArTPC) equipped with light readout and pixelated charge readout, developed for the DUNE Near Detector. The software is implemented with an end-to-end set of GPU-optimized algorithms. The algorithms have been written in Python and translated into CUDA kernels using Numba, a just-in-time compiler for a subset of Python and NumPy instructions. The GPU implementation achieves a speed up of four orders of magnitude compared with the equivalent CPU version. The simulation of the current induced on $10^3$ pixels takes around 1 ms on the GPU, compared with approximately 10 s on the CPU. The results of the simulation are compared against data from a pixel-readout LArTPC prototype. △ Less

Submitted 28 February, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

Comments: 26 pages, 15 figures

Report number: FERMILAB-PUB-22-926-LBNF

arXiv:2211.15950 [pdf, other]

Enhanced artificial intelligence-based diagnosis using CBCT with internal denoising: Clinical validation for discrimination of fungal ball, sinusitis, and normal cases in the maxillary sinus

Authors: Kyungsu Kim, Chae Yeon Lim, Joong Bo Shin, Myung Jin Chung, Yong Gi Jung

Abstract: The cone-beam computed tomography (CBCT) provides 3D volumetric imaging of a target with low radiation dose and cost compared with conventional computed tomography, and it is widely used in the detection of paranasal sinus disease. However, it lacks the sensitivity to detect soft tissue lesions owing to reconstruction constraints. Consequently, only physicians with expertise in CBCT reading can di… ▽ More The cone-beam computed tomography (CBCT) provides 3D volumetric imaging of a target with low radiation dose and cost compared with conventional computed tomography, and it is widely used in the detection of paranasal sinus disease. However, it lacks the sensitivity to detect soft tissue lesions owing to reconstruction constraints. Consequently, only physicians with expertise in CBCT reading can distinguish between inherent artifacts or noise and diseases, restricting the use of this imaging modality. The development of artificial intelligence (AI)-based computer-aided diagnosis methods for CBCT to overcome the shortage of experienced physicians has attracted substantial attention. However, advanced AI-based diagnosis addressing intrinsic noise in CBCT has not been devised, discouraging the practical use of AI solutions for CBCT. To address this issue, we propose an AI-based computer-aided diagnosis method using CBCT with a denoising module. This module is implemented before diagnosis to reconstruct the internal ground-truth full-dose scan corresponding to an input CBCT image and thereby improve the diagnostic performance. The external validation results for the unified diagnosis of sinus fungal ball, chronic rhinosinusitis, and normal cases show that the proposed method improves the micro-, macro-average AUC, and accuracy by 7.4, 5.6, and 9.6% (from 86.2, 87.0, and 73.4 to 93.6, 92.6, and 83.0%), respectively, compared with a baseline while improving human diagnosis accuracy by 11% (from 71.7 to 83.0%), demonstrating technical differentiation and clinical effectiveness. This pioneering study on AI-based diagnosis using CBCT indicates denoising can improve diagnostic performance and reader interpretability in images from the sinonasal area, thereby providing a new approach and direction to radiographic image reconstruction regarding the development of AI-based diagnostic solutions. △ Less

Submitted 29 November, 2022; originally announced November 2022.

arXiv:2211.15653 [pdf, other]

doi 10.1007/s11214-023-00984-w

Energetic electron precipitation driven by electromagnetic ion cyclotron waves from ELFIN's low altitude perspective

Authors: V. Angelopoulos, X. -J. Zhang, A. V. Artemyev, D. Mourenas, E. Tsai, C. Wilkins, A. Runov, J. Liu, D. L. Turner, W. Li, K. Khurana, R. E. Wirz, V. A. Sergeev, X. Meng, J. Wu, M. D. Hartinger, T. Raita, Y. Shen, X. An, X. Shi, M. F. Bashir, X. Shen, L. Gan, M. Qin, L. Capannolo , et al. (61 additional authors not shown)

Abstract: We review comprehensive observations of electromagnetic ion cyclotron (EMIC) wave-driven energetic electron precipitation using data from the energetic electron detector on the Electron Losses and Fields InvestigatioN (ELFIN) mission, two polar-orbiting low-altitude spinning CubeSats, measuring 50-5000 keV electrons with good pitch-angle and energy resolution. EMIC wave-driven precipitation exhibi… ▽ More We review comprehensive observations of electromagnetic ion cyclotron (EMIC) wave-driven energetic electron precipitation using data from the energetic electron detector on the Electron Losses and Fields InvestigatioN (ELFIN) mission, two polar-orbiting low-altitude spinning CubeSats, measuring 50-5000 keV electrons with good pitch-angle and energy resolution. EMIC wave-driven precipitation exhibits a distinct signature in energy-spectrograms of the precipitating-to-trapped flux ratio: peaks at 0.5 MeV which are abrupt (bursty) with significant substructure (occasionally down to sub-second timescale). Multiple ELFIN passes over the same MLT sector allow us to study the spatial and temporal evolution of the EMIC wave - electron interaction region. Using two years of ELFIN data, we assemble a statistical database of 50 events of strong EMIC wave-driven precipitation. Most reside at L=5-7 at dusk, while a smaller subset exists at L=8-12 at post-midnight. The energies of the peak-precipitation ratio and of the half-peak precipitation ratio (our proxy for the minimum resonance energy) exhibit an L-shell dependence in good agreement with theoretical estimates based on prior statistical observations of EMIC wave power spectra. The precipitation ratio's spectral shape for the most intense events has an exponential falloff away from the peak (i.e., on either side of 1.45 MeV). It too agrees well with quasi-linear diffusion theory based on prior statistics of wave spectra. Sub-MeV electron precipitation observed concurrently with strong EMIC wave-driven 1MeV precipitation has a spectral shape that is consistent with efficient pitch-angle scattering down to 200-300 keV by much less intense higher frequency EMIC waves. These results confirm the critical role of EMIC waves in driving relativistic electron losses. Nonlinear effects may abound and require further investigation. △ Less

Submitted 28 November, 2022; originally announced November 2022.

arXiv:2211.10542 [pdf, other]

Hodge-Decomposition of Brain Networks

Authors: D. Vijay Anand, Moo K. Chung

Abstract: We analyze brain networks by decomposing them into three orthogonal components: gradient, curl, and harmonic flows, through the Hodge decomposition, a technique advantageous for capturing complex topological features. A Wasserstein distance based topological inference is developed to determine the statistical significance of each component. The Hodge decomposition is applied to human brain network… ▽ More We analyze brain networks by decomposing them into three orthogonal components: gradient, curl, and harmonic flows, through the Hodge decomposition, a technique advantageous for capturing complex topological features. A Wasserstein distance based topological inference is developed to determine the statistical significance of each component. The Hodge decomposition is applied to human brain networks obtained from a resting-state fMRI study. Our results indicate statistically significant differences in the topological features between male and female brain networks. △ Less

Submitted 1 April, 2024; v1 submitted 18 November, 2022; originally announced November 2022.

Comments: Will be published in ISBI 2024

arXiv:2211.01705 [pdf]

A speech corpus for chronic kidney disease

Authors: Jihyun Mun, Sunhee Kim, Myeong Ju Kim, Jiwon Ryu, Sejoong Kim, Minhwa Chung

Abstract: In this study, we present a speech corpus of patients with chronic kidney disease (CKD) that will be used for research on pathological voice analysis, automatic illness identification, and severity prediction. This paper introduces the steps involved in creating this corpus, including the choice of speech-related parameters and speech lists as well as the recording technique. The speakers in this… ▽ More In this study, we present a speech corpus of patients with chronic kidney disease (CKD) that will be used for research on pathological voice analysis, automatic illness identification, and severity prediction. This paper introduces the steps involved in creating this corpus, including the choice of speech-related parameters and speech lists as well as the recording technique. The speakers in this corpus, 289 CKD patients with varying degrees of severity who were categorized based on estimated glomerular filtration rate (eGFR), delivered sustained vowels, sentence, and paragraph stimuli. This study compared and analyzed the voice characteristics of CKD patients with those of the control group; the results revealed differences in voice quality, phoneme-level pronunciation, prosody, glottal source, and aerodynamic parameters. △ Less

Submitted 3 November, 2022; originally announced November 2022.

arXiv:2211.01203 [pdf]

Revealing the Charge Density Wave caused by Peierls instability in two-dimensional NbSe$_{2}$

Authors: Yung-Ting Lee, Po-Tuan Chen, Zheng-Hong Li, Jyun-Yu Wu, Chia-Nung Kuo, Chin-Shan Lue, Chien-Te Wu, Chien-Cheng Kuo, Cheng-Tien Chiang, Chun-Liang Lin, Chi-Cheng Lee, Hung-Chung Hsueh, Ming-Chiang Chung

Abstract: The formation of a charge density wave (CDW) in two-dimensional (2D) materials caused by Peierls instability is a controversial topic. This study investigates the extensively debated role of Fermi surface nesting in causing the CDW state in 2H-NbSe$_{2}$ materials. Four NbSe$_{2}$ structures (i.e., normal, stripe, filled, and hollow structures) are identified on the basis of the characteristics in… ▽ More The formation of a charge density wave (CDW) in two-dimensional (2D) materials caused by Peierls instability is a controversial topic. This study investigates the extensively debated role of Fermi surface nesting in causing the CDW state in 2H-NbSe$_{2}$ materials. Four NbSe$_{2}$ structures (i.e., normal, stripe, filled, and hollow structures) are identified on the basis of the characteristics in scanning tunneling microscopy images and first-principles simulations. The calculations reveal that the filled phase corresponds to Peierls' description; that is, it exhibits fully opened gaps at the CDW Brillouin zone boundary, resulting in a drop at the Fermi level in the density of states and the scanning tunneling spectroscopy spectra. The electronic susceptibility and phonon instability in the normal phase indicate that the Fermi surface nesting is triggered by two nesting vectors, whereas the involvement of only one nesting vector leads to the stripe phase. This comprehensive study demonstrates that the filled phase of NbSe$_{2}$ can be categorized as a Peierls-instability-induced CDW in 2D systems. △ Less

Submitted 14 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

Comments: 4 figures

arXiv:2211.01166 [pdf, other]

Identification and reconstruction of low-energy electrons in the ProtoDUNE-SP detector

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, Z. Ahmad, J. Ahmed, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, C. Alt, A. Alton, R. Alvarez, P. Amedo, J. Anderson , et al. (1235 additional authors not shown)

Abstract: Measurements of electrons from $ν_e$ interactions are crucial for the Deep Underground Neutrino Experiment (DUNE) neutrino oscillation program, as well as searches for physics beyond the standard model, supernova neutrino detection, and solar neutrino measurements. This article describes the selection and reconstruction of low-energy (Michel) electrons in the ProtoDUNE-SP detector. ProtoDUNE-SP is… ▽ More Measurements of electrons from $ν_e$ interactions are crucial for the Deep Underground Neutrino Experiment (DUNE) neutrino oscillation program, as well as searches for physics beyond the standard model, supernova neutrino detection, and solar neutrino measurements. This article describes the selection and reconstruction of low-energy (Michel) electrons in the ProtoDUNE-SP detector. ProtoDUNE-SP is one of the prototypes for the DUNE far detector, built and operated at CERN as a charged particle test beam experiment. A sample of low-energy electrons produced by the decay of cosmic muons is selected with a purity of 95%. This sample is used to calibrate the low-energy electron energy scale with two techniques. An electron energy calibration based on a cosmic ray muon sample uses calibration constants derived from measured and simulated cosmic ray muon events. Another calibration technique makes use of the theoretically well-understood Michel electron energy spectrum to convert reconstructed charge to electron energy. In addition, the effects of detector response to low-energy electron energy scale and its resolution including readout electronics threshold effects are quantified. Finally, the relation between the theoretical and reconstructed low-energy electron energy spectrum is derived and the energy resolution is characterized. The low-energy electron selection presented here accounts for about 75% of the total electron deposited energy. After the addition of lost energy using a Monte Carlo simulation, the energy resolution improves from about 40% to 25% at 50~MeV. These results are used to validate the expected capabilities of the DUNE far detector to reconstruct low-energy electrons. △ Less

Submitted 31 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

Comments: 19 pages, 10 figures

Report number: FERMILAB-PUB-22-784, CERN-EP-DRAFT-MISC-2022-008

Journal ref: Phys. Rev. D 107, 092012 (2023)

arXiv:2210.15387 [pdf, other]

Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Authors: Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

Abstract: Automatic assessment of dysarthric speech is essential for sustained treatments and rehabilitation. However, obtaining atypical speech is challenging, often leading to data scarcity issues. To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning. Wav2vec 2.0 XLS-R is jointly traine… ▽ More Automatic assessment of dysarthric speech is essential for sustained treatments and rehabilitation. However, obtaining atypical speech is challenging, often leading to data scarcity issues. To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning. Wav2vec 2.0 XLS-R is jointly trained for two different tasks: severity classification and auxiliary automatic speech recognition (ASR). For the baseline experiments, we employ hand-crafted acoustic features and machine learning classifiers such as SVM, MLP, and XGBoost. Explored on the Korean dysarthric speech QoLT database, our model outperforms the traditional baseline methods, with a relative percentage increase of 1.25% for F1-score. In addition, the proposed model surpasses the model trained without ASR head, achieving 10.61% relative percentage improvements. Furthermore, we present how multi-task learning affects the severity classification performance by analyzing the latent representations and regularization effect. △ Less

Submitted 28 April, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

Comments: Accepted to ICASSP 2023

arXiv:2210.09092 [pdf, other]

Dynamic Topological Data Analysis of Functional Human Brain Networks

Authors: Moo K. Chung, Soumya Das, Hernando Ombao

Abstract: Developing reliable methods to discriminate different transient brain states that change over time is a key neuroscientific challenge in brain imaging studies. Topological data analysis (TDA), a novel framework based on algebraic topology, can handle such a challenge. However, existing TDA has been somewhat limited to capturing the static summary of dynamically changing brain networks. We propose… ▽ More Developing reliable methods to discriminate different transient brain states that change over time is a key neuroscientific challenge in brain imaging studies. Topological data analysis (TDA), a novel framework based on algebraic topology, can handle such a challenge. However, existing TDA has been somewhat limited to capturing the static summary of dynamically changing brain networks. We propose a novel dynamic-TDA framework that builds persistent homology over a time series of brain networks. We construct a Wasserstein distance based inference procedure to discriminate between time series of networks. The method is applied to the resting-state functional magnetic resonance images of human brain. We demonstrate that our proposed dynamic-TDA approach can distinctly discriminate between the topological patterns of male and female brain networks. MATLAB code for implementing this method is available at https://github.com/laplcebeltrami/PH-STAT. △ Less

Submitted 18 December, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: In press in journal Foundations of Data Science

Showing 1–50 of 292 results for author: Chung, M