-
Kappa-tail technique: Modeling and application to Solar Energetic Particles observed by Parker Solar Probe
Authors:
G. Livadiotis,
A. T. Cummings,
M. E. Cuesta,
R. Bandyopadhyay,
H. A. Farooki,
L. Y. Khoo,
D. J. McComas,
J. S. Rankin,
T. Sharma,
M. M. Shen,
C. M. S. Cohen,
G. D. Muro,
Z. Xu
Abstract:
We develop the kappa-tail fitting technique, which analyzes observations of power-law tails of distributions and energy-flux spectra and connects them to theoretical modeling of kappa distributions, to determine the thermodynamics of the examined space plasma. In particular, we (i) construct the associated mathematical formulation, (ii) prove its decisive lead for determining whether the observed…
▽ More
We develop the kappa-tail fitting technique, which analyzes observations of power-law tails of distributions and energy-flux spectra and connects them to theoretical modeling of kappa distributions, to determine the thermodynamics of the examined space plasma. In particular, we (i) construct the associated mathematical formulation, (ii) prove its decisive lead for determining whether the observed power-law is associated with kappa distributions; and (iii) provide a validation of the technique using pseudo-observations of typical input plasma parameters. Then, we apply this technique to a case-study by determining the thermodynamics of solar energetic particle (SEP) protons, for a SEP event observed on April 17, 2021, by the PSP/ISOIS instrument suite onboard PSP. The results show SEP temperatures and densities of the order of $\sim 1$ MeV and $ \sim 5 \cdot 10^{-7} $ cm$^{-3}$, respectively.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Temporal label recovery from noisy dynamical data
Authors:
Yuehaw Khoo,
Xin T. Tong,
Wanjie Wang,
Yuguan Wang
Abstract:
Analyzing dynamical data often requires information of the temporal labels, but such information is unavailable in many applications. Recovery of these temporal labels, closely related to the seriation or sequencing problem, becomes crucial in the study. However, challenges arise due to the nonlinear nature of the data and the complexity of the underlying dynamical system, which may be periodic or…
▽ More
Analyzing dynamical data often requires information of the temporal labels, but such information is unavailable in many applications. Recovery of these temporal labels, closely related to the seriation or sequencing problem, becomes crucial in the study. However, challenges arise due to the nonlinear nature of the data and the complexity of the underlying dynamical system, which may be periodic or non-periodic. Additionally, noise within the feature space complicates the theoretical analysis. Our work develops spectral algorithms that leverage manifold learning concepts to recover temporal labels from noisy data. We first construct the graph Laplacian of the data, and then employ the second (and the third) Fiedler vectors to recover temporal labels. This method can be applied to both periodic and aperiodic cases. It also does not require monotone properties on the similarity matrix, which are commonly assumed in existing spectral seriation algorithms. We develop the $\ell_{\infty}$ error of our estimators for the temporal labels and ranking, without assumptions on the eigen-gap. In numerical analysis, our method outperforms spectral seriation algorithms based on a similarity matrix. The performance of our algorithms is further demonstrated on a synthetic biomolecule data example.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
S-SOS: Stochastic Sum-Of-Squares for Parametric Polynomial Optimization
Authors:
Richard L. Zhu,
Mathias Oster,
Yuehaw Khoo
Abstract:
Global polynomial optimization is an important tool across applied mathematics, with many applications in operations research, engineering, and physical sciences. In various settings, the polynomials depend on external parameters that may be random. We discuss a stochastic sum-of-squares (S-SOS) algorithm based on the sum-of squares hierarchy that constructs a series of semidefinite programs to jo…
▽ More
Global polynomial optimization is an important tool across applied mathematics, with many applications in operations research, engineering, and physical sciences. In various settings, the polynomials depend on external parameters that may be random. We discuss a stochastic sum-of-squares (S-SOS) algorithm based on the sum-of squares hierarchy that constructs a series of semidefinite programs to jointly find strict lower bounds on the global minimum and extract candidates for parameterized global minimizers. We prove quantitative convergence of the hierarchy as the degree increases and use it to solve unconstrained and constrained polynomial optimization problems parameterized by random variables. By employing $n$-body priors from condensed matter physics to induce sparsity, we can use S-SOS to produce solutions and uncertainty intervals for sensor network localization problems containing up to 40 variables and semidefinite matrix sizes surpassing $800 \times 800$.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Solving Fractional Differential Equations on a Quantum Computer: A Variational Approach
Authors:
Fong Yew Leong,
Dax Enshan Koh,
Jian Feng Kong,
Siong Thye Goh,
Jun Yong Khoo,
Wei-Bin Ewe,
Hongying Li,
Jayne Thompson,
Dario Poletti
Abstract:
We introduce an efficient variational hybrid quantum-classical algorithm designed for solving Caputo time-fractional partial differential equations. Our method employs an iterable cost function incorporating a linear combination of overlap history states. The proposed algorithm is not only efficient in time complexity, but has lower memory costs compared to classical methods. Our results indicate…
▽ More
We introduce an efficient variational hybrid quantum-classical algorithm designed for solving Caputo time-fractional partial differential equations. Our method employs an iterable cost function incorporating a linear combination of overlap history states. The proposed algorithm is not only efficient in time complexity, but has lower memory costs compared to classical methods. Our results indicate that solution fidelity is insensitive to the fractional index and that gradient evaluation cost scales economically with the number of time steps. As a proof of concept, we apply our algorithm to solve a range of fractional partial differential equations commonly encountered in engineering applications, such as the sub-diffusion equation, the non-linear Burgers' equation and a coupled diffusive epidemic model. We assess quantum hardware performance under realistic noise conditions, further validating the practical utility of our algorithm.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Multi-Frequency Progressive Refinement for Learned Inverse Scattering
Authors:
Owen Melia,
Olivia Tsang,
Vasileios Charisopoulos,
Yuehaw Khoo,
Jeremy Hoskins,
Rebecca Willett
Abstract:
Interpreting scattered acoustic and electromagnetic wave patterns is a computational task that enables remote imaging in a number of important applications, including medical imaging, geophysical exploration, sonar and radar detection, and nondestructive testing of materials. However, accurately and stably recovering an inhomogeneous medium from far-field scattered wave measurements is a computati…
▽ More
Interpreting scattered acoustic and electromagnetic wave patterns is a computational task that enables remote imaging in a number of important applications, including medical imaging, geophysical exploration, sonar and radar detection, and nondestructive testing of materials. However, accurately and stably recovering an inhomogeneous medium from far-field scattered wave measurements is a computationally difficult problem, due to the nonlinear and non-local nature of the forward scattering process. We design a neural network, called Multi-Frequency Inverse Scattering Network (MFISNet), and a training method to approximate the inverse map from far-field scattered wave measurements at multiple frequencies. We consider three variants of MFISNet, with the strongest performing variant inspired by the recursive linearization method -- a commonly used technique for stably inverting scattered wavefield data -- that progressively refines the estimate with higher frequency content.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Augmented Lagrangian method for coupled-cluster
Authors:
Fabian M. Faulstich,
Yuehaw Khoo,
Kangbo Li
Abstract:
We propose to improve the convergence properties of the single-reference coupled cluster (CC) method through an augmented Lagrangian formalism. The conventional CC method changes a linear high-dimensional eigenvalue problem with exponential size into a problem of determining the roots of a nonlinear system of equations that has a manageable size. However, current numerical procedures for solving t…
▽ More
We propose to improve the convergence properties of the single-reference coupled cluster (CC) method through an augmented Lagrangian formalism. The conventional CC method changes a linear high-dimensional eigenvalue problem with exponential size into a problem of determining the roots of a nonlinear system of equations that has a manageable size. However, current numerical procedures for solving this system of equations to get the lowest eigenvalue suffer from two practical issues: First, solving the CC equations may not converge, and second, when converging, they may converge to other -- potentially unphysical -- states, which are stationary points of the CC energy expression. We show that both issues can be dealt with when a suitably defined energy is minimized in addition to solving the original CC equations. We further propose an augmented Lagrangian method for coupled cluster (alm-CC) to solve the resulting constrained optimization problem. We numerically investigate the proposed augmented Lagrangian formulation showing that the convergence towards the ground state is significantly more stable and that the optimization procedure is less susceptible to local minima. Furthermore, the computational cost of alm-CC is comparable to the conventional CC method.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Correlation of Coronal Mass Ejection Shock Temperature with Solar Energetic Particle Intensity
Authors:
Manuel Enrique Cuesta,
D. J. McComas,
L. Y. Khoo,
R. Bandyopadhyay,
T. Sharma,
M. M. Shen,
J. S. Rankin,
A. T. Cummings,
J. R. Szalay,
C. M. S. Cohen,
N. A. Schwadron,
R. Chhiber,
F. Pecora,
W. H. Matthaeus,
R. A. Leske,
M. L. Stevens
Abstract:
Solar energetic particle (SEP) events have been observed by the Parker Solar Probe (PSP) spacecraft since its launch in 2018. These events include sources from solar flares and coronal mass ejections (CMEs). Onboard PSP is the IS\(\odot\)IS instrument suite measuring ions over energies from ~ 20 keV/nucleon to 200 MeV/nucleon and electrons from ~ 20 keV to 6 MeV. Previous studies sought to group C…
▽ More
Solar energetic particle (SEP) events have been observed by the Parker Solar Probe (PSP) spacecraft since its launch in 2018. These events include sources from solar flares and coronal mass ejections (CMEs). Onboard PSP is the IS\(\odot\)IS instrument suite measuring ions over energies from ~ 20 keV/nucleon to 200 MeV/nucleon and electrons from ~ 20 keV to 6 MeV. Previous studies sought to group CME characteristics based on their plasma conditions and arrived at general descriptions with large statistical errors, leaving open questions on how to properly group CMEs based solely on their plasma conditions. To help resolve these open questions, plasma properties of CMEs have been examined in relation to SEPs. Here we reexamine one plasma property, the solar wind proton temperature, and compare it to the proton SEP intensity in a region immediately downstream of a CME-driven shock for seven CMEs observed at radial distances within 1 au. We find a statistically strong correlation between proton SEP intensity and bulk proton temperature, indicating a clear relationship between SEPs and the conditions in the solar wind. Furthermore, we propose that an indirect coupling of SEP intensity to the level of turbulence and the amount of energy dissipation that results is mainly responsible for the observed correlation between SEP intensity and proton temperature. These results are key to understanding the interaction of SEPs with the bulk solar wind in CME-driven shocks and will improve our ability to model the interplay of shock evolution and particle acceleration.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Nonparametric Density Estimation via Variance-Reduced Sketching
Authors:
Yifan Peng,
Yuehaw Khoo,
Daren Wang
Abstract:
Nonparametric density models are of great interest in various scientific and engineering disciplines. Classical density kernel methods, while numerically robust and statistically sound in low-dimensional settings, become inadequate even in moderate higher-dimensional settings due to the curse of dimensionality. In this paper, we introduce a new framework called Variance-Reduced Sketching (VRS), sp…
▽ More
Nonparametric density models are of great interest in various scientific and engineering disciplines. Classical density kernel methods, while numerically robust and statistically sound in low-dimensional settings, become inadequate even in moderate higher-dimensional settings due to the curse of dimensionality. In this paper, we introduce a new framework called Variance-Reduced Sketching (VRS), specifically designed to estimate multivariable density functions with a reduced curse of dimensionality. Our framework conceptualizes multivariable functions as infinite-size matrices, and facilitates a new sketching technique motivated by numerical linear algebra literature to reduce the variance in density estimation problems. We demonstrate the robust numerical performance of VRS through a series of simulated experiments and real-world data applications. Notably, VRS shows remarkable improvement over existing neural network estimators and classical kernel methods in numerous density models. Additionally, we offer theoretical justifications for VRS to support its ability to deliver nonparametric density estimation with a reduced curse of dimensionality.
△ Less
Submitted 7 July, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
On the Mesoscale Structure of CMEs at Mercury's Orbit: BepiColombo and Parker Solar Probe Observations
Authors:
Erika Palmerio,
Fernando Carcaboso,
Leng Ying Khoo,
Tarik M. Salman,
Beatriz Sánchez-Cano,
Benjamin J. Lynch,
Yeimy J. Rivera,
Sanchita Pal,
Teresa Nieves-Chinchilla,
Andreas J. Weiss,
David Lario,
Johannes Z. D. Mieth,
Daniel Heyner,
Michael L. Stevens,
Orlando M. Romeo,
Andrei N. Zhukov,
Luciano Rodriguez,
Christina O. Lee,
Christina M. S. Cohen,
Laura Rodríguez-García,
Phyllis L. Whittlesey,
Nina Dresing,
Philipp Oleynik,
Immanuel C. Jebaraj,
David Fischer
, et al. (5 additional authors not shown)
Abstract:
On 2022 February 15, an impressive filament eruption was observed off the solar eastern limb from three remote-sensing viewpoints, namely Earth, STEREO-A, and Solar Orbiter. In addition to representing the most-distant observed filament at extreme ultraviolet wavelengths -- captured by Solar Orbiter's field of view extending to above 6 $R_{\odot}$ -- this event was also associated with the release…
▽ More
On 2022 February 15, an impressive filament eruption was observed off the solar eastern limb from three remote-sensing viewpoints, namely Earth, STEREO-A, and Solar Orbiter. In addition to representing the most-distant observed filament at extreme ultraviolet wavelengths -- captured by Solar Orbiter's field of view extending to above 6 $R_{\odot}$ -- this event was also associated with the release of a fast ($\sim$2200 km$\cdot$s$^{-1}$) coronal mass ejection (CME) that was directed towards BepiColombo and Parker Solar Probe. These two probes were separated by 2$^{\circ}$ in latitude, 4$^{\circ}$ in longitude, and 0.03 au in radial distance around the time of the CME-driven shock arrival in situ. The relative proximity of the two probes to each other and to the Sun ($\sim$0.35 au) allows us to study the mesoscale structure of CMEs at Mercury's orbit for the first time. We analyse similarities and differences in the main CME-related structures measured at the two locations, namely the interplanetary shock, the sheath region, and the magnetic ejecta. We find that, despite the separation between the two spacecraft being well within the typical uncertainties associated with determination of CME geometric parameters from remote-sensing observations, the two sets of in-situ measurements display some profound differences that make understanding of the overall 3D CME structure particularly challenging. Finally, we discuss our findings within the context of space weather at Mercury's distances and in terms of the need to investigate solar transients via spacecraft constellations with small separations, which has been gaining significant attention during recent years.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Robust Point Matching with Distance Profiles
Authors:
YoonHaeng Hur,
Yuehaw Khoo
Abstract:
While matching procedures based on pairwise distances are conceptually appealing and thus favored in practice, theoretical guarantees for such procedures are rarely found in the literature. We propose and analyze matching procedures based on distance profiles that are easily implementable in practice, showing these procedures are robust to outliers and noise. We demonstrate the performance of the…
▽ More
While matching procedures based on pairwise distances are conceptually appealing and thus favored in practice, theoretical guarantees for such procedures are rarely found in the literature. We propose and analyze matching procedures based on distance profiles that are easily implementable in practice, showing these procedures are robust to outliers and noise. We demonstrate the performance of the proposed method using a real data example and provide simulation studies to complement the theoretical findings.
△ Less
Submitted 15 May, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification
Authors:
Yookyung Kho,
Jaehee Kim,
Pilsung Kang
Abstract:
Recently, prompt-based fine-tuning has garnered considerable interest as a core technique for few-shot text classification task. This approach reformulates the fine-tuning objective to align with the Masked Language Modeling (MLM) objective. Leveraging unlabeled data, prompt-based self-training has shown greater effectiveness in binary and three-class classification. However, prompt-based self-tra…
▽ More
Recently, prompt-based fine-tuning has garnered considerable interest as a core technique for few-shot text classification task. This approach reformulates the fine-tuning objective to align with the Masked Language Modeling (MLM) objective. Leveraging unlabeled data, prompt-based self-training has shown greater effectiveness in binary and three-class classification. However, prompt-based self-training for multi-class classification has not been adequately investigated, despite its significant applicability to real-world scenarios. Moreover, extending current methods to multi-class classification suffers from the verbalizer that extracts the predicted value of manually pre-defined single label word for each class from MLM predictions. Consequently, we introduce a novel, efficient verbalizer structure, named Mapping-free Automatic Verbalizer (MAV). Comprising two fully connected layers, MAV serves as a trainable verbalizer that automatically extracts the requisite word features for classification by capitalizing on all available information from MLM predictions. Experimental results on five multi-class classification datasets indicate MAV's superior self-training efficacy.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
A quantum tug of war between randomness and symmetries on homogeneous spaces
Authors:
Rahul Arvind,
Kishor Bharti,
Jun Yong Khoo,
Dax Enshan Koh,
Jian Feng Kong
Abstract:
We explore the interplay between symmetry and randomness in quantum information. Adopting a geometric approach, we consider states as $H$-equivalent if related by a symmetry transformation characterized by the group $H$. We then introduce the Haar measure on the homogeneous space $\mathbb{U}/H$, characterizing true randomness for $H$-equivalent systems. While this mathematical machinery is well-st…
▽ More
We explore the interplay between symmetry and randomness in quantum information. Adopting a geometric approach, we consider states as $H$-equivalent if related by a symmetry transformation characterized by the group $H$. We then introduce the Haar measure on the homogeneous space $\mathbb{U}/H$, characterizing true randomness for $H$-equivalent systems. While this mathematical machinery is well-studied by mathematicians, it has seen limited application in quantum information: we believe our work to be the first instance of utilizing homogeneous spaces to characterize symmetry in quantum information. This is followed by a discussion of approximations of true randomness, commencing with $t$-wise independent approximations and defining $t$-designs on $\mathbb{U}/H$ and $H$-equivalent states. Transitioning further, we explore pseudorandomness, defining pseudorandom unitaries and states within homogeneous spaces. Finally, as a practical demonstration of our findings, we study the expressibility of quantum machine learning ansatze in homogeneous spaces. Our work provides a fresh perspective on the relationship between randomness and symmetry in the quantum world.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Convex Relaxation for Fokker-Planck
Authors:
Yian Chen,
Yuehaw Khoo,
Lek-Heng Lim
Abstract:
We propose an approach to directly estimate the moments or marginals for a high-dimensional equilibrium distribution in statistical mechanics, via solving the high-dimensional Fokker-Planck equation in terms of low-order cluster moments or marginals. With this approach, we bypass the exponential complexity of estimating the full high-dimensional distribution and directly solve the simplified parti…
▽ More
We propose an approach to directly estimate the moments or marginals for a high-dimensional equilibrium distribution in statistical mechanics, via solving the high-dimensional Fokker-Planck equation in terms of low-order cluster moments or marginals. With this approach, we bypass the exponential complexity of estimating the full high-dimensional distribution and directly solve the simplified partial differential equations for low-order moments/marginals. Moreover, the proposed moment/marginal relaxation is fully convex and can be solved via off-the-shelf solvers. We further propose a time-dependent version of the convex programs to study non-equilibrium dynamics. We show the proposed method can recover the meanfield approximation of an equilibrium density. Numerical results are provided to demonstrate the performance of the proposed algorithm for high-dimensional systems.
△ Less
Submitted 3 December, 2023; v1 submitted 5 June, 2023;
originally announced June 2023.
-
Painsight: An Extendable Opinion Mining Framework for Detecting Pain Points Based on Online Customer Reviews
Authors:
Yukyung Lee,
Jaehee Kim,
Doyoon Kim,
Yookyung Kho,
Younsun Kim,
Pilsung Kang
Abstract:
As the e-commerce market continues to expand and online transactions proliferate, customer reviews have emerged as a critical element in shaping the purchasing decisions of prospective buyers. Previous studies have endeavored to identify key aspects of customer reviews through the development of sentiment analysis models and topic models. However, extracting specific dissatisfaction factors remain…
▽ More
As the e-commerce market continues to expand and online transactions proliferate, customer reviews have emerged as a critical element in shaping the purchasing decisions of prospective buyers. Previous studies have endeavored to identify key aspects of customer reviews through the development of sentiment analysis models and topic models. However, extracting specific dissatisfaction factors remains a challenging task. In this study, we delineate the pain point detection problem and propose Painsight, an unsupervised framework for automatically extracting distinct dissatisfaction factors from customer reviews without relying on ground truth labels. Painsight employs pre-trained language models to construct sentiment analysis and topic models, leveraging attribution scores derived from model gradients to extract dissatisfaction factors. Upon application of the proposed methodology to customer review data spanning five product categories, we successfully identified and categorized dissatisfaction factors within each group, as well as isolated factors for each type. Notably, Painsight outperformed benchmark methods, achieving substantial performance enhancements and exceptional results in human evaluations.
△ Less
Submitted 3 June, 2023;
originally announced June 2023.
-
Combining Monte Carlo and Tensor-network Methods for Partial Differential Equations via Sketching
Authors:
Yian Chen,
Yuehaw Khoo
Abstract:
In this paper, we propose a general framework for solving high-dimensional partial differential equations with tensor networks. Our approach uses Monte-Carlo simulations to update the solution and re-estimates the new solution from samples as a tensor-network using a recently proposed tensor train sketching technique. We showcase the versatility and flexibility of our approach by applying it to tw…
▽ More
In this paper, we propose a general framework for solving high-dimensional partial differential equations with tensor networks. Our approach uses Monte-Carlo simulations to update the solution and re-estimates the new solution from samples as a tensor-network using a recently proposed tensor train sketching technique. We showcase the versatility and flexibility of our approach by applying it to two specific scenarios: simulating the Fokker-Planck equation through Langevin dynamics and quantum imaginary time evolution via auxiliary-field quantum Monte Carlo. We also provide convergence guarantees and numerical experiments to demonstrate the efficacy of the proposed method.
△ Less
Submitted 10 October, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.
-
Tensorizing flows: a tool for variational inference
Authors:
Yuehaw Khoo,
Michael Lindsey,
Hongli Zhao
Abstract:
Fueled by the expressive power of deep neural networks, normalizing flows have achieved spectacular success in generative modeling, or learning to draw new samples from a distribution given a finite dataset of training samples. Normalizing flows have also been applied successfully to variational inference, wherein one attempts to learn a sampler based on an expression for the log-likelihood or ene…
▽ More
Fueled by the expressive power of deep neural networks, normalizing flows have achieved spectacular success in generative modeling, or learning to draw new samples from a distribution given a finite dataset of training samples. Normalizing flows have also been applied successfully to variational inference, wherein one attempts to learn a sampler based on an expression for the log-likelihood or energy function of the distribution, rather than on data. In variational inference, the unimodality of the reference Gaussian distribution used within the normalizing flow can cause difficulties in learning multimodal distributions. We introduce an extension of normalizing flows in which the Gaussian reference is replaced with a reference distribution that is constructed via a tensor network, specifically a matrix product state or tensor train. We show that by combining flows with tensor networks on difficult variational inference tasks, we can improve on the results obtained by using either tool without the other.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Deep Neural-network Prior for Orbit Recovery from Method of Moments
Authors:
Yuehaw Khoo,
Sounak Paul,
Nir Sharon
Abstract:
Orbit recovery problems are a class of problems that often arise in practice and various forms. In these problems, we aim to estimate an unknown function after being distorted by a group action and observed via a known operator. Typically, the observations are contaminated with a non-trivial level of noise. Two particular orbit recovery problems of interest in this paper are multireference alignme…
▽ More
Orbit recovery problems are a class of problems that often arise in practice and various forms. In these problems, we aim to estimate an unknown function after being distorted by a group action and observed via a known operator. Typically, the observations are contaminated with a non-trivial level of noise. Two particular orbit recovery problems of interest in this paper are multireference alignment and single-particle cryo-EM modelling. In order to suppress the noise, we suggest using the method of moments approach for both problems while introducing deep neural network priors. In particular, our neural networks should output the signals and the distribution of group elements, with moments being the input. In the multireference alignment case, we demonstrate the advantage of using the NN to accelerate the convergence for the reconstruction of signals from the moments. Finally, we use our method to reconstruct simulated and biological volumes in the cryo-EM setting.
△ Less
Submitted 30 January, 2024; v1 submitted 27 April, 2023;
originally announced April 2023.
-
Generative Modeling via Hierarchical Tensor Sketching
Authors:
Yifan Peng,
Yian Chen,
E. Miles Stoudenmire,
Yuehaw Khoo
Abstract:
We propose a hierarchical tensor-network approach for approximating high-dimensional probability density via empirical distribution. This leverages randomized singular value decomposition (SVD) techniques and involves solving linear equations for tensor cores in this tensor network. The complexity of the resulting algorithm scales linearly in the dimension of the high-dimensional density. An analy…
▽ More
We propose a hierarchical tensor-network approach for approximating high-dimensional probability density via empirical distribution. This leverages randomized singular value decomposition (SVD) techniques and involves solving linear equations for tensor cores in this tensor network. The complexity of the resulting algorithm scales linearly in the dimension of the high-dimensional density. An analysis of estimation error demonstrates the effectiveness of this method through several numerical experiments.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
High-dimensional density estimation with tensorizing flow
Authors:
Yinuo Ren,
Hongli Zhao,
Yuehaw Khoo,
Lexing Ying
Abstract:
We propose the tensorizing flow method for estimating high-dimensional probability density functions from the observed data. The method is based on tensor-train and flow-based generative modeling. Our method first efficiently constructs an approximate density in the tensor-train form via solving the tensor cores from a linear system based on the kernel density estimators of low-dimensional margina…
▽ More
We propose the tensorizing flow method for estimating high-dimensional probability density functions from the observed data. The method is based on tensor-train and flow-based generative modeling. Our method first efficiently constructs an approximate density in the tensor-train form via solving the tensor cores from a linear system based on the kernel density estimators of low-dimensional marginals. We then train a continuous-time flow model from this tensor-train density to the observed empirical distribution by performing a maximum likelihood estimation. The proposed method combines the optimization-less feature of the tensor-train with the flexibility of the flow-based generative models. Numerical results are included to demonstrate the performance of the proposed method.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
POGD: Gradient Descent with New Stochastic Rules
Authors:
Feihu Han,
Sida Xing,
Sui Yang Khoo
Abstract:
There introduce Particle Optimized Gradient Descent (POGD), an algorithm based on the gradient descent but integrates the particle swarm optimization (PSO) principle to achieve the iteration. From the experiments, this algorithm has adaptive learning ability. The experiments in this paper mainly focus on the training speed to reach the target value and the ability to prevent the local minimum. The…
▽ More
There introduce Particle Optimized Gradient Descent (POGD), an algorithm based on the gradient descent but integrates the particle swarm optimization (PSO) principle to achieve the iteration. From the experiments, this algorithm has adaptive learning ability. The experiments in this paper mainly focus on the training speed to reach the target value and the ability to prevent the local minimum. The experiments in this paper are achieved by the convolutional neural network (CNN) image classification on the MNIST and cifar-10 datasets.
△ Less
Submitted 15 October, 2022;
originally announced October 2022.
-
Autocorrelation analysis for cryo-EM with sparsity constraints: Improved sample complexity and projection-based algorithms
Authors:
Tamir Bendory,
Yuehaw Khoo,
Joe Kileel,
Oscar Mickelin,
Amit Singer
Abstract:
The number of noisy images required for molecular reconstruction in single-particle cryo-electron microscopy (cryo-EM) is governed by the autocorrelations of the observed, randomly-oriented, noisy projection images. In this work, we consider the effect of imposing sparsity priors on the molecule. We use techniques from signal processing, optimization, and applied algebraic geometry to obtain new t…
▽ More
The number of noisy images required for molecular reconstruction in single-particle cryo-electron microscopy (cryo-EM) is governed by the autocorrelations of the observed, randomly-oriented, noisy projection images. In this work, we consider the effect of imposing sparsity priors on the molecule. We use techniques from signal processing, optimization, and applied algebraic geometry to obtain new theoretical and computational contributions for this challenging non-linear inverse problem with sparsity constraints. We prove that molecular structures modeled as sums of Gaussians are uniquely determined by the second-order autocorrelation of their projection images, implying that the sample complexity is proportional to the square of the variance of the noise. This theory improves upon the non-sparse case, where the third-order autocorrelation is required for uniformly-oriented particle images and the sample complexity scales with the cube of the noise variance. Furthermore, we build a computational framework to reconstruct molecular structures which are sparse in the wavelet basis. This method combines the sparse representation for the molecule with projection-based techniques used for phase retrieval in X-ray crystallography.
△ Less
Submitted 1 May, 2023; v1 submitted 21 September, 2022;
originally announced September 2022.
-
Generative Modeling via Tree Tensor Network States
Authors:
Xun Tang,
Yoonhaeng Hur,
Yuehaw Khoo,
Lexing Ying
Abstract:
In this paper, we present a density estimation framework based on tree tensor-network states. The proposed method consists of determining the tree topology with Chow-Liu algorithm, and obtaining a linear system of equations that defines the tensor-network components via sketching techniques. Novel choices of sketch functions are developed in order to consider graphical models that contain loops. S…
▽ More
In this paper, we present a density estimation framework based on tree tensor-network states. The proposed method consists of determining the tree topology with Chow-Liu algorithm, and obtaining a linear system of equations that defines the tensor-network components via sketching techniques. Novel choices of sketch functions are developed in order to consider graphical models that contain loops. Sample complexity guarantees are provided and further corroborated by numerical experiments.
△ Less
Submitted 3 September, 2022;
originally announced September 2022.
-
Quantitatively visualizing bipartite datasets
Authors:
Tal Einav,
Yuehaw Khoo,
Amit Singer
Abstract:
As experiments continue to increase in size and scope, a fundamental challenge of subsequent analyses is to recast the wealth of information into an intuitive and readily-interpretable form. Often, each measurement only conveys the relationship between a pair of entries, and it is difficult to integrate these local interactions across a dataset to form a cohesive global picture. The classic locali…
▽ More
As experiments continue to increase in size and scope, a fundamental challenge of subsequent analyses is to recast the wealth of information into an intuitive and readily-interpretable form. Often, each measurement only conveys the relationship between a pair of entries, and it is difficult to integrate these local interactions across a dataset to form a cohesive global picture. The classic localization problem tackles this question, transforming local measurements into a global map that reveals the underlying structure of a system. Here, we examine the more challenging bipartite localization problem, where pairwise distances are only available for bipartite data comprising two classes of entries (such as antibody-virus interactions, drug-cell potency, or user-rating profiles). We modify previous algorithms to solve bipartite localization and examine how each method behaves in the presence of noise, outliers, and partially-observed data. As a proof of concept, we apply these algorithms to antibody-virus neutralization measurements to create a basis set of antibody behaviors, formalize how potently inhibiting some viruses necessitates weakly inhibiting other viruses, and quantify how often combinations of antibodies exhibit degenerate behavior.
△ Less
Submitted 26 July, 2022;
originally announced July 2022.
-
Reinforced Inverse Scattering
Authors:
Hanyang Jiang,
Yuehaw Khoo,
Haizhao Yang
Abstract:
Inverse wave scattering aims at determining the properties of an object using data on how the object scatters incoming waves. In order to collect information, sensors are put in different locations to send and receive waves from each other. The choice of sensor positions and incident wave frequencies determines the reconstruction quality of scatterer properties. This paper introduces reinforcement…
▽ More
Inverse wave scattering aims at determining the properties of an object using data on how the object scatters incoming waves. In order to collect information, sensors are put in different locations to send and receive waves from each other. The choice of sensor positions and incident wave frequencies determines the reconstruction quality of scatterer properties. This paper introduces reinforcement learning to develop precision imaging that decides sensor positions and wave frequencies adaptive to different scatterers in an intelligent way, thus obtaining a significant improvement in reconstruction quality with limited imaging resources. Extensive numerical results will be provided to demonstrate the superiority of the proposed method over existing methods.
△ Less
Submitted 2 November, 2022; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Probing the Quantum Noise of the Spinon Fermi Surface with NV Centers
Authors:
Jun Yong Khoo,
Falko Pientka,
Patrick A. Lee,
Inti Sodemann Villadiego
Abstract:
We study the transverse electrical conductivity and the corresponding magnetic noise of a two-dimensional U(1) spin liquid state with a spinon Fermi surface. We show that in the quasi-static regime these responses have the same wave-vector dependence as that of a metal but are reduced by a dimensionless pre-factor controlled by the ratio of orbital diamagnetic susceptibilities of the spinons and c…
▽ More
We study the transverse electrical conductivity and the corresponding magnetic noise of a two-dimensional U(1) spin liquid state with a spinon Fermi surface. We show that in the quasi-static regime these responses have the same wave-vector dependence as that of a metal but are reduced by a dimensionless pre-factor controlled by the ratio of orbital diamagnetic susceptibilities of the spinons and chargons, correcting previous work. We estimate that this quasi-static regime is comfortably accessed by the typical NV center splittings of a few GHz and estimate that the expected T1 times for an NV center placed above candidate materials, such as the organic dmit and ET salts, monolayer 1T-TaS2/Se2, would range from several tens to a few hundred milliseconds.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
Generative modeling via tensor train sketching
Authors:
YH. Hur,
J. G. Hoskins,
M. Lindsey,
E. M. Stoudenmire,
Y. Khoo
Abstract:
In this paper, we introduce a sketching algorithm for constructing a tensor train representation of a probability density from its samples. Our method deviates from the standard recursive SVD-based procedure for constructing a tensor train. Instead, we formulate and solve a sequence of small linear systems for the individual tensor train cores. This approach can avoid the curse of dimensionality t…
▽ More
In this paper, we introduce a sketching algorithm for constructing a tensor train representation of a probability density from its samples. Our method deviates from the standard recursive SVD-based procedure for constructing a tensor train. Instead, we formulate and solve a sequence of small linear systems for the individual tensor train cores. This approach can avoid the curse of dimensionality that threatens both the algorithmic and sample complexities of the recovery problem. Specifically, for Markov models under natural conditions, we prove that the tensor cores can be recovered with a sample complexity that scales logarithmically in the dimensionality. Finally, we illustrate the performance of the method with several numerical experiments.
△ Less
Submitted 23 June, 2023; v1 submitted 23 February, 2022;
originally announced February 2022.
-
A Spectral Method for Joint Community Detection and Orthogonal Group Synchronization
Authors:
Yifeng Fan,
Yuehaw Khoo,
Zhizhen Zhao
Abstract:
Community detection and orthogonal group synchronization are both fundamental problems with a variety of important applications in science and engineering. In this work, we consider the joint problem of community detection and orthogonal group synchronization which aims to recover the communities and perform synchronization simultaneously. To this end, we propose a simple algorithm that consists o…
▽ More
Community detection and orthogonal group synchronization are both fundamental problems with a variety of important applications in science and engineering. In this work, we consider the joint problem of community detection and orthogonal group synchronization which aims to recover the communities and perform synchronization simultaneously. To this end, we propose a simple algorithm that consists of a spectral decomposition step followed by a blockwise column pivoted QR factorization (CPQR). The proposed algorithm is efficient and scales linearly with the number of edges in the graph. We also leverage the recently developed `leave-one-out' technique to establish a near-optimal guarantee for exact recovery of the cluster memberships and stable recovery of the orthogonal transforms. Numerical experiments demonstrate the efficiency and efficacy of our algorithm and confirm our theoretical characterization of it.
△ Less
Submitted 15 September, 2022; v1 submitted 25 December, 2021;
originally announced December 2021.
-
Piecewise Linear Units Improve Deep Neural Networks
Authors:
Jordan Inturrisi,
Sui Yang Khoo,
Abbas Kouzani,
Riccardo Pagliarella
Abstract:
The activation function is at the heart of a deep neural networks nonlinearity; the choice of the function has great impact on the success of training. Currently, many practitioners prefer the Rectified Linear Unit (ReLU) due to its simplicity and reliability, despite its few drawbacks. While most previous functions proposed to supplant ReLU have been hand-designed, recent work on learning the fun…
▽ More
The activation function is at the heart of a deep neural networks nonlinearity; the choice of the function has great impact on the success of training. Currently, many practitioners prefer the Rectified Linear Unit (ReLU) due to its simplicity and reliability, despite its few drawbacks. While most previous functions proposed to supplant ReLU have been hand-designed, recent work on learning the function during training has shown promising results. In this paper we propose an adaptive piecewise linear activation function, the Piecewise Linear Unit (PiLU), which can be learned independently for each dimension of the neural network. We demonstrate how PiLU is a generalised rectifier unit and note its similarities with the Adaptive Piecewise Linear Units, namely adaptive and piecewise linear. Across a distribution of 30 experiments, we show that for the same model architecture, hyperparameters, and pre-processing, PiLU significantly outperforms ReLU: reducing classification error by 18.53% on CIFAR-10 and 13.13% on CIFAR-100, for a minor increase in the number of neurons. Further work should be dedicated to exploring generalised piecewise linear units, as well as verifying these results across other challenging domains and larger problems.
△ Less
Submitted 22 August, 2021; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Committor functions via tensor networks
Authors:
Yian Chen,
Jeremy Hoskins,
Yuehaw Khoo,
Michael Lindsey
Abstract:
We propose a novel approach for computing committor functions, which describe transitions of a stochastic process between metastable states. The committor function satisfies a backward Kolmogorov equation, and in typical high-dimensional settings of interest, it is intractable to compute and store the solution with traditional numerical methods. By parametrizing the committor function in a matrix…
▽ More
We propose a novel approach for computing committor functions, which describe transitions of a stochastic process between metastable states. The committor function satisfies a backward Kolmogorov equation, and in typical high-dimensional settings of interest, it is intractable to compute and store the solution with traditional numerical methods. By parametrizing the committor function in a matrix product state/tensor train format and using a similar representation for the equilibrium probability density, we solve the variational formulation of the backward Kolmogorov equation with linear time and memory complexity in the number of dimensions. This approach bypasses the need for sampling the equilibrium distribution, which can be difficult when the distribution has multiple modes. Numerical results demonstrate the effectiveness of the proposed method for high-dimensional problems.
△ Less
Submitted 2 August, 2021; v1 submitted 23 June, 2021;
originally announced June 2021.
-
Scalable semidefinite programming approach to variational embedding for quantum many-body problems
Authors:
Yuehaw Khoo,
Michael Lindsey
Abstract:
In quantum embedding theories, a quantum many-body system is divided into localized clusters of sites which are treated with an accurate `high-level' theory and glued together self-consistently by a less accurate `low-level' theory at the global scale. The recently introduced variational embedding approach for quantum many-body problems combines the insights of semidefinite relaxation and quantum…
▽ More
In quantum embedding theories, a quantum many-body system is divided into localized clusters of sites which are treated with an accurate `high-level' theory and glued together self-consistently by a less accurate `low-level' theory at the global scale. The recently introduced variational embedding approach for quantum many-body problems combines the insights of semidefinite relaxation and quantum embedding theory to provide a lower bound on the ground-state energy that improves as the cluster size is increased. The variational embedding method is formulated as a semidefinite program (SDP), which can suffer from poor computational scaling when treated with black-box solvers. We exploit the interpretation of this SDP as an embedding method to develop an algorithm which alternates parallelizable local updates of the high-level quantities with updates that enforce the low-level global constraints. Moreover, we show how translation invariance in lattice systems can be exploited to reduce the complexity of projecting a key matrix to the positive semidefinite cone.
△ Less
Submitted 4 June, 2021;
originally announced June 2021.
-
Joint Community Detection and Rotational Synchronization via Semidefinite Programming
Authors:
Yifeng Fan,
Yuehaw Khoo,
Zhizhen Zhao
Abstract:
In the presence of heterogeneous data, where randomly rotated objects fall into multiple underlying categories, it is challenging to simultaneously classify them into clusters and synchronize them based on pairwise relations. This gives rise to the joint problem of community detection and synchronization. We propose a series of semidefinite relaxations, and prove their exact recovery when extendin…
▽ More
In the presence of heterogeneous data, where randomly rotated objects fall into multiple underlying categories, it is challenging to simultaneously classify them into clusters and synchronize them based on pairwise relations. This gives rise to the joint problem of community detection and synchronization. We propose a series of semidefinite relaxations, and prove their exact recovery when extending the celebrated stochastic block model to this new setting where both rotations and cluster identities are to be determined. Numerical experiments demonstrate the efficacy of our proposed algorithms and confirm our theoretical result which indicates a sharp phase transition for exact recovery.
△ Less
Submitted 14 September, 2023; v1 submitted 12 May, 2021;
originally announced May 2021.
-
The universal shear conductivity of Fermi liquids and spinon Fermi surface states and its detection via spin qubit noise magnetometry
Authors:
Jun Yong Khoo,
Falko Pientka,
Inti Sodemann
Abstract:
We demonstrate a remarkable property of metallic Fermi liquids: the transverse conductivity assumes a universal value in the quasi-static ($ω\rightarrow 0$) limit for wavevectors $q$ in the regime $l_{\rm mfp}^{-1} \ll q \ll p_{\rm F}$, where $l_{\rm mfp}$ is the mean free path and $p_{\rm F}$ is the Fermi momentum. This value is $(e^2/h) \mathcal{R}_{\rm FS}/q$ in two dimensions (2D), where…
▽ More
We demonstrate a remarkable property of metallic Fermi liquids: the transverse conductivity assumes a universal value in the quasi-static ($ω\rightarrow 0$) limit for wavevectors $q$ in the regime $l_{\rm mfp}^{-1} \ll q \ll p_{\rm F}$, where $l_{\rm mfp}$ is the mean free path and $p_{\rm F}$ is the Fermi momentum. This value is $(e^2/h) \mathcal{R}_{\rm FS}/q$ in two dimensions (2D), where $\mathcal{R}_{\rm FS}$ measures the local radius of curvature of the Fermi surface in momentum space. Even more surprisingly, we find that U(1) spin liquids with a spinon Fermi surface have the same universal transverse conductivity. This means such spin liquids behave effectively as metals in this regime, even though they appear insulating in standard transport experiments. Moreover, we show that transverse current fluctuations result in a universal low-frequency magnetic noise that can be directly probed by a spin qubit, such as a nitrogen-vacancy center in diamond, placed at a distance $z$ above of the 2D metal or spin liquid. Specifically the magnetic noise is given by $Cω\mathcal{P}_{\rm FS}/z$, where $\mathcal{P}_{\rm FS}$ is the perimeter of the Fermi surface in momentum space and $C$ is a combination of fundamental constants of nature. Therefore these observables are controlled purely by the geometry of the Fermi surface and are independent of kinematic details of the quasi-particles, such as their effective mass and interactions. This behavior can be used as a new technique to measure the size of the Fermi surface of metals and as a smoking gun probe to pinpoint the presence of the elusive spinon Fermi surface in two-dimensional systems. We estimate that this universal regime is within reach of current nitrogen-vacancy center spectroscopic techniques for several spinon Fermi surface candidate materials.
△ Less
Submitted 7 May, 2021; v1 submitted 8 March, 2021;
originally announced March 2021.
-
Multiscale semidefinite programming approach to positioning problems with pairwise structure
Authors:
Yian Chen,
Yuehaw Khoo,
Michael Lindsey
Abstract:
We consider the optimization of pairwise objective functions, i.e., objective functions of the form $H(\mathbf{x}) = H(x_1,\ldots,x_N) = \sum_{1\leq i<j \leq N} H_{ij}(x_i,x_j)$ for $x_i$ in some continuous state spaces $\mathcal{X}_i$. Global optimization in this setting is generally confounded by the possible existence of spurious local minima and the impossibility of global search due to the cu…
▽ More
We consider the optimization of pairwise objective functions, i.e., objective functions of the form $H(\mathbf{x}) = H(x_1,\ldots,x_N) = \sum_{1\leq i<j \leq N} H_{ij}(x_i,x_j)$ for $x_i$ in some continuous state spaces $\mathcal{X}_i$. Global optimization in this setting is generally confounded by the possible existence of spurious local minima and the impossibility of global search due to the curse of dimensionality. In this paper, we approach such problems via convex relaxation of the marginal polytope considered in graphical modeling, proceeding in a multiscale fashion which exploits the smoothness of the cost function. We show theoretically that, compared with existing methods, such an approach is advantageous even in simple settings for sensor network localization (SNL). We successfully apply our method to SNL problems, particularly difficult instances with high noise. We also validate performance on the optimization of the Lennard-Jones potential, which is plagued by the existence of many near-optimal configurations. We demonstrate that in MMR allows us to effectively explore these configurations.
△ Less
Submitted 17 December, 2020;
originally announced December 2020.
-
A semigroup method for high dimensional committor functions based on neural network
Authors:
Haoya Li,
Yuehaw Khoo,
Yinuo Ren,
Lexing Ying
Abstract:
This paper proposes a new method based on neural networks for computing the high-dimensional committor functions that satisfy Fokker-Planck equations. Instead of working with partial differential equations, the new method works with an integral formulation based on the semigroup of the differential operator. The variational form of the new formulation is then solved by parameterizing the committor…
▽ More
This paper proposes a new method based on neural networks for computing the high-dimensional committor functions that satisfy Fokker-Planck equations. Instead of working with partial differential equations, the new method works with an integral formulation based on the semigroup of the differential operator. The variational form of the new formulation is then solved by parameterizing the committor function as a neural network. There are two major benefits of this new approach. First, stochastic gradient descent type algorithms can be applied in the training of the committor function without the need of computing any mixed second-order derivatives. Moreover, unlike the previous methods that enforce the boundary conditions through penalty terms, the new method takes into account the boundary conditions automatically. Numerical results are provided to demonstrate the performance of the proposed method.
△ Less
Submitted 5 May, 2021; v1 submitted 12 December, 2020;
originally announced December 2020.
-
NMR Assignment through Linear Programming
Authors:
Jose F. S. Bravo-Ferreira,
David Cowburn,
Yuehaw Khoo,
Amit Singer
Abstract:
Nuclear Magnetic Resonance (NMR) Spectroscopy is the second most used technique (after X-ray crystallography) for structural determination of proteins. A computational challenge in this technique involves solving a discrete optimization problem that assigns the resonance frequency to each atom in the protein. This paper introduces LIAN (LInear programming Assignment for NMR), a novel linear progra…
▽ More
Nuclear Magnetic Resonance (NMR) Spectroscopy is the second most used technique (after X-ray crystallography) for structural determination of proteins. A computational challenge in this technique involves solving a discrete optimization problem that assigns the resonance frequency to each atom in the protein. This paper introduces LIAN (LInear programming Assignment for NMR), a novel linear programming formulation of the problem which yields state-of-the-art results in simulated and experimental datasets.
△ Less
Submitted 7 September, 2021; v1 submitted 8 August, 2020;
originally announced August 2020.
-
Quantum entanglement recognition
Authors:
Jun Yong Khoo,
Markus Heyl
Abstract:
Entanglement constitutes a key characteristic feature of quantum matter. Its detection, however, still faces major challenges. In this letter, we formulate a framework for probing entanglement based on machine learning techniques. The central element is a protocol for the generation of statistical images from quantum many-body states, with which we perform image classification by means of convolut…
▽ More
Entanglement constitutes a key characteristic feature of quantum matter. Its detection, however, still faces major challenges. In this letter, we formulate a framework for probing entanglement based on machine learning techniques. The central element is a protocol for the generation of statistical images from quantum many-body states, with which we perform image classification by means of convolutional neural networks. We show that the resulting quantum entanglement recognition task is accurate and can be assigned a well-controlled error across a wide range of quantum states. We discuss the potential use of our scheme to quantify quantum entanglement in experiments. Our developed scheme provides a generally applicable strategy for quantum entanglement recognition in both equilibrium and nonequilibrium quantum matter.
△ Less
Submitted 12 April, 2021; v1 submitted 28 July, 2020;
originally announced July 2020.
-
Long-term variations of quasi-trapped and trapped electrons in the inner radiation belt observed by DEMETER and SAMPEX
Authors:
Kun Zhang,
Xinlin Li,
Zheng Xiang,
Leng Ying Khoo,
Hong Zhao,
Mark D. Looper,
Michael A. Temerin,
Jean-André Sauvaud
Abstract:
Electrons in the Earth's radiation belts can be categorized into three populations: precipitating, quasi-trapped and trapped. We use data from the DEMETER and SAMPEX missions and from ground-based neutron monitors (NM) and sunspot observations to investigate the long-term variation of quasi-trapped and trapped sub-MeV electrons on different L shells in the inner belt. DEMETER and SAMPEX measuremen…
▽ More
Electrons in the Earth's radiation belts can be categorized into three populations: precipitating, quasi-trapped and trapped. We use data from the DEMETER and SAMPEX missions and from ground-based neutron monitors (NM) and sunspot observations to investigate the long-term variation of quasi-trapped and trapped sub-MeV electrons on different L shells in the inner belt. DEMETER and SAMPEX measurements span over 17 years and show that at $L \leq 1.14$ the electron flux is anti-correlated with sunspot number, but proportional to the cosmic ray intensity represented by NM count rates, which suggests that electrons at the inner edge of the inner belt are produced by Cosmic Ray Albedo Neutron Decay (CRAND). The solar cycle variation of cosmic rays increased the electron flux at $L \leq 1.14$ by a factor of two from solar maximum at 2001 to solar minimum at 2009. At $L \ge 1.2$, both quasi-trapped and trapped electrons are enhanced during geomagnetic storms and decay to a background level during extended quiet times. At $L>2$, quasi-trapped electrons resemble trapped electrons, with correlation coefficients as high as 0.97, indicating that pitch angle scattering is the dominant process in this region.
△ Less
Submitted 26 August, 2020; v1 submitted 10 June, 2020;
originally announced June 2020.
-
Maximizing robustness of point-set registration by leveraging non-convexity
Authors:
Cindy Orozco Bohorquez,
Yuehaw Khoo,
Lexing Ying
Abstract:
Point-set registration is a classical image processing problem that looks for the optimal transformation between two sets of points. In this work, we analyze the impact of outliers when finding the optimal rotation between two point clouds. The presence of outliers motivates the use of least unsquared deviation, which is a non-smooth minimization problem over non-convex domain. We compare approach…
▽ More
Point-set registration is a classical image processing problem that looks for the optimal transformation between two sets of points. In this work, we analyze the impact of outliers when finding the optimal rotation between two point clouds. The presence of outliers motivates the use of least unsquared deviation, which is a non-smooth minimization problem over non-convex domain. We compare approaches based on non-convex optimization over special orthogonal group and convex relaxations. We show that if the fraction of outliers is larger than a certain threshold, any naive convex relaxation fails to recover the ground truth rotation regardless of the sample size and dimension. In contrast, minimizing the least unsquared deviation directly over the special orthogonal group exactly recovers the ground truth rotation for any level of corruption as long as the sample size is large enough. These theoretical findings are supported by numerical simulations.
△ Less
Submitted 7 October, 2020; v1 submitted 19 April, 2020;
originally announced April 2020.
-
Quantum Paracrystalline Shear Modes of the Electron Liquid
Authors:
Jun Yong Khoo,
Po-Yao Chang,
Falko Pientka,
Inti Sodemann
Abstract:
Unlike classical fluids, a quantum Fermi liquid can support a long-lived and propagating shear sound wave at arbitrarily small wave vectors and frequencies, reminiscent of the transverse sound in crystals, despite lacking any form of long-range crystalline order. This mode is expected to be present in moderately interacting metals where the quasiparticle mass is renormalized to be more than twice…
▽ More
Unlike classical fluids, a quantum Fermi liquid can support a long-lived and propagating shear sound wave at arbitrarily small wave vectors and frequencies, reminiscent of the transverse sound in crystals, despite lacking any form of long-range crystalline order. This mode is expected to be present in moderately interacting metals where the quasiparticle mass is renormalized to be more than twice the bare mass in two dimensions (2D), but it has remained undetected because it is hard to excite since it does not involve charge density fluctuations, in contrast to the conventional plasma mode. In this work we propose a strategy to excite and detect this unconventional mode in clean metallic channels. We show that the shear sound is responsible for the appearance of sharp dips in the ac conductance of narrow channels at resonant frequencies matching its dispersion. The liquid resonates while minimizing its dissipation in an analogous fashion to a sliding crystal. Ultra-clean 2D materials that can be tuned towards the Wigner crystallization transition such as silicon metal-oxide-semiconductor field-effect transistors, MgZnO/ZnO, p-GaAs, and AlAs quantum wells are promising platforms to experimentally discover the shear sound.
△ Less
Submitted 9 September, 2020; v1 submitted 17 January, 2020;
originally announced January 2020.
-
Method of moments for 3-D single particle ab initio modeling with non-uniform distribution of viewing angles
Authors:
Nir Sharon,
Joe Kileel,
Yuehaw Khoo,
Boris Landa,
Amit Singer
Abstract:
Single-particle reconstruction in cryo-electron microscopy (cryo-EM) is an increasingly popular technique for determining the 3-D structure of a molecule from several noisy 2-D projections images taken at unknown viewing angles. Most reconstruction algorithms require a low-resolution initialization for the 3-D structure, which is the goal of ab initio modeling. Suggested by Zvi Kam in 1980, the me…
▽ More
Single-particle reconstruction in cryo-electron microscopy (cryo-EM) is an increasingly popular technique for determining the 3-D structure of a molecule from several noisy 2-D projections images taken at unknown viewing angles. Most reconstruction algorithms require a low-resolution initialization for the 3-D structure, which is the goal of ab initio modeling. Suggested by Zvi Kam in 1980, the method of moments (MoM) offers one approach, wherein low-order statistics of the 2-D images are computed and a 3-D structure is estimated by solving a system of polynomial equations. Unfortunately, Kam's method suffers from restrictive assumptions, most notably that viewing angles should be distributed uniformly. Often unrealistic, uniformity entails the computation of higher-order correlations, as in this case first and second moments fail to determine the 3-D structure. In the present paper, we remove this hypothesis, by permitting an unknown, non-uniform distribution of viewing angles in MoM. Perhaps surprisingly, we show that this case is statistically easier than the uniform case, as now first and second moments generically suffice to determine low-resolution expansions of the molecule. In the idealized setting of a known, non-uniform distribution, we find an efficient provable algorithm inverting first and second moments. For unknown, non-uniform distributions, we use non-convex optimization methods to solve for both the molecule and distribution.
△ Less
Submitted 23 November, 2019; v1 submitted 11 July, 2019;
originally announced July 2019.
-
Semidefinite relaxation of multi-marginal optimal transport for strictly correlated electrons in second quantization
Authors:
Yuehaw Khoo,
Lin Lin,
Michael Lindsey,
Lexing Ying
Abstract:
We consider the strictly correlated electron (SCE) limit of the fermionic quantum many-body problem in the second-quantized formalism. This limit gives rise to a multi-marginal optimal transport (MMOT) problem. Here the marginal state space for our MMOT problem is the binary set $\{0,1\}$, and the number of marginals is the number $L$ of sites in the model. The costs of storing and computing the e…
▽ More
We consider the strictly correlated electron (SCE) limit of the fermionic quantum many-body problem in the second-quantized formalism. This limit gives rise to a multi-marginal optimal transport (MMOT) problem. Here the marginal state space for our MMOT problem is the binary set $\{0,1\}$, and the number of marginals is the number $L$ of sites in the model. The costs of storing and computing the exact solution of the MMOT problem both scale exponentially with respect to $L$. We propose an efficient convex relaxation which can be solved by semidefinite programming (SDP). In particular, the semidefinite constraint is only of size $2L\times 2L$. Moreover, the SDP-based method yields an approximation of the dual potential needed to the perform self-consistent field iteration in the so-called Kohn-Sham SCE framework, which, once converged, yields a lower bound for the total energy of the system. We demonstrate the effectiveness of our methods on spinless and spinful Hubbard-type models. Numerical results indicate that our relaxation methods yield tight lower bounds for the optimal cost, in the sense that the error due to the semidefinite relaxation is much smaller than the intrinsic modeling error of the Kohn-Sham SCE method. We also describe how our relaxation methods generalize to arbitrary MMOT problems with pairwise cost functions.
△ Less
Submitted 15 September, 2020; v1 submitted 20 May, 2019;
originally announced May 2019.
-
Spin-orbit driven band inversion in bilayer graphene by van der Waals proximity effect
Authors:
J. O. Island,
X. Cui,
C. Lewandowski,
J. Y. Khoo,
E. M. Spanton,
H. Zhou,
D. Rhodes,
J. C. Hone,
T. Taniguchi,
K. Watanabe,
L. S. Levitov,
M. P. Zaletel,
A. F. Young
Abstract:
Spin orbit coupling (SOC) is the key to realizing time-reversal invariant topological phases of matter. Famously, SOC was predicted by Kane and Mele to stabilize a quantum spin Hall insulator; however, the weak intrinsic SOC in monolayer graphene has precluded experimental observation. Here, we exploit a layer-selective proximity effect---achieved via van der Waals contact to a semiconducting tran…
▽ More
Spin orbit coupling (SOC) is the key to realizing time-reversal invariant topological phases of matter. Famously, SOC was predicted by Kane and Mele to stabilize a quantum spin Hall insulator; however, the weak intrinsic SOC in monolayer graphene has precluded experimental observation. Here, we exploit a layer-selective proximity effect---achieved via van der Waals contact to a semiconducting transition metal dichalcogenide--to engineer Kane-Mele SOC in ultra-clean \textit{bilayer} graphene. Using high-resolution capacitance measurements to probe the bulk electronic compressibility, we find that SOC leads to the formation of a distinct incompressible, gapped phase at charge neutrality. The experimental data agrees quantitatively with a simple theoretical model in which the new phase results from SOC-driven band inversion. In contrast to Kane-Mele SOC in monolayer graphene, the inverted phase is not expected to be a time reversal invariant topological insulator, despite being separated from conventional band insulators by electric field tuned phase transitions where crystal symmetry mandates that the bulk gap must close. Electrical transport measurements, conspicuously, reveal that the inverted phase has a conductivity $\sim e^2/h$, which is suppressed by exceptionally small in-plane magnetic fields. The high conductivity and anomalous magnetoresistance are consistent with theoretical models that predict helical edge states within the inversted phase, that are protected from backscattering by an emergent spin symmetry that remains robust even for large Rashba SOC. Our results pave the way for proximity engineering of strong topological insulators as well as correlated quantum phases in the strong spin-orbit regime in graphene heterostructures.
△ Less
Submitted 8 January, 2019; v1 submitted 4 January, 2019;
originally announced January 2019.
-
The gate-tunable strong and fragile topology of multilayer-graphene on a transition metal dichalcogenide
Authors:
Michael P. Zaletel,
Jun Yong Khoo
Abstract:
We analyze the phase diagram of multilayer-graphene sandwiched between identical transition metal dichalcogenides. Recently realized in all van-der-Wall heterostructures, these sandwiches induce sizable (1-15 meV) spin orbit coupling in the graphene, offering a way to engineer topological band-structures in a pristine and gate-tunable platform. We find a rich phase diagram that depends on the numb…
▽ More
We analyze the phase diagram of multilayer-graphene sandwiched between identical transition metal dichalcogenides. Recently realized in all van-der-Wall heterostructures, these sandwiches induce sizable (1-15 meV) spin orbit coupling in the graphene, offering a way to engineer topological band-structures in a pristine and gate-tunable platform. We find a rich phase diagram that depends on the number of layers $N$ and the gate-tunable perpendicular electric field. For $N > 1$ and odd, the system is a strong 2D topological insulator with a gap equal to the strength of proximity-induced Ising spin-orbit coupling, which reverts to a trivial phase at moderate electric fields. For $N$-even, the low energy bands exhibit a recently proposed form of "fragile" crystalline topology, as well as electric-field tuned symmetry-protected phase transitions between distinct atomic insulators. Hence AB-stacked bilayer and ABC-stacked trilayer graphene are predicted to provide controllable experimental realizations of fragile and strong topology.
△ Less
Submitted 8 January, 2019; v1 submitted 4 January, 2019;
originally announced January 2019.
-
Drop-Activation: Implicit Parameter Reduction and Harmonic Regularization
Authors:
Senwei Liang,
Yuehaw Khoo,
Haizhao Yang
Abstract:
Overfitting frequently occurs in deep learning. In this paper, we propose a novel regularization method called Drop-Activation to reduce overfitting and improve generalization. The key idea is to drop nonlinear activation functions by setting them to be identity functions randomly during training time. During testing, we use a deterministic network with a new activation function to encode the aver…
▽ More
Overfitting frequently occurs in deep learning. In this paper, we propose a novel regularization method called Drop-Activation to reduce overfitting and improve generalization. The key idea is to drop nonlinear activation functions by setting them to be identity functions randomly during training time. During testing, we use a deterministic network with a new activation function to encode the average effect of dropping activations randomly. Our theoretical analyses support the regularization effect of Drop-Activation as implicit parameter reduction and verify its capability to be used together with Batch Normalization (Ioffe and Szegedy 2015). The experimental results on CIFAR-10, CIFAR-100, SVHN, EMNIST, and ImageNet show that Drop-Activation generally improves the performance of popular neural network architectures for the image classification task. Furthermore, as a regularizer Drop-Activation can be used in harmony with standard training and regularization techniques such as Batch Normalization and Auto Augment (Cubuk et al. 2019). The code is available at \url{https://github.com/LeungSamWai/Drop-Activation}.
△ Less
Submitted 28 March, 2020; v1 submitted 14 November, 2018;
originally announced November 2018.
-
SwitchNet: a neural network model for forward and inverse scattering problems
Authors:
Yuehaw Khoo,
Lexing Ying
Abstract:
We propose a novel neural network architecture, SwitchNet, for solving the wave equation based inverse scattering problems via providing maps between the scatterers and the scattered field (and vice versa). The main difficulty of using a neural network for this problem is that a scatterer has a global impact on the scattered wave field, rendering typical convolutional neural network with local con…
▽ More
We propose a novel neural network architecture, SwitchNet, for solving the wave equation based inverse scattering problems via providing maps between the scatterers and the scattered field (and vice versa). The main difficulty of using a neural network for this problem is that a scatterer has a global impact on the scattered wave field, rendering typical convolutional neural network with local connections inapplicable. While it is possible to deal with such a problem using a fully connected network, the number of parameters grows quadratically with the size of the input and output data. By leveraging the inherent low-rank structure of the scattering problems and introducing a novel switching layer with sparse connections, the SwitchNet architecture uses much fewer parameters and facilitates the training process. Numerical experiments show promising accuracy in learning the forward and inverse maps between the scatterers and the scattered wave field.
△ Less
Submitted 23 October, 2018;
originally announced October 2018.
-
Tunable Quantum Hall Edge Conduction in Bilayer Graphene through Spin-Orbit Interaction
Authors:
Jun Yong Khoo,
Leonid Levitov
Abstract:
Bilayer graphene, in the presence of a one-sided spin-orbit interaction (SOI) induced by a suitably chosen substrate, is predicted to exhibit unconventional Quantum Hall states. The new states arise due to strong SOI-induced splittings of the eight zeroth Landau levels, which are strongly layer-polarized, residing fully or partially on one of the two graphene layers. In particular, an Ising SOI in…
▽ More
Bilayer graphene, in the presence of a one-sided spin-orbit interaction (SOI) induced by a suitably chosen substrate, is predicted to exhibit unconventional Quantum Hall states. The new states arise due to strong SOI-induced splittings of the eight zeroth Landau levels, which are strongly layer-polarized, residing fully or partially on one of the two graphene layers. In particular, an Ising SOI in the meV scale is sufficient to invert the Landau level order between the $n=0$ and $n=1$ orbital levels under moderately weak magnetic fields $B \lesssim 10$T. Furthermore, when the Ising field opposes the $B$ field, the order of the spin-polarized levels can also be inverted. We show that, under these conditions, three different compensated electron-hole phases, with equal concentrations of electrons and holes, can occur at $ν= 0$ filling. The three phases have distinct edge conductivity values. One of the phases is especially interesting, since its edge conduction can be turned on and off by switching the sign of the interlayer bias.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
Convex relaxation approaches for strictly correlated density functional theory
Authors:
Yuehaw Khoo,
Lexing Ying
Abstract:
In this paper, we introduce methods from convex optimization to solve the multimarginal transport type problems arise in the context of density functional theory. Convex relaxations are used to provide outer approximation to the set of $N$-representable 2-marginals and 3-marginals, which in turn provide lower bounds to the energy. We further propose rounding schemes to obtain upper bound to the en…
▽ More
In this paper, we introduce methods from convex optimization to solve the multimarginal transport type problems arise in the context of density functional theory. Convex relaxations are used to provide outer approximation to the set of $N$-representable 2-marginals and 3-marginals, which in turn provide lower bounds to the energy. We further propose rounding schemes to obtain upper bound to the energy. Numerical experiments demonstrate a gap of the order of $10^{-3}$ to $10^{-2}$ between the upper and lower bounds. The Kantorovich potential of the multi-marginal transport problem is also approximated with a similar accuracy.
△ Less
Submitted 13 August, 2018;
originally announced August 2018.
-
Shear sound of two-dimensional Fermi liquids
Authors:
Jun Yong Khoo,
Inti Sodemann Villadiego
Abstract:
We study the appearance of a sharp collective mode which features transverse current fluctuations within the bosonization approach to interacting two-dimensional Fermi liquids. This mode is analogous to the shear sound modes in elastic media, and, unlike the conventional zero sound mode, it is damped in weakly interacting Fermi liquids and only separates away from the particle-hole continuum when…
▽ More
We study the appearance of a sharp collective mode which features transverse current fluctuations within the bosonization approach to interacting two-dimensional Fermi liquids. This mode is analogous to the shear sound modes in elastic media, and, unlike the conventional zero sound mode, it is damped in weakly interacting Fermi liquids and only separates away from the particle-hole continuum when the quasiparticle mass becomes twice the transport mass $m^* \gtrsim 2 m$. The shear sound should be present in a large class of interacting charged and neutral Fermi liquids especially those proximate to critical points where the quasiparticle mass diverges. In metals this mode remains linearly dispersing in the presence of the long-ranged Coulomb force, unlike the conventional zero sound mode which becomes the plasma mode. We also detail a quick path between bosonization and classical Landau's Fermi liquid theory by constructing a mapping between the solutions of the classical kinetic equation and the quantized bosonic eigenmodes. By further mapping the kinetic equation into a 1D tight-binding model we solve for the entire spectrum of collective and incoherent particle-hole excitations of Fermi liquids with non-zero $F_0$ and $F_1$ Landau parameters.
△ Less
Submitted 12 March, 2019; v1 submitted 11 June, 2018;
originally announced June 2018.
-
Solving for high dimensional committor functions using artificial neural networks
Authors:
Yuehaw Khoo,
Jianfeng Lu,
Lexing Ying
Abstract:
In this note we propose a method based on artificial neural network to study the transition between states governed by stochastic processes. In particular, we aim for numerical schemes for the committor function, the central object of transition path theory, which satisfies a high-dimensional Fokker-Planck equation. By working with the variational formulation of such partial differential equation…
▽ More
In this note we propose a method based on artificial neural network to study the transition between states governed by stochastic processes. In particular, we aim for numerical schemes for the committor function, the central object of transition path theory, which satisfies a high-dimensional Fokker-Planck equation. By working with the variational formulation of such partial differential equation and parameterizing the committor function in terms of a neural network, approximations can be obtained via optimizing the neural network weights using stochastic algorithms. The numerical examples show that moderate accuracy can be achieved for high-dimensional problems.
△ Less
Submitted 28 February, 2018;
originally announced February 2018.
-
On-Demand Spin-Orbit Interaction from Which-Layer Tunability in Bilayer Graphene
Authors:
Jun Yong Khoo,
Alberto F. Morpurgo,
Leonid Levitov
Abstract:
Spin-orbit interaction (SOI) that is gate-tunable over a broad range is essential to exploiting novel spin phenomena. Achieving this regime has remained elusive because of the weakness of the underlying relativistic coupling and lack of its tunability in solids. Here we outline a general strategy that enables exceptionally high tunability of SOI through creating a which-layer spin-orbit field inho…
▽ More
Spin-orbit interaction (SOI) that is gate-tunable over a broad range is essential to exploiting novel spin phenomena. Achieving this regime has remained elusive because of the weakness of the underlying relativistic coupling and lack of its tunability in solids. Here we outline a general strategy that enables exceptionally high tunability of SOI through creating a which-layer spin-orbit field inhomogeneity in graphene multilayers. An external transverse electric field is applied to shift carriers between the layers with strong and weak SOI. Because graphene layers are separated by sub-nm scales, exceptionally high tunability of SOI can be achieved through a minute carrier displacement. A detailed analysis of the experimentally relevant case of bilayer graphene on a semiconducting transition metal dichalchogenide substrate is presented. In this system, a complete tunability of SOI amounting to its ON/OFF switching can be achieved. New opportunities for spin control are exemplified with electrically driven spin resonance and topological phases with different quantized intrinsic valley Hall conductivities.
△ Less
Submitted 12 November, 2017;
originally announced November 2017.