Search | arXiv e-print repository

doi 10.1109/TFUZZ.2024.3420963

Cascaded two-stage feature clustering and selection via separability and consistency in fuzzy decision systems

Authors: Yuepeng Chen, Weiping Ding, Hengrong Ju, Jiashuang Huang, Tao Yin

Abstract: Feature selection is a vital technique in machine learning, as it can reduce computational complexity, improve model performance, and mitigate the risk of overfitting. However, the increasing complexity and dimensionality of datasets pose significant challenges in the selection of features. Focusing on these challenges, this paper proposes a cascaded two-stage feature clustering and selection algo… ▽ More Feature selection is a vital technique in machine learning, as it can reduce computational complexity, improve model performance, and mitigate the risk of overfitting. However, the increasing complexity and dimensionality of datasets pose significant challenges in the selection of features. Focusing on these challenges, this paper proposes a cascaded two-stage feature clustering and selection algorithm for fuzzy decision systems. In the first stage, we reduce the search space by clustering relevant features and addressing inter-feature redundancy. In the second stage, a clustering-based sequentially forward selection method that explores the global and local structure of data is presented. We propose a novel metric for assessing the significance of features, which considers both global separability and local consistency. Global separability measures the degree of intra-class cohesion and inter-class separation based on fuzzy membership, providing a comprehensive understanding of data separability. Meanwhile, local consistency leverages the fuzzy neighborhood rough set model to capture uncertainty and fuzziness in the data. The effectiveness of our proposed algorithm is evaluated through experiments conducted on 18 public datasets and a real-world schizophrenia dataset. The experiment results demonstrate our algorithm's superiority over benchmarking algorithms in both classification accuracy and the number of selected features. △ Less

Submitted 21 July, 2024; originally announced July 2024.

Comments: This paper has been accepted by IEEE Transactions on Fuzzy Systems for publication. Permission from IEEE must be obtained for all other uses, in any current or future media. The final version is available at [10.1109/TFUZZ.2024.3420963]

Journal ref: IEEE Transactions on Fuzzy Systems 2024

arXiv:2407.15756 [pdf, other]

Model editing for distribution shifts in uranium oxide morphological analysis

Authors: Davis Brown, Cody Nizinski, Madelyn Shapiro, Corey Fallon, Tianzhixi Yin, Henry Kvinge, Jonathan H. Tu

Abstract: Deep learning still struggles with certain kinds of scientific data. Notably, pretraining data may not provide coverage of relevant distribution shifts (e.g., shifts induced via the use of different measurement instruments). We consider deep learning models trained to classify the synthesis conditions of uranium ore concentrates (UOCs) and show that model editing is particularly effective for impr… ▽ More Deep learning still struggles with certain kinds of scientific data. Notably, pretraining data may not provide coverage of relevant distribution shifts (e.g., shifts induced via the use of different measurement instruments). We consider deep learning models trained to classify the synthesis conditions of uranium ore concentrates (UOCs) and show that model editing is particularly effective for improving generalization to distribution shifts common in this domain. In particular, model editing outperforms finetuning on two curated datasets comprising of micrographs taken of U$_{3}$O$_{8}$ aged in humidity chambers and micrographs acquired with different scanning electron microscopes, respectively. △ Less

Submitted 22 July, 2024; originally announced July 2024.

Comments: Presented at CV4MS @ CVPR 2024

arXiv:2406.18019 [pdf, other]

Continuous Execution of High-Level Collaborative Tasks for Heterogeneous Robot Teams

Authors: Amy Fang, Tenny Yin, Jiawei Lin, Hadas Kress-Gazit

Abstract: We propose a control synthesis framework for a heterogeneous multi-robot system to satisfy collaborative tasks, where actions may take varying duration of time to complete. We encode tasks using the discrete logic LTL^ψ, which uses the concept of bindings to interleave robot actions and express information about relationship between specific task requirements and robot assignments. We present a sy… ▽ More We propose a control synthesis framework for a heterogeneous multi-robot system to satisfy collaborative tasks, where actions may take varying duration of time to complete. We encode tasks using the discrete logic LTL^ψ, which uses the concept of bindings to interleave robot actions and express information about relationship between specific task requirements and robot assignments. We present a synthesis approach to automatically generate a teaming assignment and corresponding discrete behavior that is correct-by-construction for continuous execution, while also implementing synchronization policies to ensure collaborative portions of the task are satisfied. We demonstrate our approach on a physical multi-robot system. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: Under review in IEEE Transactions on Robotics

arXiv:2405.14867 [pdf, other]

Improved Distribution Matching Distillation for Fast Image Synthesis

Authors: Tianwei Yin, Michaël Gharbi, Taesung Park, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman

Abstract: Recent approaches have shown promises distilling diffusion models into efficient one-step generators. Among them, Distribution Matching Distillation (DMD) produces one-step generators that match their teacher in distribution, without enforcing a one-to-one correspondence with the sampling trajectories of their teachers. However, to ensure stable training, DMD requires an additional regression loss… ▽ More Recent approaches have shown promises distilling diffusion models into efficient one-step generators. Among them, Distribution Matching Distillation (DMD) produces one-step generators that match their teacher in distribution, without enforcing a one-to-one correspondence with the sampling trajectories of their teachers. However, to ensure stable training, DMD requires an additional regression loss computed using a large set of noise-image pairs generated by the teacher with many steps of a deterministic sampler. This is costly for large-scale text-to-image synthesis and limits the student's quality, tying it too closely to the teacher's original sampling paths. We introduce DMD2, a set of techniques that lift this limitation and improve DMD training. First, we eliminate the regression loss and the need for expensive dataset construction. We show that the resulting instability is due to the fake critic not estimating the distribution of generated samples accurately and propose a two time-scale update rule as a remedy. Second, we integrate a GAN loss into the distillation procedure, discriminating between generated samples and real images. This lets us train the student model on real data, mitigating the imperfect real score estimation from the teacher model, and enhancing quality. Lastly, we modify the training procedure to enable multi-step sampling. We identify and address the training-inference input mismatch problem in this setting, by simulating inference-time generator samples during training time. Taken together, our improvements set new benchmarks in one-step image generation, with FID scores of 1.28 on ImageNet-64x64 and 8.35 on zero-shot COCO 2014, surpassing the original teacher despite a 500X reduction in inference cost. Further, we show our approach can generate megapixel images by distilling SDXL, demonstrating exceptional visual quality among few-step methods. △ Less

Submitted 24 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

Comments: Code, model, and dataset are available at https://tianweiy.github.io/dmd2

arXiv:2401.15862 [pdf, other]

PML-based boundary integral equation method for electromagnetic scattering problems in a layered-medium

Authors: Gang Bao, Wangtao Lu, Tao Yin, Lu Zhang

Abstract: This paper proposes a new boundary integral equation (BIE) methodology based on the perfectly matched layer (PML) truncation technique for solving the electromagnetic scattering problems in a multi-layered medium. Instead of using the original PML stretched fields, artificial fields which are also equivalent to the solutions in the physical region are introduced. This significantly simplifies the… ▽ More This paper proposes a new boundary integral equation (BIE) methodology based on the perfectly matched layer (PML) truncation technique for solving the electromagnetic scattering problems in a multi-layered medium. Instead of using the original PML stretched fields, artificial fields which are also equivalent to the solutions in the physical region are introduced. This significantly simplifies the study of the proposed methodology to derive the PML problem. Then some PML transformed layer potentials and the associated boundary integral operators (BIOs) are defined and the corresponding jump relations are shown. Under the assumption that the fields vanish on the PML boundary, the solution representations, as well as the related BIEs and regularization of the hyper-singular operators, in terms of the current density functions on the truncated interface, are derived. Numerical experiments are presented to demonstrate the efficiency and accuracy of the method. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 26 pages, 13 figures

arXiv:2312.15460 [pdf, other]

On a Robin-type non-singular coupling scheme for solving the wave scattering problems

Authors: Xiaojuan Liu, Maojun Li, Tao Yin

Abstract: This paper studies a non-singular coupling scheme for solving the acoustic and elastic wave scattering problems and its extension to the problems of Laplace and Lamé equations and the problem with a compactly supported inhomogeneity is also briefly discussed. Relying on the solution representation of the wave scattering problem, a Robin-type artificial boundary condition in terms of layer potentia… ▽ More This paper studies a non-singular coupling scheme for solving the acoustic and elastic wave scattering problems and its extension to the problems of Laplace and Lamé equations and the problem with a compactly supported inhomogeneity is also briefly discussed. Relying on the solution representation of the wave scattering problem, a Robin-type artificial boundary condition in terms of layer potentials whose kernels are non-singular, is introduced to obtain a reduced problem on a bounded domain. The wellposedness of the reduced problems and the a priori error estimates of the corresponding finite element discretization are proved. Numerical examples are presented to demonstrate the accuracy and efficiency of the proposed method. △ Less

Submitted 24 December, 2023; originally announced December 2023.

arXiv:2312.15189 [pdf, other]

Helmholtz decomposition based windowed Green function methods for elastic scattering problems on a half-space

Authors: Tao Yin, Lu Zhang, Weiying Zheng, Xiaopeng Zhu

Abstract: This paper proposes a new Helmholtz decomposition based windowed Green function (HD-WGF) method for solving the time-harmonic elastic scattering problems on a half-space with Dirichlet boundary conditions in both 2D and 3D. The Helmholtz decomposition is applied to separate the pressure and shear waves, which satisfy the Helmholtz and Helmholtz/Maxwell equations, respectively, and the correspondin… ▽ More This paper proposes a new Helmholtz decomposition based windowed Green function (HD-WGF) method for solving the time-harmonic elastic scattering problems on a half-space with Dirichlet boundary conditions in both 2D and 3D. The Helmholtz decomposition is applied to separate the pressure and shear waves, which satisfy the Helmholtz and Helmholtz/Maxwell equations, respectively, and the corresponding boundary integral equations of type $(\mathbb{I}+\mathbb{T})\bsφ=\bs f$, that couple these two waves on the unbounded surface, are derived based on the free-space fundamental solution of Helmholtz equation. This approach avoids the treatment of the complex elastic displacement tensor and traction operator that involved in the classical integral equation method for elastic problems. Then a smooth ``slow-rise'' windowing function is introduced to truncate the boundary integral equations and a ``correction'' strategy is proposed to ensure the uniformly fast convergence for all incident angles of plane incidence. Numerical experiments for both two and three dimensional problems are presented to demonstrate the accuracy and efficiency of the proposed method. △ Less

Submitted 23 December, 2023; originally announced December 2023.

Comments: 20 pages, 5 figures

arXiv:2312.06302 [pdf]

Non-iterative Methods in Inhomogeneous Background Inverse Scattering Imaging Problem Assisted by Swin Transformer Network

Authors: Naike Du, Tiantian Yin, Jing Wang, Rencheng Song, Kuiwen Xu, Bingyuan Liang, Sheng Sun, Xiuzhu Ye

Abstract: A deep learning-assisted inversion method is proposed to solve the inhomogeneous background imaging problem. Three non-iterative methods, namely the distorted-Born (DB) major current coefficients method, the DB modified Born approximation method, and the DB connection method, are introduced to address the inhomogeneous background inverse scattering problem. These methods retain the multiple scatte… ▽ More A deep learning-assisted inversion method is proposed to solve the inhomogeneous background imaging problem. Three non-iterative methods, namely the distorted-Born (DB) major current coefficients method, the DB modified Born approximation method, and the DB connection method, are introduced to address the inhomogeneous background inverse scattering problem. These methods retain the multiple scattering information by utilizing the major current obtained through singular value decomposition of the Green's function and the scattered field, without resourcing to optimization techniques. As a result, the proposed methods offer improved reconstruction resolution and accuracy for unknown objects embedded in inhomogeneous backgrounds, surpassing the backpropagation scheme (BPS) and Born approximation (BA) method that disregard the multiple scattering effect. To further enhance the resolution and accuracy of the reconstruction, a Shifted-Window (Swin) transformer network is employed for capturing super-resolution information in the images. The attention mechanism incorporated in the shifted window facilitates global interactions between objects, thereby enhancing the performance of the inhomogeneous background imaging algorithm while reducing computational complexity. Moreover, an adaptive training method is proposed to enhance the generalization ability of the network. The effectiveness of the proposed methods is demonstrated through both synthetic data and experimental data. Notably, super-resolution imaging is achieved with quasi real-time speed, indicating promising application potential for the proposed algorithms. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: We have submitted this paper to TGRS(IEEE Transactionson Geoscience andRemote Sensing) on 29-Jan-2023; and resubmitted on 12-Jul-2023

arXiv:2311.18828 [pdf, other]

One-step Diffusion with Distribution Matching Distillation

Authors: Tianwei Yin, Michaël Gharbi, Richard Zhang, Eli Shechtman, Fredo Durand, William T. Freeman, Taesung Park

Abstract: Diffusion models generate high-quality images but require dozens of forward passes. We introduce Distribution Matching Distillation (DMD), a procedure to transform a diffusion model into a one-step image generator with minimal impact on image quality. We enforce the one-step image generator match the diffusion model at distribution level, by minimizing an approximate KL divergence whose gradient c… ▽ More Diffusion models generate high-quality images but require dozens of forward passes. We introduce Distribution Matching Distillation (DMD), a procedure to transform a diffusion model into a one-step image generator with minimal impact on image quality. We enforce the one-step image generator match the diffusion model at distribution level, by minimizing an approximate KL divergence whose gradient can be expressed as the difference between 2 score functions, one of the target distribution and the other of the synthetic distribution being produced by our one-step generator. The score functions are parameterized as two diffusion models trained separately on each distribution. Combined with a simple regression loss matching the large-scale structure of the multi-step diffusion outputs, our method outperforms all published few-step diffusion approaches, reaching 2.62 FID on ImageNet 64x64 and 11.49 FID on zero-shot COCO-30k, comparable to Stable Diffusion but orders of magnitude faster. Utilizing FP16 inference, our model generates images at 20 FPS on modern hardware. △ Less

Submitted 5 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

Comments: Project page: https://tianweiy.github.io/dmd/

Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

arXiv:2311.14138 [pdf, ps, other]

A symmetric Gauss-Seidel method for the steady-state Boltzmann equation

Authors: Tianai Yin, Zhenning Cai, Yanli Wang

Abstract: We introduce numerical solvers for the steady-state Boltzmann equation based on the symmetric Gauss-Seidel (SGS) method. Due to the quadratic collision operator in the Boltzmann equation, the SGS method requires solving a nonlinear system on each grid cell, and we consider two methods, namely Newton's method and the fixed-point iteration, in our numerical tests. For small Knudsen numbers, our meth… ▽ More We introduce numerical solvers for the steady-state Boltzmann equation based on the symmetric Gauss-Seidel (SGS) method. Due to the quadratic collision operator in the Boltzmann equation, the SGS method requires solving a nonlinear system on each grid cell, and we consider two methods, namely Newton's method and the fixed-point iteration, in our numerical tests. For small Knudsen numbers, our method has an efficiency between the classical source iteration and the modern generalized synthetic iterative scheme, and the complexity of its implementation is closer to the source iteration. A variety of numerical tests are carried out to demonstrate its performance, and it is concluded that the proposed method is suitable for applications with moderate to large Knudsen numbers. △ Less

Submitted 23 November, 2023; originally announced November 2023.

arXiv:2311.12264 [pdf, other]

Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations

Authors: Sayak Mukherjee, Ramij R. Hossain, Sheik M. Mohiuddin, Yuan Liu, Wei Du, Veronica Adetola, Rohit A. Jinsiwale, Qiuhua Huang, Tianzhixi Yin, Ankit Singhal

Abstract: Improving system-level resiliency of networked microgrids is an important aspect with increased population of inverter-based resources (IBRs). This paper (1) presents resilient control design in presence of adversarial cyber-events, and proposes a novel federated reinforcement learning (Fed-RL) approach to tackle (a) model complexities, unknown dynamical behaviors of IBR devices, (b) privacy issue… ▽ More Improving system-level resiliency of networked microgrids is an important aspect with increased population of inverter-based resources (IBRs). This paper (1) presents resilient control design in presence of adversarial cyber-events, and proposes a novel federated reinforcement learning (Fed-RL) approach to tackle (a) model complexities, unknown dynamical behaviors of IBR devices, (b) privacy issues regarding data sharing in multi-party-owned networked grids, and (2) transfers learned controls from simulation to hardware-in-the-loop test-bed, thereby bridging the gap between simulation and real world. With these multi-prong objectives, first, we formulate a reinforcement learning (RL) training setup generating episodic trajectories with adversaries (attack signal) injected at the primary controllers of the grid forming (GFM) inverters where RL agents (or controllers) are being trained to mitigate the injected attacks. For networked microgrids, the horizontal Fed-RL method involving distinct independent environments is not appropriate, leading us to develop vertical variant Federated Soft Actor-Critic (FedSAC) algorithm to grasp the interconnected dynamics of networked microgrid. Next, utilizing OpenAI Gym interface, we built a custom simulation set-up in GridLAB-D/HELICS co-simulation platform, named Resilient RL Co-simulation (ResRLCoSIM), to train the RL agents with IEEE 123-bus benchmark test systems comprising 3 interconnected microgrids. Finally, the learned policies in simulation world are transferred to the real-time hardware-in-the-loop test-bed set-up developed using high-fidelity Hypersim platform. Experiments show that the simulator-trained RL controllers produce convincing results with the real-time test-bed set-up, validating the minimization of sim-to-real gap. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 10 pages, 7 figures

arXiv:2310.06341 [pdf, other]

Federated Learning with Reduced Information Leakage and Computation

Authors: Tongxin Yin, Xueru Zhang, Mohammad Mahdi Khalili, Mingyan Liu

Abstract: Federated learning (FL) is a distributed learning paradigm that allows multiple decentralized clients to collaboratively learn a common model without sharing local data. Although local data is not exposed directly, privacy concerns nonetheless exist as clients' sensitive information can be inferred from intermediate computations. Moreover, such information leakage accumulates substantially over ti… ▽ More Federated learning (FL) is a distributed learning paradigm that allows multiple decentralized clients to collaboratively learn a common model without sharing local data. Although local data is not exposed directly, privacy concerns nonetheless exist as clients' sensitive information can be inferred from intermediate computations. Moreover, such information leakage accumulates substantially over time as the same data is repeatedly used during the iterative learning process. As a result, it can be particularly difficult to balance the privacy-accuracy trade-off when designing privacy-preserving FL algorithms. In this paper, we introduce Upcycled-FL, a novel federated learning framework with first-order approximation applied at every even iteration. Under this framework, half of the FL updates incur no information leakage and require much less computation. We first conduct the theoretical analysis on the convergence (rate) of Upcycled-FL, and then apply perturbation mechanisms to preserve privacy. Experiments on real-world data show that Upcycled-FL consistently outperforms existing methods over heterogeneous data, and significantly improves privacy-accuracy trade-off while reducing 48% of the training time on average. △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.06205 [pdf, other]

Fair Classifiers that Abstain without Harm

Authors: Tongxin Yin, Jean-François Ton, Ruocheng Guo, Yuanshun Yao, Mingyan Liu, Yang Liu

Abstract: In critical applications, it is vital for classifiers to defer decision-making to humans. We propose a post-hoc method that makes existing classifiers selectively abstain from predicting certain samples. Our abstaining classifier is incentivized to maintain the original accuracy for each sub-population (i.e. no harm) while achieving a set of group fairness definitions to a user specified degree. T… ▽ More In critical applications, it is vital for classifiers to defer decision-making to humans. We propose a post-hoc method that makes existing classifiers selectively abstain from predicting certain samples. Our abstaining classifier is incentivized to maintain the original accuracy for each sub-population (i.e. no harm) while achieving a set of group fairness definitions to a user specified degree. To this end, we design an Integer Programming (IP) procedure that assigns abstention decisions for each training sample to satisfy a set of constraints. To generalize the abstaining decisions to test samples, we then train a surrogate model to learn the abstaining decisions based on the IP solutions in an end-to-end manner. We analyze the feasibility of the IP procedure to determine the possible abstention rate for different levels of unfairness tolerance and accuracy constraint for achieving no harm. To the best of our knowledge, this work is the first to identify the theoretical relationships between the constraint parameters and the required abstention rate. Our theoretical results are important since a high abstention rate is often infeasible in practice due to a lack of human resources. Our framework outperforms existing methods in terms of fairness disparity without sacrificing accuracy at similar abstention rates. △ Less

Submitted 9 October, 2023; originally announced October 2023.

arXiv:2310.05021 [pdf, other]

Toward Intelligent Emergency Control for Large-scale Power Systems: Convergence of Learning, Physics, Computing and Control

Authors: Qiuhua Huang, Renke Huang, Tianzhixi Yin, Sohom Datta, Xueqing Sun, Jason Hou, Jie Tan, Wenhao Yu, Yuan Liu, Xinya Li, Bruce Palmer, Ang Li, Xinda Ke, Marianna Vaiman, Song Wang, Yousu Chen

Abstract: This paper has delved into the pressing need for intelligent emergency control in large-scale power systems, which are experiencing significant transformations and are operating closer to their limits with more uncertainties. Learning-based control methods are promising and have shown effectiveness for intelligent power system control. However, when they are applied to large-scale power systems, t… ▽ More This paper has delved into the pressing need for intelligent emergency control in large-scale power systems, which are experiencing significant transformations and are operating closer to their limits with more uncertainties. Learning-based control methods are promising and have shown effectiveness for intelligent power system control. However, when they are applied to large-scale power systems, there are multifaceted challenges such as scalability, adaptiveness, and security posed by the complex power system landscape, which demand comprehensive solutions. The paper first proposes and instantiates a convergence framework for integrating power systems physics, machine learning, advanced computing, and grid control to realize intelligent grid control at a large scale. Our developed methods and platform based on the convergence framework have been applied to a large (more than 3000 buses) Texas power system, and tested with 56000 scenarios. Our work achieved a 26% reduction in load shedding on average and outperformed existing rule-based control in 99.7% of the test scenarios. The results demonstrated the potential of the proposed convergence framework and DRL-based intelligent control for the future grid. △ Less

Submitted 8 October, 2023; originally announced October 2023.

Comments: submitted to PSCC 2024

arXiv:2307.02072 [pdf, other]

Mathematical and numerical study of an inverse source problem for the biharmonic wave equation

Authors: Yan Chang, Yukun Guo, Tao Yin, Yue Zhao

Abstract: In this paper, we study the inverse source problem for the biharmonic wave equation. Mathematically, we characterize the radiating sources and non-radiating sources at a fixed wavenumber. We show that a general source can be decomposed into a radiating source and a non-radiating source. The radiating source can be uniquely determined by Dirichlet boundary measurements at a fixed wavenumber. Moreov… ▽ More In this paper, we study the inverse source problem for the biharmonic wave equation. Mathematically, we characterize the radiating sources and non-radiating sources at a fixed wavenumber. We show that a general source can be decomposed into a radiating source and a non-radiating source. The radiating source can be uniquely determined by Dirichlet boundary measurements at a fixed wavenumber. Moreover, we derive a Lipschitz stability estimate for determining the radiating source. On the other hand, the non-radiating source does not produce any scattered fields outside the support of the source function. Numerically, we propose a novel source reconstruction method based on Fourier series expansion by multi-wavenumber boundary measurements. Numerical experiments are presented to verify the accuracy and efficiency of the proposed method. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: 18 pages, 10 figures

arXiv:2306.11719 [pdf, other]

Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision

Authors: Ayush Tewari, Tianwei Yin, George Cazenavette, Semon Rezchikov, Joshua B. Tenenbaum, Frédo Durand, William T. Freeman, Vincent Sitzmann

Abstract: Denoising diffusion models are a powerful type of generative models used to capture complex distributions of real-world signals. However, their applicability is limited to scenarios where training samples are readily available, which is not always the case in real-world applications. For example, in inverse graphics, the goal is to generate samples from a distribution of 3D scenes that align with… ▽ More Denoising diffusion models are a powerful type of generative models used to capture complex distributions of real-world signals. However, their applicability is limited to scenarios where training samples are readily available, which is not always the case in real-world applications. For example, in inverse graphics, the goal is to generate samples from a distribution of 3D scenes that align with a given image, but ground-truth 3D scenes are unavailable and only 2D images are accessible. To address this limitation, we propose a novel class of denoising diffusion probabilistic models that learn to sample from distributions of signals that are never directly observed. Instead, these signals are measured indirectly through a known differentiable forward model, which produces partial observations of the unknown signal. Our approach involves integrating the forward model directly into the denoising process. This integration effectively connects the generative modeling of observations with the generative modeling of the underlying signals, allowing for end-to-end training of a conditional generative model over signals. During inference, our approach enables sampling from the distribution of underlying signals that are consistent with a given partial observation. We demonstrate the effectiveness of our method on three challenging computer vision tasks. For instance, in the context of inverse graphics, our model enables direct sampling from the distribution of 3D scenes that align with a single 2D input image. △ Less

Submitted 16 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

Comments: Project page: https://diffusion-with-forward-models.github.io/

arXiv:2306.07150 [pdf]

Near-Unity Emitting, Widely Tailorable and Stable Exciton Concentrators Built from Doubly Gradient 2D Semiconductor Nanoplatelets

Authors: Xiao Liang, Emek G. Durmusoglu, Maria Lunina, Pedro Ludwig Hernandez-Martinez, Vytautas Valuckas, Fei Yan, Yulia Lekina, Vijay Kumar Sharma, Tingting Yin, Son Tung Ha, Ze Xiang Shen, Handong Sun, Arseniy Kuznetsov, Hilmi Volkan Demir

Abstract: The strength of electrostatic interactions (EI) between electrons and holes within semiconductor nanocrystals profoundly impact the performance of their optoelectronic systems, and different optoelectronic devices demand distinct EI strength of the active medium. However, achieving a broad range, fine-tuning of the EI strength for specific optoelectronic applications is a daunting challenge, espec… ▽ More The strength of electrostatic interactions (EI) between electrons and holes within semiconductor nanocrystals profoundly impact the performance of their optoelectronic systems, and different optoelectronic devices demand distinct EI strength of the active medium. However, achieving a broad range, fine-tuning of the EI strength for specific optoelectronic applications is a daunting challenge, especially in quasi 2-dimensional core-shell semiconductor nanoplatelets (NPLs), as the epitaxial growth of the inorganic shell along the direction of the thickness that solely contributes to the quantum confined effect significantly undermines the strength of the EI. Herein we propose and demonstrate a novel doubly-gradient (DG) core-shell architecture of semiconductor NPLs for on-demand tailoring of the EI strength by controlling the localized exciton concentration via in-plane architectural modulation, demonstrated by a wide tuning of radiative recombination rate and exciton binding energy. Moreover, these exciton-concentration-engineered DG NPLs also exhibit a near-unity quantum yield, remarkable thermal and photo stability, as well as considerably suppressed self-absorption. As proof-of-concept demonstrations, highly efficient color converters and high-performance light-emitting diodes (external quantum efficiency: 16.9%, maximum luminance: 43,000 cd/m2) have been achieved based on the DG NPLs. This work thus opens up new avenues for developing high-performance colloidal optoelectronic device applications. △ Less

Submitted 12 June, 2023; originally announced June 2023.

arXiv:2305.10431 [pdf, other]

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Authors: Guangxuan Xiao, Tianwei Yin, William T. Freeman, Frédo Durand, Song Han

Abstract: Diffusion models excel at text-to-image generation, especially in subject-driven generation for personalized images. However, existing methods are inefficient due to the subject-specific fine-tuning, which is computationally intensive and hampers efficient deployment. Moreover, existing methods struggle with multi-subject generation as they often blend features among subjects. We present FastCompo… ▽ More Diffusion models excel at text-to-image generation, especially in subject-driven generation for personalized images. However, existing methods are inefficient due to the subject-specific fine-tuning, which is computationally intensive and hampers efficient deployment. Moreover, existing methods struggle with multi-subject generation as they often blend features among subjects. We present FastComposer which enables efficient, personalized, multi-subject text-to-image generation without fine-tuning. FastComposer uses subject embeddings extracted by an image encoder to augment the generic text conditioning in diffusion models, enabling personalized image generation based on subject images and textual instructions with only forward passes. To address the identity blending problem in the multi-subject generation, FastComposer proposes cross-attention localization supervision during training, enforcing the attention of reference subjects localized to the correct regions in the target images. Naively conditioning on subject embeddings results in subject overfitting. FastComposer proposes delayed subject conditioning in the denoising step to maintain both identity and editability in subject-driven image generation. FastComposer generates images of multiple unseen individuals with different styles, actions, and contexts. It achieves 300$\times$-2500$\times$ speedup compared to fine-tuning-based methods and requires zero extra storage for new subjects. FastComposer paves the way for efficient, personalized, and high-quality multi-subject image creation. Code, model, and dataset are available at https://github.com/mit-han-lab/fastcomposer. △ Less

Submitted 21 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

Comments: The first two authors contributed equally to this work

arXiv:2305.05090 [pdf, other]

Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts

Authors: Kun Jin, Tongxin Yin, Zhongzhu Chen, Zeyu Sun, Xueru Zhang, Yang Liu, Mingyan Liu

Abstract: We consider a federated learning (FL) system consisting of multiple clients and a server, where the clients aim to collaboratively learn a common decision model from their distributed data. Unlike the conventional FL framework that assumes the client's data is static, we consider scenarios where the clients' data distributions may be reshaped by the deployed decision model. In this work, we levera… ▽ More We consider a federated learning (FL) system consisting of multiple clients and a server, where the clients aim to collaboratively learn a common decision model from their distributed data. Unlike the conventional FL framework that assumes the client's data is static, we consider scenarios where the clients' data distributions may be reshaped by the deployed decision model. In this work, we leverage the idea of distribution shift mappings in performative prediction to formalize this model-dependent data distribution shift and propose a performative federated learning framework. We first introduce necessary and sufficient conditions for the existence of a unique performative stable solution and characterize its distance to the performative optimal solution. Then we propose the performative FedAvg algorithm and show that it converges to the performative stable solution at a rate of O(1/T) under both full and partial participation schemes. In particular, we use novel proof techniques and show how the clients' heterogeneity influences the convergence. Numerical results validate our analysis and provide valuable insights into real-world applications. △ Less

Submitted 8 May, 2023; originally announced May 2023.

arXiv:2304.12507 [pdf, other]

Learning Task-Specific Strategies for Accelerated MRI

Authors: Zihui Wu, Tianwei Yin, Yu Sun, Robert Frost, Andre van der Kouwe, Adrian V. Dalca, Katherine L. Bouman

Abstract: Compressed sensing magnetic resonance imaging (CS-MRI) seeks to recover visual information from subsampled measurements for diagnostic tasks. Traditional CS-MRI methods often separately address measurement subsampling, image reconstruction, and task prediction, resulting in a suboptimal end-to-end performance. In this work, we propose TACKLE as a unified co-design framework for jointly optimizing… ▽ More Compressed sensing magnetic resonance imaging (CS-MRI) seeks to recover visual information from subsampled measurements for diagnostic tasks. Traditional CS-MRI methods often separately address measurement subsampling, image reconstruction, and task prediction, resulting in a suboptimal end-to-end performance. In this work, we propose TACKLE as a unified co-design framework for jointly optimizing subsampling, reconstruction, and prediction strategies for the performance on downstream tasks. The naïve approach of simply appending a task prediction module and training with a task-specific loss leads to suboptimal downstream performance. Instead, we develop a training procedure where a backbone architecture is first trained for a generic pre-training task (image reconstruction in our case), and then fine-tuned for different downstream tasks with a prediction head. Experimental results on multiple public MRI datasets show that TACKLE achieves an improved performance on various tasks over traditional CS-MRI methods. We also demonstrate that TACKLE is robust to distribution shifts by showing that it generalizes to a new dataset we experimentally collected using different acquisition setups from the training data. Without additional fine-tuning, TACKLE leads to both numerical and visual improvements compared to existing baselines. We have further implemented a learned 4$\times$-accelerated sequence on a Siemens 3T MRI Skyra scanner. Compared to the fully-sampling scan that takes 335 seconds, our optimized sequence only takes 84 seconds, achieving a four-fold time reduction as desired, while maintaining high performance. △ Less

Submitted 5 December, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

arXiv:2304.09362 [pdf, other]

Long-Term Fairness with Unknown Dynamics

Authors: Tongxin Yin, Reilly Raab, Mingyan Liu, Yang Liu

Abstract: While machine learning can myopically reinforce social inequalities, it may also be used to dynamically seek equitable outcomes. In this paper, we formalize long-term fairness in the context of online reinforcement learning. This formulation can accommodate dynamical control objectives, such as driving equity inherent in the state of a population, that cannot be incorporated into static formulatio… ▽ More While machine learning can myopically reinforce social inequalities, it may also be used to dynamically seek equitable outcomes. In this paper, we formalize long-term fairness in the context of online reinforcement learning. This formulation can accommodate dynamical control objectives, such as driving equity inherent in the state of a population, that cannot be incorporated into static formulations of fairness. We demonstrate that this framing allows an algorithm to adapt to unknown dynamics by sacrificing short-term incentives to drive a classifier-population system towards more desirable equilibria. For the proposed setting, we develop an algorithm that adapts recent work in online learning. We prove that this algorithm achieves simultaneous probabilistic bounds on cumulative loss and cumulative violations of fairness (as statistical regularities between demographic groups). We compare our proposed algorithm to the repeated retraining of myopic classifiers, as a baseline, and to a deep reinforcement learning algorithm that lacks safety guarantees. Our experiments model human populations according to evolutionary game theory and integrate real-world datasets. △ Less

Submitted 7 June, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

Comments: Best paper runner-up at ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models (Non Archival)

arXiv:2212.13833 [pdf, other]

A PML method for signal-propagation problems in axon

Authors: Xue Jiang, Maohui Lyu, Tao Yin, Weiying Zheng

Abstract: This work is focused on the modelling of signal propagations in myelinated axons to characterize the functions of the myelin sheath in the neural structure. Based on reasonable assumptions on the medium properties, we derive a two-dimensional neural-signaling model in cylindrical coordinates from the time-harmonic Maxwell's equations. The well-posedness of model is established upon Dirichlet bound… ▽ More This work is focused on the modelling of signal propagations in myelinated axons to characterize the functions of the myelin sheath in the neural structure. Based on reasonable assumptions on the medium properties, we derive a two-dimensional neural-signaling model in cylindrical coordinates from the time-harmonic Maxwell's equations. The well-posedness of model is established upon Dirichlet boundary conditions at the two ends of the neural structure and the radiative condition in the radial direction of the structure. Using the perfectly matched layer (PML) method, we truncate the unbounded background medium and propose an approximate problem on the truncated domain. The well-posedness of the PML problem and the exponential convergence of the approximate solution to the exact solution are established. Numerical experiments based on finite element discretization are presented to demonstrate the theoretical results and the efficiency of our methods to simulate the signal propagation in axons. △ Less

Submitted 28 December, 2022; originally announced December 2022.

arXiv:2212.08973 [pdf, other]

Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning

Authors: Sayak Mukherjee, Ramij R. Hossain, Yuan Liu, Wei Du, Veronica Adetola, Sheik M. Mohiuddin, Qiuhua Huang, Tianzhixi Yin, Ankit Singhal

Abstract: This paper presents a novel federated reinforcement learning (Fed-RL) methodology to enhance the cyber resiliency of networked microgrids. We formulate a resilient reinforcement learning (RL) training setup which (a) generates episodic trajectories injecting adversarial actions at primary control reference signals of the grid forming (GFM) inverters and (b) trains the RL agents (or controllers) to… ▽ More This paper presents a novel federated reinforcement learning (Fed-RL) methodology to enhance the cyber resiliency of networked microgrids. We formulate a resilient reinforcement learning (RL) training setup which (a) generates episodic trajectories injecting adversarial actions at primary control reference signals of the grid forming (GFM) inverters and (b) trains the RL agents (or controllers) to alleviate the impact of the injected adversaries. To circumvent data-sharing issues and concerns for proprietary privacy in multi-party-owned networked grids, we bring in the aspects of federated machine learning and propose a novel Fed-RL algorithm to train the RL agents. To this end, the conventional horizontal Fed-RL approaches using decoupled independent environments fail to capture the coupled dynamics in a networked microgrid, which leads us to propose a multi-agent vertically federated variation of actor-critic algorithms, namely federated soft actor-critic (FedSAC) algorithm. We created a customized simulation setup encapsulating microgrid dynamics in the GridLAB-D/HELICS co-simulation platform compatible with the OpenAI Gym interface for training RL agents. Finally, the proposed methodology is validated with numerical examples of modified IEEE 123-bus benchmark test systems consisting of three coupled microgrids. △ Less

Submitted 17 December, 2022; originally announced December 2022.

Comments: 13 pages, 5 figures

arXiv:2212.02715 [pdf, other]

Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning

Authors: Ramij R. Hossain, Tianzhixi Yin, Yan Du, Renke Huang, Jie Tan, Wenhao Yu, Yuan Liu, Qiuhua Huang

Abstract: This article proposes a model-based deep reinforcement learning (DRL) method to design emergency control strategies for short-term voltage stability problems in power systems. Recent advances show promising results in model-free DRL-based methods for power systems, but model-free methods suffer from poor sample efficiency and training time, both critical for making state-of-the-art DRL algorithms… ▽ More This article proposes a model-based deep reinforcement learning (DRL) method to design emergency control strategies for short-term voltage stability problems in power systems. Recent advances show promising results in model-free DRL-based methods for power systems, but model-free methods suffer from poor sample efficiency and training time, both critical for making state-of-the-art DRL algorithms practically applicable. DRL-agent learns an optimal policy via a trial-and-error method while interacting with the real-world environment. And it is desirable to minimize the direct interaction of the DRL agent with the real-world power grid due to its safety-critical nature. Additionally, state-of-the-art DRL-based policies are mostly trained using a physics-based grid simulator where dynamic simulation is computationally intensive, lowering the training efficiency. We propose a novel model-based-DRL framework where a deep neural network (DNN)-based dynamic surrogate model, instead of a real-world power-grid or physics-based simulation, is utilized with the policy learning framework, making the process faster and sample efficient. However, stabilizing model-based DRL is challenging because of the complex system dynamics of large-scale power systems. We solved these issues by incorporating imitation learning to have a warm start in policy learning, reward-shaping, and multi-step surrogate loss. Finally, we achieved 97.5% sample efficiency and 87.7% training efficiency for an application to the IEEE 300-bus test system. △ Less

Submitted 5 December, 2022; originally announced December 2022.

arXiv:2211.01412 [pdf, other]

doi 10.1109/JBHI.2024.3354712

CAMANet: Class Activation Map Guided Attention Network for Radiology Report Generation

Authors: Jun Wang, Abhir Bhalerao, Terry Yin, Simon See, Yulan He

Abstract: Radiology report generation (RRG) has gained increasing research attention because of its huge potential to mitigate medical resource shortages and aid the process of disease decision making by radiologists. Recent advancements in RRG are largely driven by improving a model's capabilities in encoding single-modal feature representations, while few studies explicitly explore the cross-modal alignme… ▽ More Radiology report generation (RRG) has gained increasing research attention because of its huge potential to mitigate medical resource shortages and aid the process of disease decision making by radiologists. Recent advancements in RRG are largely driven by improving a model's capabilities in encoding single-modal feature representations, while few studies explicitly explore the cross-modal alignment between image regions and words. Radiologists typically focus first on abnormal image regions before composing the corresponding text descriptions, thus cross-modal alignment is of great importance to learn a RRG model which is aware of abnormalities in the image. Motivated by this, we propose a Class Activation Map guided Attention Network (CAMANet) which explicitly promotes crossmodal alignment by employing aggregated class activation maps to supervise cross-modal attention learning, and simultaneously enrich the discriminative information. CAMANet contains three complementary modules: a Visual Discriminative Map Generation module to generate the importance/contribution of each visual token; Visual Discriminative Map Assisted Encoder to learn the discriminative representation and enrich the discriminative information; and a Visual Textual Attention Consistency module to ensure the attention consistency between the visual and textual tokens, to achieve the cross-modal alignment. Experimental results demonstrate that CAMANet outperforms previous SOTA methods on two commonly used RRG benchmarks. △ Less

Submitted 3 March, 2024; v1 submitted 2 November, 2022; originally announced November 2022.

Comments: Accepted to IEEE Journal of Biomedical and Health Informatics (IJBHI). 13 pages, 8 figures

arXiv:2211.00892 [pdf, other]

A highly accurate perfectly-matched-layer boundary integral equation solver for acoustic layered-medium problems

Authors: Wangtao Lu, Liwei Xu, Tao Yin, Lu Zhang

Abstract: Based on the perfectly matched layer (PML) technique, this paper develops a high-accuracy boundary integral equation (BIE) solver for acoustic scattering problems in locally defected layered media in both two and three dimensions. The original scattering problem is truncated onto a bounded domain by the PML. Assuming the vanishing of the scattered field on the PML boundary, we derive BIEs on local… ▽ More Based on the perfectly matched layer (PML) technique, this paper develops a high-accuracy boundary integral equation (BIE) solver for acoustic scattering problems in locally defected layered media in both two and three dimensions. The original scattering problem is truncated onto a bounded domain by the PML. Assuming the vanishing of the scattered field on the PML boundary, we derive BIEs on local defects only in terms of using PML-transformed free-space Green's function, and the four standard integral operators: single-layer, double-layer, transpose of double-layer, and hyper-singular boundary integral operators. The hyper-singular integral operator is transformed into a combination of weakly-singular integral operators and tangential derivatives. We develop a high-order Chebyshev-based rectangular-polar singular-integration solver to discretize all weakly-singular integrals. Numerical experiments for both two- and three-dimensional problems are carried out to demonstrate the accuracy and efficiency of the proposed solver. △ Less

Submitted 2 November, 2022; originally announced November 2022.

Comments: 19 pages, 16 figures

arXiv:2209.12498 [pdf, ps, other]

doi 10.1103/PhysRevE.106.054119

Quantum enhancement of a single quantum battery by repeated interactions with large spins

Authors: P. Chen, T. S. Yin, Z. Q. Jiang, G. R. Jin

Abstract: A generalized collision model is developed to investigate coherent charging a single quantum battery by repeated interactions with many-atom large spins, where collective atom operators are adopted and the battery is modeled by a uniform energy ladder. For an initially empty battery, we derive analytical results of the average number of excitations and hence the charging power in the short-time li… ▽ More A generalized collision model is developed to investigate coherent charging a single quantum battery by repeated interactions with many-atom large spins, where collective atom operators are adopted and the battery is modeled by a uniform energy ladder. For an initially empty battery, we derive analytical results of the average number of excitations and hence the charging power in the short-time limit. Our analytical results show that a faster charging and an increased amount of the power in the coherent protocol uniquely arise from the phase coherence of the atoms. Finally, we show that the charging power defined by the so-called ergotropy almost follows our analytical result, due to a nearly pure state of the battery in the short-time limit. △ Less

Submitted 26 September, 2022; originally announced September 2022.

Comments: 9 pages, 5 figures

Journal ref: Phys. Rev. E106, 054119 (2022)

arXiv:2209.01269 [pdf, ps, other]

A Two-step Metropolis Hastings Method for Bayesian Empirical Likelihood Computation with Application to Bayesian Model Selection

Authors: Sanjay Chaudhuri, Teng Yin

Abstract: In recent times empirical likelihood has been widely applied under Bayesian framework. Markov chain Monte Carlo (MCMC) methods are frequently employed to sample from the posterior distribution of the parameters of interest. However, complex, especially non-convex nature of the likelihood support erects enormous hindrances in choosing an appropriate MCMC algorithm. Such difficulties have restricted… ▽ More In recent times empirical likelihood has been widely applied under Bayesian framework. Markov chain Monte Carlo (MCMC) methods are frequently employed to sample from the posterior distribution of the parameters of interest. However, complex, especially non-convex nature of the likelihood support erects enormous hindrances in choosing an appropriate MCMC algorithm. Such difficulties have restricted the use of Bayesian empirical likelihood (BayesEL) based methods in many applications. In this article, we propose a two-step Metropolis Hastings algorithm to sample from the BayesEL posteriors. Our proposal is specified hierarchically, where the estimating equations determining the empirical likelihood are used to propose values of a set of parameters depending on the proposed values of the remaining parameters. Furthermore, we discuss Bayesian model selection using empirical likelihood and extend our two-step Metropolis Hastings algorithm to a reversible jump Markov chain Monte Carlo procedure to sample from the resulting posterior. Finally, several applications of our proposed methods are presented. △ Less

Submitted 2 September, 2022; originally announced September 2022.

arXiv:2206.01869 [pdf, other]

Multiple-scattering frequency-time hybrid solver for the wave equation in interior domains

Authors: Oscar P. Bruno, Tao Yin

Abstract: This paper proposes a frequency-time hybrid solver for the time-dependent wave equation in two-dimensional interior spatial domains. The approach relies on four main elements, namely, 1) A multiple scattering strategy that decomposes a given interior time-domain problem into a sequence of limited-duration time-domain problems of scattering by overlapping open arcs, each one of which is reduced (by… ▽ More This paper proposes a frequency-time hybrid solver for the time-dependent wave equation in two-dimensional interior spatial domains. The approach relies on four main elements, namely, 1) A multiple scattering strategy that decomposes a given interior time-domain problem into a sequence of limited-duration time-domain problems of scattering by overlapping open arcs, each one of which is reduced (by means of the Fourier transform) to a sequence of Helmholtz frequency-domain problems; 2) Boundary integral equations on overlapping boundary patches for the solution of the frequency-domain problems in point 1); 3) A smooth "Time-windowing and recentering" methodology that enables both treatment of incident signals of long duration and long time simulation; and, 4) A Fourier transform algorithm that delivers numerically dispersionless, spectrally-accurate time evolution for given incident fields. By recasting the interior time-domain problem in terms of a sequence of open-arc multiple scattering events, the proposed approach regularizes the full interior frequency domain problem-which, if obtained by either Fourier or Laplace transformation of the corresponding interior time-domain problem, must encapsulate infinitely many scattering events, giving rise to non-uniqueness and eigenfunctions in the Fourier case, and ill conditioning in the Laplace case. Numerical examples are included which demonstrate the accuracy and efficiency of the proposed methodology. △ Less

Submitted 6 February, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

Comments: 34 pages, 17 figures, 3 tables

MSC Class: 35L05; 65M80; 65T99; 65R20

arXiv:2205.12907 [pdf, other]

doi 10.1016/j.jcp.2022.111863

Highly efficient energy-conserving moment method for the multi-dimensional Vlasov-Maxwell system

Authors: Tianai Yin, Xinghui Zhong, Yanli Wang

Abstract: We present an energy-conserving numerical scheme to solve the Vlasov-Maxwell (VM) system based on the regularized moment method proposed in [Z. Cai, Y. Fan, and R. Li. CPAM, 2014]. The globally hyperbolic moment system is deduced for the multi-dimensional VM system under the framework of the Hermite expansions, where the expansion center and the scaling factor are set as the macroscopic velocity a… ▽ More We present an energy-conserving numerical scheme to solve the Vlasov-Maxwell (VM) system based on the regularized moment method proposed in [Z. Cai, Y. Fan, and R. Li. CPAM, 2014]. The globally hyperbolic moment system is deduced for the multi-dimensional VM system under the framework of the Hermite expansions, where the expansion center and the scaling factor are set as the macroscopic velocity and local temperature, respectively. Thus, the effect of the Lorentz force term could be reduced into several ODEs about the macroscopic velocity and the moment coefficients of higher order, which could significantly reduce the computational cost of the whole system. An energy-conserving numerical scheme is proposed to solve the moment equations and the Maxwell equations, where only a linear equation system needs to be solved. Several numerical examples such as the two-stream instability, Weibel instability, and the two-dimensional Orszag Tang vortex problem are studied to validate the efficiency and excellent energy-preserving property of the numerical scheme. △ Less

Submitted 14 June, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

arXiv:2205.04842 [pdf, other]

Spectral Galerkin method for solving elastic wave scattering problems with multiple open arcs

Authors: Carlos Jerez-Hanckes, Jose Pinto, Tao Yin

Abstract: We study the elastic time-harmonic wave scattering problems on unbounded domains with boundaries composed of finite collections of disjoints finite open arcs (or cracks) in two dimensions. Specifically, we present a fast spectral Galerkin method for solving the associated weakly- and hyper-singular boundary integral equations (BIEs) arising from Dirichlet and Neumann boundary conditions, respectiv… ▽ More We study the elastic time-harmonic wave scattering problems on unbounded domains with boundaries composed of finite collections of disjoints finite open arcs (or cracks) in two dimensions. Specifically, we present a fast spectral Galerkin method for solving the associated weakly- and hyper-singular boundary integral equations (BIEs) arising from Dirichlet and Neumann boundary conditions, respectively. Discretization bases of the resulting BIEs employ weighted Chebyshev polynomials that capture the solutions' edge behavior. We show that these bases guarantee exponential convergence in the polynomial degree when assuming analyticity of sources and arcs geometries. Numerical examples demonstrate the accuracy and robustness of the proposed method with respect to number of arcs and wavenumber. △ Less

Submitted 10 May, 2022; originally announced May 2022.

Comments: 28

arXiv:2203.13250 [pdf, other]

Global Tracking Transformers

Authors: Xingyi Zhou, Tianwei Yin, Vladlen Koltun, Philipp Krähenbühl

Abstract: We present a novel transformer-based architecture for global multi-object tracking. Our network takes a short sequence of frames as input and produces global trajectories for all objects. The core component is a global tracking transformer that operates on objects from all frames in the sequence. The transformer encodes object features from all frames, and uses trajectory queries to group them int… ▽ More We present a novel transformer-based architecture for global multi-object tracking. Our network takes a short sequence of frames as input and produces global trajectories for all objects. The core component is a global tracking transformer that operates on objects from all frames in the sequence. The transformer encodes object features from all frames, and uses trajectory queries to group them into trajectories. The trajectory queries are object features from a single frame and naturally produce unique trajectories. Our global tracking transformer does not require intermediate pairwise grouping or combinatorial association, and can be jointly trained with an object detector. It achieves competitive performance on the popular MOT17 benchmark, with 75.3 MOTA and 59.1 HOTA. More importantly, our framework seamlessly integrates into state-of-the-art large-vocabulary detectors to track any objects. Experiments on the challenging TAO dataset show that our framework consistently improves upon baselines that are based on pairwise association, outperforming published works by a significant 7.7 tracking mAP. Code is available at https://github.com/xingyizhou/GTR. △ Less

Submitted 25 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

Comments: CVPR 2022. Code is available at https://github.com/xingyizhou/GTR

arXiv:2202.04257 [pdf, other]

On the hyper-singular boundary integral equation methods for dynamic poroelasticity: three dimensional case

Authors: Lu Zhang, Liwei Xu, Tao Yin

Abstract: In our previous work [SIAM J. Sci. Comput. 43(3) (2021) B784-B810], an accurate hyper-singular boundary integral equation method for dynamic poroelasticity in two dimensions has been developed. This work is devoted to studying the more complex and difficult three-dimensional problems with Neumann boundary condition and both the direct and indirect methods are adopted to construct combined boundary… ▽ More In our previous work [SIAM J. Sci. Comput. 43(3) (2021) B784-B810], an accurate hyper-singular boundary integral equation method for dynamic poroelasticity in two dimensions has been developed. This work is devoted to studying the more complex and difficult three-dimensional problems with Neumann boundary condition and both the direct and indirect methods are adopted to construct combined boundary integral equations. The strongly-singular and hyper-singular integral operators are reformulated into compositions of weakly-singular integral operators and tangential-derivative operators, which allow us to prove the jump relations associated with the poroelastic layer potentials and boundary integral operators in a simple manner. Relying on both the investigated spectral properties of the strongly-singular operators, which indicate that the corresponding eigenvalues accumulate at three points whose values are only dependent on two Lamé constants, and the spectral properties of the Calderón relations of the poroelasticity, we propose low-GMRES-iteration regularized integral equations. Numerical examples are presented to demonstrate the accuracy and efficiency of the proposed methodology by means of a Chebyshev-based rectangular-polar solver. △ Less

Submitted 8 February, 2022; originally announced February 2022.

arXiv:2112.12113 [pdf]

doi 10.1016/j.ab.2022.114606

PET CMR$_{glc}$ mapping and $^{1}$H MRS show altered glucose uptake and neurometabolic profiles in BDL rats

Authors: Jessie Mosso, Ting Yin, Carole Poitry-Yamate, Dunja Simicic, Mario Lepore, Valérie A. McLin, Olivier Braissant, Cristina Cudalbu, Bernard Lanz

Abstract: Type C hepatic encephalopathy (HE) is a complex neuropsychiatric disorder occurring as a consequence of chronic liver disease. Alterations in energy metabolism have been suggested in type C HE, but $\textit{in vivo}$ studies on this matter remain sparse and have reported conflicting results. Here, we propose a novel preclinical $^{18}$F-FDG PET methodology to compute quantitative 3D maps of the re… ▽ More Type C hepatic encephalopathy (HE) is a complex neuropsychiatric disorder occurring as a consequence of chronic liver disease. Alterations in energy metabolism have been suggested in type C HE, but $\textit{in vivo}$ studies on this matter remain sparse and have reported conflicting results. Here, we propose a novel preclinical $^{18}$F-FDG PET methodology to compute quantitative 3D maps of the regional cerebral metabolic rate of glucose (CMR$_{glc}$) from a labelling steady-state PET image of the brain and an image-derived input function. This quantitative approach shows its strength when comparing groups of animals with divergent physiology, such as HE animals. PET CMR$_{glc}$ maps were registered to an atlas and the mean CMR$_{glc}$ from the hippocampus and the cerebellum were associated to the corresponding localized $^{1}$H MR spectroscopy acquisitions. This study provides for the first time local and quantitative information on both brain glucose uptake and neurometabolic profile alterations in a rat model of type C HE. A 2-fold lower brain glucose uptake, concomitant with an increase in brain glutamine and a decrease in the main osmolytes was observed in the hippocampus and in the cerebellum. These novel findings are an important step towards new insights into energy metabolism in the pathophysiology of HE. △ Less

Submitted 22 December, 2021; originally announced December 2021.

Comments: 30 pages, 6 figures

Journal ref: Anal Biochem 647 (2022) 114606

arXiv:2111.14352 [pdf, other]

Physics-informed Evolutionary Strategy based Control for Mitigating Delayed Voltage Recovery

Authors: Yan Du, Qiuhua Huang, Renke Huang, Tianzhixi Yin, Jie Tan, Wenhao Yu, Xinya Li

Abstract: In this work we propose a novel data-driven, real-time power system voltage control method based on the physics-informed guided meta evolutionary strategy (ES). The main objective is to quickly provide an adaptive control strategy to mitigate the fault-induced delayed voltage recovery (FIDVR) problem. Reinforcement learning methods have been developed for the same or similar challenging control pr… ▽ More In this work we propose a novel data-driven, real-time power system voltage control method based on the physics-informed guided meta evolutionary strategy (ES). The main objective is to quickly provide an adaptive control strategy to mitigate the fault-induced delayed voltage recovery (FIDVR) problem. Reinforcement learning methods have been developed for the same or similar challenging control problems, but they suffer from training inefficiency and lack of robustness for "corner or unseen" scenarios. On the other hand, extensive physical knowledge has been developed in power systems but little has been leveraged in learning-based approaches. To address these challenges, we introduce the trainable action mask technique for flexibly embedding physical knowledge into RL models to rule out unnecessary or unfavorable actions, and achieve notable improvements in sample efficiency, control performance and robustness. Furthermore, our method leverages past learning experience to derive surrogate gradient to guide and accelerate the exploration process in training. Case studies on the IEEE 300-bus system and comparisons with other state-of-the-art benchmark methods demonstrate effectiveness and advantages of our method. △ Less

Submitted 29 November, 2021; originally announced November 2021.

arXiv:2111.06881 [pdf, other]

Multimodal Virtual Point 3D Detection

Authors: Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl

Abstract: Lidar-based sensing drives current autonomous vehicles. Despite rapid progress, current Lidar sensors still lag two decades behind traditional color cameras in terms of resolution and cost. For autonomous driving, this means that large objects close to the sensors are easily visible, but far-away or small objects comprise only one measurement or two. This is an issue, especially when these objects… ▽ More Lidar-based sensing drives current autonomous vehicles. Despite rapid progress, current Lidar sensors still lag two decades behind traditional color cameras in terms of resolution and cost. For autonomous driving, this means that large objects close to the sensors are easily visible, but far-away or small objects comprise only one measurement or two. This is an issue, especially when these objects turn out to be driving hazards. On the other hand, these same objects are clearly visible in onboard RGB sensors. In this work, we present an approach to seamlessly fuse RGB sensors into Lidar-based 3D recognition. Our approach takes a set of 2D detections to generate dense 3D virtual points to augment an otherwise sparse 3D point cloud. These virtual points naturally integrate into any standard Lidar-based 3D detectors along with regular Lidar measurements. The resulting multi-modal detector is simple and effective. Experimental results on the large-scale nuScenes dataset show that our framework improves a strong CenterPoint baseline by a significant 6.6 mAP, and outperforms competing fusion approaches. Code and more visualizations are available at https://tianweiy.github.io/mvp/ △ Less

Submitted 12 November, 2021; originally announced November 2021.

Comments: NeurIPS 2021, code available at https://tianweiy.github.io/mvp/

arXiv:2108.11121 [pdf, other]

On the generalized Calderón formulas for closed- and open-surface elastic scattering problems

Authors: Liwei Xu, Tao Yin

Abstract: The Calderón formulas (i.e., the combination of single-layer and hyper-singular boundary integral operators) have been widely utilized in the process of constructing valid boundary integral equation systems which could possess highly favorable spectral properties. This work is devoted to studying the theoretical properties of elastodynamic Calderón formulas which provide us with a solid basis for… ▽ More The Calderón formulas (i.e., the combination of single-layer and hyper-singular boundary integral operators) have been widely utilized in the process of constructing valid boundary integral equation systems which could possess highly favorable spectral properties. This work is devoted to studying the theoretical properties of elastodynamic Calderón formulas which provide us with a solid basis for the design of fast boundary integral equation methods solving elastic wave problems defined on a close-surface or an open-surface in two dimensions. For the closed-surface case, it is proved that the Calderón formula is a Fredholm operator of second-kind except for certain circumstances. Regarding to the open-surface case, we investigate weighted integral operators instead of the original integral operators which are resulted from dealing with edge singularities of potentials corresponding to the elastic scattering problems by open-surfaces, and show that the Calderón formula is a compact perturbation of a bounded and invertible operator. To complete the proof, we need to use the well-posedness result of the elastic scattering problem, the analysis of the zero-frequency integral operators defined on the straight arc, the singularity decompositions of the kernels of integral operators, and a new representation formula of the hyper-singular operator. Moreover, it can be demonstrated that the accumulation point of the spectrum of the invertible operator is the same as that of the eigenvalues of the Calderón formula in the closed-surface case. △ Less

Submitted 25 August, 2021; originally announced August 2021.

arXiv:2105.06460 [pdf, other]

End-to-End Sequential Sampling and Reconstruction for MRI

Authors: Tianwei Yin, Zihui Wu, He Sun, Adrian V. Dalca, Yisong Yue, Katherine L. Bouman

Abstract: Accelerated MRI shortens acquisition time by subsampling in the measurement $κ$-space. Recovering a high-fidelity anatomical image from subsampled measurements requires close cooperation between two components: (1) a sampler that chooses the subsampling pattern and (2) a reconstructor that recovers images from incomplete measurements. In this paper, we leverage the sequential nature of MRI measure… ▽ More Accelerated MRI shortens acquisition time by subsampling in the measurement $κ$-space. Recovering a high-fidelity anatomical image from subsampled measurements requires close cooperation between two components: (1) a sampler that chooses the subsampling pattern and (2) a reconstructor that recovers images from incomplete measurements. In this paper, we leverage the sequential nature of MRI measurements, and propose a fully differentiable framework that jointly learns a sequential sampling policy simultaneously with a reconstruction strategy. This co-designed framework is able to adapt during acquisition in order to capture the most informative measurements for a particular target. Experimental results on the fastMRI knee dataset demonstrate that the proposed approach successfully utilizes intermediate information during the sampling process to boost reconstruction performance. In particular, our proposed method can outperform the current state-of-the-art learned $κ$-space sampling baseline on over 96% of test samples. We also investigate the individual and collective benefits of the sequential sampling and co-design strategies. △ Less

Submitted 16 July, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

Comments: Code and supplementary materials are available at http://imaging.cms.caltech.edu/seq-mri

Journal ref: Proceedings of Machine Learning for Health, PMLR 158:261-281, 2021

arXiv:2102.00077 [pdf, other]

Scalable Voltage Control using Structure-Driven Hierarchical Deep Reinforcement Learning

Authors: Sayak Mukherjee, Renke Huang, Qiuhua Huang, Thanh Long Vu, Tianzhixi Yin

Abstract: This paper presents a novel hierarchical deep reinforcement learning (DRL) based design for the voltage control of power grids. DRL agents are trained for fast, and adaptive selection of control actions such that the voltage recovery criterion can be met following disturbances. Existing voltage control techniques suffer from the issues of speed of operation, optimal coordination between different… ▽ More This paper presents a novel hierarchical deep reinforcement learning (DRL) based design for the voltage control of power grids. DRL agents are trained for fast, and adaptive selection of control actions such that the voltage recovery criterion can be met following disturbances. Existing voltage control techniques suffer from the issues of speed of operation, optimal coordination between different locations, and scalability. We exploit the area-wise division structure of the power system to propose a hierarchical DRL design that can be scaled to the larger grid models. We employ an enhanced augmented random search algorithm that is tailored for the voltage control problem in a two-level architecture. We train area-wise decentralized RL agents to compute lower-level policies for the individual areas, and concurrently train a higher-level DRL agent that uses the updates of the lower-level policies to efficiently coordinate the control actions taken by the lower-level agents. Numerical experiments on the IEEE benchmark 39-bus model with 3 areas demonstrate the advantages and various intricacies of the proposed hierarchical approach. △ Less

Submitted 29 January, 2021; originally announced February 2021.

Comments: 8 pages, 13 figures

arXiv:2101.05317 [pdf, other]

Learning and Fast Adaptation for Grid Emergency Control via Deep Meta Reinforcement Learning

Authors: Renke Huang, Yujiao Chen, Tianzhixi Yin, Qiuhua Huang, Jie Tan, Wenhao Yu, Xinya Li, Ang Li, Yan Du

Abstract: As power systems are undergoing a significant transformation with more uncertainties, less inertia and closer to operation limits, there is increasing risk of large outages. Thus, there is an imperative need to enhance grid emergency control to maintain system reliability and security. Towards this end, great progress has been made in developing deep reinforcement learning (DRL) based grid control… ▽ More As power systems are undergoing a significant transformation with more uncertainties, less inertia and closer to operation limits, there is increasing risk of large outages. Thus, there is an imperative need to enhance grid emergency control to maintain system reliability and security. Towards this end, great progress has been made in developing deep reinforcement learning (DRL) based grid control solutions in recent years. However, existing DRL-based solutions have two main limitations: 1) they cannot handle well with a wide range of grid operation conditions, system parameters, and contingencies; 2) they generally lack the ability to fast adapt to new grid operation conditions, system parameters, and contingencies, limiting their applicability for real-world applications. In this paper, we mitigate these limitations by developing a novel deep meta reinforcement learning (DMRL) algorithm. The DMRL combines the meta strategy optimization together with DRL, and trains policies modulated by a latent space that can quickly adapt to new scenarios. We test the developed DMRL algorithm on the IEEE 300-bus system. We demonstrate fast adaptation of the meta-trained DRL polices with latent variables to new operating conditions and scenarios using the proposed method and achieve superior performance compared to the state-of-the-art DRL and model predictive control (MPC) methods. △ Less

Submitted 5 February, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

arXiv:2012.14696 [pdf]

doi 10.1063/5.0033516

Universal Silicon Microwave Photonic Spectral Shaper

Authors: Xin Guo, Yang Liu, Tangman Yin, Blair Morrison, Mattia Pagani, Okky Daulay, Wim Bogaerts, Benjamin J. Eggleton, Alvaro Casas-Bedoya, David Marpaung

Abstract: Optical modulation plays arguably the utmost important role in microwave photonic (MWP) systems. Precise synthesis of modulated optical spectra dictates virtually all aspects of MWP system quality including loss, noise figure, linearity, and the types of functionality that can be executed. But for such a critical function, the versatility to generate and transform analog optical modulation is seve… ▽ More Optical modulation plays arguably the utmost important role in microwave photonic (MWP) systems. Precise synthesis of modulated optical spectra dictates virtually all aspects of MWP system quality including loss, noise figure, linearity, and the types of functionality that can be executed. But for such a critical function, the versatility to generate and transform analog optical modulation is severely lacking, blocking the pathways to truly unique MWP functions including ultra-linear links and low-loss high rejection filters. Here we demonstrate versatile RF photonic spectrum synthesis in an all-integrated silicon photonic circuit, enabling electrically-tailorable universal analog modulation transformation. We show a series of unprecedented RF filtering experiments through monolithic integration of the spectrum-synthesis circuit with a network of reconfigurable ring resonators. △ Less

Submitted 29 December, 2020; originally announced December 2020.

arXiv:2011.09664 [pdf, other]

Safe Reinforcement Learning for Emergency LoadShedding of Power Systems

Authors: Thanh Long Vu, Sayak Mukherjee, Tim Yin, Renke Huang, and Jie Tan, Qiuhua Huang

Abstract: The paradigm shift in the electric power grid necessitates a revisit of existing control methods to ensure the grid's security and resilience. In particular, the increased uncertainties and rapidly changing operational conditions in power systems have revealed outstanding issues in terms of either speed, adaptiveness, or scalability of the existing control methods for power systems. On the other h… ▽ More The paradigm shift in the electric power grid necessitates a revisit of existing control methods to ensure the grid's security and resilience. In particular, the increased uncertainties and rapidly changing operational conditions in power systems have revealed outstanding issues in terms of either speed, adaptiveness, or scalability of the existing control methods for power systems. On the other hand, the availability of massive real-time data can provide a clearer picture of what is happening in the grid. Recently, deep reinforcement learning(RL) has been regarded and adopted as a promising approach leveraging massive data for fast and adaptive grid control. However, like most existing machine learning (ML)-basedcontrol techniques, RL control usually cannot guarantee the safety of the systems under control. In this paper, we introduce a novel method for safe RL-based load shedding of power systems that can enhance the safe voltage recovery of the electric power grid after experiencing faults. Numerical simulations on the 39-bus IEEE benchmark is performed to demonstrate the effectiveness of the proposed safe RL emergency control, as well as its adaptive capability to faults not seen in the training. △ Less

Submitted 17 November, 2020; originally announced November 2020.

Comments: arXiv admin note: text overlap with arXiv:2006.12667

arXiv:2009.13477 [pdf]

doi 10.1088/1361-6560/abef45

Super-Resolution Ultrasound Localization Microscopy Based on a High Frame-rate Clinical Ultrasound Scanner: An In-human Feasibility Study

Authors: Chengwu Huang, Wei Zhang, Ping Gong, U-Wai Lok, Shanshan Tang, Tinghui Yin, Xirui Zhang, Lei Zhu, Maodong Sang, Pengfei Song, Rongqin Zheng, Shigao Chen

Abstract: Non-invasive detection of microvascular alterations in deep tissues in vivo provides critical information for clinical diagnosis and evaluation of a broad-spectrum of pathologies. Recently, the emergence of super-resolution ultrasound localization microscopy (ULM) offers new possibilities for clinical imaging of microvasculature at capillary level. Currently, the clinical utility of ULM on clinica… ▽ More Non-invasive detection of microvascular alterations in deep tissues in vivo provides critical information for clinical diagnosis and evaluation of a broad-spectrum of pathologies. Recently, the emergence of super-resolution ultrasound localization microscopy (ULM) offers new possibilities for clinical imaging of microvasculature at capillary level. Currently, the clinical utility of ULM on clinical ultrasound scanners is hindered by the technical limitations, such as long data acquisition time, and compromised tracking performance associated with low imaging frame-rate. Here we present an in-human ULM on a high frame-rate (HFR) clinical ultrasound scanner to achieve super-resolution microvessel imaging using a short acquisition time (<10s). Ultrasound MB data were acquired from different human tissues, (liver, kidney, pancreatic, and breast tumor) using an HFR clinical scanner. By leveraging the HFR and advanced processing techniques including sub-pixel motion registration, MB signal separation, and Kalman filter-based tracking, MBs can be robustly localized and tracked for successful ULM under the circumstances of relatively high MB concentration and limited data acquisition time in humans. Subtle morphological and hemodynamic information were demonstrated on data acquired with single breath-hold and free-hand scanning. Compared with contrast-enhanced power Doppler generated based on the same MB dataset, ULM showed a 5.7-fold resolution improvement in a vessel, and provided a wide-range flow speed measurement that is Doppler angle-independent. This study demonstrated the feasibility of ultrafast in-human ULM in various human tissues based on a clinical scanner that supports HFR imaging, and showed a great potential for the implementation of super-resolution ultrasound microvessel imaging in a myriad of clinical applications involving microvascular abnormalities and pathologies. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Comments: 41 pages, 5 figures, 4 supplemental figures

arXiv:2008.09014 [pdf, other]

doi 10.1103/PhysRevA.103.012413

Hybrid quantum-classical algorithms for solving quantum chemistry in Hamiltonian-wavefunction space

Authors: Zhan-Hao Yuan, Tao Yin, Dan-Bo Zhang

Abstract: Variational quantum eigensolver~(VQE) typically optimizes variational parameters in a quantum circuit to prepare eigenstates for a quantum system. Its applications to many problems may involve a group of Hamiltonians, e.g., Hamiltonian of a molecule is a function of nuclear configurations. In this paper, we incorporate derivatives of Hamiltonian into VQE and develop some hybrid quantum-classical a… ▽ More Variational quantum eigensolver~(VQE) typically optimizes variational parameters in a quantum circuit to prepare eigenstates for a quantum system. Its applications to many problems may involve a group of Hamiltonians, e.g., Hamiltonian of a molecule is a function of nuclear configurations. In this paper, we incorporate derivatives of Hamiltonian into VQE and develop some hybrid quantum-classical algorithms, which explores both Hamiltonian and wavefunction spaces for optimization. Aiming for solving quantum chemistry problems more efficiently, we first propose mutual gradient descent algorithm for geometry optimization by updating parameters of Hamiltonian and wavefunction alternatively, which shows a rapid convergence towards equilibrium structures of molecules. We then establish differential equations that governs how optimized variational parameters of wavefunction change with intrinsic parameters of the Hamiltonian, which can speed up calculation of energy potential surface. Our studies suggest a direction of hybrid quantum-classical algorithm for solving quantum systems more efficiently by considering spaces of both Hamiltonian and wavefunction. △ Less

Submitted 20 August, 2020; originally announced August 2020.

Comments: 8 pages, 3 figures. Comments are welcome

Journal ref: Phys. Rev. A 103, 012413 (2021)

arXiv:2008.07115 [pdf, other]

An accurate hyper-singular boundary integral equation method for dynamic poroelasticity in two dimensions

Authors: Lu Zhang, Liwei Xu, Tao Yin

Abstract: This paper is concerned with the boundary integral equation method for solving the exterior Neumann boundary value problem of dynamic poroelasticity in two dimensions. The main contribution of this work consists of two aspescts: the proposal of a novel regularized boundary integral equation, and the presentation of new regularized formulations of the strongly-singular and hyper-singular boundary i… ▽ More This paper is concerned with the boundary integral equation method for solving the exterior Neumann boundary value problem of dynamic poroelasticity in two dimensions. The main contribution of this work consists of two aspescts: the proposal of a novel regularized boundary integral equation, and the presentation of new regularized formulations of the strongly-singular and hyper-singular boundary integral operators. Firstly, turning to the spectral properties of the double-layer operator and the corresponding Calderón relation of the poroelasticity, we propose the novel low-GMRES-iteration integral equation whose eigenvalues are bounded away from zero and infinity. Secondly, with the help of the Günter derivatives, we reformulate the strongly-singular and hyper-singular integral operators into combinations of the weakly-singular operators and the tangential derivatives. The accuracy and efficiency of the proposed methodology are demonstrated through several numerical examples. △ Less

Submitted 17 August, 2020; originally announced August 2020.

Comments: 22 pages, 6 figures, 4 tables

arXiv:2006.15781 [pdf, other]

Variational quantum eigensolvers by variance minimization

Authors: Dan-Bo Zhang, Zhan-Hao Yuan, Tao Yin

Abstract: Variational quantum eigensolver(VQE) typically minimizes energy with hybrid quantum-classical optimization, which aims to find the ground state. Here, we propose a VQE by minimizing energy variance, which is called as variance-VQE(VVQE). The VVQE can be viewed as an self-verifying eigensolver for arbitrary eigenstate by designing, since an eigenstate for a Hamiltonian should have zero energy varia… ▽ More Variational quantum eigensolver(VQE) typically minimizes energy with hybrid quantum-classical optimization, which aims to find the ground state. Here, we propose a VQE by minimizing energy variance, which is called as variance-VQE(VVQE). The VVQE can be viewed as an self-verifying eigensolver for arbitrary eigenstate by designing, since an eigenstate for a Hamiltonian should have zero energy variance. We demonstrate properties and advantages of VVQE for solving a set of excited states with quantum chemistry problems. Remarkably, we show that optimization of a combination of energy and variance may be more efficient to find low-energy excited states than those of minimizing energy or variance alone. We further reveal that the optimization can be boosted with stochastic gradient descent by Hamiltonian sampling, which uses only a few terms of the Hamiltonian and thus significantly reduces the quantum resource for evaluating variance and its gradients. △ Less

Submitted 28 June, 2020; originally announced June 2020.

Comments: 9 pages, 5 figures. Comments are welcome

arXiv:2006.12667 [pdf, other]

Accelerated Deep Reinforcement Learning Based Load Shedding for Emergency Voltage Control

Authors: Renke Huang, Yujiao Chen, Tianzhixi Yin, Xinya Li, Ang Li, Jie Tan, Wenhao Yu, Yuan Liu, Qiuhua Huang

Abstract: Load shedding has been one of the most widely used and effective emergency control approaches against voltage instability. With increased uncertainties and rapidly changing operational conditions in power systems, existing methods have outstanding issues in terms of either speed, adaptiveness, or scalability. Deep reinforcement learning (DRL) was regarded and adopted as a promising approach for fa… ▽ More Load shedding has been one of the most widely used and effective emergency control approaches against voltage instability. With increased uncertainties and rapidly changing operational conditions in power systems, existing methods have outstanding issues in terms of either speed, adaptiveness, or scalability. Deep reinforcement learning (DRL) was regarded and adopted as a promising approach for fast and adaptive grid stability control in recent years. However, existing DRL algorithms show two outstanding issues when being applied to power system control problems: 1) computational inefficiency that requires extensive training and tuning time; and 2) poor scalability making it difficult to scale to high dimensional control problems. To overcome these issues, an accelerated DRL algorithm named PARS was developed and tailored for power system voltage stability control via load shedding. PARS features high scalability and is easy to tune with only five main hyperparameters. The method was tested on both the IEEE 39-bus and IEEE 300-bus systems, and the latter is by far the largest scale for such a study. Test results show that, compared to other methods including model-predictive control (MPC) and proximal policy optimization(PPO) methods, PARS shows better computational efficiency (faster convergence), more robustness in learning, excellent scalability and generalization capability. △ Less

Submitted 5 December, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

arXiv:2006.11275 [pdf, other]

Center-based 3D Object Detection and Tracking

Authors: Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl

Abstract: Three-dimensional objects are commonly represented as 3D boxes in a point-cloud. This representation mimics the well-studied image-based 2D bounding-box detection but comes with additional challenges. Objects in a 3D world do not follow any particular orientation, and box-based detectors have difficulties enumerating all orientations or fitting an axis-aligned bounding box to rotated objects. In t… ▽ More Three-dimensional objects are commonly represented as 3D boxes in a point-cloud. This representation mimics the well-studied image-based 2D bounding-box detection but comes with additional challenges. Objects in a 3D world do not follow any particular orientation, and box-based detectors have difficulties enumerating all orientations or fitting an axis-aligned bounding box to rotated objects. In this paper, we instead propose to represent, detect, and track 3D objects as points. Our framework, CenterPoint, first detects centers of objects using a keypoint detector and regresses to other attributes, including 3D size, 3D orientation, and velocity. In a second stage, it refines these estimates using additional point features on the object. In CenterPoint, 3D object tracking simplifies to greedy closest-point matching. The resulting detection and tracking algorithm is simple, efficient, and effective. CenterPoint achieved state-of-the-art performance on the nuScenes benchmark for both 3D detection and tracking, with 65.5 NDS and 63.8 AMOTA for a single model. On the Waymo Open Dataset, CenterPoint outperforms all previous single model method by a large margin and ranks first among all Lidar-only submissions. The code and pretrained models are available at https://github.com/tianweiy/CenterPoint. △ Less

Submitted 6 January, 2021; v1 submitted 19 June, 2020; originally announced June 2020.

Comments: update nuScenes and Waymo results

Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021

arXiv:2006.00124 [pdf, other]

doi 10.1016/j.cma.2020.113651

A Windowed Green Function method for elastic scattering problems on a half-space

Authors: Oscar P. Bruno, Tao Yin

Abstract: This paper presents a windowed Green function (WGF) method for the numerical solution of problems of elastic scattering by "locally-rough surfaces" (i.e., local perturbations of a half space), under either Dirichlet or Neumann boundary conditions, and in both two and three spatial dimensions. The proposed WGF method relies on an integral-equation formulation based on the free-space Green function,… ▽ More This paper presents a windowed Green function (WGF) method for the numerical solution of problems of elastic scattering by "locally-rough surfaces" (i.e., local perturbations of a half space), under either Dirichlet or Neumann boundary conditions, and in both two and three spatial dimensions. The proposed WGF method relies on an integral-equation formulation based on the free-space Green function, together with smooth operator windowing (based on a "slow-rise" windowing function) and efficient high-order singular-integration methods. The approach avoids the evaluation of the expensive layer Green function for elastic problems on a half-space, and it yields uniformly fast convergence for all incident angles. Numerical experiments for both two and three dimensional problems are presented, demonstrating the accuracy and super-algebraically fast convergence of the proposed method as the window-size grows. △ Less

Submitted 29 May, 2020; originally announced June 2020.

Comments: 21 pages, 4 tables, 11 figures

arXiv:2003.13638 [pdf, other]

Inverse conductivity problem with internal data

Authors: Faouzi Triki, Tao Yin

Abstract: This paper concerns the reconstruction of a scalar coefficient of a second-order elliptic equation in divergence form posed on a bounded domain from internal data. This theory finds applications in multi-wave imaging, greedy methods to approximate parameter-dependent elliptic problems, and image treatment with partial differential equations. We first show that the inverse problem for smooth coeffi… ▽ More This paper concerns the reconstruction of a scalar coefficient of a second-order elliptic equation in divergence form posed on a bounded domain from internal data. This theory finds applications in multi-wave imaging, greedy methods to approximate parameter-dependent elliptic problems, and image treatment with partial differential equations. We first show that the inverse problem for smooth coefficients can be rewritten as a linear transport equation. Assuming that the coefficient is known near the boundary, we study the well-posedness of associated transport equation as well as its numerical resolution using discontinuous Galerkin method. We propose a regularized transport equation that allow us to derive rigorous convergence rates of the numerical method in terms of the order of the polynomial approximation as well as the regularization parameter. We finally provide numerical examples for the inversion assuming a lower regularity of the coefficient, and using synthetic data. △ Less

Submitted 17 May, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

MSC Class: 35R30; 65N21

Showing 1–50 of 75 results for author: Yin, T