-
Deep Probability Aggregation Clustering
Authors:
Yuxuan Yan,
Na Lu,
Ruofan Yan
Abstract:
Combining machine clustering with deep models has shown remarkable superiority in deep clustering. It modifies the data processing pipeline into two alternating phases: feature clustering and model training. However, such alternating schedule may lead to instability and computational burden issues. We propose a centerless clustering algorithm called Probability Aggregation Clustering (PAC) to proa…
▽ More
Combining machine clustering with deep models has shown remarkable superiority in deep clustering. It modifies the data processing pipeline into two alternating phases: feature clustering and model training. However, such alternating schedule may lead to instability and computational burden issues. We propose a centerless clustering algorithm called Probability Aggregation Clustering (PAC) to proactively adapt deep learning technologies, enabling easy deployment in online deep clustering. PAC circumvents the cluster center and aligns the probability space and distribution space by formulating clustering as an optimization problem with a novel objective function. Based on the computation mechanism of the PAC, we propose a general online probability aggregation module to perform stable and flexible feature clustering over mini-batch data and further construct a deep visual clustering framework deep PAC (DPAC). Extensive experiments demonstrate that PAC has superior clustering robustness and performance and DPAC remarkably outperforms the state-of-the-art deep clustering methods.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
New $^{63}$Ga(p,$γ$)$^{64}$Ge and $^{64}$Ge(p,$γ$)$^{65}$As reaction rates corresponding to the temperature regime of thermonuclear X-ray bursts
Authors:
Ning Lu,
Yi Hua Lam,
Alexander Heger,
Zi Xin Liu,
Hidetoshi Yamaguchi
Abstract:
We compute the $^{63}$Ga(p,$γ$)$^{64}$Ge and $^{64}$Ge(p,$γ$)$^{65}$As thermonuclear reaction rates using the latest experimental input supplemented with theoretical nuclear spectroscopic information. The experimental input consists of the latest proton thresholds of $^{64}$Ge and $^{65}$As, and the nuclear spectroscopic information of $^{65}$As, whereas the theoretical nuclear spectroscopic infor…
▽ More
We compute the $^{63}$Ga(p,$γ$)$^{64}$Ge and $^{64}$Ge(p,$γ$)$^{65}$As thermonuclear reaction rates using the latest experimental input supplemented with theoretical nuclear spectroscopic information. The experimental input consists of the latest proton thresholds of $^{64}$Ge and $^{65}$As, and the nuclear spectroscopic information of $^{65}$As, whereas the theoretical nuclear spectroscopic information for $^{64}$Ge and $^{65}$As are deduced from the full pf-shell space configuration-interaction shell-model calculations with the GXPF1A Hamiltonian. Both thermonuclear reaction rates are determined with known uncertainties at the energies that correspond to the Gamow windows of the temperature regime relevant to Type I X-ray bursts, covering the typical temperature range of the thermonuclear runaway of the GS 1826$-$24 periodic bursts and SAX J1808.4$-$3658 photospheric radius expansion bursts.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
A Multi-module Robust Method for Transient Stability Assessment against False Label Injection Cyberattacks
Authors:
Hanxuan Wang,
Na Lu,
Yinhong Liu,
Zhuqing Wang,
Zixuan Wang
Abstract:
The success of deep learning in transient stability assessment (TSA) heavily relies on high-quality training data. However, the label information in TSA datasets is vulnerable to contamination through false label injection (FLI) cyberattacks, resulting in degraded performance of deep TSA models. To address this challenge, a Multi-Module Robust TSA method (MMR) is proposed to rectify the supervised…
▽ More
The success of deep learning in transient stability assessment (TSA) heavily relies on high-quality training data. However, the label information in TSA datasets is vulnerable to contamination through false label injection (FLI) cyberattacks, resulting in degraded performance of deep TSA models. To address this challenge, a Multi-Module Robust TSA method (MMR) is proposed to rectify the supervised training process misguided by FLI in an unsupervised manner. In MMR, a supervised classification module and an unsupervised clustering module are alternatively trained to improve the clustering friendliness of representation leaning, thereby achieving accurate clustering assignments. Leveraging the clustering assignments, we construct a training label corrector to rectify the injected false labels and progressively enhance robustness and resilience against FLI. However, there is still a gap on accuracy and convergence speed between MMR and FLI-free deep TSA models. To narrow this gap, we further propose a human-in-the-loop training strategy, named MMR-HIL. In MMR-HIL, potential false samples can be detected by modeling the training loss with a Gaussian distribution. From these samples, the most likely false samples and most ambiguous samples are re-labeled by a TSA experts guided bi-directional annotator and then subjected to penalized optimization, aimed at improving accuracy and convergence speed. Extensive experiments indicate that MMR and MMR-HIL both exhibit powerful robustness against FLI in TSA performance. Moreover, the contaminated labels can also be effectively corrected, demonstrating superior resilience of the proposed methods.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Applying Fine-Tuned LLMs for Reducing Data Needs in Load Profile Analysis
Authors:
Yi Hu,
Hyeonjin Kim,
Kai Ye,
Ning Lu
Abstract:
This paper presents a novel method for utilizing fine-tuned Large Language Models (LLMs) to minimize data requirements in load profile analysis, demonstrated through the restoration of missing data in power system load profiles. A two-stage fine-tuning strategy is proposed to adapt a pre-trained LLMs, i.e., GPT-3.5, for missing data restoration tasks. Through empirical evaluation, we demonstrate t…
▽ More
This paper presents a novel method for utilizing fine-tuned Large Language Models (LLMs) to minimize data requirements in load profile analysis, demonstrated through the restoration of missing data in power system load profiles. A two-stage fine-tuning strategy is proposed to adapt a pre-trained LLMs, i.e., GPT-3.5, for missing data restoration tasks. Through empirical evaluation, we demonstrate the effectiveness of the fine-tuned model in accurately restoring missing data, achieving comparable performance to state-of-the-art specifically designed models such as BERT-PIN. Key findings include the importance of prompt engineering and the optimal utilization of fine-tuning samples, highlighting the efficiency of few-shot learning in transferring knowledge from general user cases to specific target users. Furthermore, the proposed approach demonstrates notable cost-effectiveness and time efficiency compared to training models from scratch, making it a practical solution for scenarios with limited data availability and computing resources. This research has significant potential for application to other power system load profile analysis tasks. Consequently, it advances the use of LLMs in power system analytics, offering promising implications for enhancing the resilience and efficiency of power distribution systems.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
A Novel Vision Transformer based Load Profile Analysis using Load Images as Inputs
Authors:
Hyeonjin Kim,
Yi Hu,
Kai Ye,
Ning Lu
Abstract:
This paper introduces ViT4LPA, an innovative Vision Transformer (ViT) based approach for Load Profile Analysis (LPA). We transform time-series load profiles into load images. This allows us to leverage the ViT architecture, originally designed for image processing, as a pre-trained image encoder to uncover latent patterns within load data. ViT is pre-trained using an extensive load image dataset,…
▽ More
This paper introduces ViT4LPA, an innovative Vision Transformer (ViT) based approach for Load Profile Analysis (LPA). We transform time-series load profiles into load images. This allows us to leverage the ViT architecture, originally designed for image processing, as a pre-trained image encoder to uncover latent patterns within load data. ViT is pre-trained using an extensive load image dataset, comprising 1M load images derived from smart meter data collected over a two-year period from 2,000 residential users. The training methodology is self-supervised, masked image modeling, wherein masked load images are restored to reveal hidden relationships among image patches. The pre-trained ViT encoder is then applied to various downstream tasks, including the identification of electric vehicle (EV) charging loads and behind-the-meter solar photovoltaic (PV) systems and load disaggregation. Simulation results illustrate ViT4LPA's superior performance compared to existing neural network models in downstream tasks. Additionally, we conduct an in-depth analysis of the attention weights within the ViT4LPA model to gain insights into its information flow mechanisms.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
An Interpretable Power System Transient Stability Assessment Method with Expert Guiding Neural-Regression-Tree
Authors:
Hanxuan Wang,
Na Lu,
Zixuan Wang,
Jiacheng Liu,
Jun Liu
Abstract:
Deep learning based transient stability assessment (TSA) has achieved great success, yet the lack of interpretability hinders its industrial application. Although a great number of studies have tried to explore the interpretability of network solutions, many problems still remain unsolved: (1) the difference between the widely accepted power system knowledge and the generated interpretive rules is…
▽ More
Deep learning based transient stability assessment (TSA) has achieved great success, yet the lack of interpretability hinders its industrial application. Although a great number of studies have tried to explore the interpretability of network solutions, many problems still remain unsolved: (1) the difference between the widely accepted power system knowledge and the generated interpretive rules is large, (2) the probability characteristics of the neural network have not been fully considered during generating the interpretive rules, (3) the cost of the trade-off between accuracy and interpretability is too heavy to take. To address these issues, an interpretable power system Transient Stability Assessment method with Expert guiding Neural-Regression-Tree (TSA-ENRT) is proposed. TSA-ENRT utilizes an expert guiding nonlinear regression tree to approximate the neural network prediction and the neural network can be explained by the interpretive rules generated by the tree model. The nonlinearity of the expert guiding nonlinear regression tree is endowed with the extracted knowledge from a simple two-machine three-bus power system, which forms an expert knowledge base and thus the generated interpretive rules are more consistent with human cognition. Besides, the expert guiding tree model can build a bridge between the interpretive rules and the probability prediction of neural network in a regression way. By regularizing the neural network with the average decision length of ENRT, the association of the neural network and tree model is constructed in the model training level which provides a better trade-off between accuracy and interpretability. Extensive experiments indicate the interpretive rules generated by the proposed TSA-ENRT are highly consistent with the neural network prediction and more agreed with human expert cognition.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Human Stress Response and Perceived Safety during Encounters with Quadruped Robots
Authors:
Ryan Gupta,
Hyonyoung Shin,
Emily Norman,
Keri K. Stephens,
Nanshu Lu,
Luis Sentis
Abstract:
Despite the rise of mobile robot deployments in home and work settings, perceived safety of users and bystanders is understudied in the human-robot interaction (HRI) literature. To address this, we present a study designed to identify elements of a human-robot encounter that correlate with observed stress response. Stress is a key component of perceived safety and is strongly associated with human…
▽ More
Despite the rise of mobile robot deployments in home and work settings, perceived safety of users and bystanders is understudied in the human-robot interaction (HRI) literature. To address this, we present a study designed to identify elements of a human-robot encounter that correlate with observed stress response. Stress is a key component of perceived safety and is strongly associated with human physiological response. In this study a Boston Dynamics Spot and a Unitree Go1 navigate autonomously through a shared environment occupied by human participants wearing multimodal physiological sensors to track their electrocardiography (ECG) and electrodermal activity (EDA). The encounters are varied through several trials and participants self-rate their stress levels after each encounter. The study resulted in a multidimensional dataset archiving various objective and subjective aspects of a human-robot encounter, containing insights for understanding perceived safety in such encounters. To this end, acute stress responses were decoded from the human participants' ECG and EDA and compared across different human-robot encounter conditions. Statistical analysis of data indicate that on average (1) participants feel more stress during encounters compared to baselines, (2) participants feel more stress encountering multiple robots compared to a single robot and (3) participants stress increases during navigation behavior compared with search behavior.
△ Less
Submitted 6 June, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Electrical Load Forecasting Model Using Hybrid LSTM Neural Networks with Online Correction
Authors:
Nan Lu,
Quan Ouyang,
Yang Li,
Changfu Zou
Abstract:
Accurate electrical load forecasting is of great importance for the efficient operation and control of modern power systems. In this work, a hybrid long short-term memory (LSTM)-based model with online correction is developed for day-ahead electrical load forecasting. Firstly, four types of features are extracted from the original electrical load dataset, including the historical time series, time…
▽ More
Accurate electrical load forecasting is of great importance for the efficient operation and control of modern power systems. In this work, a hybrid long short-term memory (LSTM)-based model with online correction is developed for day-ahead electrical load forecasting. Firstly, four types of features are extracted from the original electrical load dataset, including the historical time series, time index features, historical statistical features, and similarity features. Then, a hybrid LSTM-based electrical load forecasting model is designed, where an LSTM neural network block and a fully-connected neural network block are integrated that can model both temporal features (historical time series) and non-temporal features (the rest features). A gradient regularization-based offline training algorithm and an output layer parameter fine-tuning-based online model correction method are developed to enhance the model's capabilities to defend against disturbance and adapt to the latest load data distribution, thus improving the forecasting accuracy. At last, extensive experiments are carried out to validate the effectiveness of the proposed electrical load forecasting strategy with superior accuracy compared with commonly used forecasting models.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
B-LSTM-MIONet: Bayesian LSTM-based Neural Operators for Learning the Response of Complex Dynamical Systems to Length-Variant Multiple Input Functions
Authors:
Zhihao Kong,
Amirhossein Mollaali,
Christian Moya,
Na Lu,
Guang Lin
Abstract:
Deep Operator Network (DeepONet) is a neural network framework for learning nonlinear operators such as those from ordinary differential equations (ODEs) describing complex systems. Multiple-input deep neural operators (MIONet) extended DeepONet to allow multiple input functions in different Banach spaces. MIONet offers flexibility in training dataset grid spacing, without constraints on output lo…
▽ More
Deep Operator Network (DeepONet) is a neural network framework for learning nonlinear operators such as those from ordinary differential equations (ODEs) describing complex systems. Multiple-input deep neural operators (MIONet) extended DeepONet to allow multiple input functions in different Banach spaces. MIONet offers flexibility in training dataset grid spacing, without constraints on output location. However, it requires offline inputs and cannot handle varying sequence lengths in testing datasets, limiting its real-time application in dynamic complex systems. This work redesigns MIONet, integrating Long Short Term Memory (LSTM) to learn neural operators from time-dependent data. This approach overcomes data discretization constraints and harnesses LSTM's capability with variable-length, real-time data. Factors affecting learning performance, like algorithm extrapolation ability are presented. The framework is enhanced with uncertainty quantification through a novel Bayesian method, sampling from MIONet parameter distributions. Consequently, we develop the B-LSTM-MIONet, incorporating LSTM's temporal strengths with Bayesian robustness, resulting in a more precise and reliable model for noisy datasets.
△ Less
Submitted 29 November, 2023; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Multi-Feeder Restoration using Multi-Microgrid Formation and Management
Authors:
Valliappan Muthukaruppan,
Rongxing Hu,
Ashwin Shirsat,
Mesut Baran,
Ning Lu,
Wenyuan Tang,
David Lubkeman
Abstract:
This papers highlights the benefit of coordinating resources on mulitple active distribution feeders during severe long duration outages through multi-microgrid formation. A graph-theory based multi-microgrid formation algorithm is developed which is agnostic of the underlying energy management scheme of the microgrids and solved in a rolling horizon fashion. The algorithm is then enhanced to hand…
▽ More
This papers highlights the benefit of coordinating resources on mulitple active distribution feeders during severe long duration outages through multi-microgrid formation. A graph-theory based multi-microgrid formation algorithm is developed which is agnostic of the underlying energy management scheme of the microgrids and solved in a rolling horizon fashion. The algorithm is then enhanced to handle multiple feeders where formation of long laterals needs to be avoided due to potential voltage control issues in distribution systems. The algorithm is evaluated on a synthetic two feeder system derived from interconnecting two IEEE 123 node system. The results indicate increased service to loads in the system and better utilization of renewable resources.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Assessment of Transmission-level Fault Impacts on 3-phase and 1-phase Distribution IBR Operation
Authors:
Qi Xiao,
Jongha Woo,
Lidong Song,
Bei Xu,
David Lubkeman,
Ning Lu,
Abdul Shafae Mohammed,
Johan Enslin,
Cara De Coste Chacko,
Kat Sico,
Steven G. Whisenant
Abstract:
The widespread deployment of inverter-based resources (IBRs) renders distribution systems susceptible to transmission-level faults. This paper presents a comprehensive analysis of the impact of transmission-level faults on 3-phase and 1-phase distribution IBR operation. To evaluate distributed IBR tripping across various phases and locations on a distribution feeder, we conduct simulations of both…
▽ More
The widespread deployment of inverter-based resources (IBRs) renders distribution systems susceptible to transmission-level faults. This paper presents a comprehensive analysis of the impact of transmission-level faults on 3-phase and 1-phase distribution IBR operation. To evaluate distributed IBR tripping across various phases and locations on a distribution feeder, we conduct simulations of both symmetrical and unsymmetrical transmission faults at progressively greater electrical distances on a real-time transmission and distribution (T&D) co-simulation platform. The IBR power-to-load ratios (PLRs) at 50%, 100%, and 300% are considered to emulate low, medium, and high IBR conditions. Our results indicate that, while 1-phase and 2-phase faults typically trigger fewer IBR trips when compared to 3-phase faults, a significant power imbalance arises from the tripping of 1-phase IBRs on the affected phases. The imbalance can result in significant power quality problems and unintended equipment tripping. It may be necessary to design fault-ride-through mechanisms specifically tailored to 1-phase IBRs to help mitigate the power imbalances caused by unbalanced faults.
△ Less
Submitted 1 April, 2024; v1 submitted 19 November, 2023;
originally announced November 2023.
-
BERT-PIN: A BERT-based Framework for Recovering Missing Data Segments in Time-series Load Profiles
Authors:
Yi Hu,
Kai Ye,
Hyeonjin Kim,
Ning Lu
Abstract:
Inspired by the success of the Transformer model in natural language processing and computer vision, this paper introduces BERT-PIN, a Bidirectional Encoder Representations from Transformers (BERT) powered Profile Inpainting Network. BERT-PIN recovers multiple missing data segments (MDSs) using load and temperature time-series profiles as inputs. To adopt a standard Transformer model structure for…
▽ More
Inspired by the success of the Transformer model in natural language processing and computer vision, this paper introduces BERT-PIN, a Bidirectional Encoder Representations from Transformers (BERT) powered Profile Inpainting Network. BERT-PIN recovers multiple missing data segments (MDSs) using load and temperature time-series profiles as inputs. To adopt a standard Transformer model structure for profile inpainting, we segment the load and temperature profiles into line segments, treating each segment as a word and the entire profile as a sentence. We incorporate a top candidates selection process in BERT-PIN, enabling it to produce a sequence of probability distributions, based on which users can generate multiple plausible imputed data sets, each reflecting different confidence levels. We develop and evaluate BERT-PIN using real-world dataset for two applications: multiple MDSs recovery and demand response baseline estimation. Simulation results show that BERT-PIN outperforms the existing methods in accuracy while is capable of restoring multiple MDSs within a longer window. BERT-PIN, served as a pre-trained model, can be fine-tuned for conducting many downstream tasks, such as classification and super resolution.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Imperfect Digital Twin Assisted Low Cost Reinforcement Training for Multi-UAV Networks
Authors:
Xiucheng Wang,
Nan Cheng,
Longfei Ma,
Zhisheng Yin,
Tom. Luan,
Ning Lu
Abstract:
Deep Reinforcement Learning (DRL) is widely used to optimize the performance of multi-UAV networks. However, the training of DRL relies on the frequent interactions between the UAVs and the environment, which consumes lots of energy due to the flying and communication of UAVs in practical experiments. Inspired by the growing digital twin (DT) technology, which can simulate the performance of algor…
▽ More
Deep Reinforcement Learning (DRL) is widely used to optimize the performance of multi-UAV networks. However, the training of DRL relies on the frequent interactions between the UAVs and the environment, which consumes lots of energy due to the flying and communication of UAVs in practical experiments. Inspired by the growing digital twin (DT) technology, which can simulate the performance of algorithms in the digital space constructed by coping features of the physical space, the DT is introduced to reduce the costs of practical training, e.g., energy and hardware purchases. Different from previous DT-assisted works with an assumption of perfect reflecting real physics by virtual digital, we consider an imperfect DT model with deviations for assisting the training of multi-UAV networks. Remarkably, to trade off the training cost, DT construction cost, and the impact of deviations of DT on training, the natural and virtually generated UAV mixing deployment method is proposed. Two cascade neural networks (NN) are used to optimize the joint number of virtually generated UAVs, the DT construction cost, and the performance of multi-UAV networks. These two NNs are trained by unsupervised and reinforcement learning, both low-cost label-free training methods. Simulation results show the training cost can significantly decrease while guaranteeing the training performance. This implies that an efficient decision can be made with imperfect DTs in multi-UAV networks.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Phase Synchrony Component Self-Organization in Brain Computer Interface
Authors:
Xu Niu,
Na Lu,
Huan Luo,
Ruofan Yan
Abstract:
Phase synchrony information plays a crucial role in analyzing functional brain connectivity and identifying brain activities. A widely adopted feature extraction pipeline, composed of preprocessing, selection of EEG acquisition channels, and phase locking value (PLV) calculation, has achieved success in motor imagery classification (MI). However, this pipeline is manual and reliant on expert knowl…
▽ More
Phase synchrony information plays a crucial role in analyzing functional brain connectivity and identifying brain activities. A widely adopted feature extraction pipeline, composed of preprocessing, selection of EEG acquisition channels, and phase locking value (PLV) calculation, has achieved success in motor imagery classification (MI). However, this pipeline is manual and reliant on expert knowledge, limiting its convenience and adaptability to different application scenarios. Moreover, most studies have employed mediocre data-independent spatial filters to suppress noise, impeding the exploration of more significant phase synchronization phenomena. To address the issues, we propose the concept of phase synchrony component self-organization, which enables the adaptive learning of data-dependent spatial filters for automating both the preprocessing and channel selection procedures. Based on this concept, the first deep learning end-to-end network is developed, which directly extracts phase synchrony-based features from raw EEG signals and perform classification. The network learns optimal filters during training, which are obtained when the network achieves peak classification results. Extensive experiments have demonstrated that our network outperforms state-of-the-art methods. Remarkably, through the learned optimal filters, significant phase synchronization phenomena can be observed. Specifically, by calculating the PLV between a pair of signals extracted from each sample using two of the learned spatial filters, we have obtained an average PLV exceeding 0.87 across all tongue MI samples. This high PLV indicates a groundbreaking discovery in the synchrony pattern of tongue MI.
△ Less
Submitted 11 October, 2023; v1 submitted 21 September, 2023;
originally announced October 2023.
-
Nuclear ground-state properties probed by the relativistic Hartree-Bogoliubov approach
Authors:
Zi Xin Liu,
Yi Hua Lam,
Ning Lu,
Peter Ring
Abstract:
Using the relativistic Hartree-Bogoliubov framework with separable pairing force coupled with the latest covariant density functionals, i.e., PC-L3R, PC-X, DD-PCX, and DD-MEX, we systematically explore the ground-state properties of all isotopes of Z=8-110. These properties consist of the binding energies, one- and two-neutron separation energies ($S_\mathrm{n}$ and $S_\mathrm{2n}$), root-mean-squ…
▽ More
Using the relativistic Hartree-Bogoliubov framework with separable pairing force coupled with the latest covariant density functionals, i.e., PC-L3R, PC-X, DD-PCX, and DD-MEX, we systematically explore the ground-state properties of all isotopes of Z=8-110. These properties consist of the binding energies, one- and two-neutron separation energies ($S_\mathrm{n}$ and $S_\mathrm{2n}$), root-mean-square radius of matter, of neutron, of proton, and of charge distributions, Fermi surfaces, ground-state spins and parities. We then predict the edges of nuclear landscape and bound nuclei for the isotopic chains from oxygen (Z=8) to darmstadtium (Z=110) based on these latest covariant density functionals. The number of bound nuclei predicted by PC-L3R, PC-X, DD-PCX, and DD-MEX, are 9004, 9162, 6799, and 7112, respectively. The root-mean-square deviations of $S_\mathrm{n}$ ($S_\mathrm{2n}$) yielded from PC-L3R, PCX, DD-PCX, and DD-MEX are 0.962 (1.300) MeV, 0.920 (1.483) MeV, 0.993 (1.753) MeV, and 1.010 (1.544) MeV, respectively. The root-mean-square deviations of charge radius distributions of comparing the available experimental values with the theoretical counterparts resulted from PC-L3R, PC-X, DD-PCX, and DD-MEX are 0.035 fm, 0.037 fm, 0.035 fm, and 0.034 fm, respectively. We notice pronounced differences between the empirical and theoretical root-mean-square radii of neutron at nuclei near the neutron drip line of the Mg, Ca, and Kr isotopic chains, suggesting the possible existence of the halo or giant halo phenomena.
△ Less
Submitted 18 January, 2024; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Adopting Dynamic VAR Compensators to Mitigate PV Impacts on Unbalanced Distribution Systems
Authors:
Han Pyo Lee,
Keith DSouza,
Ke Chen,
Ning Lu,
Mesut Baran
Abstract:
The growing integration of distributed energy resources into distribution systems poses challenges for voltage regulation. Dynamic VAR Compensators (DVCs) are a new generation of power electronics-based Volt/VAR compensation devices designed to address voltage issues in distribution systems with a high penetration of renewable generation resources. Currently, the IEEE Std. 1547-based Volt/VAR Curv…
▽ More
The growing integration of distributed energy resources into distribution systems poses challenges for voltage regulation. Dynamic VAR Compensators (DVCs) are a new generation of power electronics-based Volt/VAR compensation devices designed to address voltage issues in distribution systems with a high penetration of renewable generation resources. Currently, the IEEE Std. 1547-based Volt/VAR Curve (VV-C) is widely used as the local control scheme for controlling a DVC. However, the effectiveness of this scheme is not well documented, and there is limited literature on alternative control and placement schemes that can maximize the effective use of a DVC. In this paper, we propose an optimal dispatch and control mechanism to enhance the conventional VV-C based localized DVC control. First, we establish a multi-objective optimization framework to identify the optimal dispatch strategy and suitable placement for the DVC. Next, we introduce two supervisory control strategies to determine the appropriate instances for adjusting the VV-C when the operating condition changes. The outlined scheme comprises two primary stages: time segmentation and VV-C fitting. Within this framework, each time segment aims to produce optimized Q-V trajectories. The proposed method is tested on a modified IEEE 123-bus test system using OpenDSS for a wide range of operating scenarios, including sunny and cloudy days. Simulation results demonstrate that the proposed scheme effectively reduces voltage variations compared to the standard VV-C specified in IEEE Std. 1547.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Under-frequency Load Shedding for Power Reserve Management in Islanded Microgrids
Authors:
Bei Xu,
Victor Paduani,
Qi Xiao,
Lidong Song,
David Lubkeman,
Ning Lu
Abstract:
This paper introduces under-frequency load shedding (UFLS) schemes specially designed to fulfill the power reserve requirements in islanded microgrids (MGs), where only one grid-forming resource is available for frequency regulation. When the power consumption of the MG exceeds a pre-defined threshold, the MG frequency will be lowered to various setpoints, thereby triggering UFLS for different lev…
▽ More
This paper introduces under-frequency load shedding (UFLS) schemes specially designed to fulfill the power reserve requirements in islanded microgrids (MGs), where only one grid-forming resource is available for frequency regulation. When the power consumption of the MG exceeds a pre-defined threshold, the MG frequency will be lowered to various setpoints, thereby triggering UFLS for different levels of load reduction. Three types of controllable devices are considered for executing UFLS: sectionalizers, smart meters, and controllable appliances. To avoid unnecessary UFLS activation, various time delay settings are analyzed, allowing short-lived power spikes caused by events like motor startups or cold-load pickups to be disregarded. We tested the proposed UFLS schemes on a modified IEEE 123-bus system on the OPAL-RT eMEGASIM platform. Simulation results verify the efficacy of the proposed approaches in restoring power reserves, maintaining phase power balance, and effectively handling short-lived power fluctuations. Furthermore, in comparison to sectionalizer-based UFLS, using smart meters or controllable loads for UFLS allows for a more accurate per-phase load shedding in a progressive manner. As a result, it leads to better balanced three-phase voltage and serves more loads.
△ Less
Submitted 6 September, 2023; v1 submitted 3 September, 2023;
originally announced September 2023.
-
PBFormer: Capturing Complex Scene Text Shape with Polynomial Band Transformer
Authors:
Ruijin Liu,
Ning Lu,
Dapeng Chen,
Cheng Li,
Zejian Yuan,
Wei Peng
Abstract:
We present PBFormer, an efficient yet powerful scene text detector that unifies the transformer with a novel text shape representation Polynomial Band (PB). The representation has four polynomial curves to fit a text's top, bottom, left, and right sides, which can capture a text with a complex shape by varying polynomial coefficients. PB has appealing features compared with conventional representa…
▽ More
We present PBFormer, an efficient yet powerful scene text detector that unifies the transformer with a novel text shape representation Polynomial Band (PB). The representation has four polynomial curves to fit a text's top, bottom, left, and right sides, which can capture a text with a complex shape by varying polynomial coefficients. PB has appealing features compared with conventional representations: 1) It can model different curvatures with a fixed number of parameters, while polygon-points-based methods need to utilize a different number of points. 2) It can distinguish adjacent or overlapping texts as they have apparent different curve coefficients, while segmentation-based or points-based methods suffer from adhesive spatial positions. PBFormer combines the PB with the transformer, which can directly generate smooth text contours sampled from predicted curves without interpolation. A parameter-free cross-scale pixel attention (CPA) module is employed to highlight the feature map of a suitable scale while suppressing the other feature maps. The simple operation can help detect small-scale texts and is compatible with the one-stage DETR framework, where no postprocessing exists for NMS. Furthermore, PBFormer is trained with a shape-contained loss, which not only enforces the piecewise alignment between the ground truth and the predicted curves but also makes curves' positions and shapes consistent with each other. Without bells and whistles about text pre-training, our method is superior to the previous state-of-the-art text detectors on the arbitrary-shaped text datasets.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Authors:
Ziyin Zhang,
Ning Lu,
Minghui Liao,
Yongshuai Huang,
Cheng Li,
Min Wang,
Wei Peng
Abstract:
Text recognition methods are gaining rapid development. Some advanced techniques, e.g., powerful modules, language models, and un- and semi-supervised learning schemes, consecutively push the performance on public benchmarks forward. However, the problem of how to better optimize a text recognition model from the perspective of loss functions is largely overlooked. CTC-based methods, widely used i…
▽ More
Text recognition methods are gaining rapid development. Some advanced techniques, e.g., powerful modules, language models, and un- and semi-supervised learning schemes, consecutively push the performance on public benchmarks forward. However, the problem of how to better optimize a text recognition model from the perspective of loss functions is largely overlooked. CTC-based methods, widely used in practice due to their good balance between performance and inference speed, still grapple with accuracy degradation. This is because CTC loss emphasizes the optimization of the entire sequence target while neglecting to learn individual characters. We propose a self-distillation scheme for CTC-based model to address this issue. It incorporates a framewise regularization term in CTC loss to emphasize individual supervision, and leverages the maximizing-a-posteriori of latent alignment to solve the inconsistency problem that arises in distillation between CTC-based models. We refer to the regularized CTC loss as Distillation Connectionist Temporal Classification (DCTC) loss. DCTC loss is module-free, requiring no extra parameters, longer inference lag, or additional training data or phases. Extensive experiments on public benchmarks demonstrate that DCTC can boost text recognition model accuracy by up to 2.6%, without any of these drawbacks.
△ Less
Submitted 29 December, 2023; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Corruptions of Supervised Learning Problems: Typology and Mitigations
Authors:
Laura Iacovissi,
Nan Lu,
Robert C. Williamson
Abstract:
Corruption is notoriously widespread in data collection. Despite extensive research, the existing literature on corruption predominantly focuses on specific settings and learning scenarios, lacking a unified view. There is still a limited understanding of how to effectively model and mitigate corruption in machine learning problems. In this work, we develop a general theory of corruption from an i…
▽ More
Corruption is notoriously widespread in data collection. Despite extensive research, the existing literature on corruption predominantly focuses on specific settings and learning scenarios, lacking a unified view. There is still a limited understanding of how to effectively model and mitigate corruption in machine learning problems. In this work, we develop a general theory of corruption from an information-theoretic perspective - with Markov kernels as a foundational mathematical tool. We generalize the definition of corruption beyond the concept of distributional shift: corruption includes all modifications of a learning problem, including changes in model class and loss function. We will focus here on changes in probability distributions. First, we construct a provably exhaustive framework for pairwise Markovian corruptions. The framework not only allows us to study corruption types based on their input space, but also serves to unify prior works on specific corruption models and establish a consistent nomenclature. Second, we systematically analyze the consequences of corruption on learning tasks by comparing Bayes risks in the clean and corrupted scenarios. This examination sheds light on complexities arising from joint and dependent corruptions on both labels and attributes. Notably, while label corruptions affect only the loss function, more intricate cases involving attribute corruptions extend the influence beyond the loss to affect the hypothesis class. Third, building upon these results, we investigate mitigations for various corruption types. We expand the existing loss-correction results for label corruption, and identify the necessity to generalize the classical corruption-corrected learning framework to a new paradigm with weaker requirements. Within the latter setting, we provide a negative result for loss correction in the attribute and the joint corruption case.
△ Less
Submitted 2 May, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Algorithms for Multiple Drone-Delivery Scheduling Problem (MDSP)
Authors:
Sagnik Anupam,
Nicole Lu,
John Sragow
Abstract:
The Multiple Drone-Delivery Scheduling Problem (MDSP) is a scheduling problem that optimizes the maximum reward earned by a set of $m$ drones executing a sequence of deliveries on a truck delivery route. The current best-known approximation algorithm for the problem is a $\frac{1}{4}$-approximation algorithm developed by Jana and Mandal (2022). In this paper, we propose exact and approximation alg…
▽ More
The Multiple Drone-Delivery Scheduling Problem (MDSP) is a scheduling problem that optimizes the maximum reward earned by a set of $m$ drones executing a sequence of deliveries on a truck delivery route. The current best-known approximation algorithm for the problem is a $\frac{1}{4}$-approximation algorithm developed by Jana and Mandal (2022). In this paper, we propose exact and approximation algorithms for the general MDSP, as well as a unit-cost variant. We first propose a greedy algorithm which we show to be a $\frac{1}{3}$-approximation algorithm for the general MDSP problem formulation, provided the number of conflicting intervals is less than the number of drones. We then introduce a unit-cost variant of MDSP and we devise an exact dynamic programming algorithm that runs in polynomial time when the number of drones $m$ can be assumed to be a constant.
△ Less
Submitted 17 June, 2023;
originally announced June 2023.
-
Generalizing Importance Weighting to A Universal Solver for Distribution Shift Problems
Authors:
Tongtong Fang,
Nan Lu,
Gang Niu,
Masashi Sugiyama
Abstract:
Distribution shift (DS) may have two levels: the distribution itself changes, and the support (i.e., the set where the probability density is non-zero) also changes. When considering the support change between the training and test distributions, there can be four cases: (i) they exactly match; (ii) the training support is wider (and thus covers the test support); (iii) the test support is wider;…
▽ More
Distribution shift (DS) may have two levels: the distribution itself changes, and the support (i.e., the set where the probability density is non-zero) also changes. When considering the support change between the training and test distributions, there can be four cases: (i) they exactly match; (ii) the training support is wider (and thus covers the test support); (iii) the test support is wider; (iv) they partially overlap. Existing methods are good at cases (i) and (ii), while cases (iii) and (iv) are more common nowadays but still under-explored. In this paper, we generalize importance weighting (IW), a golden solver for cases (i) and (ii), to a universal solver for all cases. Specifically, we first investigate why IW might fail in cases (iii) and (iv); based on the findings, we propose generalized IW (GIW) that could handle cases (iii) and (iv) and would reduce to IW in cases (i) and (ii). In GIW, the test support is split into an in-training (IT) part and an out-of-training (OOT) part, and the expected risk is decomposed into a weighted classification term over the IT part and a standard classification term over the OOT part, which guarantees the risk consistency of GIW. Then, the implementation of GIW consists of three components: (a) the split of validation data is carried out by the one-class support vector machine, (b) the first term of the empirical risk can be handled by any IW algorithm given training data and IT validation data, and (c) the second term just involves OOT validation data. Experiments demonstrate that GIW is a universal solver for DS problems, outperforming IW methods in cases (iii) and (iv).
△ Less
Submitted 1 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Large Language Models can be Guided to Evade AI-Generated Text Detection
Authors:
Ning Lu,
Shengcai Liu,
Rui He,
Qi Wang,
Yew-Soon Ong,
Ke Tang
Abstract:
Large language models (LLMs) have shown remarkable performance in various tasks and have been extensively utilized by the public. However, the increasing concerns regarding the misuse of LLMs, such as plagiarism and spamming, have led to the development of multiple detectors, including fine-tuned classifiers and statistical methods. In this study, we equip LLMs with prompts, rather than relying on…
▽ More
Large language models (LLMs) have shown remarkable performance in various tasks and have been extensively utilized by the public. However, the increasing concerns regarding the misuse of LLMs, such as plagiarism and spamming, have led to the development of multiple detectors, including fine-tuned classifiers and statistical methods. In this study, we equip LLMs with prompts, rather than relying on an external paraphraser, to evaluate the vulnerability of these detectors. We propose a novel Substitution-based In-Context example Optimization method (SICO) to automatically construct prompts for evading the detectors. SICO is cost-efficient as it requires only 40 human-written examples and a limited number of LLM inferences to generate a prompt. Moreover, once a task-specific prompt has been constructed, it can be universally used against a wide range of detectors. Extensive experiments across three real-world tasks demonstrate that SICO significantly outperforms the paraphraser baselines and enables GPT-3.5 to successfully evade six detectors, decreasing their AUC by 0.5 on average. Furthermore, a comprehensive human evaluation show that the SICO-generated text achieves human-level readability and task completion rates, while preserving high imperceptibility. Finally, we propose an ensemble approach to enhance the robustness of detectors against SICO attack. The code is publicly available at https://github.com/ColinLu50/Evade-GPT-Detector.
△ Less
Submitted 15 May, 2024; v1 submitted 18 May, 2023;
originally announced May 2023.
-
What are neutron stars made of? Gravitational waves may reveal the answer
Authors:
Neil Lu,
Susan M. Scott,
Karl Wette
Abstract:
Neutron stars are one of the most mysterious wonders in the Universe. Their extreme densities hint at new and exotic physics at work within. Gravitational waves could be the key to unlocking their secrets. In particular, a first detection of gravitational waves from rapidly-spinning, deformed neutron stars could yield new insights into the physics of matter at extreme densities and under strong gr…
▽ More
Neutron stars are one of the most mysterious wonders in the Universe. Their extreme densities hint at new and exotic physics at work within. Gravitational waves could be the key to unlocking their secrets. In particular, a first detection of gravitational waves from rapidly-spinning, deformed neutron stars could yield new insights into the physics of matter at extreme densities and under strong gravity. Once a first detection is made, a critical challenge will be to robustly extract physically interesting information from the detected signals. In this essay, we describe initial research towards answering this challenge, and thereby unleashing the full power of gravitational waves as an engine for the discovery of new physics.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
ICDAR 2023 Competition on Reading the Seal Title
Authors:
Wenwen Yu,
Mingyu Liu,
Mingrui Chen,
Ning Lu,
Yinlong Wen,
Yuliang Liu,
Dimosthenis Karatzas,
Xiang Bai
Abstract:
Reading seal title text is a challenging task due to the variable shapes of seals, curved text, background noise, and overlapped text. However, this important element is commonly found in official and financial scenarios, and has not received the attention it deserves in the field of OCR technology. To promote research in this area, we organized ICDAR 2023 competition on reading the seal title (Re…
▽ More
Reading seal title text is a challenging task due to the variable shapes of seals, curved text, background noise, and overlapped text. However, this important element is commonly found in official and financial scenarios, and has not received the attention it deserves in the field of OCR technology. To promote research in this area, we organized ICDAR 2023 competition on reading the seal title (ReST), which included two tasks: seal title text detection (Task 1) and end-to-end seal title recognition (Task 2). We constructed a dataset of 10,000 real seal data, covering the most common classes of seals, and labeled all seal title texts with text polygons and text contents. The competition opened on 30th December, 2022 and closed on 20th March, 2023. The competition attracted 53 participants from academia and industry including 28 submissions for Task 1 and 25 submissions for Task 2, which demonstrated significant interest in this challenging task. In this report, we present an overview of the competition, including the organization, challenges, and results. We describe the dataset and tasks, and summarize the submissions and evaluation results. The results show that significant progress has been made in the field of seal title text reading, and we hope that this competition will inspire further research and development in this important area of OCR technology.
△ Less
Submitted 5 June, 2023; v1 submitted 24 April, 2023;
originally announced April 2023.
-
STCF Conceptual Design Report: Volume 1 -- Physics & Detector
Authors:
M. Achasov,
X. C. Ai,
R. Aliberti,
L. P. An,
Q. An,
X. Z. Bai,
Y. Bai,
O. Bakina,
A. Barnyakov,
V. Blinov,
V. Bobrovnikov,
D. Bodrov,
A. Bogomyagkov,
A. Bondar,
I. Boyko,
Z. H. Bu,
F. M. Cai,
H. Cai,
J. J. Cao,
Q. H. Cao,
Z. Cao,
Q. Chang,
K. T. Chao,
D. Y. Chen,
H. Chen
, et al. (413 additional authors not shown)
Abstract:
The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII,…
▽ More
The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII, providing a unique platform for exploring the asymmetry of matter-antimatter (charge-parity violation), in-depth studies of the internal structure of hadrons and the nature of non-perturbative strong interactions, as well as searching for exotic hadrons and physics beyond the Standard Model. The STCF project in China is under development with an extensive R\&D program. This document presents the physics opportunities at the STCF, describes conceptual designs of the STCF detector system, and discusses future plans for detector R\&D and physics case studies.
△ Less
Submitted 5 October, 2023; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling
Authors:
Yongshuai Huang,
Ning Lu,
Dapeng Chen,
Yibo Li,
Zecheng Xie,
Shenggao Zhu,
Liangcai Gao,
Wei Peng
Abstract:
Table structure recognition aims to extract the logical and physical structure of unstructured table images into a machine-readable format. The latest end-to-end image-to-text approaches simultaneously predict the two structures by two decoders, where the prediction of the physical structure (the bounding boxes of the cells) is based on the representation of the logical structure. However, the pre…
▽ More
Table structure recognition aims to extract the logical and physical structure of unstructured table images into a machine-readable format. The latest end-to-end image-to-text approaches simultaneously predict the two structures by two decoders, where the prediction of the physical structure (the bounding boxes of the cells) is based on the representation of the logical structure. However, the previous methods struggle with imprecise bounding boxes as the logical representation lacks local visual information. To address this issue, we propose an end-to-end sequential modeling framework for table structure recognition called VAST. It contains a novel coordinate sequence decoder triggered by the representation of the non-empty cell from the logical structure decoder. In the coordinate sequence decoder, we model the bounding box coordinates as a language sequence, where the left, top, right and bottom coordinates are decoded sequentially to leverage the inter-coordinate dependency. Furthermore, we propose an auxiliary visual-alignment loss to enforce the logical representation of the non-empty cells to contain more local visual details, which helps produce better cell bounding boxes. Extensive experiments demonstrate that our proposed method can achieve state-of-the-art results in both logical and physical structure recognition. The ablation study also validates that the proposed coordinate sequence decoder and the visual-alignment loss are the keys to the success of our method.
△ Less
Submitted 19 March, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Digital Twin-Assisted Knowledge Distillation Framework for Heterogeneous Federated Learning
Authors:
Xiucheng Wang,
Nan Cheng,
Longfei Ma,
Ruijin Sun,
Rong Chai,
Ning Lu
Abstract:
In this paper, to deal with the heterogeneity in federated learning (FL) systems, a knowledge distillation (KD) driven training framework for FL is proposed, where each user can select its neural network model on demand and distill knowledge from a big teacher model using its own private dataset. To overcome the challenge of train the big teacher model in resource limited user devices, the digital…
▽ More
In this paper, to deal with the heterogeneity in federated learning (FL) systems, a knowledge distillation (KD) driven training framework for FL is proposed, where each user can select its neural network model on demand and distill knowledge from a big teacher model using its own private dataset. To overcome the challenge of train the big teacher model in resource limited user devices, the digital twin (DT) is exploit in the way that the teacher model can be trained at DT located in the server with enough computing resources. Then, during model distillation, each user can update the parameters of its model at either the physical entity or the digital agent. The joint problem of model selection and training offloading and resource allocation for users is formulated as a mixed integer programming (MIP) problem. To solve the problem, Q-learning and optimization are jointly used, where Q-learning selects models for users and determines whether to train locally or on the server, and optimization is used to allocate resources for users based on the output of Q-learning. Simulation results show the proposed DT-assisted KD framework and joint optimization method can significantly improve the average accuracy of users while reducing the total delay.
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
Less is More: Understanding Word-level Textual Adversarial Attack via n-gram Frequency Descend
Authors:
Ning Lu,
Shengcai Liu,
Zhirui Zhang,
Qi Wang,
Haifeng Liu,
Ke Tang
Abstract:
Word-level textual adversarial attacks have demonstrated notable efficacy in misleading Natural Language Processing (NLP) models. Despite their success, the underlying reasons for their effectiveness and the fundamental characteristics of adversarial examples (AEs) remain obscure. This work aims to interpret word-level attacks by examining their $n$-gram frequency patterns. Our comprehensive exper…
▽ More
Word-level textual adversarial attacks have demonstrated notable efficacy in misleading Natural Language Processing (NLP) models. Despite their success, the underlying reasons for their effectiveness and the fundamental characteristics of adversarial examples (AEs) remain obscure. This work aims to interpret word-level attacks by examining their $n$-gram frequency patterns. Our comprehensive experiments reveal that in approximately 90\% of cases, word-level attacks lead to the generation of examples where the frequency of $n$-grams decreases, a tendency we term as the $n$-gram Frequency Descend ($n$-FD). This finding suggests a straightforward strategy to enhance model robustness: training models using examples with $n$-FD. To examine the feasibility of this strategy, we employed the $n$-gram frequency information, as an alternative to conventional loss gradients, to generate perturbed examples in adversarial training. The experiment results indicate that the frequency-based approach performs comparably with the gradient-based approach in improving model robustness. Our research offers a novel and more intuitive perspective for understanding word-level textual adversarial attacks and proposes a new direction to improve model robustness.
△ Less
Submitted 15 April, 2024; v1 submitted 6 February, 2023;
originally announced February 2023.
-
A Novel Feeder-level Microgrid Unit Commitment Algorithm Considering Cold-load Pickup, Phase Balancing, and Reconfiguration
Authors:
Rongxing Hu,
Ashwin Shirsat,
Valliappan Muthukaruppan,
Si Zhang,
Yiyan Li,
Lidong Song,
Bei Xu,
Victor Paduani,
Ning Lu,
Mesut Baran,
Wenyuan Tang
Abstract:
This paper presents a novel 2-stage microgrid unit commitment (Microgrid-UC) algorithm considering cold-load pickup (CLPU) effects, three-phase load balancing requirements, and feasible reconfiguration options. Microgrid-UC schedules the operation of switches, generators, battery energy storage systems, and demand response resources to supply 3-phase unbalanced loads in an islanded microgrid for m…
▽ More
This paper presents a novel 2-stage microgrid unit commitment (Microgrid-UC) algorithm considering cold-load pickup (CLPU) effects, three-phase load balancing requirements, and feasible reconfiguration options. Microgrid-UC schedules the operation of switches, generators, battery energy storage systems, and demand response resources to supply 3-phase unbalanced loads in an islanded microgrid for multiple days. A performance-based CLPU model is developed to estimate additional energy needs of CLPU so that CLPU can be formulated into the traditional 2-stage UC scheduling process. A per-phase demand response budget term is added to the 1st stage UC objective function to meet 3-phase load unbalance limits. To reduce computational complexity in the 1st stage UC, we replace the spanning tree method with a feasible reconfiguration topology list method. The proposed algorithm is developed on a modified IEEE 123-bus system and tested on the real-time simulation testbed using actual load and PV data. Simulation results show that Microgrid-UC successfully accounts for CLPU, phase imbalance, and feeder reconfiguration requirements.
△ Less
Submitted 19 January, 2023;
originally announced January 2023.
-
Design Considerations of a Coordinative Demand Charge Mitigation Strategy
Authors:
Rongxing Hu,
Kai Ye,
Hyeonjin Kim,
Hanpyo Lee,
Ning Lu,
Di Wu,
PJ Rehm
Abstract:
This paper presents a coordinative demand charge mitigation (DCM) strategy for reducing electricity consumption during system peak periods. Available DCM resources include batteries, diesel generators, controllable loads, and conservation voltage reduction. All resources are directly controlled by load serving entities. A mixed integer linear programming based energy management algorithm is develo…
▽ More
This paper presents a coordinative demand charge mitigation (DCM) strategy for reducing electricity consumption during system peak periods. Available DCM resources include batteries, diesel generators, controllable loads, and conservation voltage reduction. All resources are directly controlled by load serving entities. A mixed integer linear programming based energy management algorithm is developed to optimally coordinate of DCM resources considering the load payback effect. To better capture system peak periods, two different kinds of load forecast are used: the day-ahead load forecast and the peak-hour probability forecast. Five DCM strategies are compared for reconciling the discrepancy between the two forecasting results. The DCM strategies are tested using actual utility data. Simulation results show that the proposed algorithm can effectively mitigate the demand charge while preventing the system peak from being shifted to the payback hours. We also identify the diminishing return effect, which can help load serving entities optimize the size of their DCM resources.
△ Less
Submitted 1 February, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
A Modified Sequence-to-point HVAC Load Disaggregation Algorithm
Authors:
Kai Ye,
Hyeonjin Kim,
Yi Hu,
Ning Lu,
Di Wu,
PJ Rehm
Abstract:
This paper presents a modified sequence-to-point (S2P) algorithm for disaggregating the heat, ventilation, and air conditioning (HVAC) load from the total building electricity consumption. The original S2P model is convolutional neural network (CNN) based, which uses load profiles as inputs. We propose three modifications. First, the input convolution layer is changed from 1D to 2D so that normali…
▽ More
This paper presents a modified sequence-to-point (S2P) algorithm for disaggregating the heat, ventilation, and air conditioning (HVAC) load from the total building electricity consumption. The original S2P model is convolutional neural network (CNN) based, which uses load profiles as inputs. We propose three modifications. First, the input convolution layer is changed from 1D to 2D so that normalized temperature profiles are also used as inputs to the S2P model. Second, a drop-out layer is added to improve adaptability and generalizability so that the model trained in one area can be transferred to other geographical areas without labelled HVAC data. Third, a fine-tuning process is proposed for areas with a small amount of labelled HVAC data so that the pre-trained S2P model can be fine-tuned to achieve higher disaggregation accuracy (i.e., better transferability) in other areas. The model is first trained and tested using smart meter and sub-metered HVAC data collected in Austin, Texas. Then, the trained model is tested on two other areas: Boulder, Colorado and San Diego, California. Simulation results show that the proposed modified S2P algorithm outperforms the original S2P model and the support-vector machine based approach in accuracy, adaptability, and transferability.
△ Less
Submitted 24 February, 2023; v1 submitted 9 December, 2022;
originally announced December 2022.
-
Optimal Control Design for Operating a Hybrid PV Plant with Robust Power Reserves for Fast Frequency Regulation Services
Authors:
Victor Paduani,
Qi Xiao,
Bei Xu,
David Lubkeman,
Ning Lu
Abstract:
This paper presents an optimal control strategy for operating a solar hybrid system consisting of solar photovoltaic (PV) and a high-power, low-storage battery energy storage system (BESS). A state-space model of the hybrid PV plant is first derived, based on which an adaptive model predictive controller is designed. The controller's objective is to control the PV and BESS to follow power setpoint…
▽ More
This paper presents an optimal control strategy for operating a solar hybrid system consisting of solar photovoltaic (PV) and a high-power, low-storage battery energy storage system (BESS). A state-space model of the hybrid PV plant is first derived, based on which an adaptive model predictive controller is designed. The controller's objective is to control the PV and BESS to follow power setpoints sent to the the hybrid system while maintaining desired power reserves and meeting system operational constraints. Furthermore, an extended Kalman filter (EKF) is implemented for estimating the battery SOC, and an error sensitivity is executed to assess its limitations. To validate the proposed strategy, detailed EMT models of the hybrid system are developed so that losses and control limits can be quantified accurately. Day-long simulations are performed in an OPAL-RT real-time simulator using second-by-second actual PV farm data as inputs. Results verify that the proposed method can follow power setpoints while maintaining power reserves in days of high irradiance intermittency even with a small BESS storage.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
Load Profile Inpainting for Missing Load Data Restoration and Baseline Estimation
Authors:
Yiyan Li,
Lidong Song,
Yi Hu,
Hanpyo Lee,
Di Wu,
PJ Rehm,
Ning Lu
Abstract:
This paper introduces a Generative Adversarial Nets (GAN) based, Load Profile Inpainting Network (Load-PIN) for restoring missing load data segments and estimating the baseline for a demand response event. The inputs are time series load data before and after the inpainting period together with explanatory variables (e.g., weather data). We propose a Generator structure consisting of a coarse netw…
▽ More
This paper introduces a Generative Adversarial Nets (GAN) based, Load Profile Inpainting Network (Load-PIN) for restoring missing load data segments and estimating the baseline for a demand response event. The inputs are time series load data before and after the inpainting period together with explanatory variables (e.g., weather data). We propose a Generator structure consisting of a coarse network and a fine-tuning network. The coarse network provides an initial estimation of the data segment in the inpainting period. The fine-tuning network consists of self-attention blocks and gated convolution layers for adjusting the initial estimations. Loss functions are specially designed for the fine-tuning and the discriminator networks to enhance both the point-to-point accuracy and realisticness of the results. We test the Load-PIN on three real-world data sets for two applications: patching missing data and deriving baselines of conservation voltage reduction (CVR) events. We benchmark the performance of Load-PIN with five existing deep-learning methods. Our simulation results show that, compared with the state-of-the-art methods, Load-PIN can handle varying-length missing data events and achieve 15-30% accuracy improvement.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
SigT: An Efficient End-to-End MIMO-OFDM Receiver Framework Based on Transformer
Authors:
Ziyou Ren,
Nan Cheng,
Ruijin Sun,
Xiucheng Wang,
Ning Lu,
Wenchao Xu
Abstract:
Multiple-input multiple-output and orthogonal frequency-division multiplexing (MIMO-OFDM) are the key technologies in 4G and subsequent wireless communication systems. Conventionally, the MIMO-OFDM receiver is performed by multiple cascaded blocks with different functions and the algorithm in each block is designed based on ideal assumptions of wireless channel distributions. However, these assump…
▽ More
Multiple-input multiple-output and orthogonal frequency-division multiplexing (MIMO-OFDM) are the key technologies in 4G and subsequent wireless communication systems. Conventionally, the MIMO-OFDM receiver is performed by multiple cascaded blocks with different functions and the algorithm in each block is designed based on ideal assumptions of wireless channel distributions. However, these assumptions may fail in practical complex wireless environments. The deep learning (DL) method has the ability to capture key features from complex and huge data. In this paper, a novel end-to-end MIMO-OFDM receiver framework based on \textit{transformer}, named SigT, is proposed. By regarding the signal received from each antenna as a token of the transformer, the spatial correlation of different antennas can be learned and the critical zero-shot problem can be mitigated. Furthermore, the proposed SigT framework can work well without the inserted pilots, which improves the useful data transmission efficiency. Experiment results show that SigT achieves much higher performance in terms of signal recovery accuracy than benchmark methods, even in a low SNR environment or with a small number of training samples. Code is available at https://github.com/SigTransformer/SigT.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.
-
An Iterative Bidirectional Gradient Boosting Approach for CVR Baseline Estimation
Authors:
Han Pyo Lee,
Yiyan Li,
Lidong Song,
Di Wu,
Ning Lu
Abstract:
This paper presents a novel Iterative Bidirectional Gradient Boosting Model (IBi-GBM) for estimating the baseline of Conservation Voltage Reduction (CVR) programs. In contrast to many existing methods, we treat CVR baseline estimation as a missing data retrieval problem. The approach involves dividing the load and its corresponding temperature profiles into three periods: pre-CVR, CVR, and post-CV…
▽ More
This paper presents a novel Iterative Bidirectional Gradient Boosting Model (IBi-GBM) for estimating the baseline of Conservation Voltage Reduction (CVR) programs. In contrast to many existing methods, we treat CVR baseline estimation as a missing data retrieval problem. The approach involves dividing the load and its corresponding temperature profiles into three periods: pre-CVR, CVR, and post-CVR. To restore the missing load profile during the CVR period, the method employs a three-step process. First, a forward-pass GBM is executed using data from the pre-CVR period as inputs. Subsequently, a backward-pass GBM is applied using data from the post-CVR period. The two restored load profiles are reconciled, considering pre-calculated weights derived from forecasting accuracy, and only the leftmost and rightmost points are retained. The newly restored points are then included as inputs for the subsequent iteration. This iterative procedure continues until the original load data in the CVR period is fully restored. We develop IBi-GBM using actual smart meter and Supervisory Control and Data Acquisition (SCADA) data. Our results demonstrate that IBi-GBM exhibits robust performance across various data resolutions and in different seasons and outperforms existing methods by achieving a 1-2% reduction in normalized Root Mean Square Error (nRMSE).
△ Less
Submitted 14 December, 2023; v1 submitted 7 November, 2022;
originally announced November 2022.
-
MultiLoad-GAN: A GAN-Based Synthetic Load Group Generation Method Considering Spatial-Temporal Correlations
Authors:
Yi Hu,
Yiyan Li,
Lidong Song,
Han Pyo Lee,
PJ Rehm,
Matthew Makdad,
Edmond Miller,
Ning Lu
Abstract:
This paper presents a deep-learning framework, Multi-load Generative Adversarial Network (MultiLoad-GAN), for generating a group of synthetic load profiles (SLPs) simultaneously. The main contribution of MultiLoad-GAN is the capture of spatial-temporal correlations among a group of loads that are served by the same distribution transformer. This enables the generation of a large amount of correlat…
▽ More
This paper presents a deep-learning framework, Multi-load Generative Adversarial Network (MultiLoad-GAN), for generating a group of synthetic load profiles (SLPs) simultaneously. The main contribution of MultiLoad-GAN is the capture of spatial-temporal correlations among a group of loads that are served by the same distribution transformer. This enables the generation of a large amount of correlated SLPs required for microgrid and distribution system studies. The novelty and uniqueness of the MultiLoad-GAN framework are three-fold. First, to the best of our knowledge, this is the first method for generating a group of load profiles bearing realistic spatial-temporal correlations simultaneously. Second, two complementary realisticness metrics for evaluating generated load profiles are developed: computing statistics based on domain knowledge and comparing high-level features via a deep-learning classifier. Third, to tackle data scarcity, a novel iterative data augmentation mechanism is developed to generate training samples for enhancing the training of both the classifier and the MultiLoad-GAN model. Simulation results show that MultiLoad-GAN can generate more realistic load profiles than existing approaches, especially in group level characteristics. With little finetuning, MultiLoad-GAN can be readily extended to generate a group of load or PV profiles for a feeder or a service area.
△ Less
Submitted 23 August, 2023; v1 submitted 3 October, 2022;
originally announced October 2022.
-
A Novel Power-Band based Data Segmentation Method for Enhancing Meter Phase and Transformer-Meter Pairing Identification
Authors:
Han Pyo Lee,
PJ Rehm,
Matthew Makdad,
Edmond Miller,
Ning Lu
Abstract:
This paper presents a novel power-band-based data segmentation (PBDS) method to enhance the identification of meter phase and meter-transformer pairing. Meters that share the same transformer or are on the same phase typically exhibit strongly correlated voltage profiles. However, under high power consumption, there can be significant voltage drops along the line connecting a customer to the distr…
▽ More
This paper presents a novel power-band-based data segmentation (PBDS) method to enhance the identification of meter phase and meter-transformer pairing. Meters that share the same transformer or are on the same phase typically exhibit strongly correlated voltage profiles. However, under high power consumption, there can be significant voltage drops along the line connecting a customer to the distribution transformer. These voltage drops significantly decrease the correlations among meters on the same phase or supplied by the same transformer, resulting in high misidentification rates. To address this issue, we propose using power bands to select highly correlated voltage segments for computing correlations, rather than relying solely on correlations computed from the entire voltage waveforms. The algorithm's performance is assessed by conducting tests using data gathered from 13 utility feeders. To ensure the credibility of the identification results, utility engineers conduct field verification for all 13 feeders. The verification results unequivocally demonstrate that the proposed algorithm surpasses existing methods in both accuracy and robustness.
△ Less
Submitted 14 September, 2023; v1 submitted 30 September, 2022;
originally announced October 2022.
-
Inferring neutron star properties with continuous gravitational waves
Authors:
Neil Lu,
Karl Wette,
Susan M. Scott,
Andrew Melatos
Abstract:
Detection of continuous gravitational waves from rapidly-spinning neutron stars opens up the possibility of examining their internal physics. We develop a framework that leverages a future continuous gravitational wave detection to infer a neutron star's moment of inertia, equatorial ellipticity, and the component of the magnetic dipole moment perpendicular to its rotation axis. We assume that the…
▽ More
Detection of continuous gravitational waves from rapidly-spinning neutron stars opens up the possibility of examining their internal physics. We develop a framework that leverages a future continuous gravitational wave detection to infer a neutron star's moment of inertia, equatorial ellipticity, and the component of the magnetic dipole moment perpendicular to its rotation axis. We assume that the neutron star loses rotational kinetic energy through both gravitational wave and electromagnetic radiation, and that the distance to the neutron star can be measured, but do not assume electromagnetic pulsations are observable or a particular neutron star equation of state. We use the Fisher information matrix and Monte Carlo simulations to estimate errors in the inferred parameters, assuming a population of gravitational-wave-emitting neutron stars consistent with the typical parameter domains of continuous gravitational wave searches. After an observation time of one year, the inferred errors for many neutron stars are limited chiefly by the error in the distance to the star. The techniques developed here will be useful if continuous gravitational waves are detected from a radio, X-ray, or gamma-ray pulsar, or else from a compact object with known distance, such as a supernova remnant.
△ Less
Submitted 23 January, 2023; v1 submitted 22 September, 2022;
originally announced September 2022.
-
An ICA-Based HVAC Load Disaggregation Method Using Smart Meter Data
Authors:
Hyeonjin Kim,
Kai Ye,
Han Pyo Lee,
Rongxing Hu,
Ning Lu,
Di Wu,
PJ Rehm
Abstract:
This paper presents an independent component analysis (ICA) based unsupervised-learning method for heat, ventilation, and air-conditioning (HVAC) load disaggregation using low-resolution (e.g., 15 minutes) smart meter data. We first demonstrate that electricity consumption profiles on mild-temperature days can be used to estimate the non-HVAC base load on hot days. A residual load profile can then…
▽ More
This paper presents an independent component analysis (ICA) based unsupervised-learning method for heat, ventilation, and air-conditioning (HVAC) load disaggregation using low-resolution (e.g., 15 minutes) smart meter data. We first demonstrate that electricity consumption profiles on mild-temperature days can be used to estimate the non-HVAC base load on hot days. A residual load profile can then be calculated by subtracting the mild-day load profile from the hot-day load profile. The residual load profiles are processed using ICA for HVAC load extraction. An optimization-based algorithm is proposed for post-adjustment of the ICA results, considering two bounding factors for enhancing the robustness of the ICA algorithm. First, we use the hourly HVAC energy bounds computed based on the relationship between HVAC load and temperature to remove unrealistic HVAC load spikes. Second, we exploit the dependency between the daily nocturnal and diurnal loads extracted from historical meter data to smooth the base load profile. Pecan Street data with sub-metered HVAC data were used to test and validate the proposed methods.Simulation results demonstrated that the proposed method is computationally efficient and robust across multiple customers.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
All-electrical valley filtering in graphene systems (II): Numerical study of electron transport in valley valves
Authors:
Jia-Huei Jiang,
Ning-Yuan Lue,
Feng-Wu Chen,
Yu-Shu G. Wu
Abstract:
This work performs a numerical study of electron transport through the fundamental logic gate in valleytronics - a valley valve consisting of two or increasing number of valley filters. Various typical effects on the transport are investigated, such as those due to interface scattering, long- and short- range impurity scattering, edge roughness, strain, inter-filter spacing, or increasing number o…
▽ More
This work performs a numerical study of electron transport through the fundamental logic gate in valleytronics - a valley valve consisting of two or increasing number of valley filters. Various typical effects on the transport are investigated, such as those due to interface scattering, long- and short- range impurity scattering, edge roughness, strain, inter-filter spacing, or increasing number of valley filters. For illustration, we consider the class of specific valves built from graphene quantum wire valley filters in single layer or bilayer graphene, with the filters subject to separate control of in-plane, transverse electric fields. The nearest-neighbor tight-binding model of graphene is used to formulate the corresponding transport problem, and the algorithm of recursive Green's function method is applied to solve for the corresponding transmission coefficient. In the case of two-filter valves, the result explicitly demonstrates the existence of a pronounced on-off contrast in electron transmission between the two configurations of valves, namely, one with identical and the other with opposite valley polarities in the two constituent filters. The contrast is shown to be enhanced when increasing the number of filters in valves. Signatures of Fano-Fabry-Perot type resonances in association with interface scattering and inter-filter spacing are illustrated. Electron backscattering due to impurities is found to be sizably suppressed, with the valve performance showing considerable robustness against edge roughness scattering. On the other hand, the presence of a uniaxial strain modifies the electron transmission and results in an interesting quasi-periodic modulation of transmission as we vary the strain strength.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
All-electrical valley filtering in graphene systems (I): A path to integrated electro-valleytronics
Authors:
Feng-Wu Chen,
Nin-Yuan Lue,
Mei-Yin Chou,
Yu-Shu G. Wu
Abstract:
Probing and controlling the valley degree of freedom in graphene systems by transport measurements has been a major challenge to fully exploit the unique properties of this two-dimensional material. In this theoretical work, we show that this goal can be achieved by a quantum-wire geometry made of gapped graphene that acts as a valley filter with the following favorable features: i) all electrical…
▽ More
Probing and controlling the valley degree of freedom in graphene systems by transport measurements has been a major challenge to fully exploit the unique properties of this two-dimensional material. In this theoretical work, we show that this goal can be achieved by a quantum-wire geometry made of gapped graphene that acts as a valley filter with the following favorable features: i) all electrical gate control, ii) electrically switchable valley polarity, iii) robustness against configuration fluctuation, and iv) potential for room temperature operation. This valley filtering is accomplished by a combination of gap opening in either bilayer graphene with a vertical electrical field or single layer graphene on h-BN, valley splitting with a horizontal electric field, and intervalley mixing by defect scattering. In addition to functioning as a building block for valleytronics, the proposed configuration makes it possible to convert signals between electrical and valleytronic forms, thus allowing for the integration of electronic and valleytronic components for the realization of electro-valleytronics.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Feeder Microgrid Management on an Active Distribution System during a Severe Outage
Authors:
Valliappan Muthukaruppan,
Ashwin Shirsat,
Rongxing Hu,
Victor Paduani,
Bei Xu,
Yiyan Li,
Mesut Baran,
Ning Lu,
David Lubkeman,
Wenyuan Tang
Abstract:
Forming a microgrid on a distribution system with large scale outage after a severe weather event is emerging as a viable solution to improve resiliency at the distribution level. This option becomes more attractive when the distribution system has high levels of distributed PV. The management of such feeder-level microgrid has however many challenges, such as limited resources that can be deploye…
▽ More
Forming a microgrid on a distribution system with large scale outage after a severe weather event is emerging as a viable solution to improve resiliency at the distribution level. This option becomes more attractive when the distribution system has high levels of distributed PV. The management of such feeder-level microgrid has however many challenges, such as limited resources that can be deployed on the feeder quickly, and the limited real-time monitoring and control on the distribution system. Effective use of the distributed PV is also challenging as they are not monitored and controlled. To handle these challenges, the paper proposes a 2-stage hierarchical energy management scheme to securely operate these feeder level micorgrids. The first stage of the scheme solves a sequential rolling optimization problem to optimally schedule the main resources (such as a mobile diesel generator and battery storage unit). The second stage adopts a dispatching scheme for the main resources to adjust the stage-1 set-points closer to real- time. The proposed scheme has unique features to assure that the scheme is robust under highly varying operating conditions with limited system observability: (i) an innovative PV forecast error adjustment and a dynamic reserve adjustment scheme to handle the extreme uncertainty on PV power output, and (ii) an intelligent fuel management scheme to assure that the resources are utilized optimally over the multiple days of the restoration period. The proposed algorithm is tested on sample system with real-time data. The results show that the proposed scheme performs well in maximizing service to loads by effective use of all the resources and by properly taking into account the challenging operating conditions.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
On-Demand Resource Management for 6G Wireless Networks Using Knowledge-Assisted Dynamic Neural Networks
Authors:
Longfei Ma,
Nan Cheng,
Xiucheng Wang,
Ruijin Sun,
Ning Lu
Abstract:
On-demand service provisioning is a critical yet challenging issue in 6G wireless communication networks, since emerging services have significantly diverse requirements and the network resources become increasingly heterogeneous and dynamic. In this paper, we study the on-demand wireless resource orchestration problem with the focus on the computing delay in orchestration decision-making process.…
▽ More
On-demand service provisioning is a critical yet challenging issue in 6G wireless communication networks, since emerging services have significantly diverse requirements and the network resources become increasingly heterogeneous and dynamic. In this paper, we study the on-demand wireless resource orchestration problem with the focus on the computing delay in orchestration decision-making process. Specifically, we take the decision-making delay into the optimization problem. Then, a dynamic neural network (DyNN)-based method is proposed, where the model complexity can be adjusted according to the service requirements. We further build a knowledge base representing the relationship among the service requirements, available computing resources, and the resource allocation performance. By exploiting the knowledge, the width of DyNN can be selected in a timely manner, further improving the performance of orchestration. Simulation results show that the proposed scheme significantly outperforms the traditional static neural network, and also shows sufficient flexibility in on-demand service provisioning.
△ Less
Submitted 2 August, 2022;
originally announced August 2022.
-
Multi-class Classification from Multiple Unlabeled Datasets with Partial Risk Regularization
Authors:
Yuting Tang,
Nan Lu,
Tianyi Zhang,
Masashi Sugiyama
Abstract:
Recent years have witnessed a great success of supervised deep learning, where predictive models were trained from a large amount of fully labeled data. However, in practice, labeling such big data can be very costly and may not even be possible for privacy reasons. Therefore, in this paper, we aim to learn an accurate classifier without any class labels. More specifically, we consider the case wh…
▽ More
Recent years have witnessed a great success of supervised deep learning, where predictive models were trained from a large amount of fully labeled data. However, in practice, labeling such big data can be very costly and may not even be possible for privacy reasons. Therefore, in this paper, we aim to learn an accurate classifier without any class labels. More specifically, we consider the case where multiple sets of unlabeled data and only their class priors, i.e., the proportions of each class, are available. Under this problem setup, we first derive an unbiased estimator of the classification risk that can be estimated from the given unlabeled sets and theoretically analyze the generalization error of the learned classifier. We then find that the classifier obtained as such tends to cause overfitting as its empirical risks go negative during training. To prevent overfitting, we further propose a partial risk regularization that maintains the partial risks with respect to unlabeled datasets and classes to certain levels. Experiments demonstrate that our method effectively mitigates overfitting and outperforms state-of-the-art methods for learning from multiple unlabeled sets.
△ Less
Submitted 15 October, 2022; v1 submitted 4 July, 2022;
originally announced July 2022.
-
The optimized point-coupling interaction for the relativistic energy density functional of Hartree-Bogoliubov approach quantifying the nuclear bulk properties
Authors:
Zi Xin Liu,
Yi Hua Lam,
Ning Lu,
Peter Ring
Abstract:
We propose a newly optimized nonlinear point-coupling parameterized interaction, PC-L3R, for the relativistic Hartree-Bogoliubov framework with a further optimized separable pairing force by fitting to observables, i.e., the binding energies of 91 spherical nuclei, charge radii of 63 nuclei, and 12 sets of mean pairing gaps consisting of 54 nuclei in total. The separable pairing force strengths of…
▽ More
We propose a newly optimized nonlinear point-coupling parameterized interaction, PC-L3R, for the relativistic Hartree-Bogoliubov framework with a further optimized separable pairing force by fitting to observables, i.e., the binding energies of 91 spherical nuclei, charge radii of 63 nuclei, and 12 sets of mean pairing gaps consisting of 54 nuclei in total. The separable pairing force strengths of proton and neutron are optimized together with the point-coupling constants, and are justified in satisfactory reproducing the empirical pairing gaps. The comparison of experimental binding energies compiled in AME2020 for 91 nuclei with the ones generated from the present and other commonly used point-coupling interactions indicates that the implementation of PC-L3R in relativistic Hartree-Bogoliubov yields the lowest root-mean-square deviation. The charge radii satisfactory agree with experiment. Meanwhile, PC-L3R is capable of estimating the saturation properties of the symmetric nuclear matter and of appropriately predicting the isospin and mass dependence of binding energy. The experimental odd-even staggering of single nucleon separation energies is well reproduced. The comparison of the estimated binding energies for 7,373 nuclei based on the PC-L3R and other point-coupling interactions is also presented.
△ Less
Submitted 8 May, 2023; v1 submitted 11 April, 2022;
originally announced April 2022.
-
Federated Learning from Only Unlabeled Data with Class-Conditional-Sharing Clients
Authors:
Nan Lu,
Zhao Wang,
Xiaoxiao Li,
Gang Niu,
Qi Dou,
Masashi Sugiyama
Abstract:
Supervised federated learning (FL) enables multiple clients to share the trained model without sharing their labeled data. However, potential clients might even be reluctant to label their own data, which could limit the applicability of FL in practice. In this paper, we show the possibility of unsupervised FL whose model is still a classifier for predicting class labels, if the class-prior probab…
▽ More
Supervised federated learning (FL) enables multiple clients to share the trained model without sharing their labeled data. However, potential clients might even be reluctant to label their own data, which could limit the applicability of FL in practice. In this paper, we show the possibility of unsupervised FL whose model is still a classifier for predicting class labels, if the class-prior probabilities are shifted while the class-conditional distributions are shared among the unlabeled data owned by the clients. We propose federation of unsupervised learning (FedUL), where the unlabeled data are transformed into surrogate labeled data for each of the clients, a modified model is trained by supervised FL, and the wanted model is recovered from the modified model. FedUL is a very general solution to unsupervised FL: it is compatible with many supervised FL methods, and the recovery of the wanted model can be theoretically guaranteed as if the data have been labeled. Experiments on benchmark and real-world datasets demonstrate the effectiveness of FedUL. Code is available at https://github.com/lunanbit/FedUL.
△ Less
Submitted 11 May, 2022; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Improving Di-Higgs Sensitivity at Future Colliders in Hadronic Final States with Machine Learning
Authors:
Artur Apresyan,
Daniel Diaz,
Javier Duarte,
Sanmay Ganguly,
Raghav Kansal,
Nan Lu,
Cristina Mantilla Suarez,
Samadrita Mukherjee,
Cristían Peña,
Brian Sheldon,
Si Xie
Abstract:
One of the central goals of the physics program at the future colliders is to elucidate the origin of electroweak symmetry breaking, including precision measurements of the Higgs sector. This includes a detailed study of Higgs boson (H) pair production, which can reveal the H self-coupling. Since the discovery of the Higgs boson, a large campaign of measurements of the properties of the Higgs boso…
▽ More
One of the central goals of the physics program at the future colliders is to elucidate the origin of electroweak symmetry breaking, including precision measurements of the Higgs sector. This includes a detailed study of Higgs boson (H) pair production, which can reveal the H self-coupling. Since the discovery of the Higgs boson, a large campaign of measurements of the properties of the Higgs boson has begun and many new ideas have emerged during the completion of this program. One such idea is the use of highly boosted and merged hadronic decays of the Higgs boson ($\mathrm{H}\to\mathrm{b}\bar{\mathrm{b}}$, $\mathrm{H}\to\mathrm{W}\mathrm{W}\to\mathrm{q}\bar{\mathrm{q}}\mathrm{q}\bar{\mathrm{q}}$) with machine learning methods to improve the signal-to-background discrimination. In this white paper, we champion the use of these modes to boost the sensitivity of future collider physics programs to Higgs boson pair production, the Higgs self-coupling, and Higgs-vector boson couplings. We demonstrate the potential improvement possible at the Future Circular Collider in hadron mode, especially with the use of graph neural networks.
△ Less
Submitted 4 April, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Ultra-High Lithium Storage Capacity of Al2C Monolayer under Restricted Multilayered Growth Mechanism
Authors:
Ning Lu,
Kai Wang,
Jiaxin Jiang,
Hongyan Guo,
Gui Zhong Zuo,
Zhiwen Zhuo,
Xiaojun Wu,
Xiao Cheng Zeng
Abstract:
Designing anode materials with high lithium specific capacity is crucial to the development of high energy-density lithium ion batteries. Herein, a distinctive lithium growth mechanism, namely, the restricted multilayered growth for lithium, and a strategy for lithium storage are proposed to achieve the balance between the ultra-high specific capacity and the need to avert uncontrolled dendritic g…
▽ More
Designing anode materials with high lithium specific capacity is crucial to the development of high energy-density lithium ion batteries. Herein, a distinctive lithium growth mechanism, namely, the restricted multilayered growth for lithium, and a strategy for lithium storage are proposed to achieve the balance between the ultra-high specific capacity and the need to avert uncontrolled dendritic growth of lithium. In particular, based on first-principles computation, we show that the Al2C monolayer with planar tetracoordinate carbon structure can be an ideal platform for realizing the restricted multilayered growth mechanism as a 2D anode material. Furthermore, the Al2C monolayer exhibits ultra-high specific capacity of lithium of 4059 mAh/g, yet with a low dif-fusion barrier of 0.039-0.17 eV as well as low open circuit voltage in the range of 0.002-0.34 V. These novel properties endow the Al2C monolayer a promising anode material for future lithium ion batteries. Our study offers a new way to design promising 2D anode materials with high specific capacity, fast lithium-ion diffusion, and safe lithium storage mechanism.
△ Less
Submitted 13 March, 2022;
originally announced March 2022.
-
Anisotropic Electrene T'-Ca2P with Electron Gas Magnetic Coupling as Anode Material for Na/K Ion Batteries
Authors:
Jiaxin Jiang,
Kai Wang,
Hongyan Guo,
Guizhong Zuo,
Zhiwen Zhuo,
Ning Lu
Abstract:
There is an urgently need for the high-performance rechargeable electrical storage devices as supplement or substitutions of lithium ion batteries due to the shortage of lithium in nature. Herein we propose a stable 2D electrene T'-Ca2P as anode material for Na/K ion batteries by first-principle calculations. Our calculated results show that T'-Ca2P monolayer is an antiferromagnetic semiconducting…
▽ More
There is an urgently need for the high-performance rechargeable electrical storage devices as supplement or substitutions of lithium ion batteries due to the shortage of lithium in nature. Herein we propose a stable 2D electrene T'-Ca2P as anode material for Na/K ion batteries by first-principle calculations. Our calculated results show that T'-Ca2P monolayer is an antiferromagnetic semiconducting electrene with spin-polarized electron gas. It exhibits suitable adsorption for both Na and K atoms, and its anisotropic migration energy barriers are 0.050/0.101 eV and 0.037/0.091 eV in b/a direction, respectively. The theoretical capacities for Na and K are both 482 mAh/g, while the average working voltage platforms are 0.171-0.226 V and 0.013-0.267 V, respectively. All the results reveal that the T'-Ca2P monolayer has promised application prospects as anode materials for Na/K ion batteries.
△ Less
Submitted 13 March, 2022;
originally announced March 2022.