-
Efficient Sampling for Data-Driven Frequency Stability Constraint via Forward-Mode Automatic Differentiation
Authors:
Wangkun Xu,
Qian Chen,
Pudong Ge,
Zhongda Chu,
Fei Teng
Abstract:
Encoding frequency stability constraints in the operation problem is challenging due to its complex dynamics. Recently, data-driven approaches have been proposed to learn the stability criteria offline with the trained model embedded as a constraint of online optimization. However, random sampling of stationary operation points is less efficient in generating balanced stable and unstable samples.…
▽ More
Encoding frequency stability constraints in the operation problem is challenging due to its complex dynamics. Recently, data-driven approaches have been proposed to learn the stability criteria offline with the trained model embedded as a constraint of online optimization. However, random sampling of stationary operation points is less efficient in generating balanced stable and unstable samples. Meanwhile, the performance of such a model is strongly dependent on the quality of the training dataset. Observing this research gap, we propose a gradient-based data generation method via forward-mode automatic differentiation. In this method, the original dynamic system is augmented with new states that represent the dynamic of sensitivities of the original states, which can be solved by invoking any ODE solver for a single time. To compensate for the contradiction between the gradient of various frequency stability criteria, gradient surgery is proposed by projecting the gradient on the normal plane of the other. In the end, we demonstrate the superior performance of the proposed sampling algorithm, compared with the unrolling differentiation and finite difference. All codes are available at https://github.com/xuwkk/frequency_sample_ad.
△ Less
Submitted 20 July, 2024;
originally announced July 2024.
-
Conservative Spin Magnitude Change in Orbital Evolution in General Relativity
Authors:
Mark Alaverdian,
Zvi Bern,
Dimitrios Kosmopoulos,
Andres Luna,
Radu Roiban,
Trevor Scheopner,
Fei Teng
Abstract:
We show that physical scattering observables for compact spinning objects in general relativity can depend on additional degrees of freedom in the spin tensor beyond those described by the spin vector alone. The impulse, spin kick, and leading-order waveforms exhibit such a nontrivial dependence. A signal of this additional structure is the change in the magnitude of the spin vector under conserva…
▽ More
We show that physical scattering observables for compact spinning objects in general relativity can depend on additional degrees of freedom in the spin tensor beyond those described by the spin vector alone. The impulse, spin kick, and leading-order waveforms exhibit such a nontrivial dependence. A signal of this additional structure is the change in the magnitude of the spin vector under conservative Hamiltonian evolution, similar to our previous studies in electrodynamics. These additional degrees of freedom describe dynamical mass multipoles of compact objects and decouple for black holes. We also show that the conservative impulse, spin kick and change of the additional degrees of freedom are encoded in the eikonal phase.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Difference Equations and Integral Families for Witten Diagrams
Authors:
Mark Alaverdian,
Aidan Herderschee,
Radu Roiban,
Fei Teng
Abstract:
We show that tree-level and one-loop Mellin space correlators in anti-de Sitter space obey certain difference equations, which are the direct analog to the differential equations for Feynman loop integrals in the flat space. Finite-difference relations, which we refer to as ``summation-by-parts relations'', in parallel with the integration-by-parts relations for Feynman loop integrals, are derived…
▽ More
We show that tree-level and one-loop Mellin space correlators in anti-de Sitter space obey certain difference equations, which are the direct analog to the differential equations for Feynman loop integrals in the flat space. Finite-difference relations, which we refer to as ``summation-by-parts relations'', in parallel with the integration-by-parts relations for Feynman loop integrals, are derived to reduce the integrals to a basis. We illustrate the general methodology by explicitly deriving the difference equations and summation-by-parts relations for various tree-level and one-loop Witten diagrams up to the four-point bubble level.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Cardinality Estimation on Hyper-relational Knowledge Graphs
Authors:
Fei Teng,
Haoyang Li,
Shimin Di,
Lei Chen
Abstract:
Cardinality Estimation (CE) for query is to estimate the number of results without execution, which is an effective index in query optimization. Recently, CE over has achieved great success in knowledge graphs (KGs) that consist of triple facts. To more precisely represent facts, current researchers propose hyper-relational KGs (HKGs) to represent a triple fact with qualifiers, where qualifiers pr…
▽ More
Cardinality Estimation (CE) for query is to estimate the number of results without execution, which is an effective index in query optimization. Recently, CE over has achieved great success in knowledge graphs (KGs) that consist of triple facts. To more precisely represent facts, current researchers propose hyper-relational KGs (HKGs) to represent a triple fact with qualifiers, where qualifiers provide additional context to the fact. However, existing CE methods over KGs achieve unsatisfying performance on HKGs due to the complexity of qualifiers in HKGs. Also, there is only one dataset for HKG query cardinality estimation, i.e., WD50K-QE, which is not comprehensive and only covers limited patterns. The lack of querysets over HKG also becomes a bottleneck to comprehensively investigate CE problems on HKGs. In this work, we first construct diverse and unbiased hyper-relational querysets over three popular HKGs for investigating CE. Besides, we also propose a novel qualifier-attached graph neural network (GNN) model that effectively incorporates qualifier information and adaptively combines outputs from multiple GNN layers, to accurately predict the cardinality. Our experiments illustrate that the proposed hyper-relational query encoder outperforms all state-of-the-art CE methods over three popular HKGs on the diverse and unbiased benchmark.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Coordinated Planning for Stability Enhancement in High IBR-Penetrated Systems
Authors:
Zhongda Chu,
Fei Teng
Abstract:
Security and stability challenges in future power systems with high penetration Inverter-Based Resources (IBR) have been anticipated as the main barrier to decolonization. Grid-following IBRs may become unstable under small disturbances in weak grids, while, during transient processes, system stability and protection may be jeopardized due to the lack of sufficient Short-Circuit Current (SCC). To…
▽ More
Security and stability challenges in future power systems with high penetration Inverter-Based Resources (IBR) have been anticipated as the main barrier to decolonization. Grid-following IBRs may become unstable under small disturbances in weak grids, while, during transient processes, system stability and protection may be jeopardized due to the lack of sufficient Short-Circuit Current (SCC). To solve these challenges and achieve decarbonization, the future system has to be carefully planned. However, it remains unclear how both small-signal and transient processes can be considered during the system planning stage. In this context, this paper proposes a coordinated planning model of different resources to enhance system-level stability. The system strength and SCC constraints are analytically derived by considering the different characteristics of synchronous units and IBRs, which are further effectively linearized through a novel data-driven approach, where an active sampling method is proposed to generate a representative data set. The significant economic value of the proposed coordinated planning framework in both system asset investment and system operation is demonstrated through detailed case studies.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Continual Learning for Smart City: A Survey
Authors:
Li Yang,
Zhipeng Luo,
Shiming Zhang,
Fei Teng,
Tianrui Li
Abstract:
With the digitization of modern cities, large data volumes and powerful computational resources facilitate the rapid update of intelligent models deployed in smart cities. Continual learning (CL) is a novel machine learning paradigm that constantly updates models to adapt to changing environments, where the learning tasks, data, and distributions can vary over time. Our survey provides a comprehen…
▽ More
With the digitization of modern cities, large data volumes and powerful computational resources facilitate the rapid update of intelligent models deployed in smart cities. Continual learning (CL) is a novel machine learning paradigm that constantly updates models to adapt to changing environments, where the learning tasks, data, and distributions can vary over time. Our survey provides a comprehensive review of continual learning methods that are widely used in smart city development. The content consists of three parts: 1) Methodology-wise. We categorize a large number of basic CL methods and advanced CL frameworks in combination with other learning paradigms including graph learning, spatial-temporal learning, multi-modal learning, and federated learning. 2) Application-wise. We present numerous CL applications covering transportation, environment, public health, safety, networks, and associated datasets related to urban computing. 3) Challenges. We discuss current problems and challenges and envision several promising research directions. We believe this survey can help relevant researchers quickly familiarize themselves with the current state of continual learning research used in smart city development and direct them to future research trends.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning
Authors:
Yongqi Tong,
Dawei Li,
Sizhe Wang,
Yujia Wang,
Fei Teng,
Jingbo Shang
Abstract:
Recent works have shown the benefits to LLMs from fine-tuning golden-standard Chain-of-Thought (CoT) rationales or using them as correct examples in few-shot prompting. While humans can indeed imitate correct examples, learning from our mistakes is another vital aspect of human cognition. Hence, a question naturally arises: \textit{can LLMs learn and benefit from their mistakes, especially for the…
▽ More
Recent works have shown the benefits to LLMs from fine-tuning golden-standard Chain-of-Thought (CoT) rationales or using them as correct examples in few-shot prompting. While humans can indeed imitate correct examples, learning from our mistakes is another vital aspect of human cognition. Hence, a question naturally arises: \textit{can LLMs learn and benefit from their mistakes, especially for their reasoning? } This study investigates this problem from both the prompting and model-tuning perspectives. We begin by introducing \textsc{CoTErrorSet}, a new benchmark with 609,432 questions, each designed with both correct and error references, and demonstrating the types and reasons for making such mistakes. To explore the effectiveness of those mistakes, we design two methods: (1) \textbf{Self-rethinking} prompting guides LLMs to rethink whether they have made similar previous mistakes; and (2) \textbf{Mistake tuning} involves finetuning models in both correct and incorrect reasoning domains, rather than only tuning models to learn ground truth in traditional methodology. We conduct a series of experiments to prove LLMs can obtain benefits from mistakes in both directions. Our two methods offer potentially cost-effective strategies by leveraging errors to enhance reasoning capabilities, which costs significantly less than creating meticulously hand-crafted golden references. We ultimately make a thorough analysis of the reasons behind LLMs' errors, which provides directions that future research needs to overcome. \textsc{CoTErrorSet} will be published soon on \texttt{\url{https://github.com/YookiTong/Learn-from-Mistakes-CotErrorSet}}.
△ Less
Submitted 7 June, 2024; v1 submitted 29 March, 2024;
originally announced March 2024.
-
Self-supervised Contrastive Learning for Implicit Collaborative Filtering
Authors:
Shipeng Song,
Bin Liu,
Fei Teng,
Tianrui Li
Abstract:
Contrastive learning-based recommendation algorithms have significantly advanced the field of self-supervised recommendation, particularly with BPR as a representative ranking prediction task that dominates implicit collaborative filtering. However, the presence of false-positive and false-negative examples in recommendation systems hampers accurate preference learning. In this study, we propose a…
▽ More
Contrastive learning-based recommendation algorithms have significantly advanced the field of self-supervised recommendation, particularly with BPR as a representative ranking prediction task that dominates implicit collaborative filtering. However, the presence of false-positive and false-negative examples in recommendation systems hampers accurate preference learning. In this study, we propose a simple self-supervised contrastive learning framework that leverages positive feature augmentation and negative label augmentation to improve the self-supervisory signal. Theoretical analysis demonstrates that our learning method is equivalent to maximizing the likelihood estimation with latent variables representing user interest centers. Additionally, we establish an efficient negative label augmentation technique that samples unlabeled examples with a probability linearly dependent on their relative ranking positions, enabling efficient augmentation in constant time complexity. Through validation on multiple datasets, we illustrate the significant improvements our method achieves over the widely used BPR optimization objective while maintaining comparable runtime.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
A Survey of Route Recommendations: Methods, Applications, and Opportunities
Authors:
Shiming Zhang,
Zhipeng Luo,
Li Yang,
Fei Teng,
Tianrui Li
Abstract:
Nowadays, with advanced information technologies deployed citywide, large data volumes and powerful computational resources are intelligentizing modern city development. As an important part of intelligent transportation, route recommendation and its applications are widely used, directly influencing citizens` travel habits. Developing smart and efficient travel routes based on big data (possibly…
▽ More
Nowadays, with advanced information technologies deployed citywide, large data volumes and powerful computational resources are intelligentizing modern city development. As an important part of intelligent transportation, route recommendation and its applications are widely used, directly influencing citizens` travel habits. Developing smart and efficient travel routes based on big data (possibly multi-modal) has become a central challenge in route recommendation research. Our survey offers a comprehensive review of route recommendation work based on urban computing. It is organized by the following three parts: 1) Methodology-wise. We categorize a large volume of traditional machine learning and modern deep learning methods. Also, we discuss their historical relations and reveal the edge-cutting progress. 2) Application\-wise. We present numerous novel applications related to route commendation within urban computing scenarios. 3) We discuss current problems and challenges and envision several promising research directions. We believe that this survey can help relevant researchers quickly familiarize themselves with the current state of route recommendation research and then direct them to future research trends.
△ Less
Submitted 6 April, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Uncertainty-Aware Transient Stability-Constrained Preventive Redispatch: A Distributional Reinforcement Learning Approach
Authors:
Zhengcheng Wang,
Fei Teng,
Yanzhen Zhou,
Qinglai Guo,
Hongbin Sun
Abstract:
Transient stability-constrained preventive redispatch plays a crucial role in ensuring power system security and stability. Since redispatch strategies need to simultaneously satisfy complex transient constraints and the economic need, model-based formulation and optimization become extremely challenging. In addition, the increasing uncertainty and variability introduced by renewable sources start…
▽ More
Transient stability-constrained preventive redispatch plays a crucial role in ensuring power system security and stability. Since redispatch strategies need to simultaneously satisfy complex transient constraints and the economic need, model-based formulation and optimization become extremely challenging. In addition, the increasing uncertainty and variability introduced by renewable sources start to drive the system stability consideration from deterministic to probabilistic, which further exaggerates the complexity. In this paper, a Graph neural network guided Distributional Deep Reinforcement Learning (GD2RL) method is proposed, for the first time, to solve the uncertainty-aware transient stability-constrained preventive redispatch problem. First, a graph neural network-based transient simulator is trained by supervised learning to efficiently generate post-contingency rotor angle curves with the steady-state and contingency as inputs, which serves as a feature extractor for operating states and a surrogate time-domain simulator during the environment interaction for reinforcement learning. Distributional deep reinforcement learning with explicit uncertainty distribution of system operational conditions is then applied to generate the redispatch strategy to balance the user-specified probabilistic stability performance and economy preferences. The full distribution of the post-redispatch transient stability index is directly provided as the output. Case studies on the modified New England 39-bus system validate the proposed method.
△ Less
Submitted 29 June, 2024; v1 submitted 14 February, 2024;
originally announced February 2024.
-
Gravitational Waveform: A Tale of Two Formalisms
Authors:
Donato Bini,
Thibault Damour,
Stefano De Angelis,
Andrea Geralico,
Aidan Herderschee,
Radu Roiban,
Fei Teng
Abstract:
We revisit the quantum-amplitude-based derivation of the gravitational waveform emitted by the scattering of two spinless massive bodies at the third order in Newton's constant, $h \sim G+G^2+G^3$ (one-loop level), and correspondingly update its comparison with its classically-derived multipolar-post-Minkowskian counterpart. A spurious-pole-free reorganization of the one-loop five-point amplitude…
▽ More
We revisit the quantum-amplitude-based derivation of the gravitational waveform emitted by the scattering of two spinless massive bodies at the third order in Newton's constant, $h \sim G+G^2+G^3$ (one-loop level), and correspondingly update its comparison with its classically-derived multipolar-post-Minkowskian counterpart. A spurious-pole-free reorganization of the one-loop five-point amplitude substantially simplifies the post-Newtonian expansion. We find complete agreement between the two results up to the fifth order in the small velocity expansion after taking into account three subtle aspects of the amplitude derivation: (1) in agreement with [arXiv:2312.07452 [hep-th]], the term quadratic in the amplitude in the observable-based formalism [JHEP 02, 137 (2019)] generates a frame rotation by half the classical scattering angle; (2) the dimensional regularization of the infrared divergences of the amplitude introduces an additional $(d-4)/(d-4)$ finite term; and (3) zero-frequency gravitons are found to contribute additional terms both at order $h \sim G^1$ and at order $h \sim G^3$ when including disconnected diagrams in the observable-based formalism.
△ Less
Submitted 21 June, 2024; v1 submitted 9 February, 2024;
originally announced February 2024.
-
Learning Contrastive Feature Representations for Facial Action Unit Detection
Authors:
Ziqiao Shang,
Bin Liu,
Fengmao Lv,
Fei Teng,
Tianrui Li
Abstract:
Facial action unit (AU) detection has long encountered the challenge of detecting subtle feature differences when AUs activate. Existing methods often rely on encoding pixel-level information of AUs, which not only encodes additional redundant information but also leads to increased model complexity and limited generalizability. Additionally, the accuracy of AU detection is negatively impacted by…
▽ More
Facial action unit (AU) detection has long encountered the challenge of detecting subtle feature differences when AUs activate. Existing methods often rely on encoding pixel-level information of AUs, which not only encodes additional redundant information but also leads to increased model complexity and limited generalizability. Additionally, the accuracy of AU detection is negatively impacted by the class imbalance issue of each AU type, and the presence of noisy and false AU labels. In this paper, we introduce a novel contrastive learning framework aimed for AU detection that incorporates both self-supervised and supervised signals, thereby enhancing the learning of discriminative features for accurate AU detection. To tackle the class imbalance issue, we employ a negative sample re-weighting strategy that adjusts the step size of updating parameters for minority and majority class samples. Moreover, to address the challenges posed by noisy and false AU labels, we employ a sampling technique that encompasses three distinct types of positive sample pairs. This enables us to inject self-supervised signals into the supervised signal, effectively mitigating the adverse effects of noisy labels. Our experimental assessments, conducted on four widely-utilized benchmark datasets (BP4D, DISFA, GFT and Aff-Wild2), underscore the superior performance of our approach compared to state-of-the-art methods of AU detection. Our code is available at \url{https://github.com/Ziqiao-Shang/AUNCE}.
△ Less
Submitted 12 July, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras
Authors:
Fei Teng,
Jiaming Zhang,
Jiawei Liu,
Kunyu Peng,
Xina Cheng,
Zhiyong Li,
Kailun Yang
Abstract:
Leveraging the rich information extracted from light field (LF) cameras is instrumental for dense prediction tasks. However, adapting light field data to enhance Salient Object Detection (SOD) still follows the traditional RGB methods and remains under-explored in the community. Previous approaches predominantly employ a custom two-stream design to discover the implicit angular feature within ligh…
▽ More
Leveraging the rich information extracted from light field (LF) cameras is instrumental for dense prediction tasks. However, adapting light field data to enhance Salient Object Detection (SOD) still follows the traditional RGB methods and remains under-explored in the community. Previous approaches predominantly employ a custom two-stream design to discover the implicit angular feature within light field cameras, leading to significant information isolation between different LF representations. In this study, we propose an efficient paradigm (LF Tracy) to address this limitation. We eschew the conventional specialized fusion and decoder architecture for a dual-stream backbone in favor of a unified, single-pipeline approach. This comprises firstly a simple yet effective data augmentation strategy called MixLD to bridge the connection of spatial, depth, and implicit angular information under different LF representations. A highly efficient information aggregation (IA) module is then introduced to boost asymmetric feature-wise information fusion. Owing to this innovative approach, our model surpasses the existing state-of-the-art methods, particularly demonstrating a 23% improvement over previous results on the latest large-scale PKU dataset. By utilizing only 28.9M parameters, the model achieves a 10% increase in accuracy with 3M additional parameters compared to its backbone using RGB images and an 86% rise to its backbone using LF images. The source code will be made publicly available at https://github.com/FeiBryantkit/LF-Tracy.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Pricing of Short Circuit Current in High IBR-Penetrated System
Authors:
Zhongda Chu,
Jingyi Wu,
Fei Teng
Abstract:
With the growing penetration of Inverter-Based Resources (IBRs) in power systems, stability service markets have emerged to incentivize technologies that ensure power system stability and reliability. Among the various challenges faced in power system operation and stability, a prominent issue raised from the increasing integration of large-scale IBRs is the significant reduction of the Short-Circ…
▽ More
With the growing penetration of Inverter-Based Resources (IBRs) in power systems, stability service markets have emerged to incentivize technologies that ensure power system stability and reliability. Among the various challenges faced in power system operation and stability, a prominent issue raised from the increasing integration of large-scale IBRs is the significant reduction of the Short-Circuit Current (SCC) level in the system, which poses a considerable threat to system voltage stability and protection. Thus, a proper market mechanism to incentivize the provision of SCC as a stability service is desired. However, the pricing of this service within the future stability market has not yet been fully developed, due to the nonconvex nature of SCC constraints and the locational property of SCC. To address these problems, this work aims to explore, for the first time, a pricing model for SCC service by incorporating a linearized SCC constraint into the Unit Commitment (UC) problem, to achieve the desired SCC level and extract the shadow price for SCC through different pricing methods.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Decision-Oriented Learning for Future Power System Decision-Making under Uncertainty
Authors:
Ran Li,
Haipeng Zhang,
Mingyang Sun,
Fei Teng,
Can Wan,
Salvador Pineda,
Georges Kariniotakis
Abstract:
Better forecasts may not lead to better decision-making. To address this challenge, decision-oriented learning (DOL) has been proposed as a new branch of machine learning that replaces traditional statistical loss with a decision loss to form an end-to-end model. Applications of DOL in power systems have been developed in recent years. For renewable-rich power systems, uncertainties propagate thro…
▽ More
Better forecasts may not lead to better decision-making. To address this challenge, decision-oriented learning (DOL) has been proposed as a new branch of machine learning that replaces traditional statistical loss with a decision loss to form an end-to-end model. Applications of DOL in power systems have been developed in recent years. For renewable-rich power systems, uncertainties propagate through sequential tasks, where traditional statistical-based approaches focus on minimizing statistical errors at intermediate stages but may fail to provide optimal decisions at the final stage. This paper first elaborates on the mismatch between more accurate forecasts and more optimal decisions in the power system caused by statistical-based learning (SBL) and explains how DOL resolves this problem. Secondly, this paper extensively reviews DOL techniques and their applications in power systems while highlighting their pros and cons in relation to SBL. Finally, this paper identifies the challenges to adopt DOL in the energy sector and presents future research directions.
△ Less
Submitted 7 April, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Learning Time-aware Graph Structures for Spatially Correlated Time Series Forecasting
Authors:
Minbo Ma,
Jilin Hu,
Christian S. Jensen,
Fei Teng,
Peng Han,
Zhiqiang Xu,
Tianrui Li
Abstract:
Spatio-temporal forecasting of future values of spatially correlated time series is important across many cyber-physical systems (CPS). Recent studies offer evidence that the use of graph neural networks to capture latent correlations between time series holds a potential for enhanced forecasting. However, most existing methods rely on pre-defined or self-learning graphs, which are either static o…
▽ More
Spatio-temporal forecasting of future values of spatially correlated time series is important across many cyber-physical systems (CPS). Recent studies offer evidence that the use of graph neural networks to capture latent correlations between time series holds a potential for enhanced forecasting. However, most existing methods rely on pre-defined or self-learning graphs, which are either static or unintentionally dynamic, and thus cannot model the time-varying correlations that exhibit trends and periodicities caused by the regularity of the underlying processes in CPS. To tackle such limitation, we propose Time-aware Graph Structure Learning (TagSL), which extracts time-aware correlations among time series by measuring the interaction of node and time representations in high-dimensional spaces. Notably, we introduce time discrepancy learning that utilizes contrastive learning with distance-based regularization terms to constrain learned spatial correlations to a trend sequence. Additionally, we propose a periodic discriminant function to enable the capture of periodic changes from the state of nodes. Next, we present a Graph Convolution-based Gated Recurrent Unit (GCGRU) that jointly captures spatial and temporal dependencies while learning time-aware and node-specific patterns. Finally, we introduce a unified framework named Time-aware Graph Convolutional Recurrent Network (TGCRN), combining TagSL, and GCGRU in an encoder-decoder architecture for multi-step spatio-temporal forecasting. We report on experiments with TGCRN and popular existing approaches on five real-world datasets, thus providing evidence that TGCRN is capable of advancing the state-of-the-art. We also cover a detailed ablation study and visualization analysis, offering detailed insight into the effectiveness of time-aware structure learning.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
E2E-AT: A Unified Framework for Tackling Uncertainty in Task-aware End-to-end Learning
Authors:
Wangkun Xu,
Jianhong Wang,
Fei Teng
Abstract:
Successful machine learning involves a complete pipeline of data, model, and downstream applications. Instead of treating them separately, there has been a prominent increase of attention within the constrained optimization (CO) and machine learning (ML) communities towards combining prediction and optimization models. The so-called end-to-end (E2E) learning captures the task-based objective for w…
▽ More
Successful machine learning involves a complete pipeline of data, model, and downstream applications. Instead of treating them separately, there has been a prominent increase of attention within the constrained optimization (CO) and machine learning (ML) communities towards combining prediction and optimization models. The so-called end-to-end (E2E) learning captures the task-based objective for which they will be used for decision making. Although a large variety of E2E algorithms have been presented, it has not been fully investigated how to systematically address uncertainties involved in such models. Most of the existing work considers the uncertainties of ML in the input space and improves robustness through adversarial training. We extend this idea to E2E learning and prove that there is a robustness certification procedure by solving augmented integer programming. Furthermore, we show that neglecting the uncertainty of COs during training causes a new trigger for generalization errors. To include all these components, we propose a unified framework that covers the uncertainties emerging in both the input feature space of the ML models and the COs. The framework is described as a robust optimization problem and is practically solved via end-to-end adversarial training (E2E-AT). Finally, the performance of E2E-AT is evaluated by a real-world end-to-end power system operation problem, including load forecasting and sequential scheduling tasks.
△ Less
Submitted 23 December, 2023; v1 submitted 16 December, 2023;
originally announced December 2023.
-
ULMA: Unified Language Model Alignment with Human Demonstration and Point-wise Preference
Authors:
Tianchi Cai,
Xierui Song,
Jiyan Jiang,
Fei Teng,
Jinjie Gu,
Guannan Zhang
Abstract:
Aligning language models to human expectations, e.g., being helpful and harmless, has become a pressing challenge for large language models. A typical alignment procedure consists of supervised fine-tuning and preference learning. Most preference learning methods, such as RLHF and DPO, depend on pairwise preference data, which inadequately address scenarios where human feedback is point-wise, lead…
▽ More
Aligning language models to human expectations, e.g., being helpful and harmless, has become a pressing challenge for large language models. A typical alignment procedure consists of supervised fine-tuning and preference learning. Most preference learning methods, such as RLHF and DPO, depend on pairwise preference data, which inadequately address scenarios where human feedback is point-wise, leading to potential information loss and suboptimal performance. Addressing this gap, we introduce Point-wise Direct Preference Optimization, a novel preference learning method designed to harness point-wise feedback effectively. Our work also uncovers a novel connection between supervised fine-tuning and point-wise preference learning, culminating in Unified Language Model Alignment, a single-step method that unifies the alignment with human demonstrations and point-wise preferences. Extensive experiments on point-wise preference datasets with binary or continuous labels validate the effectiveness of our methods. Our code and a new dataset with high-quality demonstration samples on harmlessness are released.
△ Less
Submitted 26 February, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.
-
Out-of-Distribution Generalized Dynamic Graph Neural Network for Human Albumin Prediction
Authors:
Zeyang Zhang,
Xingwang Li,
Fei Teng,
Ning Lin,
Xueling Zhu,
Xin Wang,
Wenwu Zhu
Abstract:
Human albumin is essential for indicating the body's overall health. Accurately predicting plasma albumin levels and determining appropriate doses are urgent clinical challenges, particularly in critically ill patients, to maintain optimal blood levels. However, human albumin prediction is non-trivial that has to leverage the dynamics of biochemical markers as well as the experience of treating pa…
▽ More
Human albumin is essential for indicating the body's overall health. Accurately predicting plasma albumin levels and determining appropriate doses are urgent clinical challenges, particularly in critically ill patients, to maintain optimal blood levels. However, human albumin prediction is non-trivial that has to leverage the dynamics of biochemical markers as well as the experience of treating patients. Moreover, the problem of distribution shift is often encountered in real clinical data, which may lead to a decline in the model prediction performance and reduce the reliability of the model's application. In this paper, we propose a framework named Out-of-Distribution Generalized Dynamic Graph Neural Network for Human Albumin Prediction (DyG-HAP), which is able to provide accurate albumin predictions for Intensity Care Unit (ICU) patients during hospitalization. We first model human albumin prediction as a dynamic graph regression problem to model the dynamics and patient relationship. Then, we propose a disentangled dynamic graph attention mechanism to capture and disentangle the patterns whose relationship to labels under distribution shifts is invariant and variant respectively. Last, we propose an invariant dynamic graph regression method to encourage the model to rely on invariant patterns to make predictions. Moreover, we propose a dataset named Albumin level testing and nutritional dosing data for Intensive Care (ANIC) for evaluation. Extensive experiments demonstrate the superiority of our method compared to several baseline methods in human albumin prediction.
△ Less
Submitted 7 March, 2024; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Managing the Uncertainty in System Dynamics Through Distributionally Robust Stability-Constrained Optimization
Authors:
Zhongda Chu,
Fei Teng
Abstract:
With the increasing penetration of Inverter-Based Resources (IBRs) and their impact on power system stability and operation, the concept of stability-constrained optimization has drawn significant attention from researchers. In order to manage the parametric uncertainty due to inaccurate modeling that influences the system dynamics, this work proposes a distributionally robust stability constraint…
▽ More
With the increasing penetration of Inverter-Based Resources (IBRs) and their impact on power system stability and operation, the concept of stability-constrained optimization has drawn significant attention from researchers. In order to manage the parametric uncertainty due to inaccurate modeling that influences the system dynamics, this work proposes a distributionally robust stability constraint formulation. However, the uncertainty of system dynamic parameters influences the stability constraints indirectly through a nonlinear and implicit relationship. To address this issue, a propagation mechanism from the uncertainty of the system dynamic parameters to the stability constraint coefficients is established. Since these coefficients are connected to the uncertain parameters through highly nonlinear and implicit functions, an approximation approach utilizing Taylor expansion and the Delta method is developed to estimate the statistical moments of the stability constraint coefficients based on the first and second-order derivatives, with which an ambiguity set for the distributionally robust optimization can be formulated. The accuracy of the uncertainty propagation as well as the effectiveness of the distributionally robust stability constraints are demonstrated through detailed case studies in the modified IEEE 39-bus system.
△ Less
Submitted 22 April, 2024; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Cyber Recovery from Dynamic Load Altering Attacks: Linking Electricity, Transportation, and Cyber Networks
Authors:
Mengxiang Liu,
Zhongda Chu,
Fei Teng
Abstract:
To address the increasing vulnerability of power grids, significant attention has been focused on the attack detection and impact mitigation. However, it is still unclear how to effectively and quickly recover the cyber and physical networks from a cyberattack. In this context, this paper presents the first investigation of the Cyber Recovery from Dynamic load altering Attack (CRDA). Considering t…
▽ More
To address the increasing vulnerability of power grids, significant attention has been focused on the attack detection and impact mitigation. However, it is still unclear how to effectively and quickly recover the cyber and physical networks from a cyberattack. In this context, this paper presents the first investigation of the Cyber Recovery from Dynamic load altering Attack (CRDA). Considering the interconnection among electricity, transportation, and cyber networks, two essential sub-tasks are formulated for the CRDA: i) Optimal design of repair crew routes to remove installed malware and ii) Adaptive adjustment of system operation to eliminate the mitigation costs while guaranteeing stability. To achieve this, linear stability constraints are obtained by estimating the related eigenvalues under the variation of multiple IBR droop gains based on the sensitivity information of strategically selected sampling points. Moreover, to obtain the robust recovery strategy, the potential counter-measures from the adversary during the recovery process are modeled as maximizing the attack impact of remaining compromised resources in each step. A Mixed-Integer Linear Programming (MILP) problem can be finally formulated for the CRDA with the primary objective to reset involved droop gains and secondarily to repair all compromised loads. Case studies are performed in the modified IEEE 39-bus power system to illustrate the effectiveness of the proposed CRDA compared to the benchmark case.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Task-Aware Machine Unlearning and Its Application in Load Forecasting
Authors:
Wangkun Xu,
Fei Teng
Abstract:
Data privacy and security have become a non-negligible factor in load forecasting. Previous researches mainly focus on training stage enhancement. However, once the model is trained and deployed, it may need to `forget' (i.e., remove the impact of) part of training data if the these data are found to be malicious or as requested by the data owner. This paper introduces the concept of machine unlea…
▽ More
Data privacy and security have become a non-negligible factor in load forecasting. Previous researches mainly focus on training stage enhancement. However, once the model is trained and deployed, it may need to `forget' (i.e., remove the impact of) part of training data if the these data are found to be malicious or as requested by the data owner. This paper introduces the concept of machine unlearning which is specifically designed to remove the influence of part of the dataset on an already trained forecaster. However, direct unlearning inevitably degrades the model generalization ability. To balance between unlearning completeness and model performance, a performance-aware algorithm is proposed by evaluating the sensitivity of local model parameter change using influence function and sample re-weighting. Furthermore, we observe that the statistical criterion such as mean squared error, cannot fully reflect the operation cost of the downstream tasks in power system. Therefore, a task-aware machine unlearning is proposed whose objective is a trilevel optimization with dispatch and redispatch problems considered. We theoretically prove the existence of the gradient of such an objective, which is key to re-weighting the remaining samples. We tested the unlearning algorithms on linear, CNN, and MLP-Mixer based load forecasters with a realistic load dataset. The simulation demonstrates the balance between unlearning completeness and operational cost. All codes can be found at https://github.com/xuwkk/task_aware_machine_unlearning.
△ Less
Submitted 11 March, 2024; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Quantum Field Theory, Worldline Theory, and Spin Magnitude Change in Orbital Evolution
Authors:
Zvi Bern,
Dimitrios Kosmopoulos,
Andres Luna,
Radu Roiban,
Trevor Scheopner,
Fei Teng,
Justin Vines
Abstract:
A previous paper~\cite{Bern:2022kto} identified a puzzle stemming from the amplitudes-based approach to spinning bodies in general relativity: additional Wilson coefficients appear compared to current worldline approaches to conservative dynamics of generic astrophysical objects, including neutron stars. In this paper we clarify the nature of analogous Wilson coefficients in the simpler theory of…
▽ More
A previous paper~\cite{Bern:2022kto} identified a puzzle stemming from the amplitudes-based approach to spinning bodies in general relativity: additional Wilson coefficients appear compared to current worldline approaches to conservative dynamics of generic astrophysical objects, including neutron stars. In this paper we clarify the nature of analogous Wilson coefficients in the simpler theory of electrodynamics. We analyze the original field-theory construction, identifying definite-spin states some of which have negative norms, and relating the additional Wilson coefficients in the classical theory to transitions between different quantum spin states. We produce a new version of the theory which also has additional Wilson coefficients, but no negative-norm states. We match, through $\mathcal O(α^2)$ and $\mathcal O(S^2)$, the Compton amplitudes of these field theories with those of a modified worldline theory with extra degrees of freedom introduced by releasing the spin supplementary condition. We build an effective two-body Hamiltonian that matches the impulse and spin kick of the modified field theory and of the worldline theory, displaying additional Wilson coefficients compared to standard worldline approaches. The results are then compactly expressed in terms of an eikonal formula. Our key conclusion is that, contrary to standard approaches, while the magnitude of the spin tensor is still conserved, the magnitude of the spin vector can change under conserved Hamiltonian dynamics and this change is governed by the additional Wilson coefficients. For specific values of Wilson coefficients the results are equivalent to those from a definite spin obeying the spin supplementary condition, but for generic values they are physically inequivalent. These results warrant detailed studies of the corresponding issues in general relativity.
△ Less
Submitted 1 March, 2024; v1 submitted 27 August, 2023;
originally announced August 2023.
-
Semi-Supervised Dual-Stream Self-Attentive Adversarial Graph Contrastive Learning for Cross-Subject EEG-based Emotion Recognition
Authors:
Weishan Ye,
Zhiguo Zhang,
Min Zhang,
Fei Teng,
Li Zhang,
Linling Li,
Gan Huang,
Jianhong Wang,
Dong Ni,
Zhen Liang
Abstract:
Electroencephalography (EEG) is an objective tool for emotion recognition with promising applications. However, the scarcity of labeled data remains a major challenge in this field, limiting the widespread use of EEG-based emotion recognition. In this paper, a semi-supervised Dual-stream Self-Attentive Adversarial Graph Contrastive learning framework (termed as DS-AGC) is proposed to tackle the ch…
▽ More
Electroencephalography (EEG) is an objective tool for emotion recognition with promising applications. However, the scarcity of labeled data remains a major challenge in this field, limiting the widespread use of EEG-based emotion recognition. In this paper, a semi-supervised Dual-stream Self-Attentive Adversarial Graph Contrastive learning framework (termed as DS-AGC) is proposed to tackle the challenge of limited labeled data in cross-subject EEG-based emotion recognition. The DS-AGC framework includes two parallel streams for extracting non-structural and structural EEG features. The non-structural stream incorporates a semi-supervised multi-domain adaptation method to alleviate distribution discrepancy among labeled source domain, unlabeled source domain, and unknown target domain. The structural stream develops a graph contrastive learning method to extract effective graph-based feature representation from multiple EEG channels in a semi-supervised manner. Further, a self-attentive fusion module is developed for feature fusion, sample selection, and emotion recognition, which highlights EEG features more relevant to emotions and data samples in the labeled source domain that are closer to the target domain. Extensive experiments conducted on two benchmark databases (SEED and SEED-IV) using a semi-supervised cross-subject leave-one-subject-out cross-validation evaluation scheme show that the proposed model outperforms existing methods under different incomplete label conditions (with an average improvement of 5.83% on SEED and 6.99% on SEED-IV), demonstrating its effectiveness in addressing the label scarcity problem in cross-subject EEG-based emotion recognition.
△ Less
Submitted 13 August, 2023;
originally announced August 2023.
-
Control-mode as a Grid Service in Software-defined Power Grids: GFL vs GFM
Authors:
Guoxuan Cui,
Zhongda Chu,
Fei Teng
Abstract:
In power systems with high penetration of power electronics, grid-forming control is proposed to replace traditional Grid-Following Converter (GFL) in order to improve the overall system strength and resist small-signal instability in weak grids by directly forming the terminal voltage. However, sufficient headroom of both active and reactive power must be made available for Grid-Forming Converter…
▽ More
In power systems with high penetration of power electronics, grid-forming control is proposed to replace traditional Grid-Following Converter (GFL) in order to improve the overall system strength and resist small-signal instability in weak grids by directly forming the terminal voltage. However, sufficient headroom of both active and reactive power must be made available for Grid-Forming Converter (GFM) to operate, potentially leading to sub-optimal operation in steady states. This presents a new research problem to optimally allocate between GFM and GFL to balance the ability of GFMs to improve the grid strength and the potential economic loss resulting from reserved headroom. An optimization framework under software-defined grids is proposed, for the first time, to dynamically determine the optimal allocation of GFMs and GFLs in power systems at each time step of system scheduling according to system conditions, which ensures both system stability and minimum operational cost. To achieve this, the system scheduling model is expanded to simultaneously consider the constraints related to active and reactive power reserves for GFMs, as well as the system level stability. Case studies conducted on the modified IEEE 30-bus system demonstrate significant economic benefits in that the optimal proportion of GFMs in the power system can be dynamically determined while ensuring power reserve and grid stability constraints.
△ Less
Submitted 28 July, 2023;
originally announced July 2023.
-
OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation
Authors:
Fei Teng,
Jiaming Zhang,
Kunyu Peng,
Yaonan Wang,
Rainer Stiefelhagen,
Kailun Yang
Abstract:
Light field cameras, by harnessing the power of micro-lens array, are capable of capturing intricate angular and spatial details. This allows for acquiring complex light patterns and details from multiple angles, significantly enhancing the precision of image semantic segmentation, a critical aspect of scene interpretation in vision intelligence. However, the extensive angular information of light…
▽ More
Light field cameras, by harnessing the power of micro-lens array, are capable of capturing intricate angular and spatial details. This allows for acquiring complex light patterns and details from multiple angles, significantly enhancing the precision of image semantic segmentation, a critical aspect of scene interpretation in vision intelligence. However, the extensive angular information of light field cameras contains a large amount of redundant data, which is overwhelming for the limited hardware resources of intelligent vehicles. Besides, inappropriate compression leads to information corruption and data loss. To excavate representative information, we propose a new paradigm, Omni-Aperture Fusion model (OAFuser), which leverages dense context from the central view and discovers the angular information from sub-aperture images to generate a semantically consistent result. To avoid feature loss during network propagation and simultaneously streamline the redundant information from the light field camera, we present a simple yet very effective Sub-Aperture Fusion Module (SAFM) to embed sub-aperture images into angular features without any additional memory cost. Furthermore, to address the mismatched spatial information across viewpoints, we present a Center Angular Rectification Module (CARM) to realize feature resorting and prevent feature occlusion caused by asymmetric information. Our proposed OAFuser achieves state-of-the-art performance on the UrbanLF-Real and -Syn datasets and sets a new record of 84.93% in mIoU on the UrbanLF-Real Extended dataset, with a gain of +4.53%. The source code of OAFuser will be available at https://github.com/FeiBryantkit/OAFuser.
△ Less
Submitted 21 December, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
Stability Constrained Optimization in High IBR-Penetrated Power Systems-Part II: Constraint Validation and Applications
Authors:
Zhongda Chu,
Fei Teng
Abstract:
Multiple operational constraints of power system stability are derived analytically and reformulated into Second-Order Cone (SOC) form through a unification method in Part I of this paper. The accuracy and conservativeness of the proposed methods are illustrated in the second part. The validity of the developed constraints is tested against dynamic simulations carried out based on the modified IEE…
▽ More
Multiple operational constraints of power system stability are derived analytically and reformulated into Second-Order Cone (SOC) form through a unification method in Part I of this paper. The accuracy and conservativeness of the proposed methods are illustrated in the second part. The validity of the developed constraints is tested against dynamic simulations carried out based on the modified IEEE 39-bus system. Furthermore, the developed power system stability constraints are applied to the optimal system scheduling model. The resulting stability-constrained system scheduling problem aims to achieve most economic system operation while ensuring different stability in power systems with high Inverter-Based Resources (IBR) penetration. Moreover, based on the stability-constrained optimization model, a novel marginal unit pricing scheme is proposed to quantify the stability services of different units appropriately according to their economic value in maintaining system stability, thus providing rational incentives to the stability service provider and insightful information for the stability market development.
△ Less
Submitted 14 February, 2024; v1 submitted 22 July, 2023;
originally announced July 2023.
-
Stability Constrained Optimization in High IBR-Penetrated Power Systems-Part I: Constraint Development and Unification
Authors:
Zhongda Chu,
Fei Teng
Abstract:
Maintaining power system stability is becoming more and more challenging due to the ever-increasing inverter-interfaced renewable penetration in power systems. To ensure system stability during system operation and to provide appropriate incentives in the future market-based stability maintenance framework, it is essential to develop a comprehensive set of power system stability constraints which…
▽ More
Maintaining power system stability is becoming more and more challenging due to the ever-increasing inverter-interfaced renewable penetration in power systems. To ensure system stability during system operation and to provide appropriate incentives in the future market-based stability maintenance framework, it is essential to develop a comprehensive set of power system stability constraints which can be incorporated into system operation, market design and planning problems. In this paper, different system stability issues, including synchronization, voltage and frequency stability, are investigated and the corresponding stability conditions are analytically formulated as system operation constraints. A unified framework is further proposed to represent the stability constraints in a general form and enables effective reformulation of the impedance-based stability metrics. All the constraints are converted into linear or Second-Order-Cone (SOC) form, which can be readily implemented in any optimisation-based applications, such as system scheduling, planning and market design, thus providing significant value for multiple system stability enhancement and studies.
△ Less
Submitted 22 July, 2023;
originally announced July 2023.
-
Vehicle-to-grid plug-in forecasting for participation in ancillary services markets
Authors:
Jemima Graham,
Fei Teng
Abstract:
Electric vehicle (EV) charge points (CPs) can be used by aggregators to provide frequency response (FR) services. Aggregators must have day-ahead half-hourly forecasts of minimum aggregate vehicle-to-grid (V2G) plug-in to produce meaningful bids for the day-ahead ancillary services market. However, there is a lack of understanding on what features should be considered and how complex the forecasti…
▽ More
Electric vehicle (EV) charge points (CPs) can be used by aggregators to provide frequency response (FR) services. Aggregators must have day-ahead half-hourly forecasts of minimum aggregate vehicle-to-grid (V2G) plug-in to produce meaningful bids for the day-ahead ancillary services market. However, there is a lack of understanding on what features should be considered and how complex the forecasting model should be. This paper explores the dependency of aggregate V2G plug-in on historic plug-in levels, calendar variables, and weather conditions. These investigations are used to develop three day-ahead forecasts of minimum aggregate V2G plug-in during 30-minute window. A neural network that considers previous V2G plug-in values the day before, three days before, and seven days before, in addition to day of the week, month, and hour, is found to be the most accurate.
△ Less
Submitted 16 August, 2023; v1 submitted 14 July, 2023;
originally announced July 2023.
-
Benchmarking Explanatory Models for Inertia Forecasting using Public Data of the Nordic Area
Authors:
Jemima Graham,
Evelyn Heylen,
Yuankai Bian,
Fei Teng
Abstract:
This paper investigates the performance of a day-ahead explanatory model for inertia forecasting based on field data in the Nordic system, which achieves a 43% reduction in mean absolute percentage error (MAPE) against a state-of-the-art time-series forecast model. The generalizability of the explanatory model is verified by its consistent performance on Nordic and Great Britain datasets. Also, it…
▽ More
This paper investigates the performance of a day-ahead explanatory model for inertia forecasting based on field data in the Nordic system, which achieves a 43% reduction in mean absolute percentage error (MAPE) against a state-of-the-art time-series forecast model. The generalizability of the explanatory model is verified by its consistent performance on Nordic and Great Britain datasets. Also, it appears that a long duration of training data is not required to obtain accurate results with this model, but taking a more spatially granular approach reduces the MAPE by 3.6%. Finally, two further model enhancements are studied considering the specific features in Nordic system: (i) a monthly interaction variable applied to the day-ahead national demand forecast feature, reducing the MAPE by up to 18%; and (ii) a feature based on the inertia from hydropower, although this has a negligible impact. The field dataset used for benchmarking is also made publicly available.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Enhancing Cyber-Resiliency of DER-based SmartGrid: A Survey
Authors:
Mengxiang Liu,
Fei Teng,
Zhenyong Zhang,
Pudong Ge,
Ruilong Deng,
Mingyang Sun,
Peng Cheng,
Jiming Chen
Abstract:
The rapid development of information and communications technology has enabled the use of digital-controlled and software-driven distributed energy resources (DERs) to improve the flexibility and efficiency of power supply, and support grid operations. However, this evolution also exposes geographically-dispersed DERs to cyber threats, including hardware and software vulnerabilities, communication…
▽ More
The rapid development of information and communications technology has enabled the use of digital-controlled and software-driven distributed energy resources (DERs) to improve the flexibility and efficiency of power supply, and support grid operations. However, this evolution also exposes geographically-dispersed DERs to cyber threats, including hardware and software vulnerabilities, communication issues, and personnel errors, etc. Therefore, enhancing the cyber-resiliency of DER-based smart grid - the ability to survive successful cyber intrusions - is becoming increasingly vital and has garnered significant attention from both industry and academia. In this survey, we aim to provide a systematical and comprehensive review regarding the cyber-resiliency enhancement (CRE) of DER-based smart grid. Firstly, an integrated threat modeling method is tailored for the hierarchical DER-based smart grid with special emphasis on vulnerability identification and impact analysis. Then, the defense-in-depth strategies encompassing prevention, detection, mitigation, and recovery are comprehensively surveyed, systematically classified, and rigorously compared. A CRE framework is subsequently proposed to incorporate the five key resiliency enablers. Finally, challenges and future directions are discussed in details. The overall aim of this survey is to demonstrate the development trend of CRE methods and motivate further efforts to improve the cyber-resiliency of DER-based smart grid.
△ Less
Submitted 5 March, 2024; v1 submitted 9 May, 2023;
originally announced May 2023.
-
On the 1-Wasserstein Distance between Location-Scale Distributions and the Effect of Differential Privacy
Authors:
Saurab Chhachhi,
Fei Teng
Abstract:
We provide an exact expressions for the 1-Wasserstein distance between independent location-scale distributions. The expressions are represented using location and scale parameters and special functions such as the standard Gaussian CDF or the Gamma function. Specifically, we find that the 1-Wasserstein distance between independent univariate location-scale distributions is equivalent to the mean…
▽ More
We provide an exact expressions for the 1-Wasserstein distance between independent location-scale distributions. The expressions are represented using location and scale parameters and special functions such as the standard Gaussian CDF or the Gamma function. Specifically, we find that the 1-Wasserstein distance between independent univariate location-scale distributions is equivalent to the mean of a folded distribution within the same family whose underlying location and scale are equal to the difference of the locations and scales of the original distributions. A new linear upper bound on the 1-Wasserstein distance is presented and the asymptotic bounds of the 1-Wasserstein distance are detailed in the Gaussian case. The effect of differential privacy using the Laplace and Gaussian mechanisms on the 1-Wasserstein distance is studied using the closed-form expressions and bounds.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
Optimal Design of Neural Network Structure for Power System Frequency Security Constraints
Authors:
Zhuoxuan Li,
Zhongda Chu,
Fei Teng
Abstract:
Recently, frequency security is challenged by high uncertainty and low inertia in power system with high penetration of Renewable Energy Sources (RES). In the context of Unit Commitment (UC) problems, frequency security constraints represented by neural networks have been developed and embedded into the optimization problem to represent complicated frequency dynamics. However, there are two major…
▽ More
Recently, frequency security is challenged by high uncertainty and low inertia in power system with high penetration of Renewable Energy Sources (RES). In the context of Unit Commitment (UC) problems, frequency security constraints represented by neural networks have been developed and embedded into the optimization problem to represent complicated frequency dynamics. However, there are two major disadvantages related to this technique: the risk of overconfident prediction and poor computational efficiency. To handle these disadvantages, novel methodologies are proposed to optimally design the neural network structure, including the use of asymmetric loss function during the training stage and scientifically selecting neural network size and topology. The effectiveness of the proposed methodologies are validated by case study which reveals the improvement of conservativeness and mitigation of computation performance issues.
△ Less
Submitted 20 August, 2023; v1 submitted 24 April, 2023;
originally announced April 2023.
-
Comparison of post-Minkowskian and self-force expansions: Scattering in a scalar charge toy model
Authors:
Leor Barack,
Zvi Bern,
Enrico Herrmann,
Oliver Long,
Julio Parra-Martinez,
Radu Roiban,
Michael S. Ruf,
Chia-Hsien Shen,
Mikhail P. Solon,
Fei Teng,
Mao Zeng
Abstract:
We compare numerical self-force results and analytical fourth-order post-Minkowskian (PM) calculations for hyperbolic-type scattering of a point-like particle carrying a scalar charge $Q$ off a Schwarzschild black hole, showing a remarkably good agreement. Specifically, we numerically compute the scattering angle including the full $O(Q^2)$ scalar-field self-force term (but ignoring the gravitatio…
▽ More
We compare numerical self-force results and analytical fourth-order post-Minkowskian (PM) calculations for hyperbolic-type scattering of a point-like particle carrying a scalar charge $Q$ off a Schwarzschild black hole, showing a remarkably good agreement. Specifically, we numerically compute the scattering angle including the full $O(Q^2)$ scalar-field self-force term (but ignoring the gravitational self-force), and compare with analytical expressions obtained in a PM framework using scattering-amplitude methods. This example provides a nontrivial, high-precision test of both calculation methods, and illustrates the complementarity of the two approaches in the context of the program to provide high-precision models of gravitational two-body dynamics. Our PM calculation is carried out through 4PM order, i.e., including all terms through $O(Q^2 G^3)$. At the fourth post-Minkowskian order the point-particle description involves two a-priori undetermined coefficients, due to contributions from tidal effects in the model under consideration. These coefficients are chosen to align the post-Minkowskian results with the self-force ones.
△ Less
Submitted 12 July, 2023; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Always Strengthen Your Strengths: A Drift-Aware Incremental Learning Framework for CTR Prediction
Authors:
Congcong Liu,
Fei Teng,
Xiwei Zhao,
Zhangang Lin,
Jinghe Hu,
Jingping Shao
Abstract:
Click-through rate (CTR) prediction is of great importance in recommendation systems and online advertising platforms. When served in industrial scenarios, the user-generated data observed by the CTR model typically arrives as a stream. Streaming data has the characteristic that the underlying distribution drifts over time and may recur. This can lead to catastrophic forgetting if the model simply…
▽ More
Click-through rate (CTR) prediction is of great importance in recommendation systems and online advertising platforms. When served in industrial scenarios, the user-generated data observed by the CTR model typically arrives as a stream. Streaming data has the characteristic that the underlying distribution drifts over time and may recur. This can lead to catastrophic forgetting if the model simply adapts to new data distribution all the time. Also, it's inefficient to relearn distribution that has been occurred. Due to memory constraints and diversity of data distributions in large-scale industrial applications, conventional strategies for catastrophic forgetting such as replay, parameter isolation, and knowledge distillation are difficult to be deployed. In this work, we design a novel drift-aware incremental learning framework based on ensemble learning to address catastrophic forgetting in CTR prediction. With explicit error-based drift detection on streaming data, the framework further strengthens well-adapted ensembles and freezes ensembles that do not match the input distribution avoiding catastrophic interference. Both evaluations on offline experiments and A/B test shows that our method outperforms all baselines considered.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Risk-Aware Objective-Based Forecasting in Inertia Management
Authors:
Haipeng Zhang,
Ran Li,
Yan Chen,
Zhongda Chu,
Mingyang Sun,
Fei Teng
Abstract:
The objective-based forecasting considers the asymmetric and non-linear impacts of forecasting errors on decision objectives, thus improving the effectiveness of its downstream decision-making process. However, existing objective-based forecasting methods are risk-neutral and not suitable for tasks like power system inertia management and unit commitment, of which decision-makers are usually biase…
▽ More
The objective-based forecasting considers the asymmetric and non-linear impacts of forecasting errors on decision objectives, thus improving the effectiveness of its downstream decision-making process. However, existing objective-based forecasting methods are risk-neutral and not suitable for tasks like power system inertia management and unit commitment, of which decision-makers are usually biased toward risk aversion in practice. To tackle this problem, this paper proposes a generic risk-aware objective-based forecasting method. It enables decision-makers to customize their forecasting with different risk preferences. The equivalence between the proposed method and optimization under uncertainty (stochastic/robust optimization) is established for the first time. Case studies are carried out on a Great Britain 2030 power system with system operational data from National Grid. The results show that the proposed model with deterministic optimization can approximate the performance of stochastic programming or robust optimization at only a fraction of their computational cost.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
The Sub-Leading Scattering Waveform from Amplitudes
Authors:
Aidan Herderschee,
Radu Roiban,
Fei Teng
Abstract:
We compute the next-to-leading order term in the scattering waveform of uncharged black holes in classical general relativity and of half-BPS black holes in $\mathcal{N}=8$ supergravity. We propose criteria, generalizing explicit calculations at next-to-leading order, for determining the terms in amplitudes that contribute to local observables. For general relativity, we construct the relevant cla…
▽ More
We compute the next-to-leading order term in the scattering waveform of uncharged black holes in classical general relativity and of half-BPS black holes in $\mathcal{N}=8$ supergravity. We propose criteria, generalizing explicit calculations at next-to-leading order, for determining the terms in amplitudes that contribute to local observables. For general relativity, we construct the relevant classical integrand through generalized unitarity in two distinct ways, (1) in a heavy-particle effective theory and (2) in general relativity minimally-coupled to scalar fields. With a suitable prescription for the matter propagator in the former, we find agreement between the two methods, thus demonstrating the absence of interference of quantum and classically-singular contributions. The classical $\mathcal{N}=8$ integrand for massive scalar fields is constructed through dimensional reduction of the known five-point one-loop integrand. Our calculation exhibits novel features compared to conservative calculations and inclusive observables, such as the appearance of master integrals with intersecting matter lines and the appearance of a classical infrared divergence whose absence from classical observables requires a suitable definition of the retarded time.
△ Less
Submitted 22 December, 2023; v1 submitted 10 March, 2023;
originally announced March 2023.
-
Preventive-Corrective Cyber-Defense: Attack-Induced Region Minimization and Cybersecurity Margin Maximization
Authors:
Jiazuo Hou,
Fei Teng,
Wenqian Yin,
Yue Song,
Yunhe Hou
Abstract:
False data injection (FDI) cyber-attacks on power systems can be prevented by strategically selecting and protecting a sufficiently large measurement subset, which, however, requires adequate cyber-defense resources for measurement protection. With any given cyber-defense resource, this paper proposes a preventive-corrective cyber-defense strategy, which minimizes the FDI attack-induced region in…
▽ More
False data injection (FDI) cyber-attacks on power systems can be prevented by strategically selecting and protecting a sufficiently large measurement subset, which, however, requires adequate cyber-defense resources for measurement protection. With any given cyber-defense resource, this paper proposes a preventive-corrective cyber-defense strategy, which minimizes the FDI attack-induced region in a preventive manner, followed by maximizing the cybersecurity margin in a corrective manner. First, this paper proposes a preventive cyber-defense strategy that minimizes the volume of the FDI attack-induced region via preventive allocation of any given measurement protection resource. Particularly, a sufficient condition for constructing the FDI unattackable lines is proposed, indicating that the FDI cyber-attack could be locally rather than globally prevented. Then, given a non-empty FDI attack-induced region, this paper proposes a corrective cyber-defense strategy that maximizes the cybersecurity margin, leading to a trade-off between the safest-but-expensive operation point (i.e., Euclidean Chebyshev center) and the cheapest-but-dangerous operation point. Simulation results on a modified IEEE 14 bus system verify the effectiveness and cost-effectiveness of the proposed preventive-corrective cyber-defense strategy.
△ Less
Submitted 13 November, 2023; v1 submitted 15 February, 2023;
originally announced February 2023.
-
Availability Adversarial Attack and Countermeasures for Deep Learning-based Load Forecasting
Authors:
Wangkun Xu,
Fei Teng
Abstract:
The forecast of electrical loads is essential for the planning and operation of the power system. Recently, advances in deep learning have enabled more accurate forecasts. However, deep neural networks are prone to adversarial attacks. Although most of the literature focuses on integrity-based attacks, this paper proposes availability-based adversarial attacks, which can be more easily implemented…
▽ More
The forecast of electrical loads is essential for the planning and operation of the power system. Recently, advances in deep learning have enabled more accurate forecasts. However, deep neural networks are prone to adversarial attacks. Although most of the literature focuses on integrity-based attacks, this paper proposes availability-based adversarial attacks, which can be more easily implemented by attackers. For each forecast instance, the availability attack position is optimally solved by mixed-integer reformulation of the artificial neural network. To tackle this attack, an adversarial training algorithm is proposed. In simulation, a realistic load forecasting dataset is considered and the attack performance is compared to the integrity-based attack. Meanwhile, the adversarial training algorithm is shown to significantly improve robustness against availability attacks. All codes are available at https://github.com/xuwkk/AAA_Load_Forecast.
△ Less
Submitted 4 January, 2023;
originally announced January 2023.
-
Scheduling of Software-Defined Microgrids for Optimal Frequency Regulation
Authors:
Zhongda Chu,
Guoxuan Cui,
Fei Teng
Abstract:
Integrated with a high share of Inverter-Based Resources (IBRs), microgrids face increasing complexity of frequency dynamics, especially after unintentional islanding from the maingrid. These IBRs, on the other hand, provide more control flexibility to shape the frequency dynamics of microgrid and together with advanced communication infrastructure offer new opportunities in the future software-de…
▽ More
Integrated with a high share of Inverter-Based Resources (IBRs), microgrids face increasing complexity of frequency dynamics, especially after unintentional islanding from the maingrid. These IBRs, on the other hand, provide more control flexibility to shape the frequency dynamics of microgrid and together with advanced communication infrastructure offer new opportunities in the future software-defined microgrids. To enhance the frequency stability of microgrids with high IBR penetration, this paper proposes an optimal scheduling framework for software-defined microgrids to maintain frequency stability by utilizing the non-essential load shedding and dynamical optimization of the virtual inertia and virtual damping from IBRs. Moreover, side effects of these services, namely, the time delay associated with non-essential load shedding and potential IBR control parameter update failure are explicitly modeled to avoid underestimations of frequency deviation and over-optimistic results. The effectiveness and significant economic value of the proposed simultaneous and dynamic virtual inertia and damping provision strategy are demonstrated based on case studies in the modified IEEE 33-bus system.
△ Less
Submitted 21 February, 2024; v1 submitted 29 December, 2022;
originally announced December 2022.
-
Design and Planning of Flexible Mobile Micro-Grids Using Deep Reinforcement Learning
Authors:
Cesare Caputo,
Michel-Alexandre Cardin,
Pudong Ge,
Fei Teng,
Anna Korre,
Ehecatl Antonio del Rio Chanona
Abstract:
Ongoing risks from climate change have impacted the livelihood of global nomadic communities, and are likely to lead to increased migratory movements in coming years. As a result, mobility considerations are becoming increasingly important in energy systems planning, particularly to achieve energy access in developing countries. Advanced Plug and Play control strategies have been recently develope…
▽ More
Ongoing risks from climate change have impacted the livelihood of global nomadic communities, and are likely to lead to increased migratory movements in coming years. As a result, mobility considerations are becoming increasingly important in energy systems planning, particularly to achieve energy access in developing countries. Advanced Plug and Play control strategies have been recently developed with such a decentralized framework in mind, more easily allowing for the interconnection of nomadic communities, both to each other and to the main grid. In light of the above, the design and planning strategy of a mobile multi-energy supply system for a nomadic community is investigated in this work. Motivated by the scale and dimensionality of the associated uncertainties, impacting all major design and decision variables over the 30-year planning horizon, Deep Reinforcement Learning (DRL) is implemented for the design and planning problem tackled. DRL based solutions are benchmarked against several rigid baseline design options to compare expected performance under uncertainty. The results on a case study for ger communities in Mongolia suggest that mobile nomadic energy systems can be both technically and economically feasible, particularly when considering flexibility, although the degree of spatial dispersion among households is an important limiting factor. Key economic, sustainability and resilience indicators such as Cost, Equivalent Emissions and Total Unmet Load are measured, suggesting potential improvements compared to available baselines of up to 25%, 67% and 76%, respectively. Finally, the decomposition of values of flexibility and plug and play operation is presented using a variation of real options theory, with important implications for both nomadic communities and policymakers focused on enabling their energy access.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Cyber-physical interdependent restoration scheduling for active distribution network via ad hoc wireless communication
Authors:
Chongyu Wang,
Mingyu Yan,
Kaiyuan Pang,
Fushuan Wen,
Fei Teng
Abstract:
This paper proposes a post-disaster cyber-physical interdependent restoration scheduling (CPIRS) framework for active distribution networks (ADN) where the simultaneous damages on cyber and physical networks are considered. The ad hoc wireless device-to-device (D2D) communication is leveraged, for the first time, to establish cyber networks instantly after the disaster to support ADN restoration.…
▽ More
This paper proposes a post-disaster cyber-physical interdependent restoration scheduling (CPIRS) framework for active distribution networks (ADN) where the simultaneous damages on cyber and physical networks are considered. The ad hoc wireless device-to-device (D2D) communication is leveraged, for the first time, to establish cyber networks instantly after the disaster to support ADN restoration. The repair and operation crew dispatching, the remote-controlled network reconfiguration and the system operation with DERs can be effectively coordinated under the cyber-physical interactions. The uncertain outputs of renewable energy resources (RESs) are represented by budget-constrained polyhedral uncertainty sets. Through implementing linearization techniques on disjunctive expressions, a monolithic mixed-integer linear programming (MILP) based two-stage robust optimization model is formulated and subsequently solved by a customized column-and-constraint generation (C&CG) algorithm. Numerical results on the IEEE 123-node distribution system demonstrate the effectiveness and superiorities of the proposed CPIRS method for ADN.
△ Less
Submitted 5 November, 2022;
originally announced November 2022.
-
Perfecting one-loop BCJ numerators in SYM and supergravity
Authors:
Alex Edison,
Song He,
Henrik Johansson,
Oliver Schlotterer,
Fei Teng,
Yong Zhang
Abstract:
We take a major step towards computing $D$-dimensional one-loop amplitudes in general gauge theories, compatible with the principles of unitarity and the color-kinematics duality. For $n$-point amplitudes with either supersymmetry multiplets or generic non-supersymmetric matter in the loop, simple all-multiplicity expressions are obtained for the maximal cuts of kinematic numerators of $n$-gon dia…
▽ More
We take a major step towards computing $D$-dimensional one-loop amplitudes in general gauge theories, compatible with the principles of unitarity and the color-kinematics duality. For $n$-point amplitudes with either supersymmetry multiplets or generic non-supersymmetric matter in the loop, simple all-multiplicity expressions are obtained for the maximal cuts of kinematic numerators of $n$-gon diagrams. At $n=6,7$ points with maximal supersymmetry, we extend the cubic-diagram numerators to encode all contact terms, and thus solve the long-standing problem of \emph{simultaneously} realizing the following properties: color-kinematics duality, manifest locality, optimal power counting of loop momenta, quadratic rather than linearized Feynman propagators, compatibility with double copy as well as all graph symmetries. Color-kinematics dual representations with similar properties are presented in the half-maximally supersymmetric case at $n=4,5$ points. The resulting gauge-theory integrands and their supergravity counterparts obtained from the double copy are checked to reproduce the expected ultraviolet divergences.
△ Less
Submitted 17 February, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization
Authors:
Jiangyi Deng,
Fei Teng,
Yanjiao Chen,
Xiaofu Chen,
Zhaohui Wang,
Wenyuan Xu
Abstract:
Voice data generated on instant messaging or social media applications contains unique user voiceprints that may be abused by malicious adversaries for identity inference or identity theft. Existing voice anonymization techniques, e.g., signal processing and voice conversion/synthesis, suffer from degradation of perceptual quality. In this paper, we develop a voice anonymization system, named V-Cl…
▽ More
Voice data generated on instant messaging or social media applications contains unique user voiceprints that may be abused by malicious adversaries for identity inference or identity theft. Existing voice anonymization techniques, e.g., signal processing and voice conversion/synthesis, suffer from degradation of perceptual quality. In this paper, we develop a voice anonymization system, named V-Cloak, which attains real-time voice anonymization while preserving the intelligibility, naturalness and timbre of the audio. Our designed anonymizer features a one-shot generative model that modulates the features of the original audio at different frequency levels. We train the anonymizer with a carefully-designed loss function. Apart from the anonymity loss, we further incorporate the intelligibility loss and the psychoacoustics-based naturalness loss. The anonymizer can realize untargeted and targeted anonymization to achieve the anonymity goals of unidentifiability and unlinkability.
We have conducted extensive experiments on four datasets, i.e., LibriSpeech (English), AISHELL (Chinese), CommonVoice (French) and CommonVoice (Italian), five Automatic Speaker Verification (ASV) systems (including two DNN-based, two statistical and one commercial ASV), and eleven Automatic Speech Recognition (ASR) systems (for different languages). Experiment results confirm that V-Cloak outperforms five baselines in terms of anonymity performance. We also demonstrate that V-Cloak trained only on the VoxCeleb1 dataset against ECAPA-TDNN ASV and DeepSpeech2 ASR has transferable anonymity against other ASVs and cross-language intelligibility for other ASRs. Furthermore, we verify the robustness of V-Cloak against various de-noising techniques and adaptive attacks. Hopefully, V-Cloak may provide a cloak for us in a prism world.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
Towards Joint Electricity and Data Trading: A Scalable Cooperative Game Theoretic Approach
Authors:
Mingyu Yan,
Fei Teng
Abstract:
This paper, for the first time, proposes a joint electricity and data trading mechanism based on cooperative game theory. All prosumers first submit the parameters associated with both electricity and data to the market operator. The operator utilizes the public and prosumers' private data to forecast the distributed renewable generators (DRGs) and quantify the improvement driven by prosumers' pri…
▽ More
This paper, for the first time, proposes a joint electricity and data trading mechanism based on cooperative game theory. All prosumers first submit the parameters associated with both electricity and data to the market operator. The operator utilizes the public and prosumers' private data to forecast the distributed renewable generators (DRGs) and quantify the improvement driven by prosumers' private data in terms of reduced uncertainty set. Then, the operator maximizes the grand coalition's total payoff considering the uncertain generation of DRGs and imputes the payoff to each prosumer based on their contribution to electricity and data sharing. The mathematical formulation of the grand coalition is developed and converted into a second order cone programming problem by using an affinepolicy based robust approach. The stability of such a grand coalition is mathematically proved, i.e., all prosumers are willing to cooperate. Furthermore, to address the scalability challenge of existing payoff imputation methods in the cooperative game, a two stage optimization based approach is proposed, which is converted into a mixed integer second order cone programming and solved by the Benders decomposition. Case studies illustrate all prosumers are motivated to trade electricity and data under the joint trading framework and the proposed imputation method significantly enhances the scalability.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
Federated Graph-based Networks with Shared Embedding
Authors:
Tianyi Yu,
Pei Lai,
Fei Teng
Abstract:
Nowadays, user privacy is becoming an issue that cannot be bypassed for system developers, especially for that of web applications where data can be easily transferred through internet. Thankfully, federated learning proposes an innovative method to train models with distributed devices while data are kept in local storage. However, unlike general neural networks, although graph-based networks hav…
▽ More
Nowadays, user privacy is becoming an issue that cannot be bypassed for system developers, especially for that of web applications where data can be easily transferred through internet. Thankfully, federated learning proposes an innovative method to train models with distributed devices while data are kept in local storage. However, unlike general neural networks, although graph-based networks have achieved great success in classification tasks and advanced recommendation system, its high performance relies on the rich context provided by a graph structure, which is vulnerable when data attributes are incomplete. Therefore, the latter becomes a realistic problem when implementing federated learning for graph-based networks. Knowing that data embedding is a representation in a different space, we propose our Federated Graph-based Networks with Shared Embedding (Feras), which uses shared embedding data to train the network and avoids the direct sharing of original data. A solid theoretical proof of the convergence of Feras is given in this work. Experiments on different datasets (PPI, Flickr, Reddit) are conducted to show the efficiency of Feras for centralized learning. Finally, Feras enables the training of current graph-based models in the federated learning framework for privacy concern.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
A Wireless-Assisted Hierarchical Framework to Accommodate Mobile Energy Resources
Authors:
Pudong Ge,
Cesare Caputo,
Michel-Alexandre Cardin,
Anna Korre,
Fei Teng
Abstract:
The societal decarbonisation fosters the installation of massive renewable inverter-based resources (IBRs) in replacing fossil fuel based traditional energy supply. The efficient and reliable operation of distributed IBRs requires advanced Information and Communication Technologies (ICT) , which may lead to a huge infrastructure investment and long construction time for remote communities. Therefo…
▽ More
The societal decarbonisation fosters the installation of massive renewable inverter-based resources (IBRs) in replacing fossil fuel based traditional energy supply. The efficient and reliable operation of distributed IBRs requires advanced Information and Communication Technologies (ICT) , which may lead to a huge infrastructure investment and long construction time for remote communities. Therefore, to efficiently coordinate IBRs, we propose a low-cost hierarchical structure, especially for remote communities without existing strong ICT connections, that combines the advantages of centralised and distributed frameworks via advanced wireless communication technologies. More specifically, in each region covered by a single cellular network, dispatchable resources are controlled via a regional aggregated controller, and the corresponding regional information flow is enabled by a device-to-device (D2D) communication assisted wireless network. The wireless network can fully reuse the bandwidth to improve data flow efficiency, leading to a flexible information structure that can accommodate the plug-and-play operation of mobile IBRs. Simulation results demonstrate that the proposed wireless communication scheme significantly improves the utilization of existing bandwidth, and the dynamically allocated wireless system ensures the flexible operation of mobile IBRs.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
A Missing Value Filling Model Based on Feature Fusion Enhanced Autoencoder
Authors:
Xinyao Liu,
Shengdong Du,
Tianrui Li,
Fei Teng,
Yan Yang
Abstract:
With the advent of the big data era, the data quality problem is becoming more critical. Among many factors, data with missing values is one primary issue, and thus developing effective imputation models is a key topic in the research community. Recently, a major research direction is to employ neural network models such as self-organizing mappings or automatic encoders for filling missing values.…
▽ More
With the advent of the big data era, the data quality problem is becoming more critical. Among many factors, data with missing values is one primary issue, and thus developing effective imputation models is a key topic in the research community. Recently, a major research direction is to employ neural network models such as self-organizing mappings or automatic encoders for filling missing values. However, these classical methods can hardly discover interrelated features and common features simultaneously among data attributes. Especially, it is a very typical problem for classical autoencoders that they often learn invalid constant mappings, which dramatically hurts the filling performance. To solve the above-mentioned problems, we propose a missing-value-filling model based on a feature-fusion-enhanced autoencoder. We first incorporate into an autoencoder a hidden layer that consists of de-tracking neurons and radial basis function neurons, which can enhance the ability of learning interrelated features and common features. Besides, we develop a missing value filling strategy based on dynamic clustering that is incorporated into an iterative optimization process. This design can enhance the multi-dimensional feature fusion ability and thus improves the dynamic collaborative missing-value-filling performance. The effectiveness of the proposed model is validated by extensive experiments compared to a variety of baseline methods on thirteen data sets.
△ Less
Submitted 3 August, 2023; v1 submitted 29 August, 2022;
originally announced August 2022.
-
Localization of Coordinated Cyber-Physical Attacks in Power Grids Using Moving Target Defense and Deep Learning
Authors:
Yexiang Chen,
Subhash Lakshminarayana,
Fei Teng
Abstract:
As one of the most sophisticated attacks against power grids, coordinated cyber-physical attacks (CCPAs) damage the power grid's physical infrastructure and use a simultaneous cyber attack to mask its effect. This work proposes a novel approach to detect such attacks and identify the location of the line outages (due to the physical attack). The proposed approach consists of three parts. Firstly,…
▽ More
As one of the most sophisticated attacks against power grids, coordinated cyber-physical attacks (CCPAs) damage the power grid's physical infrastructure and use a simultaneous cyber attack to mask its effect. This work proposes a novel approach to detect such attacks and identify the location of the line outages (due to the physical attack). The proposed approach consists of three parts. Firstly, moving target defense (MTD) is applied to expose the physical attack by actively perturbing transmission line reactance via distributed flexible AC transmission system (D-FACTS) devices. MTD invalidates the attackers' knowledge required to mask their physical attack. Secondly, convolution neural networks (CNNs) are applied to localize line outage position from the compromised measurements. Finally, model agnostic meta-learning (MAML) is used to accelerate the training speed of CNN following the topology reconfigurations (due to MTD) and reduce the data/retraining time requirements. Simulations are carried out using IEEE test systems. The experimental results demonstrate that the proposed approach can effectively localize line outages in stealthy CCPAs.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Mitigating Load-Altering Attacks Against Power Grids Using Cyber-Resilient Economic Dispatch
Authors:
Zhongda Chu,
Subhash Lakshminarayana,
Balarko Chaudhuri,
Fei Teng
Abstract:
Large-scale Load-Altering Attacks (LAAs) against Internet-of-Things (IoT) enabled high-wattage electrical appliances (e.g., wifi-enabled air-conditioners, electric vehicles, etc.) pose a serious threat to power systems' security and stability. In this work, a Cyber-Resilient Economic Dispatch (CRED) framework is presented to mitigate the destabilizing effect of LAAs while minimizing the overall op…
▽ More
Large-scale Load-Altering Attacks (LAAs) against Internet-of-Things (IoT) enabled high-wattage electrical appliances (e.g., wifi-enabled air-conditioners, electric vehicles, etc.) pose a serious threat to power systems' security and stability. In this work, a Cyber-Resilient Economic Dispatch (CRED) framework is presented to mitigate the destabilizing effect of LAAs while minimizing the overall operational cost by dynamically optimizing the frequency droop control gains of Inverter-Based Resources (IBRs). The system frequency dynamics incorporating both LAAs and the IBR droop control are modeled. The system stability constraints are explicitly derived based on parametric sensitivities. To incorporate them into the CRED model and minimize the error of the sensitivity analysis, a recursive linearization method is further proposed. A distributionally robust approach is applied to account for the uncertainty associated with the LAA detection/parameter estimation. The overall performance of the proposed CRED model is demonstrated through simulations in a modified IEEE reliability test system.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.