-
A Wolf in Sheep's Clothing: Practical Black-box Adversarial Attacks for Evading Learning-based Windows Malware Detection in the Wild
Authors:
Xiang Ling,
Zhiyu Wu,
Bin Wang,
Wei Deng,
Jingzheng Wu,
Shouling Ji,
Tianyue Luo,
Yanjun Wu
Abstract:
Given the remarkable achievements of existing learning-based malware detection in both academia and industry, this paper presents MalGuise, a practical black-box adversarial attack framework that evaluates the security risks of existing learning-based Windows malware detection systems under the black-box setting. MalGuise first employs a novel semantics-preserving transformation of call-based redi…
▽ More
Given the remarkable achievements of existing learning-based malware detection in both academia and industry, this paper presents MalGuise, a practical black-box adversarial attack framework that evaluates the security risks of existing learning-based Windows malware detection systems under the black-box setting. MalGuise first employs a novel semantics-preserving transformation of call-based redividing to concurrently manipulate both nodes and edges of malware's control-flow graph, making it less noticeable. By employing a Monte-Carlo-tree-search-based optimization, MalGuise then searches for an optimized sequence of call-based redividing transformations to apply to the input Windows malware for evasions. Finally, it reconstructs the adversarial malware file based on the optimized transformation sequence while adhering to Windows executable format constraints, thereby maintaining the same semantics as the original. MalGuise is systematically evaluated against three state-of-the-art learning-based Windows malware detection systems under the black-box setting. Evaluation results demonstrate that MalGuise achieves a remarkably high attack success rate, mostly exceeding 95%, with over 91% of the generated adversarial malware files maintaining the same semantics. Furthermore, MalGuise achieves up to a 74.97% attack success rate against five anti-virus products, highlighting potential tangible security concerns to real-world users.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Revealing the Electronic Structure of NiPS$_3$ through Synchrotron-Based ARPES and Alkali Metal Dosing
Authors:
Yifeng Cao,
Qishuo Tan,
Yucheng Guo,
Clóvis Guerim Vieira,
Mário S. C. Mazzon,
Jude Laverock,
Nicholas Russo,
Hongze Gao,
Chris Jozwiak,
Aaron Bostwick,
Eli Rotenberg,
Jinghua Guo,
Ming Yi,
Matheus J. S. Matos,
Xi Ling,
Kevin E. Smith
Abstract:
This study presents a comprehensive analysis of the band structure in NiPS$_3$, a van der Waals layered antiferromagnet, utilizing high-resolution synchrotron-based angle-resolved photoemission spectroscopy (ARPES) and corroborative density functional theory (DFT) calculations. By tuning the parameters of the light source, we obtained a very clear and wide energy range band structure of NiPS$_3$.…
▽ More
This study presents a comprehensive analysis of the band structure in NiPS$_3$, a van der Waals layered antiferromagnet, utilizing high-resolution synchrotron-based angle-resolved photoemission spectroscopy (ARPES) and corroborative density functional theory (DFT) calculations. By tuning the parameters of the light source, we obtained a very clear and wide energy range band structure of NiPS$_3$. Comparison with DFT calculations allows for the identification of the orbital character of the observed bands. Our DFT calculations perfectly match the experimental results, and no adaptations were made to the calculations based on the experimental outcomes. The appearance of novel electronic structure upon alkali metal dosing (AMD) were also obtained in this ARPES study. Above valence band maximum, structure of conduction bands and bands from defect states were firstly observed in NiPS$_3$. We provide the direct determination of the band gap of NiPS$_3$ as 1.3 eV from the band structure by AMD. In addition, detailed temperature dependent ARPES spectra were obtained across a range that spans both below and above the Néel transition temperature of NiPS$_3$. We found that the paramagnetic and antiferromagnetic states have almost identical spectra, indicating the highly localized nature of Ni $d$ states.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Spectral evidence for NiPS3 as a Mott-Hubbard insulator
Authors:
Yifeng Cao,
Nicholas Russo,
Qishuo Tan,
Xi Ling,
Jinghua Guo,
Yi-de Chuang,
Kevin E. Smith
Abstract:
The layered van der Waals trichalcogenide NiPS3 has attracted widespread attention due to its unique optical, magnetic, and electronic properties. The complexity of NiPS3 itself, however, has also led to ongoing debates regarding its characteristics such as the existence of self-doped ligand holes. In this study, X-ray absorption spectroscopy and resonant inelastic X-ray scattering have been appli…
▽ More
The layered van der Waals trichalcogenide NiPS3 has attracted widespread attention due to its unique optical, magnetic, and electronic properties. The complexity of NiPS3 itself, however, has also led to ongoing debates regarding its characteristics such as the existence of self-doped ligand holes. In this study, X-ray absorption spectroscopy and resonant inelastic X-ray scattering have been applied to investigate the electronic structure of NiPS3. With the aid of theoretical calculations using the charge-transfer multiplet model, we provide experimental evidence for NiPS3 being a Mott-Hubbard insulator rather than a charge-transfer insulator. Moreover, we explain why some previous XAS studies have concluded that NiPS3 is a charge-transfer insulator by comparing surface and bulk sensitive spectra.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Analysis of Channel Uncertainty in Trusted Wireless Services via Repeated Interactions
Authors:
Bingwen Chen,
Xintong Ling,
Weihang Cao,
Jiaheng Wang,
Zhi Ding
Abstract:
The coexistence of heterogeneous sub-networks in 6G poses new security and trust concerns and thus calls for a perimeterless-security model. Blockchain radio access network (B-RAN) provides a trust-building approach via repeated interactions rather than relying on pre-established trust or central authentication. Such a trust-building process naturally supports dynamic trusted services across vario…
▽ More
The coexistence of heterogeneous sub-networks in 6G poses new security and trust concerns and thus calls for a perimeterless-security model. Blockchain radio access network (B-RAN) provides a trust-building approach via repeated interactions rather than relying on pre-established trust or central authentication. Such a trust-building process naturally supports dynamic trusted services across various service providers (SP) without the need for perimeter-based authentications; however, it remains vulnerable to environmental and system unreliability such as wireless channel uncertainty. In this study, we investigate channel unreliability in the trust-building framework based on repeated interactions for secure wireless services. We derive specific requirements for achieving cooperation between SP and client via a repeated game model and illustrate the implications of channel unreliability on sustaining trusted access services. We consider the framework optimization to guarantee SP-client cooperation, given a worst-case channel condition. Furthermore, we introduce the concept of cooperation region to represent the robustness of the trust-building process and explore the maximum cooperation area to enhance service resilience. Finally, we present simulations to demonstrate the system performance over fading channels and verify our results.
△ Less
Submitted 2 July, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
Learning Translations via Matrix Completion
Authors:
Derry Wijaya,
Brendan Callahan,
John Hewitt,
Jie Gao,
Xiao Ling,
Marianna Apidianaki,
Chris Callison-Burch
Abstract:
Bilingual Lexicon Induction is the task of learning word translations without bilingual parallel corpora. We model this task as a matrix completion problem, and present an effective and extendable framework for completing the matrix. This method harnesses diverse bilingual and monolingual signals, each of which may be incomplete or noisy. Our model achieves state-of-the-art performance for both hi…
▽ More
Bilingual Lexicon Induction is the task of learning word translations without bilingual parallel corpora. We model this task as a matrix completion problem, and present an effective and extendable framework for completing the matrix. This method harnesses diverse bilingual and monolingual signals, each of which may be incomplete or noisy. Our model achieves state-of-the-art performance for both high and low resource languages.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
A Dual-functional Blockchain Framework for Solving Distributed Optimization
Authors:
Weihang Cao,
Xintong Ling,
Jiaheng Wang,
Xiqi Gao,
Zhi Ding
Abstract:
Proof of Work (PoW) has been extensively utilized as the foundation of blockchain's security, consistency, and tamper-resistance. However, long has it been criticized for its tremendous and inefficient utilization of computational power and energy. In this work, we design a dual-functional blockchain framework that uses solving optimization problems to reach consensus as an alternative to PoW, cha…
▽ More
Proof of Work (PoW) has been extensively utilized as the foundation of blockchain's security, consistency, and tamper-resistance. However, long has it been criticized for its tremendous and inefficient utilization of computational power and energy. In this work, we design a dual-functional blockchain framework that uses solving optimization problems to reach consensus as an alternative to PoW, channeling wasted resources into useful work. We model and analyze our framework by developing discrete Markov chains, and derive the security conditions to ensure that selfish miners behave honestly. Based on the security conditions, we derive a lower bound for the security overhead and analyze the trade-off between useful work efficiency and PoW safeguard. We further dive deep into the reward function design for the proposed dual-functional blockchain and provide practical design guidelines for reward functions assuming concavity and linearity respectively. Finally, simulation results are presented to validate and illustrate our analytical results.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
JUNO Sensitivity to Invisible Decay Modes of Neutrons
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Kai Adamowicz,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta,
Antonio Bergnoli,
Daniel Bick
, et al. (635 additional authors not shown)
Abstract:
We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode…
▽ More
We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation modes of the excited residual nuclei can produce a time- and space-correlated triple coincidence signal in the JUNO detector. Based on a full Monte Carlo simulation informed with the latest available data, we estimate all backgrounds, including inverse beta decay events of the reactor antineutrino $\barν_e$, natural radioactivity, cosmogenic isotopes and neutral current interactions of atmospheric neutrinos. Pulse shape discrimination and multivariate analysis techniques are employed to further suppress backgrounds. With two years of exposure, JUNO is expected to give an order of magnitude improvement compared to the current best limits. After 10 years of data taking, the JUNO expected sensitivities at a 90% confidence level are $τ/B( n \rightarrow { inv} ) > 5.0 \times 10^{31} \, {\rm yr}$ and $τ/B( nn \rightarrow { inv} ) > 1.4 \times 10^{32} \, {\rm yr}$.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Differentially Private Federated Learning: A Systematic Review
Authors:
Jie Fu,
Yuan Hong,
Xinpeng Ling,
Leixia Wang,
Xun Ran,
Zhiyu Sun,
Wendy Hui Wang,
Zhili Chen,
Yang Cao
Abstract:
In recent years, privacy and security concerns in machine learning have promoted trusted federated learning to the forefront of research. Differential privacy has emerged as the de facto standard for privacy protection in federated learning due to its rigorous mathematical foundation and provable guarantee. Despite extensive research on algorithms that incorporate differential privacy within feder…
▽ More
In recent years, privacy and security concerns in machine learning have promoted trusted federated learning to the forefront of research. Differential privacy has emerged as the de facto standard for privacy protection in federated learning due to its rigorous mathematical foundation and provable guarantee. Despite extensive research on algorithms that incorporate differential privacy within federated learning, there remains an evident deficiency in systematic reviews that categorize and synthesize these studies.
Our work presents a systematic overview of the differentially private federated learning. Existing taxonomies have not adequately considered objects and level of privacy protection provided by various differential privacy models in federated learning. To rectify this gap, we propose a new taxonomy of differentially private federated learning based on definition and guarantee of various differential privacy models and federated scenarios. Our classification allows for a clear delineation of the protected objects across various differential privacy models and their respective neighborhood levels within federated learning environments. Furthermore, we explore the applications of differential privacy in federated learning scenarios. Our work provide valuable insights into privacy-preserving federated learning and suggest practical directions for future research.
△ Less
Submitted 19 May, 2024; v1 submitted 13 May, 2024;
originally announced May 2024.
-
Physics-Informed Neural Networks: Minimizing Residual Loss with Wide Networks and Effective Activations
Authors:
Nima Hosseini Dashtbayaz,
Ghazal Farhani,
Boyu Wang,
Charles X. Ling
Abstract:
The residual loss in Physics-Informed Neural Networks (PINNs) alters the simple recursive relation of layers in a feed-forward neural network by applying a differential operator, resulting in a loss landscape that is inherently different from those of common supervised problems. Therefore, relying on the existing theory leads to unjustified design choices and suboptimal performance. In this work,…
▽ More
The residual loss in Physics-Informed Neural Networks (PINNs) alters the simple recursive relation of layers in a feed-forward neural network by applying a differential operator, resulting in a loss landscape that is inherently different from those of common supervised problems. Therefore, relying on the existing theory leads to unjustified design choices and suboptimal performance. In this work, we analyze the residual loss by studying its characteristics at critical points to find the conditions that result in effective training of PINNs. Specifically, we first show that under certain conditions, the residual loss of PINNs can be globally minimized by a wide neural network. Furthermore, our analysis also reveals that an activation function with well-behaved high-order derivatives plays a crucial role in minimizing the residual loss. In particular, to solve a $k$-th order PDE, the $k$-th derivative of the activation function should be bijective. The established theory paves the way for designing and choosing effective activation functions for PINNs and explains why periodic activations have shown promising performance in certain cases. Finally, we verify our findings by conducting a set of experiments on several PDEs. Our code is publicly available at https://github.com/nimahsn/pinns_tf2.
△ Less
Submitted 12 June, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
Layout2Rendering: AI-aided Greenspace design
Authors:
Ran Chen,
Zeke Lian,
Yueheng He,
Xiao Ling,
Fuyu Yang,
Xueqi Yao,
Xingjian Yi,
Jing Zhao
Abstract:
In traditional human living environment landscape design, the establishment of three-dimensional models is an essential step for designers to intuitively present the spatial relationships of design elements, as well as a foundation for conducting landscape analysis on the site. Rapidly and effectively generating beautiful and realistic landscape spaces is a significant challenge faced by designers…
▽ More
In traditional human living environment landscape design, the establishment of three-dimensional models is an essential step for designers to intuitively present the spatial relationships of design elements, as well as a foundation for conducting landscape analysis on the site. Rapidly and effectively generating beautiful and realistic landscape spaces is a significant challenge faced by designers. Although generative design has been widely applied in related fields, they mostly generate three-dimensional models through the restriction of indicator parameters. However, the elements of landscape design are complex and have unique requirements, making it difficult to generate designs from the perspective of indicator limitations. To address these issues, this study proposes a park space generative design system based on deep learning technology. This system generates design plans based on the topological relationships of landscape elements, then vectorizes the plan element information, and uses Grasshopper to generate three-dimensional models while synchronously fine-tuning parameters, rapidly completing the entire process from basic site conditions to model effect analysis. Experimental results show that: (1) the system, with the aid of AI-assisted technology, can rapidly generate space green space schemes that meet the designer's perspective based on site conditions; (2) this study has vectorized and three-dimensionalized various types of landscape design elements based on semantic information; (3) the analysis and visualization module constructed in this study can perform landscape analysis on the generated three-dimensional models and produce node effect diagrams, allowing users to modify the design in real time based on the effects, thus enhancing the system's interactivity.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Thermal conversion of ultrathin nickel hydroxide for wide bandgap 2D nickel oxides
Authors:
Lu Ping,
Nicholas Russo,
Zifan Wang,
Ching-Hsiang Yao,
Kevin E. Smith,
Xi Ling
Abstract:
Wide bandgap (WBG) semiconductors (Eg >2.0 eV) are integral to the advancement of next generation electronics, optoelectronics, and power industries, owing to their capability for high temperature operation, high breakdown voltage and efficient light emission. Enhanced power efficiency and functional performance can be attained through miniaturization, specifically via the integration of device fa…
▽ More
Wide bandgap (WBG) semiconductors (Eg >2.0 eV) are integral to the advancement of next generation electronics, optoelectronics, and power industries, owing to their capability for high temperature operation, high breakdown voltage and efficient light emission. Enhanced power efficiency and functional performance can be attained through miniaturization, specifically via the integration of device fabrication into two-dimensional (2D) structure enabled by WBG 2D semiconductors. However, as an essential subgroup of WBG semiconductors, 2D transition metal oxides (TMOs) remain largely underexplored in terms of physical properties and applications in 2D opto-electronic devices, primarily due to the scarcity of sufficiently large 2D crystals. Thus, our goal is to develop synthesis pathways for 2D TMOs possessing large crystal domain (e.g. >10 nm), expanding the 2D TMOs family and providing insights for future engineering of 2D TMOs. Here, we demonstrate the synthesis of WBG 2D nickel oxide (NiO) (Eg > 2.7 eV) thermally converted from 2D nickel hydroxide (Ni(OH)2) with the lateral domain size larger than 10 um. Moreover, the conversion process is investigated using various microscopic techniques such as atomic force microscopy (AFM), Raman spectroscopy, transmission electron microscopy (TEM) and X-ray photoelectron spectroscopy (XPS), providing significant insights on the morphology and structure variation under different oxidative conditions. The electronic structure of the converted NixOy is further investigated using multiple soft X-ray spectroscopies, such as X-ray absorption (XAS) and emission spectroscopies (XES).
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Productions of $X(3872)$/$Z_c(3900)$ and $X_2(4013)$/$Z_c(4020)$ in $Y(4220)$ and $Y(4360)$ decays
Authors:
Ming-Zhu Liu,
Xi-Zhe Ling,
Li-Sheng Geng
Abstract:
The two excited vector charmonium states $Y(4220)$ and $Y(4360)$ are difficult to be understood as pure $c\bar{c}$ charmonium states. Since they are located close to the mass thresholds of $\bar{D}D_{1}$ and $\bar{D}^*D_{1}$, they can be viewed as $\bar{D}D_{1}$ and $\bar{D}^*D_{1}$ molecules. Furthermore, recent studies indicated that the exotic states $X(3872)$/$Z_c(3900)$ and $X_2(4013)$/…
▽ More
The two excited vector charmonium states $Y(4220)$ and $Y(4360)$ are difficult to be understood as pure $c\bar{c}$ charmonium states. Since they are located close to the mass thresholds of $\bar{D}D_{1}$ and $\bar{D}^*D_{1}$, they can be viewed as $\bar{D}D_{1}$ and $\bar{D}^*D_{1}$ molecules. Furthermore, recent studies indicated that the exotic states $X(3872)$/$Z_c(3900)$ and $X_2(4013)$/$Z_c(4020)$ are the isoscalar/isovector $\bar{D}D^{*}$ and isoscalar/isovector $\bar{D}^*D^{*}$ molecules, respectively. In this work, in the molecular picture, we employ the triangle diagram mechanism to study the productions of $ Z_{c}(3900) $ and $X(3872)$ in the pionic and radiative decays of $Y(4220)$, as well as their heavy-quark spin symmetry (HQSS) partners, i.e., the productions of $Z_{c}(4020)$ and $X_2(4013)$ in the pionic and radiative decays of $Y(4360)$. Using the effective Lagrangian approach, we obtain the ratios of the branching fractions $\mathcal{B}[Y(4360) \to Z_c(4020)π]/\mathcal{B}[Y(4220)\to Z_c(3900)π]=1.2$ and $\mathcal{B}[Y(4360)\to X_2(4013) γ]/\mathcal{B}[Y(4220)\to X(3872)γ]=0.5$, almost independent of model parameters, which indicate that the productions of $X_2(4013)$ and $Z_c(4020)$ in the radiative and pionic decays of $Y(4360)$ are likely to be measured in the future. The experimental studies of the predicted decay modes will help verify the molecular nature of $X(3872)$, $Z_c(3900)$, and $Y(4220)$. We hope the present work can stimulate experimental and further theoretical studies on these decay modes.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
When Large Language Models Confront Repository-Level Automatic Program Repair: How Well They Done?
Authors:
Yuxiao Chen,
Jingzheng Wu,
Xiang Ling,
Changjiang Li,
Zhiqing Rui,
Tianyue Luo,
Yanjun Wu
Abstract:
In recent years, large language models (LLMs) have demonstrated substantial potential in addressing automatic program repair (APR) tasks. However, the current evaluation of these models for APR tasks focuses solely on the limited context of the single function or file where the bug is located, overlooking the valuable information in the repository-level context. This paper investigates the perform…
▽ More
In recent years, large language models (LLMs) have demonstrated substantial potential in addressing automatic program repair (APR) tasks. However, the current evaluation of these models for APR tasks focuses solely on the limited context of the single function or file where the bug is located, overlooking the valuable information in the repository-level context. This paper investigates the performance of popular LLMs in handling repository-level repair tasks. We introduce RepoBugs, a new benchmark comprising 124 typical repository-level bugs from open-source repositories. Preliminary experiments using GPT3.5 based on the function where the error is located, reveal that the repair rate on RepoBugs is only 22.58%, significantly diverging from the performance of GPT3.5 on function-level bugs in related studies. This underscores the importance of providing repository-level context when addressing bugs at this level. However, the repository-level context offered by the preliminary method often proves redundant and imprecise and easily exceeds the prompt length limit of LLMs. To solve the problem, we propose a simple and universal repository-level context extraction method (RLCE) designed to provide more precise context for repository-level code repair tasks. Evaluations of three mainstream LLMs show that RLCE significantly enhances the ability to repair repository-level bugs. The improvement reaches a maximum of 160% compared to the preliminary method. Additionally, we conduct a comprehensive analysis of the effectiveness and limitations of RLCE, along with the capacity of LLMs to address repository-level bugs, offering valuable insights for future research.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
l1-norm regularized l1-norm best-fit lines
Authors:
Xiao Ling,
Paul Brooks
Abstract:
In this work, we propose an optimization framework for estimating a sparse robust one-dimensional subspace. Our objective is to minimize both the representation error and the penalty, in terms of the l1-norm criterion. Given that the problem is NP-hard, we introduce a linear relaxation-based approach. Additionally, we present a novel fitting procedure, utilizing simple ratios and sorting technique…
▽ More
In this work, we propose an optimization framework for estimating a sparse robust one-dimensional subspace. Our objective is to minimize both the representation error and the penalty, in terms of the l1-norm criterion. Given that the problem is NP-hard, we introduce a linear relaxation-based approach. Additionally, we present a novel fitting procedure, utilizing simple ratios and sorting techniques. The proposed algorithm demonstrates a worst-case time complexity of $O(n^2 m \log n)$ and, in certain instances, achieves global optimality for the sparse robust subspace, thereby exhibiting polynomial time efficiency. Compared to extant methodologies, the proposed algorithm finds the subspace with the lowest discordance, offering a smoother trade-off between sparsity and fit. Its architecture affords scalability, evidenced by a 16-fold improvement in computational speeds for matrices of 2000x2000 over CPU version. Furthermore, this method is distinguished by several advantages, including its independence from initialization and deterministic and replicable procedures. Furthermore, this method is distinguished by several advantages, including its independence from initialization and deterministic and replicable procedures. The real-world example demonstrates the effectiveness of algorithm in achieving meaningful sparsity, underscoring its precise and useful application across various domains.
△ Less
Submitted 6 March, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
FedFDP: Fairness-Aware Federated Learning with Differential Privacy
Authors:
Xinpeng Ling,
Jie Fu,
Kuncan Wang,
Huifa Li,
Tong Cheng,
Zhili Chen
Abstract:
Federated learning (FL) is a new machine learning paradigm to overcome the challenge of data silos and has garnered significant attention. However, through our observations, a globally effective trained model may performance disparities in different clients. This implies that the jointly trained models by clients may lead to unfair outcomes. On the other hand, relevant studies indicate that the tr…
▽ More
Federated learning (FL) is a new machine learning paradigm to overcome the challenge of data silos and has garnered significant attention. However, through our observations, a globally effective trained model may performance disparities in different clients. This implies that the jointly trained models by clients may lead to unfair outcomes. On the other hand, relevant studies indicate that the transmission of gradients or models in federated learning can also give rise to privacy leakage issues, such as membership inference attacks.
To address the first issue mentioned above, we propose a fairness-aware federated learning algorithm, termed FedFair. Building upon FedFair, we introduce privacy protection to form the FedFDP algorithm to address the second issue mentioned above. In FedFDP, we devise a fairness-aware clipping strategy to achieve differential privacy while adjusting fairness. Additionally, for the extra uploaded loss values, we present an adaptive clipping approach to maximize utility. Furthermore, we theoretically prove that our algorithm converges and ensures differential privacy. Lastly, extensive experimental results demonstrate that FedFair and FedFDP significantly outperform state-of-the-art solutions in terms of model performance and fairness. Code and data is accessible at https://anonymous.4open.science/r/FedFDP-5607.
△ Less
Submitted 20 May, 2024; v1 submitted 25 February, 2024;
originally announced February 2024.
-
SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model
Authors:
Xudong Ling,
Chaorong Li,
Fengqing Qin,
Peng Yang,
Yuanyuan Huang
Abstract:
Diffusion models are widely used in image generation because they can generate high-quality and realistic samples. This is in contrast to generative adversarial networks (GANs) and variational autoencoders (VAEs), which have some limitations in terms of image quality.We introduce the diffusion model to the precipitation forecasting task and propose a short-term precipitation nowcasting with condit…
▽ More
Diffusion models are widely used in image generation because they can generate high-quality and realistic samples. This is in contrast to generative adversarial networks (GANs) and variational autoencoders (VAEs), which have some limitations in terms of image quality.We introduce the diffusion model to the precipitation forecasting task and propose a short-term precipitation nowcasting with condition diffusion model based on historical observational data, which is referred to as SRNDiff. By incorporating an additional conditional decoder module in the denoising process, SRNDiff achieves end-to-end conditional rainfall prediction. SRNDiff is composed of two networks: a denoising network and a conditional Encoder network. The conditional network is composed of multiple independent UNet networks. These networks extract conditional feature maps at different resolutions, providing accurate conditional information that guides the diffusion model for conditional generation.SRNDiff surpasses GANs in terms of prediction accuracy, although it requires more computational resources.The SRNDiff model exhibits higher stability and efficiency during training than GANs-based approaches, and generates high-quality precipitation distribution samples that better reflect future actual precipitation conditions. This fully validates the advantages and potential of diffusion models in precipitation forecasting, providing new insights for enhancing rainfall prediction.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Two-stage Rainfall-Forecasting Diffusion Model
Authors:
XuDong Ling,
ChaoRong Li,
FengQing Qin,
LiHong Zhu,
Yuanyuan Huang
Abstract:
Deep neural networks have made great achievements in rainfall prediction.However, the current forecasting methods have certain limitations, such as with blurry generated images and incorrect spatial positions. To overcome these challenges, we propose a Two-stage Rainfall-Forecasting Diffusion Model (TRDM) aimed at improving the accuracy of long-term rainfall forecasts and addressing the imbalance…
▽ More
Deep neural networks have made great achievements in rainfall prediction.However, the current forecasting methods have certain limitations, such as with blurry generated images and incorrect spatial positions. To overcome these challenges, we propose a Two-stage Rainfall-Forecasting Diffusion Model (TRDM) aimed at improving the accuracy of long-term rainfall forecasts and addressing the imbalance in performance between temporal and spatial modeling. TRDM is a two-stage method for rainfall prediction tasks. The task of the first stage is to capture robust temporal information while preserving spatial information under low-resolution conditions. The task of the second stage is to reconstruct the low-resolution images generated in the first stage into high-resolution images. We demonstrate state-of-the-art results on the MRMS and Swedish radar datasets. Our project is open source and available on GitHub at: \href{https://github.com/clearlyzerolxd/TRDM}{https://github.com/clearlyzerolxd/TRDM}.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data
Authors:
Bo Peng,
Xinyi Ling,
Ziru Chen,
Huan Sun,
Xia Ning
Abstract:
With tremendous efforts on developing effective e-commerce models, conventional e-commerce models show limited success in generalist e-commerce modeling, and suffer from unsatisfactory performance on new users and new products - a typical out-of-domain generalization challenge. Meanwhile, large language models (LLMs) demonstrate outstanding performance in generalist modeling and out-of-domain gene…
▽ More
With tremendous efforts on developing effective e-commerce models, conventional e-commerce models show limited success in generalist e-commerce modeling, and suffer from unsatisfactory performance on new users and new products - a typical out-of-domain generalization challenge. Meanwhile, large language models (LLMs) demonstrate outstanding performance in generalist modeling and out-of-domain generalizability in many fields. Toward fully unleashing their power for e-commerce, in this paper, we construct ECInstruct, the first open-sourced, large-scale, and high-quality benchmark instruction dataset for e-commerce. Leveraging ECInstruct, we develop eCeLLM, a series of e-commerce LLMs, by instruction-tuning general-purpose LLMs. Our comprehensive experiments and evaluation demonstrate that eCeLLM models substantially outperform baseline models, including the most advanced GPT-4, and the state-of-the-art task-specific models in in-domain evaluation. Moreover, eCeLLM exhibits excellent generalizability to out-of-domain settings, including unseen products and unseen instructions, highlighting its superiority as a generalist e-commerce model. Both the ECInstruct dataset and the eCeLLM models show great potential in empowering versatile and effective LLMs for e-commerce. ECInstruct and eCeLLM models are publicly accessible through https://ninglab.github.io/eCeLLM.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Generalizing across Temporal Domains with Koopman Operators
Authors:
Qiuhao Zeng,
Wei Wang,
Fan Zhou,
Gezheng Xu,
Ruizhi Pu,
Changjian Shui,
Christian Gagne,
Shichun Yang,
Boyu Wang,
Charles X. Ling
Abstract:
In the field of domain generalization, the task of constructing a predictive model capable of generalizing to a target domain without access to target data remains challenging. This problem becomes further complicated when considering evolving dynamics between domains. While various approaches have been proposed to address this issue, a comprehensive understanding of the underlying generalization…
▽ More
In the field of domain generalization, the task of constructing a predictive model capable of generalizing to a target domain without access to target data remains challenging. This problem becomes further complicated when considering evolving dynamics between domains. While various approaches have been proposed to address this issue, a comprehensive understanding of the underlying generalization theory is still lacking. In this study, we contribute novel theoretic results that aligning conditional distribution leads to the reduction of generalization bounds. Our analysis serves as a key motivation for solving the Temporal Domain Generalization (TDG) problem through the application of Koopman Neural Operators, resulting in Temporal Koopman Networks (TKNets). By employing Koopman Operators, we effectively address the time-evolving distributions encountered in TDG using the principles of Koopman theory, where measurement functions are sought to establish linear transition relations between evolving domains. Through empirical evaluations conducted on synthetic and real-world datasets, we validate the effectiveness of our proposed approach.
△ Less
Submitted 15 February, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
AdvSQLi: Generating Adversarial SQL Injections against Real-world WAF-as-a-service
Authors:
Zhenqing Qu,
Xiang Ling,
Ting Wang,
Xiang Chen,
Shouling Ji,
Chunming Wu
Abstract:
As the first defensive layer that attacks would hit, the web application firewall (WAF) plays an indispensable role in defending against malicious web attacks like SQL injection (SQLi). With the development of cloud computing, WAF-as-a-service, as one kind of Security-as-a-service, has been proposed to facilitate the deployment, configuration, and update of WAFs in the cloud. Despite its tremendou…
▽ More
As the first defensive layer that attacks would hit, the web application firewall (WAF) plays an indispensable role in defending against malicious web attacks like SQL injection (SQLi). With the development of cloud computing, WAF-as-a-service, as one kind of Security-as-a-service, has been proposed to facilitate the deployment, configuration, and update of WAFs in the cloud. Despite its tremendous popularity, the security vulnerabilities of WAF-as-a-service are still largely unknown, which is highly concerning given its massive usage. In this paper, we propose a general and extendable attack framework, namely AdvSQLi, in which a minimal series of transformations are performed on the hierarchical tree representation of the original SQLi payload, such that the generated SQLi payloads can not only bypass WAF-as-a-service under black-box settings but also keep the same functionality and maliciousness as the original payload. With AdvSQLi, we make it feasible to inspect and understand the security vulnerabilities of WAFs automatically, helping vendors make products more secure. To evaluate the attack effectiveness and efficiency of AdvSQLi, we first employ two public datasets to generate adversarial SQLi payloads, leading to a maximum attack success rate of 100% against state-of-the-art ML-based SQLi detectors. Furthermore, to demonstrate the immediate security threats caused by AdvSQLi, we evaluate the attack effectiveness against 7 WAF-as-a-service solutions from mainstream vendors and find all of them are vulnerable to AdvSQLi. For instance, AdvSQLi achieves an attack success rate of over 79% against the F5 WAF. Through in-depth analysis of the evaluation results, we further condense out several general yet severe flaws of these vendors that cannot be easily patched.
△ Less
Submitted 9 January, 2024; v1 submitted 4 January, 2024;
originally announced January 2024.
-
Trading Off Scalability, Privacy, and Performance in Data Synthesis
Authors:
Xiao Ling,
Tim Menzies,
Christopher Hazard,
Jack Shu,
Jacob Beel
Abstract:
Synthetic data has been widely applied in the real world recently. One typical example is the creation of synthetic data for privacy concerned datasets. In this scenario, synthetic data substitute the real data which contains the privacy information, and is used to public testing for machine learning models. Another typical example is the unbalance data over-sampling which the synthetic data is ge…
▽ More
Synthetic data has been widely applied in the real world recently. One typical example is the creation of synthetic data for privacy concerned datasets. In this scenario, synthetic data substitute the real data which contains the privacy information, and is used to public testing for machine learning models. Another typical example is the unbalance data over-sampling which the synthetic data is generated in the region of minority samples to balance the positive and negative ratio when training the machine learning models. In this study, we concentrate on the first example, and introduce (a) the Howso engine, and (b) our proposed random projection based synthetic data generation framework. We evaluate these two algorithms on the aspects of privacy preservation and accuracy, and compare them to the two state-of-the-art synthetic data generation algorithms DataSynthesizer and Synthetic Data Vault. We show that the synthetic data generated by Howso engine has good privacy and accuracy, which results the best overall score. On the other hand, our proposed random projection based framework can generate synthetic data with highest accuracy score, and has the fastest scalability.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Productions of $D^*_{s0}(2317)$ and $D_{s1}(2460)$ in $B_{(s)}$ and $Λ_b(Ξ_b)$ decays
Authors:
Ming-Zhu Liu,
Xi-Zhe Ling,
Li-Sheng Geng
Abstract:
Recent studies show that $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ contain large molecular components. In this work, we employ the naive factorization approach to calculate the production rates of $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ as hadronic molecules in $B_{(s)}$ and $Λ_b(Ξ_b)$ decays, where their decay constants are estimated in the effective Lagrangian approach. With the so-obtained deca…
▽ More
Recent studies show that $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ contain large molecular components. In this work, we employ the naive factorization approach to calculate the production rates of $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ as hadronic molecules in $B_{(s)}$ and $Λ_b(Ξ_b)$ decays, where their decay constants are estimated in the effective Lagrangian approach. With the so-obtained decay constants $f_{D_{s0}^{\ast}(2317)}$ and $f_{D_{s1}(2460)}$, we calculate the branching fractions of the $b$-meson decays $B_{(s)}\to \bar{D}_{(s)}^{(*)}D_{s0}^*$ and $B_{(s)}\to \bar{D}_{(s)}^{(*)}D_{s1}$ and the $b$-baryon decays $Λ_b(Ξ_{b}) \to Λ_c(Ξ_{c}) D_{s0}^*$ and $Λ_b(Ξ_{b}) \to Λ_c(Ξ_c) D_{s1}$. Our results show that the production rates of $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ in the $B_s$, $Λ_b$ and $Ξ_b$ decays are rather large that future experiments could observe them. In particular, we demonstrate that one can extract the decay constants of hadronic molecules via the triangle mechanism because of the equivalence of the triangle mechanism to the tree diagram established in calculating the decays $B \to \bar{D}^{(*)}D_{s0}^{\ast}(2317)$ and $B \to \bar{D}^{(*)}D_{s1}(2460)$.
△ Less
Submitted 17 March, 2024; v1 submitted 3 December, 2023;
originally announced December 2023.
-
Optimizing and Fine-tuning Large Language Model for Urban Renewal
Authors:
Xi Wang,
Xianyao Ling,
Tom Zhang,
Xuecao Li,
Shaolan Wang,
Zhixing Li,
Liang Zhang,
Peng Gong
Abstract:
This study aims to innovatively explore adaptive applications of large language models (LLM) in urban renewal. It also aims to improve its performance and text generation quality for knowledge question-answering (QA) tasks. Based on the ChatGLM, we automatically generate QA datasets using urban renewal scientific literature corpora in a self-instruct manner and then conduct joint fine-tuning train…
▽ More
This study aims to innovatively explore adaptive applications of large language models (LLM) in urban renewal. It also aims to improve its performance and text generation quality for knowledge question-answering (QA) tasks. Based on the ChatGLM, we automatically generate QA datasets using urban renewal scientific literature corpora in a self-instruct manner and then conduct joint fine-tuning training on the model using the Prefix and LoRA fine-tuning methods to create an LLM for urban renewal. By guiding the LLM to automatically generate QA data based on prompt words and given text, it is possible to quickly obtain datasets in the urban renewal field and provide data support for the fine-tuning training of LLMs. The experimental results show that the joint fine-tuning training method proposed in this study can significantly improve the performance of LLM on the QA tasks. Compared with LoRA fine-tuning, the method improves the Bleu and Rouge metrics on the test by about 5%; compared with the model before fine-tuning, the method improves the Bleu and Rouge metrics by about 15%-20%. This study demonstrates the effectiveness and superiority of the joint fine-tuning method using Prefix and LoRA for ChatGLM in the urban renewal knowledge QA tasks. It provides a new approach for fine-tuning LLMs on urban renewal-related tasks.
△ Less
Submitted 26 November, 2023;
originally announced November 2023.
-
Hessian Aware Low-Rank Perturbation for Order-Robust Continual Learning
Authors:
Jiaqi Li,
Yuanhao Lai,
Rui Wang,
Changjian Shui,
Sabyasachi Sahoo,
Charles X. Ling,
Shichun Yang,
Boyu Wang,
Christian Gagné,
Fan Zhou
Abstract:
Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this work, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling the parameter transitions along the sequential tasks with the weight matrix transformation, we propose to apply the low-rank approximation on the task-adaptive…
▽ More
Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this work, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling the parameter transitions along the sequential tasks with the weight matrix transformation, we propose to apply the low-rank approximation on the task-adaptive parameters in each layer of the neural networks. Specifically, we theoretically demonstrate the quantitative relationship between the Hessian and the proposed low-rank approximation. The approximation ranks are then globally determined according to the marginal increment of the empirical loss estimated by the layer-specific gradient and low-rank approximation error. Furthermore, we control the model capacity by pruning less important parameters to diminish the parameter growth. We conduct extensive experiments on various benchmarks, including a dataset with large-scale tasks, and compare our method against some recent state-of-the-art methods to demonstrate the effectiveness and scalability of our proposed method. Empirical results show that our method performs better on different benchmarks, especially in achieving task order robustness and handling the forgetting issue. The source code is at https://github.com/lijiaqi/HALRP.
△ Less
Submitted 7 July, 2024; v1 submitted 25 November, 2023;
originally announced November 2023.
-
Observation of three-state nematicity and domain evolution in atomically-thin antiferromagnetic NiPS3
Authors:
Qishuo Tan,
Connor A. Occhialini,
Hongze Gao,
Jiaruo Li,
Hikari Kitadai,
Riccardo Comin,
Xi Ling
Abstract:
Nickel phosphorus trisulfide (NiPS3), a van der Waals (vdW) 2D antiferromagnet, has captivated enormous attention for its intriguing physics in recent years. However, despite its fundamental importance in physics of magnetism and promising potential for technological applications, the study of magnetic domains in NiPS3 down to atomically thin is still lacking. Here, we report the layer-dependent m…
▽ More
Nickel phosphorus trisulfide (NiPS3), a van der Waals (vdW) 2D antiferromagnet, has captivated enormous attention for its intriguing physics in recent years. However, despite its fundamental importance in physics of magnetism and promising potential for technological applications, the study of magnetic domains in NiPS3 down to atomically thin is still lacking. Here, we report the layer-dependent magnetic characteristics and magnetic domains within antiferromagnetic NiPS3 by employing linear dichroism (LD) combined with polarized microscopy, spin-correlated photoluminescence (PL), and Raman spectroscopy. Our results reveal the existence of the paramagnetic-to-antiferromagnetic phase transition in bulk to bilayer NiPS3 with stronger spin fluctuation in thinner NiPS3. Furthermore, our study identifies three distinct antiferromagnetic domains within atomicallythin NiPS3 and captures the thermally-activated domain evolution. Our findings provide crucial insights for the development of antiferromagnetic spintronics and related technologies.
△ Less
Submitted 27 February, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
DP-DCAN: Differentially Private Deep Contrastive Autoencoder Network for Single-cell Clustering
Authors:
Huifa Li,
Jie Fu,
Zhili Chen,
Xiaomin Yang,
Haitao Liu,
Xinpeng Ling
Abstract:
Single-cell RNA sequencing (scRNA-seq) is important to transcriptomic analysis of gene expression. Recently, deep learning has facilitated the analysis of high-dimensional single-cell data. Unfortunately, deep learning models may leak sensitive information about users. As a result, Differential Privacy (DP) is increasingly used to protect privacy. However, existing DP methods usually perturb whole…
▽ More
Single-cell RNA sequencing (scRNA-seq) is important to transcriptomic analysis of gene expression. Recently, deep learning has facilitated the analysis of high-dimensional single-cell data. Unfortunately, deep learning models may leak sensitive information about users. As a result, Differential Privacy (DP) is increasingly used to protect privacy. However, existing DP methods usually perturb whole neural networks to achieve differential privacy, and hence result in great performance overheads. To address this challenge, in this paper, we take advantage of the uniqueness of the autoencoder that it outputs only the dimension-reduced vector in the middle of the network, and design a Differentially Private Deep Contrastive Autoencoder Network (DP-DCAN) by partial network perturbation for single-cell clustering. Since only partial network is added with noise, the performance improvement is obvious and twofold: one part of network is trained with less noise due to a bigger privacy budget, and the other part is trained without any noise. Experimental results of six datasets have verified that DP-DCAN is superior to the traditional DP scheme with whole network perturbation. Moreover, DP-DCAN demonstrates strong robustness to adversarial attacks.
△ Less
Submitted 13 May, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Robust Backdoor Attacks on Object Detection in Real World
Authors:
Yaguan Qian,
Boyuan Ji,
Shuke He,
Shenhui Huang,
Xiang Ling,
Bin Wang,
Wei Wang
Abstract:
Deep learning models are widely deployed in many applications, such as object detection in various security fields. However, these models are vulnerable to backdoor attacks. Most backdoor attacks were intensively studied on classified models, but little on object detection. Previous works mainly focused on the backdoor attack in the digital world, but neglect the real world. Especially, the backdo…
▽ More
Deep learning models are widely deployed in many applications, such as object detection in various security fields. However, these models are vulnerable to backdoor attacks. Most backdoor attacks were intensively studied on classified models, but little on object detection. Previous works mainly focused on the backdoor attack in the digital world, but neglect the real world. Especially, the backdoor attack's effect in the real world will be easily influenced by physical factors like distance and illumination. In this paper, we proposed a variable-size backdoor trigger to adapt to the different sizes of attacked objects, overcoming the disturbance caused by the distance between the viewing point and attacked object. In addition, we proposed a backdoor training named malicious adversarial training, enabling the backdoor object detector to learn the feature of the trigger with physical noise. The experiment results show this robust backdoor attack (RBA) could enhance the attack success rate in the real world.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
Real-time Monitoring for the Next Core-Collapse Supernova in JUNO
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta,
Antonio Bergnoli
, et al. (606 additional authors not shown)
Abstract:
The core-collapse supernova (CCSN) is considered one of the most energetic astrophysical events in the universe. The early and prompt detection of neutrinos before (pre-SN) and during the supernova (SN) burst presents a unique opportunity for multi-messenger observations of CCSN events. In this study, we describe the monitoring concept and present the sensitivity of the system to pre-SN and SN neu…
▽ More
The core-collapse supernova (CCSN) is considered one of the most energetic astrophysical events in the universe. The early and prompt detection of neutrinos before (pre-SN) and during the supernova (SN) burst presents a unique opportunity for multi-messenger observations of CCSN events. In this study, we describe the monitoring concept and present the sensitivity of the system to pre-SN and SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), a 20 kton liquid scintillator detector currently under construction in South China. The real-time monitoring system is designed to ensure both prompt alert speed and comprehensive coverage of progenitor stars. It incorporates prompt monitors on the electronic board as well as online monitors at the data acquisition stage. Assuming a false alert rate of 1 per year, this monitoring system exhibits sensitivity to pre-SN neutrinos up to a distance of approximately 1.6 (0.9) kiloparsecs and SN neutrinos up to about 370 (360) kiloparsecs for a progenitor mass of 30 solar masses, considering both normal and inverted mass ordering scenarios. The pointing ability of the CCSN is evaluated by analyzing the accumulated event anisotropy of inverse beta decay interactions from pre-SN or SN neutrinos. This, along with the early alert, can play a crucial role in facilitating follow-up multi-messenger observations of the next galactic or nearby extragalactic CCSN.
△ Less
Submitted 4 December, 2023; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Latency Analysis of LEO Satellite Relay Communication: An Application of Conditional Contact Angle Distribution
Authors:
Sixi Cheng,
Xiang Ling
Abstract:
This article investigates the transmission delay of a Low Earth Orbit (LEO) satellite communication system in a bent pipe structure. By employing a stochastic geometry framework, satellites are modeled as spherical binomial point processes (BPP). A suboptimal satellite relay selection strategy is proposed, which achieves optimal conditions through theoretical analysis and numerical exploration. We…
▽ More
This article investigates the transmission delay of a Low Earth Orbit (LEO) satellite communication system in a bent pipe structure. By employing a stochastic geometry framework, satellites are modeled as spherical binomial point processes (BPP). A suboptimal satellite relay selection strategy is proposed, which achieves optimal conditions through theoretical analysis and numerical exploration. We derive the distance distributions for the uplink and downlink links, and provide corresponding analytical expressions for the transmission delays.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
ALI-DPFL: Differentially Private Federated Learning with Adaptive Local Iterations
Authors:
Xinpeng Ling,
Jie Fu,
Kuncan Wang,
Haitao Liu,
Zhili Chen
Abstract:
Federated Learning (FL) is a distributed machine learning technique that allows model training among multiple devices or organizations by sharing training parameters instead of raw data. However, adversaries can still infer individual information through inference attacks (e.g. differential attacks) on these training parameters. As a result, Differential Privacy (DP) has been widely used in FL to…
▽ More
Federated Learning (FL) is a distributed machine learning technique that allows model training among multiple devices or organizations by sharing training parameters instead of raw data. However, adversaries can still infer individual information through inference attacks (e.g. differential attacks) on these training parameters. As a result, Differential Privacy (DP) has been widely used in FL to prevent such attacks.
We consider differentially private federated learning in a resource-constrained scenario, where both privacy budget and communication rounds are constrained. By theoretically analyzing the convergence, we can find the optimal number of local DPSGD iterations for clients between any two sequential global updates. Based on this, we design an algorithm of Differentially Private Federated Learning with Adaptive Local Iterations (ALI-DPFL). We experiment our algorithm on the MNIST, FashionMNIST and Cifar10 datasets, and demonstrate significantly better performances than previous work in the resource-constraint scenario. Code is available at https://github.com/cheng-t/ALI-DPFL.
△ Less
Submitted 22 May, 2024; v1 submitted 21 August, 2023;
originally announced August 2023.
-
The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite
Authors:
Z. X. Ling,
X. J. Sun,
C. Zhang,
S. L. Sun,
G. Jin,
S. N. Zhang,
X. F. Zhang,
J. B. Chang,
F. S. Chen,
Y. F. Chen,
Z. W. Cheng,
W. Fu,
Y. X. Han,
H. Li,
J. F. Li,
Y. Li,
Z. D. Li,
P. R. Liu,
Y. H. Lv,
X. H. Ma,
Y. J. Tang,
C. B. Wang,
R. J. Xie,
Y. L. Xue,
A. L. Yan
, et al. (101 additional authors not shown)
Abstract:
The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo…
▽ More
The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (FoV) of 346 square degrees (18.6 degrees * 18.6 degrees) of the X-ray imager is realized. An optical assembly composed of 36 MPO chips is used to focus incident X-ray photons, and four large-format complementary metal-oxide semiconductor (CMOS) sensors, each of 6 cm * 6 cm, are used as the focal plane detectors. The instrument has an angular resolution of 4 - 8 arcmin (in FWHM) for the central focal spot of the point spread function, and an effective area of 2 - 3 cm2 at 1 keV in essentially all the directions within the field of view. The detection passband is 0.5 - 4 keV in the soft X-rays and the sensitivity is 2 - 3 * 10-11 erg s-1 cm-2 (about 1 mini-Crab) at 1,000 second observation. The total weight of LEIA is 56 kg and the power is 85 W. The satellite, with a design lifetime of 2 years, operates in a Sun-synchronous orbit of 500 km with an orbital period of 95 minutes. LEIA is paving the way for future missions by verifying in flight the technologies of both novel focusing imaging optics and CMOS sensors for X-ray observation, and by optimizing the working setups of the instrumental parameters. In addition, LEIA is able to carry out scientific observations to find new transients and to monitor known sources in the soft X-ray band, albeit limited useful observing time available.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
Large-scale and Efficient Texture Mapping Algorithm via Loopy Belief Propagation
Authors:
Xiao ling,
Rongjun Qin
Abstract:
Texture mapping as a fundamental task in 3D modeling has been well established for well-acquired aerial assets under consistent illumination, yet it remains a challenge when it is scaled to large datasets with images under varying views and illuminations. A well-performed texture mapping algorithm must be able to efficiently select views, fuse and map textures from these views to mesh models, at t…
▽ More
Texture mapping as a fundamental task in 3D modeling has been well established for well-acquired aerial assets under consistent illumination, yet it remains a challenge when it is scaled to large datasets with images under varying views and illuminations. A well-performed texture mapping algorithm must be able to efficiently select views, fuse and map textures from these views to mesh models, at the same time, achieve consistent radiometry over the entire model. Existing approaches achieve efficiency either by limiting the number of images to one view per face, or simplifying global inferences to only achieve local color consistency. In this paper, we break this tie by proposing a novel and efficient texture mapping framework that allows the use of multiple views of texture per face, at the same time to achieve global color consistency. The proposed method leverages a loopy belief propagation algorithm to perform an efficient and global-level probabilistic inferences to rank candidate views per face, which enables face-level multi-view texture fusion and blending. The texture fusion algorithm, being non-parametric, brings another advantage over typical parametric post color correction methods, due to its improved robustness to non-linear illumination differences. The experiments on three different types of datasets (i.e. satellite dataset, unmanned-aerial vehicle dataset and close-range dataset) show that the proposed method has produced visually pleasant and texturally consistent results in all scenarios, with an added advantage of consuming less running time as compared to the state of the art methods, especially for large-scale dataset such as satellite-derived models.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
On the Benefits of Semi-Supervised Test Case Generation for Simulation Models
Authors:
Xiao Ling,
Tim Menzies
Abstract:
Testing complex simulation models can be expensive and time consuming. Current state-of-the-art methods that explore this problem are fully-supervised; i.e. they require that all examples are labeled. On the other hand, the GenClu system (introduced in this paper) takes a semi-supervised approach; i.e. (a) only a small subset of information is actually labeled (via simulation) and (b) those labels…
▽ More
Testing complex simulation models can be expensive and time consuming. Current state-of-the-art methods that explore this problem are fully-supervised; i.e. they require that all examples are labeled. On the other hand, the GenClu system (introduced in this paper) takes a semi-supervised approach; i.e. (a) only a small subset of information is actually labeled (via simulation) and (b) those labels are then spread across the rest of the data. When applied to five open-source simulation models of cyber-physical systems, GenClu's test generation can be multiple orders of magnitude faster than the prior state of the art. Further, when assessed via mutation testing, tests generated by GenClu were as good or better than anything else tested here. Hence, we recommend semi-supervised methods over prior methods (evolutionary search and fully-supervised learning).
△ Less
Submitted 1 December, 2023; v1 submitted 5 May, 2023;
originally announced May 2023.
-
Imaging Strain-Localized Single-Photon Emitters in Layered GaSe below the Diffraction Limit
Authors:
Weijun Luo,
Benjamin Lawrie,
Alexander Puretzky,
Qishuo Tan,
Gage Eichman,
Edward Mcgee,
Anna Swan,
Liangbo Liang,
Xi Ling
Abstract:
Nanoscale strain control of exciton funneling is an increasingly critical tool for the scalable production of single photon emitters (SPEs) in two-dimensional materials. However, conventional far-field optical microscopies remain constrained in spatial resolution by the diffraction limit and thus can only provide a limited description of nanoscale strain localization of SPEs. Here, we quantify the…
▽ More
Nanoscale strain control of exciton funneling is an increasingly critical tool for the scalable production of single photon emitters (SPEs) in two-dimensional materials. However, conventional far-field optical microscopies remain constrained in spatial resolution by the diffraction limit and thus can only provide a limited description of nanoscale strain localization of SPEs. Here, we quantify the effects of nanoscale heterogeneous strain on the energy and brightness of GaSe SPEs on nanopillars with correlative cathodoluminescence, photoluminescence, and atomic force microscopies supported by density functional theory simulations. We report the strain-localized SPEs have a broad range of emission wavelengths from 620 nm to 900 nm. We reveal substantial strain-controlled SPE wavelength tunability over a ~ 100 nm spectral range and two-orders of magnitude enhancement in the SPE brightness at the pillar center due to Type-I exciton funneling. In addition, we show that radiative biexciton cascade processes contribute to the observed CL photon superbunching. Also, the measured GaSe SPE photophysics after electron beam exposure shows the excellent stability of these SPEs. We anticipate this insight into nanoscale strain control of two-dimensional SPEs will guide the development of truly deterministic quantum photonics.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction
Authors:
Hongbo Tian,
Yulong Li,
Linzhi Huang,
Xu Ling,
Yue Yang,
Jiani Hu
Abstract:
Structured reconstruction is a non-trivial dense prediction problem, which extracts structural information (\eg, building corners and edges) from a raster image, then reconstructs it to a 2D planar graph accordingly. Compared with common segmentation or detection problems, it significantly relays on the capability that leveraging holistic geometric information for structural reasoning. Current tra…
▽ More
Structured reconstruction is a non-trivial dense prediction problem, which extracts structural information (\eg, building corners and edges) from a raster image, then reconstructs it to a 2D planar graph accordingly. Compared with common segmentation or detection problems, it significantly relays on the capability that leveraging holistic geometric information for structural reasoning. Current transformer-based approaches tackle this challenging problem in a two-stage manner, which detect corners in the first model and classify the proposed edges (corner-pairs) in the second model. However, they separate two-stage into different models and only share the backbone encoder. Unlike the existing modeling strategies, we present an enhanced corner representation method: 1) It fuses knowledge between the corner detection and edge prediction by sharing feature in different granularity; 2) Corner candidates are proposed in four heatmap channels w.r.t its direction. Both qualitative and quantitative evaluations demonstrate that our proposed method can better reconstruct fine-grained structures, such as adjacent corners and tiny edges. Consequently, it outperforms the state-of-the-art model by +1.9\%@F-1 on Corner and +3.0\%@F-1 on Edge.
△ Less
Submitted 12 December, 2023; v1 submitted 14 April, 2023;
originally announced April 2023.
-
Predictions for feed-down enhancements at the $Λ_c \bar{D}$ and $Λ_c \bar{D}^*$ thresholds via the triangle and box singularities
Authors:
Ming-Xiao Duan,
Lin Qiu,
Xi-Zhe Ling,
Qiang Zhao
Abstract:
We demonstrate that triangle singularity (TS) and box singularity (BS) mechanisms can produce unique narrow enhancements at the $Λ_c\bar{D}$ and $Λ_c\bar{D}^*$ thresholds in the invariant mass spectra of $J/ψp$ and $J/ψpπ$, respectively. Taking into account that such mechanisms only depend on the initial $Σ_c^{(*)}\bar{D}^{(*)}$ interactions near threshold within the TS or BS kinematic regimes, th…
▽ More
We demonstrate that triangle singularity (TS) and box singularity (BS) mechanisms can produce unique narrow enhancements at the $Λ_c\bar{D}$ and $Λ_c\bar{D}^*$ thresholds in the invariant mass spectra of $J/ψp$ and $J/ψpπ$, respectively. Taking into account that such mechanisms only depend on the initial $Σ_c^{(*)}\bar{D}^{(*)}$ interactions near threshold within the TS or BS kinematic regimes, the $Λ_c\bar{D}$ and $Λ_c\bar{D}^*$ threshold enhancements can be regarded as a feed-down phenomenon originated from both the heavier pentaquark decays and the $Σ_c^{(*)}\bar{D}^{(*)}$ scatterings from the continuum. A search for these structures in the $J/ψp$ and $J/ψpπ$ spectra in both exclusive and semi-inclusive processes will provide a smoking-gun evidence for the hadronic molecule nature of those observed pentaquarks and clarify the role played by the TS and BS in the near-threshold dynamics.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
JUNO sensitivity to $^7$Be, $pep$, and CNO solar neutrinos
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta
, et al. (592 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented levels of precision. In this paper, we provide estimation of the JUNO sensitivity to 7Be, pep, and CNO solar neutrinos that can be obtained via a spectral analysis above the 0.45 MeV threshold. This study is performed assuming different scenarios of the liquid scintillator radiopurity, ranging from the most opti mistic one corresponding to the radiopurity levels obtained by the Borexino experiment, up to the minimum requirements needed to perform the neutrino mass ordering determination with reactor antineutrinos - the main goal of JUNO. Our study shows that in most scenarios, JUNO will be able to improve the current best measurements on 7Be, pep, and CNO solar neutrino fluxes. We also perform a study on the JUNO capability to detect periodical time variations in the solar neutrino flux, such as the day-night modulation induced by neutrino flavor regeneration in Earth, and the modulations induced by temperature changes driven by helioseismic waves.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Van der Waals device integration beyond the limits of van der Waals forces via adhesive matrix transfer
Authors:
Peter F. Satterthwaite,
Weikun Zhu,
Patricia Jastrzebska-Perfect,
Melbourne Tang,
Hongze Gao,
Hikari Kitadai,
Ang-Yu Lu,
Qishuo Tan,
Shin-Yi Tang,
Yu-Lun Chueh,
Chia-Nung Kuo,
Chin Shan Lue,
Jing Kong,
Xi Ling,
Farnaz Niroui
Abstract:
Pristine van der Waals (vdW) interfaces between two-dimensional (2D) and other materials are core to emerging optical and electronic devices. Their direct fabrication is, however, challenged as the vdW forces are weak and cannot be tuned to accommodate integration of arbitrary layers without solvents, sacrificial-layers or high-temperatures, steps that can introduce damage. To address these limita…
▽ More
Pristine van der Waals (vdW) interfaces between two-dimensional (2D) and other materials are core to emerging optical and electronic devices. Their direct fabrication is, however, challenged as the vdW forces are weak and cannot be tuned to accommodate integration of arbitrary layers without solvents, sacrificial-layers or high-temperatures, steps that can introduce damage. To address these limitations, we introduce a single-step 2D material-to-device integration approach in which forces promoting transfer are decoupled from the vdW forces at the interface of interest. We use this adhesive matrix transfer to demonstrate conventionally-forbidden direct integration of diverse 2D materials (MoS2, WSe2, PtS2, GaS) with dielectrics (SiO2, Al2O3), and scalable, aligned heterostructure formation, both foundational to device development. We then demonstrate a single-step integration of monolayer-MoS2 into arrays of transistors. With no exposure to polymers or solvents, clean interfaces and pristine surfaces are preserved, which can be further engineered to demonstrate both n- and p-type behavior. Beyond serving as a platform to probe the intrinsic properties of sensitive nanomaterials without the influence of processing steps, our technique allows efficient formation of unconventional device form-factors, with an example of flexible transistors demonstrated.
△ Less
Submitted 12 February, 2023;
originally announced February 2023.
-
Dive into the Resolution Augmentations and Metrics in Low Resolution Face Recognition: A Plain yet Effective New Baseline
Authors:
Xu Ling,
Yichen Lu,
Wenqi Xu,
Weihong Deng,
Yingjie Zhang,
Xingchen Cui,
Hongzhi Shi,
Dongchao Wen
Abstract:
Although deep learning has significantly improved Face Recognition (FR), dramatic performance deterioration may occur when processing Low Resolution (LR) faces. To alleviate this, approaches based on unified feature space are proposed with the sacrifice under High Resolution (HR) circumstances. To deal with the huge domain gap between HR and LR domains and achieve the best on both domains, we firs…
▽ More
Although deep learning has significantly improved Face Recognition (FR), dramatic performance deterioration may occur when processing Low Resolution (LR) faces. To alleviate this, approaches based on unified feature space are proposed with the sacrifice under High Resolution (HR) circumstances. To deal with the huge domain gap between HR and LR domains and achieve the best on both domains, we first took a closer look at the impacts of several resolution augmentations and then analyzed the difficulty of LR samples from the perspective of the model gradient produced by different resolution samples. Besides, we also find that the introduction of some resolutions could help the learning of lower resolutions. Based on these, we divide the LR samples into three difficulties according to the resolution and propose a more effective Multi-Resolution Augmentation. Then, due to the rapidly increasing domain gap as the resolution decreases, we carefully design a novel and effective metric loss based on a LogExp distance function that provides decent gradients to prevent oscillation near the convergence point or tolerance to small distance errors; it could also dynamically adjust the penalty for errors in different dimensions, allowing for more optimization of dimensions with large errors. Combining these two insights, our model could learn more general knowledge in a wide resolution range of images and balanced results can be achieved by our extremely simple framework. Moreover, the augmentations and metrics are the cornerstones of LRFR, so our method could be considered a new baseline for the LRFR task. Experiments on the LRFR datasets: SCface, XQLFW, and large-scale LRFR dataset: TinyFace demonstrate the effectiveness of our methods, while the degradation on HRFR datasets is significantly reduced.
△ Less
Submitted 11 February, 2023;
originally announced February 2023.
-
Design and test results of different aluminum coating layers on the sCMOS sensors for soft X-ray detection
Authors:
W. X. Wang,
Z. X. Ling,
C. Zhang,
W. M. Yuan,
S. N. Zhang
Abstract:
In recent years, tremendous progress has been made on complementary metal-oxide-semiconductor (CMOS) sensors for applications as X-ray detectors. To shield the visible light in X-ray detection, a blocking filter of aluminum is commonly employed. We designed three types of aluminum coating layers, which are deposited directly on the surface of back-illuminated sCMOS sensors during fabrication. A co…
▽ More
In recent years, tremendous progress has been made on complementary metal-oxide-semiconductor (CMOS) sensors for applications as X-ray detectors. To shield the visible light in X-ray detection, a blocking filter of aluminum is commonly employed. We designed three types of aluminum coating layers, which are deposited directly on the surface of back-illuminated sCMOS sensors during fabrication. A commercial 2k * 2k sCMOS sensor is used to realize these designs. In this work, we report their performance by comparison with that of an uncoated sCMOS sensor. The optical transmissions at 660 nm and 850 nm are measured, and the results show that the optical transmission reaches a level of about 10-9 for the 200 nm aluminum layer and about 10-4 for the 100 nm aluminum layer. Light leakage is found around the four sides of the sensor. The readout noise, fixed-pattern noise and energy resolution of these Al-coated sCMOS sensors do not show significant changes. The dark currents of these Al-coated sCMOS sensors show a noticeable increase compared with that of the uncoated sCMOS sensor at room temperatures, while no significant difference is found when the sCMOS sensors are cooled down to about -15 degree. The aluminum coatings show no visible crack after the thermal cycle and aging tests. Based on these results, an aluminum coating of a larger area on larger sCMOS sensors is proposed for future work.
△ Less
Submitted 30 November, 2022; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Spin-state Directed Synthesis of >20 micrometers 2D Layered Transition Metal Hydroxides via Edge-on Condensation
Authors:
Lu Ping,
Gillian E. Minarik,
Hongze Gao,
Jun Cao,
Tianshu Li,
Hikari Kitadai,
Xi Ling
Abstract:
Layered transition metal hydroxides (LTMHs) with transition metal centers sandwiched between layers of coordinating hydroxide anions have attracted considerable interest for their potential in developing clean energy sources and storage technologies. However, two dimensional (2D) LTMHs remain largely unstudied in terms of their physical properties and the applications in electronic devices. Here,…
▽ More
Layered transition metal hydroxides (LTMHs) with transition metal centers sandwiched between layers of coordinating hydroxide anions have attracted considerable interest for their potential in developing clean energy sources and storage technologies. However, two dimensional (2D) LTMHs remain largely unstudied in terms of their physical properties and the applications in electronic devices. Here, directed by the relationship of the spin state of 3d transition metal (TM) ions such as Ni, Co, Cu, and the corresponding geometry of the crystal field, we discover that Ni2+ with perfect Oh symmetry is ideal for intraplanar growth, leading to the achievement of >20 μm α-Ni(OH)2 2D crystals with high yield, which are the largest 2D domains reported so far. We also report the successful synthesis of 2D Co(OH)2 crystals (>40 μm) with less yield due to the slight geometry distortion resulted from uneven number of electrons. Moreover, the detailed structural characterization of synthesized α-Ni(OH)2 are performed; the optical band gap energy is extrapolated as 2.54 eV from optical absorption measurements and is measured as 2.50 eV from reflected electrons energy loss spectroscopy (REELS), suggesting the potential as insulating 2D dielectric material for electronic devices. Furthermore, key parameters of the hydrothermal reaction including soaking temperature, starting pH and cooling rate, are systematically tuned to understand their effects on morphological and crystallographic perspectives, allowing the establishment of a 2D growth mechanism. This work demonstrates a scalable pathway to synthesize large 2D LTMHs from simple methods, paving the way for the study of fundamental physical properties and device applications of 2D LTMHs.
△ Less
Submitted 25 November, 2022;
originally announced November 2022.
-
First wide field-of-view X-ray observations by a lobster eye focusing telescope in orbit
Authors:
C. Zhang,
Z. X. Ling,
X. J. Sun,
S. L. Sun,
Y. Liu,
Z. D. Li,
Y. L. Xue,
Y. F. Chen,
Y. F. Dai,
Z. Q. Jia,
H. Y. Liu,
X. F. Zhang,
Y. H. Zhang,
S. N. Zhang,
F. S. Chen,
Z. W. Cheng,
W. Fu,
Y. X. Han,
H. Li,
J. F. Li,
Y. Li,
P. R. Liu,
X. H. Ma,
Y. J. Tang,
C. B. Wang
, et al. (53 additional authors not shown)
Abstract:
As a novel X-ray focusing technology, lobster eye micro-pore optics (MPO) feature both a wide observing field of view and true imaging capability, promising sky monitoring with significantly improved sensitivity and spatial resolution in soft X-rays. Since first proposed by Angel (1979), the optics have been extensively studied, developed and trialed over the past decades. In this Letter, we repor…
▽ More
As a novel X-ray focusing technology, lobster eye micro-pore optics (MPO) feature both a wide observing field of view and true imaging capability, promising sky monitoring with significantly improved sensitivity and spatial resolution in soft X-rays. Since first proposed by Angel (1979), the optics have been extensively studied, developed and trialed over the past decades. In this Letter, we report on the first-light results from a flight experiment of the Lobster Eye Imager for Astronomy ($LEIA$), a pathfinder of the wide-field X-ray telescope of the Einstein Probe mission. The piggyback imager, launched in July 2022, has a mostly un-vignetted field of view of $18.6^\circ \times 18.6^\circ $. Its spatial resolution is in the range of 4$-$7 arcmin in FWHM and the focal spot effective area is 2$-$3 cm$^2$, both showing only mild fluctuations across the field of view. We present images of the Galactic center region, Sco X-1 and the diffuse Cygnus Loop nebular taken in snapshot observations over 0.5$-$4 keV. These are truly wide-field X-ray images of celestial bodies observed, for the first time, by a focusing imaging telescope. Initial analyses of the in-flight data show excellent agreement between the observed images and the on-ground calibration and simulations. The instrument and its characterization are briefly described, as well as the flight experiment. The results provide a solid basis for the development of the present and proposed wide-field X-ray missions using lobster eye MPO.
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
SA-DPSGD: Differentially Private Stochastic Gradient Descent based on Simulated Annealing
Authors:
Jie Fu,
Zhili Chen,
XinPeng Ling
Abstract:
Differential privacy (DP) provides a formal privacy guarantee that prevents adversaries with access to machine learning models from extracting information about individual training points. Differentially private stochastic gradient descent (DPSGD) is the most popular training method with differential privacy in image recognition. However, existing DPSGD schemes lead to significant performance degr…
▽ More
Differential privacy (DP) provides a formal privacy guarantee that prevents adversaries with access to machine learning models from extracting information about individual training points. Differentially private stochastic gradient descent (DPSGD) is the most popular training method with differential privacy in image recognition. However, existing DPSGD schemes lead to significant performance degradation, which prevents the application of differential privacy. In this paper, we propose a simulated annealing-based differentially private stochastic gradient descent scheme (SA-DPSGD) which accepts a candidate update with a probability that depends both on the update quality and on the number of iterations. Through this random update screening, we make the differentially private gradient descent proceed in the right direction in each iteration, and result in a more accurate model finally. In our experiments, under the same hyperparameters, our scheme achieves test accuracies 98.35%, 87.41% and 60.92% on datasets MNIST, FashionMNIST and CIFAR10, respectively, compared to the state-of-the-art result of 98.12%, 86.33% and 59.34%. Under the freely adjusted hyperparameters, our scheme achieves even higher accuracies, 98.89%, 88.50% and 64.17%. We believe that our method has a great contribution for closing the accuracy gap between private and non-private image classification.
△ Less
Submitted 13 December, 2022; v1 submitted 14 November, 2022;
originally announced November 2022.
-
Deterministic Localization of Strain-induced Single-photon Emitters in Multilayer GaSe
Authors:
Weijun Luo,
Alexander Puretzky,
Benjamin Lawrie,
Qishuo Tan,
Hongze Gao,
Zhuofa Chen,
Alexander Sergienko,
Anna Swan,
Liangbo Liang,
Xi Ling
Abstract:
Nanoscale strain has emerged as a powerful tool for controlling single-photon emitters (SPEs) in atomically thin transition metal dichalcogenides (TMDCs)(1, 2). However, quantum emitters in monolayer TMDCs are typically unstable in ambient conditions. Multilayer two-dimensional (2D) TMDCs could be a solution, but they suffer from low quantum efficiency, resulting in low brightness of the SPEs. Her…
▽ More
Nanoscale strain has emerged as a powerful tool for controlling single-photon emitters (SPEs) in atomically thin transition metal dichalcogenides (TMDCs)(1, 2). However, quantum emitters in monolayer TMDCs are typically unstable in ambient conditions. Multilayer two-dimensional (2D) TMDCs could be a solution, but they suffer from low quantum efficiency, resulting in low brightness of the SPEs. Here, we report the deterministic spatial localization of strain-induced single-photon emitters in multilayer GaSe by nanopillar arrays. The strain-controlled quantum confinement effect introduces well-isolated sub-bandgap photoluminescence and corresponding suppression of the broad band edge photoluminescence. Clear photon-antibunching behavior is observed from the quantum dot-like GaSe sub-bandgap exciton emission at 3.5 Kelvin. The strain-dependent confinement potential and the brightness are found to be strongly correlated, suggesting a promising route for tuning and controlling SPEs. The comprehensive investigations of strain-engineered GaSe SPEs provide a solid foundation for the development of 2D devices for quantum photonic technologies.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
Heterogeneous earning responses to inheritance: new event-study evidence from Norway
Authors:
Xiaoguang Ling
Abstract:
It has long been assumed that inheritances, particularly large ones, have a negative effect on the labor supply of inheritors. Using Norwegian registry data, I examine the inheritance-induced decline in inheritors' wages and occupational income. In contrast to prior research, my estimates allow the dynamic effect of inheritances on labor supply to vary among inheritor cohorts. The estimation appro…
▽ More
It has long been assumed that inheritances, particularly large ones, have a negative effect on the labor supply of inheritors. Using Norwegian registry data, I examine the inheritance-induced decline in inheritors' wages and occupational income. In contrast to prior research, my estimates allow the dynamic effect of inheritances on labor supply to vary among inheritor cohorts. The estimation approach adopted and the 25-year long panel data make it possible to trace the dynamics of the effect for at least 20 years, which is twice as long as the study period in previous studies. Since all observations in the sample are inheritors, I avoid the selection problem arising in studies employing non-inheritors as controls. I find that large parental inheritances (more than one million Norwegian kroner) reduce annual wage and occupational income by, at most, 4.3%, which is about half the decrease previously identified. The magnitude of the effect increases with the size of the inheritance. Large inheritances also increase the probability of being self-employed by more than 1%, although entrepreneurship may be dampened by inheritances that are excessively large. The inheritance effect lasts for up to 10 years and is heterogeneous across sexes and age groups. Male heirs are more likely to reduce their labor supply after receiving the transfer. Young heirs are more likely to be self-employed, and their annual occupational income is, therefore, less affected by inheritances in the long run; for the very young inheriting large amounts of wealth from their grandparents, the probability of their attaining a post-secondary education declines by 2%.
△ Less
Submitted 4 November, 2022; v1 submitted 21 September, 2022;
originally announced September 2022.
-
Production of $D^*_{s0}(2317)$ and $D_{s1}(2460)$ in $B$ decays as $D^{(*)}K$ and $D^{(*)}_sη$ molecules
Authors:
Ming-Zhu Liu,
Xi-Zhe Ling,
Li-Sheng Geng,
En-Wang,
Ju-Jun Xie
Abstract:
The molecular nature of $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ have been extensively studied from the perspective of their masses, decay properties, and production rates. In this work, we study the weak decays of $B \to \bar{D}^{(\ast)}D_{s0}^{*}(2317)$ and $B \to \bar{D}^{(\ast)}D_{s1}(2460)$ by invoking triangle diagrams where the $B$ meson first decays weakly into…
▽ More
The molecular nature of $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ have been extensively studied from the perspective of their masses, decay properties, and production rates. In this work, we study the weak decays of $B \to \bar{D}^{(\ast)}D_{s0}^{*}(2317)$ and $B \to \bar{D}^{(\ast)}D_{s1}(2460)$ by invoking triangle diagrams where the $B$ meson first decays weakly into $\bar{D}^{(\ast)}D_{s}^{(\ast)}$ and $J/ψK$($η_{c}K$), and then the $D_{s0}^{\ast}(2317)$ and $D_{s1}(2460)$ are dynamically generated by the final-state interactions of $D_{s}^{(\ast)}η$ and $D^{(\ast)}K$ via exchanges of $η$ and $D^{(\ast)}$ mesons. The obtained absolute branching fractions of Br$[B \to \bar{D}^{(\ast)}D_{s0}^{*}(2317)]$ are in reasonable agreement with the experimental data, while the branching fractions of Br$[B \to \bar{D}^{(\ast)}D_{s1}(2460)]$ are smaller than the experimental central values by almost a factor of two to three. We tentatively attribute such a discrepancy to either reaction mechanisms missing in the present work or the likely existence of a relatively larger $c\bar{s}$ component in the $D_{s1}(2460)$ wave function.
△ Less
Submitted 6 December, 2022; v1 submitted 2 September, 2022;
originally announced September 2022.
-
The effect of ambient air pollution on birth outcomes in Norway
Authors:
Xiaoguang Ling
Abstract:
Ambient air pollution is harmful to the fetus even in countries with relatively low levels of pollution. In this paper, I examine the effects of ambient air pollution on birth outcomes in Norway. I find that prenatal exposure to ambient nitric oxide in the last trimester causes significant birth weight and birth length loss under the same sub-postcode fixed effects and calendar month fixed effects…
▽ More
Ambient air pollution is harmful to the fetus even in countries with relatively low levels of pollution. In this paper, I examine the effects of ambient air pollution on birth outcomes in Norway. I find that prenatal exposure to ambient nitric oxide in the last trimester causes significant birth weight and birth length loss under the same sub-postcode fixed effects and calendar month fixed effects, whereas other ambient air pollutants such as nitrogen dioxide and sulfur dioxide appear to be at safe levels for the fetus in Norway. In addition, the marginal adverse effect of ambient nitric oxide is larger for newborns with disadvantaged parents. Both average concentrations of nitric oxide and occasional high concentration events can adversely affect birth outcomes. The contributions of my work include: first, my finding that prenatal exposure to environmental nitric oxide has an adverse effect on birth outcomes fills a long-standing knowledge gap. Second, with the large sample size and geographic division of sub-postal codes in Norway, I can control for a rich set of spatio-temporal fixed effects to overcome most of the endogeneity problems caused by the choice of residential area and date of delivery. In addition, I study ambient air pollution in a low-pollution setting, which provides new evidence on the health effects of low ambient air pollution.
△ Less
Submitted 25 May, 2023; v1 submitted 12 August, 2022;
originally announced August 2022.
-
Learn From All: Erasing Attention Consistency for Noisy Label Facial Expression Recognition
Authors:
Yuhang Zhang,
Chengrui Wang,
Xu Ling,
Weihong Deng
Abstract:
Noisy label Facial Expression Recognition (FER) is more challenging than traditional noisy label classification tasks due to the inter-class similarity and the annotation ambiguity. Recent works mainly tackle this problem by filtering out large-loss samples. In this paper, we explore dealing with noisy labels from a new feature-learning perspective. We find that FER models remember noisy samples b…
▽ More
Noisy label Facial Expression Recognition (FER) is more challenging than traditional noisy label classification tasks due to the inter-class similarity and the annotation ambiguity. Recent works mainly tackle this problem by filtering out large-loss samples. In this paper, we explore dealing with noisy labels from a new feature-learning perspective. We find that FER models remember noisy samples by focusing on a part of the features that can be considered related to the noisy labels instead of learning from the whole features that lead to the latent truth. Inspired by that, we propose a novel Erasing Attention Consistency (EAC) method to suppress the noisy samples during the training process automatically. Specifically, we first utilize the flip semantic consistency of facial images to design an imbalanced framework. We then randomly erase input images and use flip attention consistency to prevent the model from focusing on a part of the features. EAC significantly outperforms state-of-the-art noisy label FER methods and generalizes well to other tasks with a large number of classes like CIFAR100 and Tiny-ImageNet. The code is available at https://github.com/zyh-uaiaaaa/Erasing-Attention-Consistency.
△ Less
Submitted 20 September, 2022; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Towards the Desirable Decision Boundary by Moderate-Margin Adversarial Training
Authors:
Xiaoyu Liang,
Yaguan Qian,
Jianchang Huang,
Xiang Ling,
Bin Wang,
Chunming Wu,
Wassim Swaileh
Abstract:
Adversarial training, as one of the most effective defense methods against adversarial attacks, tends to learn an inclusive decision boundary to increase the robustness of deep learning models. However, due to the large and unnecessary increase in the margin along adversarial directions, adversarial training causes heavy cross-over between natural examples and adversarial examples, which is not co…
▽ More
Adversarial training, as one of the most effective defense methods against adversarial attacks, tends to learn an inclusive decision boundary to increase the robustness of deep learning models. However, due to the large and unnecessary increase in the margin along adversarial directions, adversarial training causes heavy cross-over between natural examples and adversarial examples, which is not conducive to balancing the trade-off between robustness and natural accuracy. In this paper, we propose a novel adversarial training scheme to achieve a better trade-off between robustness and natural accuracy. It aims to learn a moderate-inclusive decision boundary, which means that the margins of natural examples under the decision boundary are moderate. We call this scheme Moderate-Margin Adversarial Training (MMAT), which generates finer-grained adversarial examples to mitigate the cross-over problem. We also take advantage of logits from a teacher model that has been well-trained to guide the learning of our model. Finally, MMAT achieves high natural accuracy and robustness under both black-box and white-box attacks. On SVHN, for example, state-of-the-art robustness and natural accuracy are achieved.
△ Less
Submitted 15 July, 2022;
originally announced July 2022.
-
Search for hidden-charm pentaquark states in three-body final states
Authors:
Jia-Ming Xie,
Xi-Zhe Ling,
Ming-Zhu Liu,
Li-Sheng Geng
Abstract:
The three pentaquark states, $P_c(4312)$, $P_c(4440)$ and $P_c(4457)$, discovered by the LHCb Collaboration in 2019, are widely recognized as $\bar{D}^{(\ast)}Σ_{c}$ hadronic molecules. Together with their four $\bar{D}^{(*)}Σ_{c}^{\ast}$ partners dictated by heavy quark spin symmetry they present a complete multiplet of hadronic molecules of $\bar{D}^{(\ast)}Σ_{c}^{(\ast)}$. It is widely recogniz…
▽ More
The three pentaquark states, $P_c(4312)$, $P_c(4440)$ and $P_c(4457)$, discovered by the LHCb Collaboration in 2019, are widely recognized as $\bar{D}^{(\ast)}Σ_{c}$ hadronic molecules. Together with their four $\bar{D}^{(*)}Σ_{c}^{\ast}$ partners dictated by heavy quark spin symmetry they present a complete multiplet of hadronic molecules of $\bar{D}^{(\ast)}Σ_{c}^{(\ast)}$. It is widely recognized that to understand their nature, other discovery channels play an important role. In this work, we investigate two three-body decay modes of the $\bar{D}^{(\ast)}Σ_{c}^{(\ast)}$ molecules. The tree-level modes proceed via off-shell $Σ_{c}^{(\ast)}$ baryons, $\bar{D}^{(\ast)}Σ_{c}^{(\ast)} \to \bar{D}^{(\ast)}\left(Σ_{c}^{(\ast)}\to Λ_{c}π\right)\to\bar{D}^{(\ast)}Λ_{c}π$, while the triangle-loop modes proceed through $\bar{D}^{\ast}Σ_{c}^{(\ast)}\to J/ψNπ$, $η_{c}Nπ$ via $\bar{D}Σ_{c}^{(\ast)}$ rescattering to $J/ψN$ and $η_{c}N$. Our results indicate that the decay widths of the $P_{c}(4457)$ and $\bar{D}^{(\ast)}Σ_{c}^{\ast}$ states into $\bar{D}^{(\ast)}Λ_{c}π$ are several MeV, as a result can be observed in the upcoming Run 3 and Run 4 of LHC. The partial decay widths into $\bar{D}^{(\ast)}Λ_{c}π$ of the $P_{c}(4312)$ and $P_{c}(4440)$ states range from tens to hundreds of keV. In addition, the partial decay widths of $\bar{D}^{\ast}Σ_{c}$ molecules into $J/ψN π$ and $η_c N π$ are several keV and tens of keV, respectively, and the partial decay widths of $\bar{D}^{\ast}Σ_{c}^{\ast}$ molecules into $J/ψN π$ vary from several keV to tens of keV. These three-body decay modes of the pentaquark states are of great value to further observations of the pentaquark states and to a better understanding of their nature.
△ Less
Submitted 23 November, 2022; v1 submitted 26 April, 2022;
originally announced April 2022.