Search | arXiv e-print repository

arXiv:2407.16791 [pdf, other]

Towards understanding interactions between the AO system and segment co-phasing with the vector-Zernike wavefront sensor on Keck

Authors: Maïssa Salama, Charlotte Guthery, Vincent Chambouleyron, Rebecca Jensen-Clem, J. Kent Wallace, Mitchell Troy, Jacques-Robert Delorme, Daren Dillon, Daniel Echeverri, Yeyuan, Xin, Wen Hao, Xuan, Nemanja Jovanovic, Dimitri Mawet, Peter L. Wizinowich, Rachel Bowens-Rubin

Abstract: We extend our previous demonstration of the first on-sky primary mirror segment closed-loop control on Keck using a vector-Zernike wavefront sensor (vZWFS), which improved the Strehl ratio on the NIRC2 science camera by up to 10 percentage points. Segment co-phasing errors contribute to Keck contrast limits and will be necessary to correct for the segmented Extremely Large Telescopes and future sp… ▽ More We extend our previous demonstration of the first on-sky primary mirror segment closed-loop control on Keck using a vector-Zernike wavefront sensor (vZWFS), which improved the Strehl ratio on the NIRC2 science camera by up to 10 percentage points. Segment co-phasing errors contribute to Keck contrast limits and will be necessary to correct for the segmented Extremely Large Telescopes and future space missions. The goal of the post-AO vZWFS on Keck is to monitor and correct segment co-phasing errors in parallel with science observations. The ZWFS is ideal for measuring phase discontinuities and is one of the most sensitive WFSs, but has limited dynamic range. The Keck vZWFS consists of a metasurface mask imposing two different phase shifts to orthogonal polarizations, split into two pupil images, extending its dynamic range. We report on the vZWFS closed-loop co-phasing performance and early work towards understanding the interactions between the AO system and segment phasing. We discuss a comparison of the AO performance when co-phasing by aligning segment edges, as is currently done at Keck, compared with aligning to the average phase over the segments, as is done by the vZWFS. △ Less

Submitted 23 July, 2024; originally announced July 2024.

Comments: Proceedings of SPIE, 13097-61, 8 pages, 3 figures, 3 tables

arXiv:2407.10408 [pdf, other]

Latency Minimization for IRS-enhanced Wideband MEC Networks with Practical Reflection Model

Authors: N. Li, W. Hao, X. Li, Z. Zhu, Z. Tang, S. Yang

Abstract: Intelligent reflecting surface (IRS) has been considered as an efficient way to boost the computation capability of mobile edge computing (MEC) system, especially when the communication links is blocked or the communication signal is weak. However, most existing works are restricted to narrow-band channel and ideal IRS reflection model, which is not practical and may lead to significant performanc… ▽ More Intelligent reflecting surface (IRS) has been considered as an efficient way to boost the computation capability of mobile edge computing (MEC) system, especially when the communication links is blocked or the communication signal is weak. However, most existing works are restricted to narrow-band channel and ideal IRS reflection model, which is not practical and may lead to significant performance degradation in realistic systems. To further exploit the benefits of IRS in MEC system, we consider an IRS-enhanced wideband MEC system with practical IRS reflection model. With the aim of minimizing the weighted latency of all devices, the offloading data volume, edge computing resource, BS's receiving vector, and IRS passive beamforming are jointly optimized. Since the formulated problem is non-convex, we employ the block coordinate descent (BCD) technique to decouple it into two subproblems for alternatively optimizing computing and communication settings. The effectiveness and convergence of the proposed algorithm are validate via numerical analyses. In addition, simulation results demonstrate that the proposed algorithm can achieve lower latency compared to that based on the ideal IRS reflection model, which confirms the necessary of considering practical model when designing an IRS-enhanced wideband MEC system. △ Less

Submitted 14 July, 2024; originally announced July 2024.

Comments: 13 pages, 9 figures

arXiv:2407.05368 [pdf, other]

Music Era Recognition Using Supervised Contrastive Learning and Artist Information

Authors: Qiqi He, Xuchen Song, Weituo Hao, Ju-Chiang Wang, Wei-Tsung Lu, Wei Li

Abstract: Does popular music from the 60s sound different than that of the 90s? Prior study has shown that there would exist some variations of patterns and regularities related to instrumentation changes and growing loudness across multi-decadal trends. This indicates that perceiving the era of a song from musical features such as audio and artist information is possible. Music era information can be an im… ▽ More Does popular music from the 60s sound different than that of the 90s? Prior study has shown that there would exist some variations of patterns and regularities related to instrumentation changes and growing loudness across multi-decadal trends. This indicates that perceiving the era of a song from musical features such as audio and artist information is possible. Music era information can be an important feature for playlist generation and recommendation. However, the release year of a song can be inaccessible in many circumstances. This paper addresses a novel task of music era recognition. We formulate the task as a music classification problem and propose solutions based on supervised contrastive learning. An audio-based model is developed to predict the era from audio. For the case where the artist information is available, we extend the audio-based model to take multimodal inputs and develop a framework, called MultiModal Contrastive (MMC) learning, to enhance the training. Experimental result on Million Song Dataset demonstrates that the audio-based model achieves 54% in accuracy with a tolerance of 3-years range; incorporating the artist information with the MMC framework for training leads to 9% improvement further. △ Less

Submitted 7 July, 2024; originally announced July 2024.

arXiv:2406.18393 [pdf, other]

Stability and Robustness of Time-discretization Schemes for the Allen-Cahn Equation via Bifurcation and Perturbation Analysis

Authors: Wenrui Hao, Sun Lee, Xiaofeng Xu, Zhiliang Xu

Abstract: The Allen-Cahn equation is a fundamental model for phase transitions, offering critical insights into the dynamics of interface evolution in various physical systems. This paper investigates the stability and robustness of frequently utilized time-discretization numerical schemes for solving the Allen-Cahn equation, with focuses on the Backward Euler, Crank-Nicolson (CN), convex splitting of modif… ▽ More The Allen-Cahn equation is a fundamental model for phase transitions, offering critical insights into the dynamics of interface evolution in various physical systems. This paper investigates the stability and robustness of frequently utilized time-discretization numerical schemes for solving the Allen-Cahn equation, with focuses on the Backward Euler, Crank-Nicolson (CN), convex splitting of modified CN, and Diagonally Implicit Runge-Kutta (DIRK) methods. Our stability analysis reveals that the Convex Splitting of the Modified CN scheme exhibits unconditional stability, allowing greater flexibility in time step selection, while the other schemes are conditionally stable. Additionally, our robustness analysis highlights that the Backward Euler method converges to correct physical solutions regardless of initial conditions. In contrast, the other methods studied in this work show sensitivity to initial conditions and may converge to incorrect physical solutions if the initial conditions are not carefully chosen. This study introduces a comprehensive approach to assessing stability and robustness in numerical methods for solving the Allen-Cahn equation, providing a new perspective for evaluating numerical techniques for general nonlinear differential equations. △ Less

Submitted 26 June, 2024; originally announced June 2024.

MSC Class: 65M12; 35Q99; 35A35

arXiv:2406.03705 [pdf, other]

Coherent control of a triangular exchange-only spin qubit

Authors: Edwin Acuna, Joseph D. Broz, Kaushal Shyamsundar, Antonio B. Mei, Colin P. Feeney, Valerie Smetanka, Tiffany Davis, Kangmu Lee, Maxwell D. Choi, Brydon Boyd, June Suh, Wonill D. Ha, Cameron Jennings, Andrew S. Pan, Daniel S. Sanchez, Matthew D. Reed, Jason R. Petta

Abstract: We demonstrate coherent control of a three-electron exchange-only spin qubit with the quantum dots arranged in a close-packed triangular geometry. The device is tuned to confine one electron in each quantum dot, as evidenced by pairwise charge stability diagrams. Time-domain control of the exchange coupling is demonstrated and qubit performance is characterized using blind randomized benchmarking,… ▽ More We demonstrate coherent control of a three-electron exchange-only spin qubit with the quantum dots arranged in a close-packed triangular geometry. The device is tuned to confine one electron in each quantum dot, as evidenced by pairwise charge stability diagrams. Time-domain control of the exchange coupling is demonstrated and qubit performance is characterized using blind randomized benchmarking, with an average single-qubit gate fidelity F = 99.84%. The compact triangular device geometry can be readily scaled to larger two-dimensional quantum dot arrays with high connectivity. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2405.16649 [pdf, other]

Deep Koopman Learning using the Noisy Data

Authors: Wenjian Hao, Devesh Upadhyay, Shaoshuai Mou

Abstract: This paper proposes a data-driven framework to learn a finite-dimensional approximation of a Koopman operator for approximating the state evolution of a dynamical system under noisy observations. To this end, our proposed solution has two main advantages. First, the proposed method only requires the measurement noise to be bounded. Second, the proposed method modifies the existing deep Koopman ope… ▽ More This paper proposes a data-driven framework to learn a finite-dimensional approximation of a Koopman operator for approximating the state evolution of a dynamical system under noisy observations. To this end, our proposed solution has two main advantages. First, the proposed method only requires the measurement noise to be bounded. Second, the proposed method modifies the existing deep Koopman operator formulations by characterizing the effect of the measurement noise on the Koopman operator learning and then mitigating it by updating the tunable parameter of the observable functions of the Koopman operator, making it easy to implement. The performance of the proposed method is demonstrated on several standard benchmarks. We further compare the presented method with similar methods proposed in the latest literature on Koopman learning. △ Less

Submitted 2 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

arXiv:2405.14099 [pdf, other]

Automatic Differentiation is Essential in Training Neural Networks for Solving Differential Equations

Authors: Chuqi Chen, Yahong Yang, Yang Xiang, Wenrui Hao

Abstract: Neural network-based approaches have recently shown significant promise in solving partial differential equations (PDEs) in science and engineering, especially in scenarios featuring complex domains or the incorporation of empirical data. One advantage of the neural network method for PDEs lies in its automatic differentiation (AD), which necessitates only the sample points themselves, unlike trad… ▽ More Neural network-based approaches have recently shown significant promise in solving partial differential equations (PDEs) in science and engineering, especially in scenarios featuring complex domains or the incorporation of empirical data. One advantage of the neural network method for PDEs lies in its automatic differentiation (AD), which necessitates only the sample points themselves, unlike traditional finite difference (FD) approximations that require nearby local points to compute derivatives. In this paper, we quantitatively demonstrate the advantage of AD in training neural networks. The concept of truncated entropy is introduced to characterize the training property. Specifically, through comprehensive experimental and theoretical analyses conducted on random feature models and two-layer neural networks, we discover that the defined truncated entropy serves as a reliable metric for quantifying the residual loss of random feature models and the training speed of neural networks for both AD and FD methods. Our experimental and theoretical analyses demonstrate that, from a training perspective, AD outperforms FD in solving partial differential equations. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.14096 [pdf, other]

Newton Informed Neural Operator for Computing Multiple Solutions of Nonlinear Partials Differential Equations

Authors: Wenrui Hao, Xinliang Liu, Yahong Yang

Abstract: Solving nonlinear partial differential equations (PDEs) with multiple solutions using neural networks has found widespread applications in various fields such as physics, biology, and engineering. However, classical neural network methods for solving nonlinear PDEs, such as Physics-Informed Neural Networks (PINN), Deep Ritz methods, and DeepONet, often encounter challenges when confronted with the… ▽ More Solving nonlinear partial differential equations (PDEs) with multiple solutions using neural networks has found widespread applications in various fields such as physics, biology, and engineering. However, classical neural network methods for solving nonlinear PDEs, such as Physics-Informed Neural Networks (PINN), Deep Ritz methods, and DeepONet, often encounter challenges when confronted with the presence of multiple solutions inherent in the nonlinear problem. These methods may encounter ill-posedness issues. In this paper, we propose a novel approach called the Newton Informed Neural Operator, which builds upon existing neural network techniques to tackle nonlinearities. Our method combines classical Newton methods, addressing well-posed problems, and efficiently learns multiple solutions in a single learning process while requiring fewer supervised data points compared to existing neural network methods. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.01258 [pdf, other]

Towards Consistent Object Detection via LiDAR-Camera Synergy

Authors: Kai Luo, Hao Wu, Kefu Yi, Kailun Yang, Wei Hao, Rongdong Hu

Abstract: As human-machine interaction continues to evolve, the capacity for environmental perception is becoming increasingly crucial. Integrating the two most common types of sensory data, images, and point clouds, can enhance detection accuracy. However, currently, no model exists that can simultaneously detect an object's position in both point clouds and images and ascertain their corresponding relatio… ▽ More As human-machine interaction continues to evolve, the capacity for environmental perception is becoming increasingly crucial. Integrating the two most common types of sensory data, images, and point clouds, can enhance detection accuracy. However, currently, no model exists that can simultaneously detect an object's position in both point clouds and images and ascertain their corresponding relationship. This information is invaluable for human-machine interactions, offering new possibilities for their enhancement. In light of this, this paper introduces an end-to-end Consistency Object Detection (COD) algorithm framework that requires only a single forward inference to simultaneously obtain an object's position in both point clouds and images and establish their correlation. Furthermore, to assess the accuracy of the object correlation between point clouds and images, this paper proposes a new evaluation metric, Consistency Precision (CP). To verify the effectiveness of the proposed framework, an extensive set of experiments has been conducted on the KITTI and DAIR-V2X datasets. The study also explored how the proposed consistency detection method performs on images when the calibration parameters between images and point clouds are disturbed, compared to existing post-processing methods. The experimental results demonstrate that the proposed method exhibits excellent detection performance and robustness, achieving end-to-end consistency detection. The source code will be made publicly available at https://github.com/xifen523/COD. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: The source code will be made publicly available at https://github.com/xifen523/COD

arXiv:2404.14248 [pdf, other]

NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

Authors: Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi Jin, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, Jing Lin, Alan Yuille, Ben Shao, Jin Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin , et al. (87 additional authors not shown)

Abstract: This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlig… ▽ More This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlighting, extreme darkness, and night scenes. A notable total of 428 participants registered for the challenge, with 22 teams ultimately making valid submissions. This paper meticulously evaluates the state-of-the-art advancements in enhancing low-light images, reflecting the significant progress and creativity in this field. △ Less

Submitted 22 April, 2024; originally announced April 2024.

Comments: NTIRE 2024 Challenge Report

arXiv:2404.09790 [pdf, other]

NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge is to obtain designs/solutions with the most advanced SR performance, with no constraints on computational resources (e.g., model size and FLOPs) or training data. The track of this challenge assesses performance with the PSNR metric on the DIV2K testing dataset. The competition attracted 199 registrants, with 20 teams submitting valid entries. This collective endeavour not only pushes the boundaries of performance in single-image SR but also offers a comprehensive overview of current trends in this field. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

arXiv:2404.08080 [pdf, other]

Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models

Authors: Tanmay Gautam, Youngsuk Park, Hao Zhou, Parameswaran Raman, Wooseok Ha

Abstract: Fine-tuning language models (LMs) has demonstrated success in a wide array of downstream tasks. However, as LMs are scaled up, the memory requirements for backpropagation become prohibitively high. Zeroth-order (ZO) optimization methods can leverage memory-efficient forward passes to estimate gradients. More recently, MeZO, an adaptation of ZO-SGD, has been shown to consistently outperform zero-sh… ▽ More Fine-tuning language models (LMs) has demonstrated success in a wide array of downstream tasks. However, as LMs are scaled up, the memory requirements for backpropagation become prohibitively high. Zeroth-order (ZO) optimization methods can leverage memory-efficient forward passes to estimate gradients. More recently, MeZO, an adaptation of ZO-SGD, has been shown to consistently outperform zero-shot and in-context learning when combined with suitable task prompts. In this work, we couple ZO methods with variance reduction techniques to enhance stability and convergence for inference-based LM fine-tuning. We introduce Memory-Efficient Zeroth-Order Stochastic Variance-Reduced Gradient (MeZO-SVRG) and demonstrate its efficacy across multiple LM fine-tuning tasks, eliminating the reliance on task-specific prompts. Evaluated across a range of both masked and autoregressive LMs on benchmark GLUE tasks, MeZO-SVRG outperforms MeZO with up to 20% increase in test accuracies in both full- and partial-parameter fine-tuning settings. MeZO-SVRG benefits from reduced computation time as it often surpasses MeZO's peak test accuracy with a $2\times$ reduction in GPU-hours. MeZO-SVRG significantly reduces the required memory footprint compared to first-order SGD, i.e. by $2\times$ for autoregressive models. Our experiments highlight that MeZO-SVRG's memory savings progressively improve compared to SGD with larger batch sizes. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 29 pages, 25 tables, 9 figures

arXiv:2403.05972 [pdf, other]

C3D: Cascade Control with Change Point Detection and Deep Koopman Learning for Autonomous Surface Vehicles

Authors: Jianwen Li, Hyunsang Park, Wenjian Hao, Lei Xin, Jalil Chavez-Galaviz, Ajinkya Chaudhary, Meredith Bloss, Kyle Pattison, Christopher Vo, Devesh Upadhyay, Shreyas Sundaram, Shaoshuai Mou, Nina Mahmoudian

Abstract: In this paper, we discuss the development and deployment of a robust autonomous system capable of performing various tasks in the maritime domain under unknown dynamic conditions. We investigate a data-driven approach based on modular design for ease of transfer of autonomy across different maritime surface vessel platforms. The data-driven approach alleviates issues related to a priori identifica… ▽ More In this paper, we discuss the development and deployment of a robust autonomous system capable of performing various tasks in the maritime domain under unknown dynamic conditions. We investigate a data-driven approach based on modular design for ease of transfer of autonomy across different maritime surface vessel platforms. The data-driven approach alleviates issues related to a priori identification of system models that may become deficient under evolving system behaviors or shifting, unanticipated, environmental influences. Our proposed learning-based platform comprises a deep Koopman system model and a change point detector that provides guidance on domain shifts prompting relearning under severe exogenous and endogenous perturbations. Motion control of the autonomous system is achieved via an optimal controller design. The Koopman linearized model naturally lends itself to a linear-quadratic regulator (LQR) control design. We propose the C3D control architecture Cascade Control with Change Point Detection and Deep Koopman Learning. The framework is verified in station keeping task on an ASV in both simulation and real experiments. The approach achieved at least 13.9 percent improvement in mean distance error in all test cases compared to the methods that do not consider system changes. △ Less

Submitted 25 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2403.05008 [pdf, ps, other]

Clunie lemma in several complex variables and application in PDEs

Authors: Wenjie Hao, Qingcai Zhang

Abstract: Two purposes will be shown in this paper. The first one is to extend the classic Tumura-Clunie type theorem for meromorphic functions of one complex variable to meromorphic functions of several complex variables by using Clunie lemma. The second one is to characterize entire solutions of certain partial differential equations in $\mathbb{C}^{m}$. Our results are extensions and generalizations of t… ▽ More Two purposes will be shown in this paper. The first one is to extend the classic Tumura-Clunie type theorem for meromorphic functions of one complex variable to meromorphic functions of several complex variables by using Clunie lemma. The second one is to characterize entire solutions of certain partial differential equations in $\mathbb{C}^{m}$. Our results are extensions and generalizations of the previous theorems by Liao-Ye \cite{Liao-Ye} and Li \cite{Li11}. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2402.18898 [pdf, ps, other]

Beauty-charm Meson Family with Coupled Channel Effects and Their Strong Decays

Authors: Wei Hao, Ruilin Zhu

Abstract: We systematically study the mass spectra and their two-body hadronic decays of the beauty-charm meson family considering the coupled channel effects. Our results can good explain the observed $B_c$ meson spectrum and the prediction of the mass spectrum for unobserved beauty-charm mesons can be tested in future experiments. For the coupled channel components, we predicted the $1S$ state in beauty-c… ▽ More We systematically study the mass spectra and their two-body hadronic decays of the beauty-charm meson family considering the coupled channel effects. Our results can good explain the observed $B_c$ meson spectrum and the prediction of the mass spectrum for unobserved beauty-charm mesons can be tested in future experiments. For the coupled channel components, we predicted the $1S$ state in beauty-charm meson family is about $4\%$, while the $2S$, $1P$, $2P$, $1D$, and $2D$ states are about $14\%$, $10\%$, $33\%$, and $17\%$ respectively. For the $3S$, $2P$ and $2D$ states, the strong decay is allowed, The two-body hadronic decay widths of the $3^1S_0$, $3^3S_1$, $2^3P_2$ states are about 110 MeV, 69 MeV, and 3 MeV, respectively. While the two-body decay widths of the $2^3D_1$, $2D$, $2D^\prime$, and $2^3D_2$ states are 60 MeV, 149 MeV, 65 MeV, and 72 MeV, respectively. △ Less

Submitted 3 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.03949 [pdf, other]

Joint Beamforming Design for the STAR-RIS-Enabled ISAC Systems with Multiple Targets and Multiple Users

Authors: Shuang Zhang, Wanming Hao, Gangcan Sun, Zhengyu Zhu, Xingwang Li, Qingqing Wu

Abstract: In this paper, the sensing beam pattern gain under simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RIS)-enabled integrated sensing and communications (ISAC) systems is investigated, in which multiple targets and multiple users exist. However, multiple targets detection introduces new challenges, since the STAR-RIS cannot directly send sensing beams and detect t… ▽ More In this paper, the sensing beam pattern gain under simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RIS)-enabled integrated sensing and communications (ISAC) systems is investigated, in which multiple targets and multiple users exist. However, multiple targets detection introduces new challenges, since the STAR-RIS cannot directly send sensing beams and detect targets, the dual-functional base station (DFBS) is required to analyze the echoes of the targets. While the echoes reflected by different targets through STAR-RIS come from the same direction for the DFBS, making it impossible to distinguish them. To address the issue, we first introduce the signature sequence (SS) modulation scheme to the ISAC system, and thus, the DFBS can detect different targets by the SS-modulated sensing beams. Next, via the joint beamforming design of DFBS and STAR-RIS, we develop a maxmin sensing beam pattern gain problem, and meanwhile, considering the communication quality requirements, the interference limitations of other targets and users, the passive nature constraint of STAR-RIS, and the total transmit power limitation. Then, to tackle the complex non-convex problem, we propose an alternating optimization method to divide it into two quadratic semidefinite program subproblems and decouple the coupled variables. Drawing on mathematical transformation, semidefinite programming, as well as semidefinite relaxation techniques, these two subproblems are iteratively sloved until convergence, and the ultimate solutions are obtained. Finally, simulation results are conducted to validate the benefits and efficiency of our proposed scheme. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2401.08981 [pdf, other]

Real-time generative design of diverse, "truly" optimized structures with controllable structural complexities

Authors: Zongliang Du, Xinyu Ma, Wenyu Hao, Yuan Liang, Xiaoyu Zhang, Hongzhi Luo, Xu Guo

Abstract: Compared with traditional design methods, generative design significantly attracts engineers in various disciplines. In thiswork, howto achieve the real-time generative design of optimized structures with various diversities and controllable structural complexities is investigated. To this end, a modified Moving Morphable Component (MMC) method together with novel strategies are adopted to generat… ▽ More Compared with traditional design methods, generative design significantly attracts engineers in various disciplines. In thiswork, howto achieve the real-time generative design of optimized structures with various diversities and controllable structural complexities is investigated. To this end, a modified Moving Morphable Component (MMC) method together with novel strategies are adopted to generate high-quality dataset. The complexity level of optimized structures is categorized by the topological invariant. By improving the cost function, the WGAN is trained to produce optimized designs with the input of loading position and complexity level in real time. It is found that, diverse designs with a clear load transmission path and crisp boundary, even not requiring further optimization and different from any reference in the dataset, can be generated by the proposed model. This method holds great potential for future applications of machine learning enhanced intelligent design. △ Less

Submitted 20 January, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

arXiv:2312.16986 [pdf, ps, other]

Analysis of Kozai Cycles in Equal-Mass Hierarchical Triple Supermassive Black Hole Mergers in the Presence of a Stellar Cluster

Authors: Wei Hao, M. B. N. Kouwenhoven, Rainer Spurzem, Pau Amaro Seoane, Rosemary A. Mardling, Xiuming Xu

Abstract: Supermassive black holes (SMBHs) play an important role in galaxy evolution. Binary and triple SMBHs can form after galaxy mergers. A third SMBH may accelerate the SMBH merging process, possibly through the Kozai mechanism. We use N -body simulations to analyze oscillations in the orbital elements of hierarchical triple SMBHs with surrounding star clusters in galaxy centers. We find that SMBH trip… ▽ More Supermassive black holes (SMBHs) play an important role in galaxy evolution. Binary and triple SMBHs can form after galaxy mergers. A third SMBH may accelerate the SMBH merging process, possibly through the Kozai mechanism. We use N -body simulations to analyze oscillations in the orbital elements of hierarchical triple SMBHs with surrounding star clusters in galaxy centers. We find that SMBH triples spend only a small fraction of time in the hierarchical merger phase (i.e., a binary SMBH with a distant third SMBH perturber). Most of the time, the enclosed stellar mass within the orbits of the innermost or the outermost SMBH is comparable to the SMBH masses, indicating that the influence of the surrounding stellar population cannot be ignored. We search for Eccentric Kozai-Lidov (EKL) oscillations for which (i) the eccentricity of the inner binary and inclination are both oscillate and are anti-phase or in-phase and (ii) the oscillation period is consistent with EKL timescale. We find that EKL oscillations are short-lived and rare: the triple SMBH spends around 3% of its time in this phase over the ensemble of simulations, reaching around 8% in the best-case scenario. This suggests that the role of the EKL mechanism in accelerating the SMBH merger process may have been overestimated in previous studies. We follow-up with three-body simulations, using initial conditions extracted from the simulation, and the result can to some extent repeat the observed EKL-like oscillations. This comparison provides clues about why those EKL oscillations with perturbing stars are short-lived. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: 24 pages, 16 figures, Published: 20 December 2023 on MNRAS https://doi.org/10.1093/mnras/stad3908

arXiv:2312.08952 [pdf, other]

UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation

Authors: Kefu Yi, Kai Luo, Xiaolei Luo, Jiangui Huang, Hao Wu, Rongdong Hu, Wei Hao

Abstract: Multi-object tracking (MOT) in video sequences remains a challenging task, especially in scenarios with significant camera movements. This is because targets can drift considerably on the image plane, leading to erroneous tracking outcomes. Addressing such challenges typically requires supplementary appearance cues or Camera Motion Compensation (CMC). While these strategies are effective, they als… ▽ More Multi-object tracking (MOT) in video sequences remains a challenging task, especially in scenarios with significant camera movements. This is because targets can drift considerably on the image plane, leading to erroneous tracking outcomes. Addressing such challenges typically requires supplementary appearance cues or Camera Motion Compensation (CMC). While these strategies are effective, they also introduce a considerable computational burden, posing challenges for real-time MOT. In response to this, we introduce UCMCTrack, a novel motion model-based tracker robust to camera movements. Unlike conventional CMC that computes compensation parameters frame-by-frame, UCMCTrack consistently applies the same compensation parameters throughout a video sequence. It employs a Kalman filter on the ground plane and introduces the Mapped Mahalanobis Distance (MMD) as an alternative to the traditional Intersection over Union (IoU) distance measure. By leveraging projected probability distributions on the ground plane, our approach efficiently captures motion patterns and adeptly manages uncertainties introduced by homography projections. Remarkably, UCMCTrack, relying solely on motion cues, achieves state-of-the-art performance across a variety of challenging datasets, including MOT17, MOT20, DanceTrack and KITTI. More details and code are available at https://github.com/corfyi/UCMCTrack △ Less

Submitted 11 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

Comments: Accepted to AAAI 2024

arXiv:2311.17867 [pdf, other]

A Class of Directed Acyclic Graphs with Mixed Data Types in Mediation Analysis

Authors: Wei Hao, Canyi Chen, Peter X. -K. Song

Abstract: We propose a unified class of generalized structural equation models (GSEMs) with data of mixed types in mediation analysis, including continuous, categorical, and count variables. Such models extend substantially the classical linear structural equation model to accommodate many data types arising from the application of mediation analysis. Invoking the hierarchical modeling approach, we specify… ▽ More We propose a unified class of generalized structural equation models (GSEMs) with data of mixed types in mediation analysis, including continuous, categorical, and count variables. Such models extend substantially the classical linear structural equation model to accommodate many data types arising from the application of mediation analysis. Invoking the hierarchical modeling approach, we specify GSEMs by a copula joint distribution of outcome variable, mediator and exposure variable, in which marginal distributions are built upon generalized linear models (GLMs) with confounding factors. We discuss the identifiability conditions for the causal mediation effects in the counterfactual paradigm as well as the issue of mediation leakage, and develop an asymptotically efficient profile maximum likelihood estimation and inference for two key mediation estimands, natural direct effect and natural indirect effect, in different scenarios of mixed data types. The proposed new methodology is illustrated by a motivating epidemiological study that aims to investigate whether the tempo of reaching infancy BMI peak (delay or on time), an important early life growth milestone, may mediate the association between prenatal exposure to phthalates and pubertal health outcomes. △ Less

Submitted 4 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

Comments: 33 pages, 3 figures, 3 tables

arXiv:2311.16628 [pdf, other]

Symmetry-regularized neural ordinary differential equations

Authors: Wenbo Hao

Abstract: Neural ordinary differential equations (Neural ODEs) is a class of machine learning models that approximate the time derivative of hidden states using a neural network. They are powerful tools for modeling continuous-time dynamical systems, enabling the analysis and prediction of complex temporal behaviors. However, how to improve the model's stability and physical interpretability remains a chall… ▽ More Neural ordinary differential equations (Neural ODEs) is a class of machine learning models that approximate the time derivative of hidden states using a neural network. They are powerful tools for modeling continuous-time dynamical systems, enabling the analysis and prediction of complex temporal behaviors. However, how to improve the model's stability and physical interpretability remains a challenge. This paper introduces new conservation relations in Neural ODEs using Lie symmetries in both the hidden state dynamics and the back propagation dynamics. These conservation laws are then incorporated into the loss function as additional regularization terms, potentially enhancing the physical interpretability and generalizability of the model. To illustrate this method, the paper derives Lie symmetries and conservation laws in a simple Neural ODE designed to monitor charged particles in a sinusoidal electric field. New loss functions are constructed from these conservation relations, demonstrating the applicability symmetry-regularized Neural ODE in typical modeling tasks, such as data-driven discovery of dynamical systems. △ Less

Submitted 12 July, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

arXiv:2311.09866 [pdf, other]

A numerical method for solving elliptic equations on real closed algebraic curves and surfaces

Authors: Wenrui Hao, Jonathan D. Hauenstein, Margaret H. Regan, Tingting Tang

Abstract: There are many numerical methods for solving partial different equations (PDEs) on manifolds such as classical implicit, finite difference, finite element, and isogeometric analysis methods which aim at improving the interoperability between finite element method and computer aided design (CAD) software. However, these approaches have difficulty when the domain has singularities since the solution… ▽ More There are many numerical methods for solving partial different equations (PDEs) on manifolds such as classical implicit, finite difference, finite element, and isogeometric analysis methods which aim at improving the interoperability between finite element method and computer aided design (CAD) software. However, these approaches have difficulty when the domain has singularities since the solution at the singularity may be multivalued. This paper develops a novel numerical approach to solve elliptic PDEs on real, closed, connected, orientable, and almost smooth algebraic curves and surfaces. Our method integrates numerical algebraic geometry, differential geometry, and a finite difference scheme which is demonstrated on several examples. △ Less

Submitted 16 November, 2023; originally announced November 2023.

arXiv:2311.07487 [pdf, other]

Vertiport Navigation Requirements and Multisensor Architecture Considerations for Urban Air Mobility

Authors: Omar Garcia Crespillo, Chen Zhu, Maximilian Simonetti, Daniel Gerbeth, Young-Hee Lee, Wenhan Hao

Abstract: Communication, Navigation and Surveillance (CNS) technologies are key enablers for future safe operation of drones in urban environments. However, the design of navigation technologies for these new applications is more challenging compared to e.g., civil aviation. On the one hand, the use cases and operations in urban environments are expected to have stringent requirements in terms of accuracy,… ▽ More Communication, Navigation and Surveillance (CNS) technologies are key enablers for future safe operation of drones in urban environments. However, the design of navigation technologies for these new applications is more challenging compared to e.g., civil aviation. On the one hand, the use cases and operations in urban environments are expected to have stringent requirements in terms of accuracy, integrity, continuity and availability. On the other hand, airborne sensors may not be based on high-quality equipment as in civil aviation and solutions need to rely on tighter multisensor solutions, whose safety is difficult to assess. In this work, we first provide some initial navigation requirements related to precision approach operations based on recently proposed vertiport designs. Then, we provide an overview of a possible multisensor navigation architecture solution able to support these types of operations and we comment on the challenges of each of the subsystems. Finally, initial proof of concept for some navigation sensor subsystems is presented based on flight trials performed during the German Aerospace Center (DLR) project HorizonUAM. △ Less

Submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.03282 [pdf, ps, other]

Resource Allocation for RIS-Empowered Wireless Communications: Low-Complexity and Robust Designs

Authors: Ming Zeng, Wanming Hao, Zhangjie Peng, Zheng Chu, Xingwang Li, Changsheng You, Cunhua Pan

Abstract: This article delves into advancements in resource allocation techniques tailored for systems utilizing reconfigurable intelligent surfaces (RIS), with a primary focus on achieving low-complexity and resilient solutions. The investigation of low-complexity approaches for RIS holds significant relevance, primarily owing to the intricate characteristics inherent in RIS-based systems and the need of d… ▽ More This article delves into advancements in resource allocation techniques tailored for systems utilizing reconfigurable intelligent surfaces (RIS), with a primary focus on achieving low-complexity and resilient solutions. The investigation of low-complexity approaches for RIS holds significant relevance, primarily owing to the intricate characteristics inherent in RIS-based systems and the need of deploying large-scale RIS arrays. Concurrently, the exploration of robust solutions aims to address the issue of hardware impairments occurring at both the transceivers and RIS components in practical RIS-assisted systems. In the realm of both low-complexity and robust resource allocation, this article not only elucidates the fundamental techniques underpinning these methodologies but also offers comprehensive numerical results for illustrative purposes. The necessity of adopting resource allocation strategies that are both low in complexity and resilient is thoroughly established. Ultimately, this article provides prospective research avenues in the domain of low-complexity and robust resource allocation techniques tailored for RIS-assisted systems. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: submitted to IEEE WCM

arXiv:2310.13933 [pdf, other]

Wideband Beamforming for STAR-RIS-assisted THz Communications with Three-Side Beam Split

Authors: Wencai Yan, Wanming Hao, Gangcan Sun, Chongwen Huang, Qingqing Wu

Abstract: In this paper, we consider the simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS)-assisted THz communications with three-side beam split. Except for the beam split at the base station (BS), we analyze the double-side beam split at the STAR-RIS for the first time. To relieve the double-side beam split effect, we propose a time delayer (TD)-based fully-connected… ▽ More In this paper, we consider the simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS)-assisted THz communications with three-side beam split. Except for the beam split at the base station (BS), we analyze the double-side beam split at the STAR-RIS for the first time. To relieve the double-side beam split effect, we propose a time delayer (TD)-based fully-connected structure at the STAR-RIS. As a further advance, a low-hardware complexity and low-power consumption sub-connected structure is developed, where multiple STAR-RIS elements share one TD. Meanwhile, considering the practical scenario, we investigate a multi-STAR-RIS and multi-user communication system, and a sum rate maximization problem is formulated by jointly optimizing the hybrid analog/digital beamforming, time delays at the BS as well as the double-layer phase-shift coefficients, time delays and amplitude coefficients at the STAR-RISs. Based on this, we first allocate users for each STAR-RIS, and then derive the analog beamforming, time delays at the BS, and the double-layer phase-shift coefficients, time delays at each STAR-RIS. Next, we develop an alternative optimization algorithm to calculate the digital beamforming at the BS and amplitude coefficients at the STAR-RISs. Finally, the numerical results verify the effectiveness of the proposed schemes. △ Less

Submitted 21 October, 2023; originally announced October 2023.

arXiv:2310.13917 [pdf, other]

Beamforming Design for the Distributed RISs-aided THz Communications with Double-Layer True Time Delays

Authors: Gangcan Sun, Wencai Yan, Wanming Hao, Chongwen Huang, Chau Yuen

Abstract: In this paper, we investigate the reconfigurable intelligent surface (RIS)-aided terahertz (THz) communication system with the sparse radio frequency chains antenna structure at the base station (BS). To overcome the beam split of the BS, different from the conventional single-layer true-time-delay (TTD) scheme, we propose a double-layer TTD scheme that can effectively reduce the number of large-r… ▽ More In this paper, we investigate the reconfigurable intelligent surface (RIS)-aided terahertz (THz) communication system with the sparse radio frequency chains antenna structure at the base station (BS). To overcome the beam split of the BS, different from the conventional single-layer true-time-delay (TTD) scheme, we propose a double-layer TTD scheme that can effectively reduce the number of large-range delay devices, which involve additional insertion loss and amplification circuitry. Next, we analyze the system performance under the proposed double-layer TTD scheme. To relieve the beam split of the RIS, we consider multiple distributed RISs to replace an ultra-large size RIS. Based on this, we formulate an achievable rate maximization problem for the distributed RISs-aided THz communications via jointly optimizing the hybrid analog/digital beamforming, time delays of the double-layer TTD network and reflection coefficients of RISs. Considering the practical hardware limitation, the finite-resolution phase shift, time delay and reflection phase are constrained. To solve the formulated problem, we first design an analog beamforming scheme including optimizing phase shift and time delay based on the RISs' locations. Then, an alternatively optimization algorithm is proposed to obtain the digital beamforming and reflection coefficients based on the minimum mean square error and coordinate update techniques. Finally, simulation results show the effectiveness of the proposed scheme. △ Less

Submitted 21 October, 2023; originally announced October 2023.

arXiv:2310.01605 [pdf, other]

Primal-dual hybrid gradient algorithms for computing time-implicit Hamilton-Jacobi equations

Authors: Tingwei Meng, Wenbo Hao, Siting Liu, Stanley J. Osher, Wuchen Li

Abstract: Hamilton-Jacobi (HJ) partial differential equations (PDEs) have diverse applications spanning physics, optimal control, game theory, and imaging sciences. This research introduces a first-order optimization-based technique for HJ PDEs, which formulates the time-implicit update of HJ PDEs as saddle point problems. We remark that the saddle point formulation for HJ equations is aligned with the prim… ▽ More Hamilton-Jacobi (HJ) partial differential equations (PDEs) have diverse applications spanning physics, optimal control, game theory, and imaging sciences. This research introduces a first-order optimization-based technique for HJ PDEs, which formulates the time-implicit update of HJ PDEs as saddle point problems. We remark that the saddle point formulation for HJ equations is aligned with the primal-dual formulation of optimal transport and potential mean-field games (MFGs). This connection enables us to extend MFG techniques and design numerical schemes for solving HJ PDEs. We employ the primal-dual hybrid gradient (PDHG) method to solve the saddle point problems, benefiting from the simple structures that enable fast computations in updates. Remarkably, the method caters to a broader range of Hamiltonians, encompassing non-smooth and spatiotemporally dependent cases. The approach's effectiveness is verified through various numerical examples in both one-dimensional and two-dimensional examples, such as quadratic and $L^1$ Hamiltonians with spatial and time dependence. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2309.15244 [pdf, other]

Homotopy Relaxation Training Algorithms for Infinite-Width Two-Layer ReLU Neural Networks

Authors: Yahong Yang, Qipin Chen, Wenrui Hao

Abstract: In this paper, we present a novel training approach called the Homotopy Relaxation Training Algorithm (HRTA), aimed at accelerating the training process in contrast to traditional methods. Our algorithm incorporates two key mechanisms: one involves building a homotopy activation function that seamlessly connects the linear activation function with the ReLU activation function; the other technique… ▽ More In this paper, we present a novel training approach called the Homotopy Relaxation Training Algorithm (HRTA), aimed at accelerating the training process in contrast to traditional methods. Our algorithm incorporates two key mechanisms: one involves building a homotopy activation function that seamlessly connects the linear activation function with the ReLU activation function; the other technique entails relaxing the homotopy parameter to enhance the training refinement process. We have conducted an in-depth analysis of this novel method within the context of the neural tangent kernel (NTK), revealing significantly improved convergence rates. Our experimental results, especially when considering networks with larger widths, validate the theoretical conclusions. This proposed HRTA exhibits the potential for other activation functions and deep neural networks. △ Less

Submitted 8 January, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

arXiv:2309.10301 [pdf, other]

Prominent Roles of Conditionally Invariant Components in Domain Adaptation: Theory and Algorithms

Authors: Keru Wu, Yuansi Chen, Wooseok Ha, Bin Yu

Abstract: Domain adaptation (DA) is a statistical learning problem that arises when the distribution of the source data used to train a model differs from that of the target data used to evaluate the model. While many DA algorithms have demonstrated considerable empirical success, blindly applying these algorithms can often lead to worse performance on new datasets. To address this, it is crucial to clarify… ▽ More Domain adaptation (DA) is a statistical learning problem that arises when the distribution of the source data used to train a model differs from that of the target data used to evaluate the model. While many DA algorithms have demonstrated considerable empirical success, blindly applying these algorithms can often lead to worse performance on new datasets. To address this, it is crucial to clarify the assumptions under which a DA algorithm has good target performance. In this work, we focus on the assumption of the presence of conditionally invariant components (CICs), which are relevant for prediction and remain conditionally invariant across the source and target data. We demonstrate that CICs, which can be estimated through conditional invariant penalty (CIP), play three prominent roles in providing target risk guarantees in DA. First, we propose a new algorithm based on CICs, importance-weighted conditional invariant penalty (IW-CIP), which has target risk guarantees beyond simple settings such as covariate shift and label shift. Second, we show that CICs help identify large discrepancies between source and target risks of other DA algorithms. Finally, we demonstrate that incorporating CICs into the domain invariant projection (DIP) algorithm can address its failure scenario caused by label-flipping features. We support our new algorithms and theoretical findings via numerical experiments on synthetic data, MNIST, CelebA, Camelyon17, and DomainNet datasets. △ Less

Submitted 8 July, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

arXiv:2309.03471 [pdf, other]

Resource Management for IRS-assisted WP-MEC Networks with Practical Phase Shift Model

Authors: Nana Li, Wanming Hao, Fuhui Zhou, Zheng Chu, Shouyi Yang, Pei Xiao

Abstract: Wireless powered mobile edge computing (WP-MEC) has been recognized as a promising solution to enhance the computational capability and sustainable energy supply for low-power wireless devices (WDs). However, when the communication links between the hybrid access point (HAP) and WDs are hostile, the energy transfer efficiency and task offloading rate are compromised. To tackle this problem, we pro… ▽ More Wireless powered mobile edge computing (WP-MEC) has been recognized as a promising solution to enhance the computational capability and sustainable energy supply for low-power wireless devices (WDs). However, when the communication links between the hybrid access point (HAP) and WDs are hostile, the energy transfer efficiency and task offloading rate are compromised. To tackle this problem, we propose to employ multiple intelligent reflecting surfaces (IRSs) to WP-MEC networks. Based on the practical IRS phase shift model, we formulate a total computation rate maximization problem by jointly optimizing downlink/uplink IRSs passive beamforming, downlink energy beamforming and uplink multi-user detection (MUD) vector at HAPs, task offloading power and local computing frequency of WDs, and the time slot allocation. Specifically, we first derive the optimal time allocation for downlink wireless energy transmission (WET) to IRSs and the corresponding energy beamforming. Next, with fixed time allocation for the downlink WET to WDs, the original optimization problem can be divided into two independent subproblems. For the WD charging subproblem, the optimal IRSs passive beamforming is derived by utilizing the successive convex approximation (SCA) method and the penalty-based optimization technique, and for the offloading computing subproblem, we propose a joint optimization framework based on the fractional programming (FP) method. Finally, simulation results validate that our proposed optimization method based on the practical phase shift model can achieve a higher total computation rate compared to the baseline schemes. △ Less

Submitted 7 September, 2023; originally announced September 2023.

Comments: 15 pages, 14 figures

arXiv:2308.14360 [pdf, other]

InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models

Authors: Bing Han, Junyu Dai, Weituo Hao, Xinyan He, Dong Guo, Jitong Chen, Yuxuan Wang, Yanmin Qian, Xuchen Song

Abstract: Music editing primarily entails the modification of instrument tracks or remixing in the whole, which offers a novel reinterpretation of the original piece through a series of operations. These music processing methods hold immense potential across various applications but demand substantial expertise. Prior methodologies, although effective for image and audio modifications, falter when directly… ▽ More Music editing primarily entails the modification of instrument tracks or remixing in the whole, which offers a novel reinterpretation of the original piece through a series of operations. These music processing methods hold immense potential across various applications but demand substantial expertise. Prior methodologies, although effective for image and audio modifications, falter when directly applied to music. This is attributed to music's distinctive data nature, where such methods can inadvertently compromise the intrinsic harmony and coherence of music. In this paper, we develop InstructME, an Instruction guided Music Editing and remixing framework based on latent diffusion models. Our framework fortifies the U-Net with multi-scale aggregation in order to maintain consistency before and after editing. In addition, we introduce chord progression matrix as condition information and incorporate it in the semantic space to improve melodic harmony while editing. For accommodating extended musical pieces, InstructME employs a chunk transformer, enabling it to discern long-term temporal dependencies within music sequences. We tested InstructME in instrument-editing, remixing, and multi-round editing. Both subjective and objective evaluations indicate that our proposed method significantly surpasses preceding systems in music quality, text relevance and harmony. Demo samples are available at https://musicedit.github.io/ △ Less

Submitted 12 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: Demo samples are available at https://musicedit.github.io/

arXiv:2308.08514 [pdf]

doi 10.1038/s41467-023-40735-7

Towards Layer-Selective Quantum Spin Hall Channels in Weak Topological Insulator Bi4Br2I2

Authors: Jingyuan Zhong, Ming Yang, Zhijian Shi, Yaqi Li, Dan Mu, Yundan Liu, Ningyan Cheng, Wenxuan Zhao, Weichang Hao, Jianfeng Wang, Lexian Yang, Jincheng Zhuang, Yi Du

Abstract: Weak topological insulators, constructed by stacking quantum spin Hall insulators with weak interlayer coupling, offer promising quantum electronic applications through topologically nontrivial edge channels. However, the currently available weak topological insulators are stacks of the same quantum spin Hall layer with translational symmetry in the out-of-plane direction, leading to the absence o… ▽ More Weak topological insulators, constructed by stacking quantum spin Hall insulators with weak interlayer coupling, offer promising quantum electronic applications through topologically nontrivial edge channels. However, the currently available weak topological insulators are stacks of the same quantum spin Hall layer with translational symmetry in the out-of-plane direction, leading to the absence of the channel degree of freedom for edge states. Here, we study a candidate weak topological insulator, Bi4Br2I2, which is alternately stacked by three different quantum spin Hall insulators, each with tunable topologically non-trivial edge states. Our angle-resolved photoemission spectroscopy and first-principles calculations show that an energy gap opens at the crossing points of different Dirac cones correlated with different layers due to the interlayer interaction. This is essential to achieve the tunability of topological edge states as controlled by varying the chemical potential. Our work offers a perspective for the construction of tunable quantized conductance devices for future spintronic applications. △ Less

Submitted 16 August, 2023; originally announced August 2023.

Journal ref: Nature Communications 14, 4964 (2023)

arXiv:2308.06064 [pdf, other]

Joint Beamforming Optimization for Active STAR-RIS Assisted ISAC systems

Authors: Shuang Zhang, Wanming Hao, Gangcan Sun, Chongwen Huang, Zhengyu Zhu, Xingwang Li, Chau Yuen

Abstract: In this paper, we investigate an active simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted integrated sensing and communications (ISAC) system, in which a dual-function base station (DFBS) equipped with multiple antennas provides communication services for multiple users with the assistance of an active STARRIS and performs target sensing simultaneous… ▽ More In this paper, we investigate an active simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted integrated sensing and communications (ISAC) system, in which a dual-function base station (DFBS) equipped with multiple antennas provides communication services for multiple users with the assistance of an active STARRIS and performs target sensing simultaneously. Through optimizing both the DFBS and STAR-RIS beamforming jointly under different work modes, our purpose is to achieve the maximized communication sum rate, subject to the minimum radar signal-to-noise ratio (SNR) constraint, active STAR-RIS hardware constraints, and total power constraint of DFBS and active STAR-RIS. To solve the non-convex optimization problem formulated, an efficient alternating optimization algorithm is proposed. Specifically, the fractional programming scheme is first leveraged to turn the original problem into a structure with more tractable, and subsequently the transformed problem is decomposed into multiple sub-problems. Next, we develop a derivation method to obtain the closed expression of the radar receiving beamforming, and then the DFBS transmit beamforming is optimized under the radar SNR requirement and total power constraint. After that, the active STAR-RIS reflection and transmission beamforming are optimized by majorization minimiation, complex circle manifold and convex optimization techniques. Finally, the proposed schemes are conducted through numerical simulations to show their benefits and efficiency. △ Less

Submitted 11 August, 2023; originally announced August 2023.

arXiv:2308.05813 [pdf, other]

doi 10.1109/JIOT.2023.3296319

Physical Layer Security for NOMA Systems: Requirements, Issues, and Recommendations

Authors: Saeid Pakravan, Jean-Yves Chouinard, Xingwang Li, Ming Zeng, Wanming Hao, Quoc-Viet Pham, Octavia A. Dobre

Abstract: Non-orthogonal multiple access (NOMA) has been viewed as a potential candidate for the upcoming generation of wireless communication systems. Comparing to traditional orthogonal multiple access (OMA), multiplexing users in the same time-frequency resource block can increase the number of served users and improve the efficiency of the systems in terms of spectral efficiency. Nevertheless, from a se… ▽ More Non-orthogonal multiple access (NOMA) has been viewed as a potential candidate for the upcoming generation of wireless communication systems. Comparing to traditional orthogonal multiple access (OMA), multiplexing users in the same time-frequency resource block can increase the number of served users and improve the efficiency of the systems in terms of spectral efficiency. Nevertheless, from a security view-point, when multiple users are utilizing the same time-frequency resource, there may be concerns regarding keeping information confidential. In this context, physical layer security (PLS) has been introduced as a supplement of protection to conventional encryption techniques by making use of the random nature of wireless transmission media for ensuring communication secrecy. The recent years have seen significant interests in PLS being applied to NOMA networks. Numerous scenarios have been investigated to assess the security of NOMA systems, including when active and passive eavesdroppers are present, as well as when these systems are combined with relay and reconfigurable intelligent surfaces (RIS). Additionally, the security of the ambient backscatter (AmB)-NOMA systems are other issues that have lately drawn a lot of attention. In this paper, a thorough analysis of the PLS-assisted NOMA systems research state-of-the-art is presented. In this regard, we begin by outlining the foundations of NOMA and PLS, respectively. Following that, we discuss the PLS performances for NOMA systems in four categories depending on the type of the eavesdropper, the existence of relay, RIS, and AmB systems in different conditions. Finally, a thorough explanation of the most recent PLS-assisted NOMA systems is given. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: 17 pages, 4 figures

Journal ref: IEEE Internet of Things Journal

arXiv:2308.03215 [pdf, other]

The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning

Authors: Nikhil Ghosh, Spencer Frei, Wooseok Ha, Bin Yu

Abstract: In this work, we investigate the dynamics of stochastic gradient descent (SGD) when training a single-neuron autoencoder with linear or ReLU activation on orthogonal data. We show that for this non-convex problem, randomly initialized SGD with a constant step size successfully finds a global minimum for any batch size choice. However, the particular global minimum found depends upon the batch size… ▽ More In this work, we investigate the dynamics of stochastic gradient descent (SGD) when training a single-neuron autoencoder with linear or ReLU activation on orthogonal data. We show that for this non-convex problem, randomly initialized SGD with a constant step size successfully finds a global minimum for any batch size choice. However, the particular global minimum found depends upon the batch size. In the full-batch setting, we show that the solution is dense (i.e., not sparse) and is highly aligned with its initialized direction, showing that relatively little feature learning occurs. On the other hand, for any batch size strictly smaller than the number of samples, SGD finds a global minimum which is sparse and nearly orthogonal to its initialization, showing that the randomness of stochastic gradients induces a qualitatively different type of "feature selection" in this setting. Moreover, if we measure the sharpness of the minimum by the trace of the Hessian, the minima found with full batch gradient descent are flatter than those found with strictly smaller batch sizes, in contrast to previous works which suggest that large batches lead to sharper minima. To prove convergence of SGD with a constant step size, we introduce a powerful tool from the theory of non-homogeneous random walks which may be of independent interest. △ Less

Submitted 6 August, 2023; originally announced August 2023.

arXiv:2308.01551 [pdf, other]

Avoidance Navigation Based on Offline Pre-Training Reinforcement Learning

Authors: Yang Wenkai Ji Ruihang Zhang Yuxiang Lei Hao, Zhao Zijie

Abstract: This paper presents a Pre-Training Deep Reinforcement Learning(DRL) for avoidance navigation without map for mobile robots which map raw sensor data to control variable and navigate in an unknown environment. The efficient offline training strategy is proposed to speed up the inefficient random explorations in early stage and we also collect a universal dataset including expert experience for offl… ▽ More This paper presents a Pre-Training Deep Reinforcement Learning(DRL) for avoidance navigation without map for mobile robots which map raw sensor data to control variable and navigate in an unknown environment. The efficient offline training strategy is proposed to speed up the inefficient random explorations in early stage and we also collect a universal dataset including expert experience for offline training, which is of some significance for other navigation training work. The pre-training and prioritized expert experience are proposed to reduce 80\% training time and has been verified to improve the 2 times reward of DRL. The advanced simulation gazebo with real physical modelling and dynamic equations reduce the gap between sim-to-real. We train our model a corridor environment, and evaluate the model in different environment getting the same effect. Compared to traditional method navigation, we can confirm the trained model can be directly applied into different scenarios and have the ability to no collision navigate. It was demonstrated that our DRL model have universal general capacity in different environment. △ Less

Submitted 3 August, 2023; originally announced August 2023.

arXiv:2307.07507 [pdf, other]

MGit: A Model Versioning and Management System

Authors: Wei Hao, Daniel Mendoza, Rafael da Silva, Deepak Narayanan, Amar Phanishaye

Abstract: Models derived from other models are extremely common in machine learning (ML) today. For example, transfer learning is used to create task-specific models from "pre-trained" models through finetuning. This has led to an ecosystem where models are related to each other, sharing structure and often even parameter values. However, it is hard to manage these model derivatives: the storage overhead of… ▽ More Models derived from other models are extremely common in machine learning (ML) today. For example, transfer learning is used to create task-specific models from "pre-trained" models through finetuning. This has led to an ecosystem where models are related to each other, sharing structure and often even parameter values. However, it is hard to manage these model derivatives: the storage overhead of storing all derived models quickly becomes onerous, prompting users to get rid of intermediate models that might be useful for further analysis. Additionally, undesired behaviors in models are hard to track down (e.g., is a bug inherited from an upstream model?). In this paper, we propose a model versioning and management system called MGit that makes it easier to store, test, update, and collaborate on model derivatives. MGit introduces a lineage graph that records provenance and versioning information between models, optimizations to efficiently store model parameters, as well as abstractions over this lineage graph that facilitate relevant testing, updating and collaboration functionality. MGit is able to reduce the lineage graph's storage footprint by up to 7x and automatically update downstream models in response to updates to upstream models. △ Less

Submitted 14 July, 2023; originally announced July 2023.

arXiv:2306.17347 [pdf, other]

Mediation with External Summary Statistic Information (MESSI)

Authors: Jonathan Boss, Wei Hao, Amber Cathey, Barrett M. Welch, Kelly K. Ferguson, John D. Meeker, Jian Kang, Bhramar Mukherjee

Abstract: Environmental health studies are increasingly measuring endogenous omics data ($\boldsymbol{M}$) to study intermediary biological pathways by which an exogenous exposure ($\boldsymbol{A}$) affects a health outcome ($\boldsymbol{Y}$), given confounders ($\boldsymbol{C}$). Mediation analysis is frequently carried out to understand such mechanisms. If intermediary pathways are of interest, then there… ▽ More Environmental health studies are increasingly measuring endogenous omics data ($\boldsymbol{M}$) to study intermediary biological pathways by which an exogenous exposure ($\boldsymbol{A}$) affects a health outcome ($\boldsymbol{Y}$), given confounders ($\boldsymbol{C}$). Mediation analysis is frequently carried out to understand such mechanisms. If intermediary pathways are of interest, then there is likely literature establishing statistical and biological significance of the total effect, defined as the effect of $\boldsymbol{A}$ on $\boldsymbol{Y}$ given $\boldsymbol{C}$. For mediation models with continuous outcomes and mediators, we show that leveraging external summary-level information on the total effect improves estimation efficiency of the natural direct and indirect effects. Moreover, the efficiency gain depends on the asymptotic partial $R^2$ between the outcome ($\boldsymbol{Y}\mid\boldsymbol{M},\boldsymbol{A},\boldsymbol{C}$) and total effect ($\boldsymbol{Y}\mid\boldsymbol{A},\boldsymbol{C}$) models, with smaller (larger) values benefiting direct (indirect) effect estimation. We robustify our estimation procedure to incongenial external information by assuming the total effect follows a random distribution. This framework allows shrinkage towards the external information if the total effects in the internal and external populations agree. We illustrate our methodology using data from the Puerto Rico Testsite for Exploring Contamination Threats, where Cytochrome p450 metabolites are hypothesized to mediate the effect of phthalate exposure on gestational age at delivery. External information on the total effect comes from a recently published pooled analysis of 16 studies. The proposed framework blends mediation analysis with emerging data integration techniques. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 32 pages, 6 figures

arXiv:2306.08727 [pdf, other]

Gauss Newton method for solving variational problems of PDEs with neural network discretizaitons

Authors: Wenrui Hao, Qingguo Hong, Xianlin Jin

Abstract: The numerical solution of differential equations using machine learning-based approaches has gained significant popularity. Neural network-based discretization has emerged as a powerful tool for solving differential equations by parameterizing a set of functions. Various approaches, such as the deep Ritz method and physics-informed neural networks, have been developed for numerical solutions. Trai… ▽ More The numerical solution of differential equations using machine learning-based approaches has gained significant popularity. Neural network-based discretization has emerged as a powerful tool for solving differential equations by parameterizing a set of functions. Various approaches, such as the deep Ritz method and physics-informed neural networks, have been developed for numerical solutions. Training algorithms, including gradient descent and greedy algorithms, have been proposed to solve the resulting optimization problems. In this paper, we focus on the variational formulation of the problem and propose a Gauss- Newton method for computing the numerical solution. We provide a comprehensive analysis of the superlinear convergence properties of this method, along with a discussion on semi-regular zeros of the vanishing gradient. Numerical examples are presented to demonstrate the efficiency of the proposed Gauss-Newton method. △ Less

Submitted 21 January, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

arXiv:2305.19613 [pdf]

High-Entropy Enhanced Negative Thermal Expansion Perfomance in Antiperovkites

Authors: Xiuliang Yuan, Bing Wang, Ying Sun, Huaiming Guo, Kewen Shi, Sihao Deng, Lunhua He, Huiqing Lu, Hong Zhang, Shengdi Xu, Yi Du, Weichang Hao, Shengqi Chu, Zhijie Ma, Shihai An, Jin Cui, Dongmei Hu, Huiming Han, Cong Wang

Abstract: The negative thermal expansion (NTE) materials, which can act as thermal-expansion compensators to counteract the positive thermal expansion, have great applications merit in precision engineering. However, the exploration of NTE behavior with a wide temperature range has reached its upper ceiling through traditional doping strategies due to composition limitations. The unique sluggish characteris… ▽ More The negative thermal expansion (NTE) materials, which can act as thermal-expansion compensators to counteract the positive thermal expansion, have great applications merit in precision engineering. However, the exploration of NTE behavior with a wide temperature range has reached its upper ceiling through traditional doping strategies due to composition limitations. The unique sluggish characteristic in phase transition and extended optimization space in recent high entropy systems has great potential to broaden the temperature range in electronic transitions-induced NTE materials. Mn-based anti-perovskites offer an ideal platform for the exploration of high entropy NTE material due to their abundant element selection and controllable NTE performance. In this paper, the high entropy strategy is first introduced to broaden the NTE temperature range by relaxing the abrupt phase transition in Mn-based anti-perovskite nitride. We propose an empirical screening method to synthesize the high-entropy anti-perovskite (HEAP). it is found that magnetic phase separation from anti-ferromagnetic CII to paramagnetic CI surviving in an ultra-wide temperature range of 5K<=T<=350K (Delta_T=345K), revealing a unique sluggish characteristic. Consequently, a remarkable NTE behavior (up to Delta_T=235K, 5K<=T<=240K) with a coefficient of thermal expansion of -4.7x10-6/K, has been obtained in HEAP. It is worth noting that the temperature range is two/three times wider than that of low-entropy systems. The sluggish characteristic has been further experimentally proved to come from disturbed phase transition dynamics due to distortion in atomic spacing and chemical environmental fluctuation observed by the spherical aberration-corrected electron microscope. Our demonstration provides a unique paradigm for broadening the temperature range of NTE materials induced by phase transition through entropy engineering. △ Less

Submitted 4 March, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

Comments: 34 pages

arXiv:2305.19329 [pdf, other]

Mitigating Test-Time Bias for Fair Image Retrieval

Authors: Fanjie Kong, Shuai Yuan, Weituo Hao, Ricardo Henao

Abstract: We address the challenge of generating fair and unbiased image retrieval results given neutral textual queries (with no explicit gender or race connotations), while maintaining the utility (performance) of the underlying vision-language (VL) model. Previous methods aim to disentangle learned representations of images and text queries from gender and racial characteristics. However, we show these a… ▽ More We address the challenge of generating fair and unbiased image retrieval results given neutral textual queries (with no explicit gender or race connotations), while maintaining the utility (performance) of the underlying vision-language (VL) model. Previous methods aim to disentangle learned representations of images and text queries from gender and racial characteristics. However, we show these are inadequate at alleviating bias for the desired equal representation result, as there usually exists test-time bias in the target retrieval set. So motivated, we introduce a straightforward technique, Post-hoc Bias Mitigation (PBM), that post-processes the outputs from the pre-trained vision-language model. We evaluate our algorithm on real-world image search datasets, Occupation 1 and 2, as well as two large-scale image-text datasets, MS-COCO and Flickr30k. Our approach achieves the lowest bias, compared with various existing bias-mitigation methods, in text-based image retrieval result while maintaining satisfactory retrieval performance. The source code is publicly available at \url{https://anonymous.4open.science/r/Fair_Text_based_Image_Retrieval-D8B2}. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.15193 [pdf, other]

Adaptive Policy Learning to Additional Tasks

Authors: Wenjian Hao, Zehui Lu, Zihao Liang, Tianyu Zhou, Shaoshuai Mou

Abstract: This paper develops a policy learning method for tuning a pre-trained policy to adapt to additional tasks without altering the original task. A method named Adaptive Policy Gradient (APG) is proposed in this paper, which combines Bellman's principle of optimality with the policy gradient approach to improve the convergence rate. This paper provides theoretical analysis which guarantees the converg… ▽ More This paper develops a policy learning method for tuning a pre-trained policy to adapt to additional tasks without altering the original task. A method named Adaptive Policy Gradient (APG) is proposed in this paper, which combines Bellman's principle of optimality with the policy gradient approach to improve the convergence rate. This paper provides theoretical analysis which guarantees the convergence rate and sample complexity of $\mathcal{O}(1/T)$ and $\mathcal{O}(1/ε)$, respectively, where $T$ denotes the number of iterations and $ε$ denotes the accuracy of the resulting stationary policy. Furthermore, several challenging numerical simulations, including cartpole, lunar lander, and robot arm, are provided to show that APG obtains similar performance compared to existing deterministic policy gradient methods while utilizing much less data and converging at a faster rate. △ Less

Submitted 24 May, 2023; originally announced May 2023.

arXiv:2305.15188 [pdf, other]

Policy Learning based on Deep Koopman Representation

Authors: Wenjian Hao, Paulo C. Heredia, Bowen Huang, Zehui Lu, Zihao Liang, Shaoshuai Mou

Abstract: This paper proposes a policy learning algorithm based on the Koopman operator theory and policy gradient approach, which seeks to approximate an unknown dynamical system and search for optimal policy simultaneously, using the observations gathered through interaction with the environment. The proposed algorithm has two innovations: first, it introduces the so-called deep Koopman representation int… ▽ More This paper proposes a policy learning algorithm based on the Koopman operator theory and policy gradient approach, which seeks to approximate an unknown dynamical system and search for optimal policy simultaneously, using the observations gathered through interaction with the environment. The proposed algorithm has two innovations: first, it introduces the so-called deep Koopman representation into the policy gradient to achieve a linear approximation of the unknown dynamical system, all with the purpose of improving data efficiency; second, the accumulated errors for long-term tasks induced by approximating system dynamics are avoided by applying Bellman's principle of optimality. Furthermore, a theoretical analysis is provided to prove the asymptotic convergence of the proposed algorithm and characterize the corresponding sampling complexity. These conclusions are also supported by simulations on several challenging benchmark environments. △ Less

Submitted 24 May, 2023; originally announced May 2023.

arXiv:2305.07772 [pdf, other]

Monitoring and Adapting ML Models on Mobile Devices

Authors: Wei Hao, Zixi Wang, Lauren Hong, Lingxiao Li, Nader Karayanni, Chengzhi Mao, Junfeng Yang, Asaf Cidon

Abstract: ML models are increasingly being pushed to mobile devices, for low-latency inference and offline operation. However, once the models are deployed, it is hard for ML operators to track their accuracy, which can degrade unpredictably (e.g., due to data drift). We design the first end-to-end system for continuously monitoring and adapting models on mobile devices without requiring feedback from users… ▽ More ML models are increasingly being pushed to mobile devices, for low-latency inference and offline operation. However, once the models are deployed, it is hard for ML operators to track their accuracy, which can degrade unpredictably (e.g., due to data drift). We design the first end-to-end system for continuously monitoring and adapting models on mobile devices without requiring feedback from users. Our key observation is that often model degradation is due to a specific root cause, which may affect a large group of devices. Therefore, once the system detects a consistent degradation across a large number of devices, it employs a root cause analysis to determine the origin of the problem and applies a cause-specific adaptation. We evaluate the system on two computer vision datasets, and show it consistently boosts accuracy compared to existing approaches. On a dataset containing photos collected from driving cars, our system improves the accuracy on average by 15%. △ Less

Submitted 17 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

arXiv:2305.04162 [pdf, other]

Companion-Based Multi-Level Finite Element Method for Computing Multiple Solutions of Nonlinear Differential Equations

Authors: Wenrui Hao, Sun Lee, Young Ju Lee

Abstract: The use of nonlinear PDEs has led to significant advancements in various fields, such as physics, biology, ecology, and quantum mechanics. However, finding multiple solutions for nonlinear PDEs can be a challenging task, especially when suitable initial guesses are difficult to obtain. In this paper, we introduce a novel approach called the Companion-Based Multilevel finite element method (CBMFEM)… ▽ More The use of nonlinear PDEs has led to significant advancements in various fields, such as physics, biology, ecology, and quantum mechanics. However, finding multiple solutions for nonlinear PDEs can be a challenging task, especially when suitable initial guesses are difficult to obtain. In this paper, we introduce a novel approach called the Companion-Based Multilevel finite element method (CBMFEM), which can efficiently and accurately generate multiple initial guesses for solving nonlinear elliptic semi-linear equations with polynomial nonlinear terms using finite element methods with conforming elements. We provide a theoretical analysis of the error estimate of finite element methods using an appropriate notion of isolated solutions, for the nonlinear elliptic equation with multiple solutions and present numerical results obtained using CBMFEM which are consistent with the theoretical analysis. △ Less

Submitted 6 May, 2023; originally announced May 2023.

MSC Class: 49M37; 65N30; 90C99

arXiv:2304.02811 [pdf, other]

doi 10.1016/j.jcp.2023.112751

HomPINNs: homotopy physics-informed neural networks for solving the inverse problems of nonlinear differential equations with multiple solutions

Authors: Haoyang Zheng, Yao Huang, Ziyang Huang, Wenrui Hao, Guang Lin

Abstract: Due to the complex behavior arising from non-uniqueness, symmetry, and bifurcations in the solution space, solving inverse problems of nonlinear differential equations (DEs) with multiple solutions is a challenging task. To address this, we propose homotopy physics-informed neural networks (HomPINNs), a novel framework that leverages homotopy continuation and neural networks (NNs) to solve inverse… ▽ More Due to the complex behavior arising from non-uniqueness, symmetry, and bifurcations in the solution space, solving inverse problems of nonlinear differential equations (DEs) with multiple solutions is a challenging task. To address this, we propose homotopy physics-informed neural networks (HomPINNs), a novel framework that leverages homotopy continuation and neural networks (NNs) to solve inverse problems. The proposed framework begins with the use of NNs to simultaneously approximate unlabeled observations across diverse solutions while adhering to DE constraints. Through homotopy continuation, the proposed method solves the inverse problem by tracing the observations and identifying multiple solutions. The experiments involve testing the performance of the proposed method on one-dimensional DEs and applying it to solve a two-dimensional Gray-Scott simulation. Our findings demonstrate that the proposed method is scalable and adaptable, providing an effective solution for solving DEs with multiple solutions and unknown parameters. Moreover, it has significant potential for various applications in scientific computing, such as modeling complex systems and solving inverse problems in physics, chemistry, biology, etc. △ Less

Submitted 17 January, 2024; v1 submitted 5 April, 2023; originally announced April 2023.

Comments: 20 pages, 15 figures, 7 tables

Journal ref: Volume 500, 2024

arXiv:2304.00100 [pdf, other]

A Data-Driven Approach for Inverse Optimal Control

Authors: Zihao Liang, Wenjian Hao, Shaoshuai Mou

Abstract: This paper proposes a data-driven, iterative approach for inverse optimal control (IOC), which aims to learn the objective function of a nonlinear optimal control system given its states and inputs. The approach solves the IOC problem in a challenging situation when the system dynamics is unknown. The key idea of the proposed approach comes from the deep Koopman representation of the unknown syste… ▽ More This paper proposes a data-driven, iterative approach for inverse optimal control (IOC), which aims to learn the objective function of a nonlinear optimal control system given its states and inputs. The approach solves the IOC problem in a challenging situation when the system dynamics is unknown. The key idea of the proposed approach comes from the deep Koopman representation of the unknown system, which employs a deep neural network to represent observables for the Koopman operator. By assuming the objective function to be learned is parameterized as a linear combination of features with unknown weights, the proposed approach for IOC is able to achieve a Koopman representation of the unknown dynamics and the unknown weights in objective function together. Simulation is provided to verify the proposed approach. △ Less

Submitted 31 March, 2023; originally announced April 2023.

arXiv:2303.11815 [pdf, ps, other]

doi 10.1140/epjc/s10052-023-12275-3

The mass spectrum and strong decay properties of the charmed-strange mesons within Godfrey-Isgur model considering the coupled-channel effects

Authors: Jing-Jing Yang, Wei Hao, Xiaoyu Wang, De-Min Li, Yu-Xiao Li, En Wang

Abstract: Motivated by the recently observed $D_{s0}(2590)$ state by LHCb, we investigate the mass spectrum and the strong decay properties of the charmed-strange mesons within Godfrey-Isgur model considering the coupled-channel effects. Our results support that $D_{s0}^*(2317)$ and $D_{s1}(2460)$ can be interpreted as the $D_{s}(1^3P_0)$ and $D_{s}(1^3P_1)$ states with larger $DK$ and $D^*K$ components, re… ▽ More Motivated by the recently observed $D_{s0}(2590)$ state by LHCb, we investigate the mass spectrum and the strong decay properties of the charmed-strange mesons within Godfrey-Isgur model considering the coupled-channel effects. Our results support that $D_{s0}^*(2317)$ and $D_{s1}(2460)$ can be interpreted as the $D_{s}(1^3P_0)$ and $D_{s}(1^3P_1)$ states with larger $DK$ and $D^*K$ components, respectively, and $D_{s1}(2700)$, $D_{s1}(2536)$, $D^*_{s2}(2573)$, $D_{s1}^*(2860)$, $D_{s3}^*(2860)$, and $D_{sJ}^*(3040)$ can be well interpreted as the $D_s(2^3S_1)$, $D_s(1^1P_1)$, $D_s(1^3P_2)$, $D_s(1^3D_1)$, $D_s(1^3D_3)$, and $D_s(2^1P_1)$ states, respectively. Although, $D_{s0}(2590)$ mass is about 50 MeV less than our prediction for the $D_{s}(2^1S_0)$ state, its width is still in good agreement with the one of $D_{s}(2^1S_0)$. Therefore, $D_{s0}(2590)$ state needs to be further confirmed by the experimental measurements, and the more precise information about $D_{s0}(2590)$ will shed light on its assignment of $D_{s}(2^1S_0)$. Furthermore, we predict the masses and the strong decay properties of the charmed-strange mesons with masses around 3 GeV, which would be helpful to experimentally search for these states. △ Less

Submitted 4 December, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

Comments: 8 pages

Journal ref: Eur. Phys. J. C 83, 1098 (2023)

arXiv:2302.12046 [pdf, ps, other]

Observation of Q-switched and continuous wave regimes with mode-hopping in Er-doped fiber lasers incorporating a dynamic population grating

Authors: Zengrun Wen, Xiulin Fan, Kaile Wang, Weiming Wang, Song Gao, Wenjing Hao, Yuanmei Gao, Yangjian Cai, Liren Zheng

Abstract: Dynamic population gratings (DPGs) in rare-earth doped fibers are prevalent devices in fiber lasers for the production of single-longitudinal-mode emission, Q-switched pulses, and wavelength self-sweeping regimes. This study presents a transition from Q-switched state to continuous wave (CW) state, accompanying irregular mode-hopping, in an erbium-doped fiber laser with a heavily-doped DPG centere… ▽ More Dynamic population gratings (DPGs) in rare-earth doped fibers are prevalent devices in fiber lasers for the production of single-longitudinal-mode emission, Q-switched pulses, and wavelength self-sweeping regimes. This study presents a transition from Q-switched state to continuous wave (CW) state, accompanying irregular mode-hopping, in an erbium-doped fiber laser with a heavily-doped DPG centered at 1549.95 nm. Our results demonstrate that the transition between these two states can be achieved by adjusting the pump power. The repetition frequency of the Q-switched pulse increases monotonically with the increasing pump power, while the pulse duration initially narrows and then expands because the reduced peak intensity weakens the nonlinear effect. Additionally, modulation peaks are evident on both the Q-switched pulse train and the CW background, which are induced by the irregular mode-hopping caused by the DPG. Furthermore, we observe that the central wavelength fluctuates within a range of 0.05 nm. These results provide valuable insight into the DPG effect in heavily-doped fibers. △ Less

Submitted 23 February, 2023; originally announced February 2023.

arXiv:2302.10424 [pdf, other]

Deep Learning via Neural Energy Descent

Authors: Wenrui Hao, Chunmei Wang, Xingjian Xu, Haizhao Yang

Abstract: This paper proposes the Nerual Energy Descent (NED) via neural network evolution equations for a wide class of deep learning problems. We show that deep learning can be reformulated as the evolution of network parameters in an evolution equation and the steady state solution of the partial differential equation (PDE) provides a solution to deep learning. This equation corresponds to a gradient des… ▽ More This paper proposes the Nerual Energy Descent (NED) via neural network evolution equations for a wide class of deep learning problems. We show that deep learning can be reformulated as the evolution of network parameters in an evolution equation and the steady state solution of the partial differential equation (PDE) provides a solution to deep learning. This equation corresponds to a gradient descent flow of a variational problem and hence the proposed time-dependent PDE solves an energy minimization problem to obtain a global minimizer of deep learning. This gives a novel interpretation and solution to deep learning optimization. The computational complexity of the proposed energy descent method can be enhanced by randomly sampling the spatial domain of the PDE leading to an efficient NED. Numerical examples are provided to demonstrate the numerical advantage of NED over stochastic gradient descent (SGD). △ Less

Submitted 20 February, 2023; originally announced February 2023.

Showing 1–50 of 181 results for author: Hao, W