Search | arXiv e-print repository

arXiv:2409.16637 [pdf, ps, other]

Deep-Learning Recognition of Scanning Transmission Electron Microscopy: Quantifying and Mitigating the Influence of Gaussian Noises

Authors: Hanlei Zhang, Jincheng Bai, Xiabo Chen, Can Li, Chuanjian Zhong, Jiye Fang, Guangwen Zhou

Abstract: Scanning transmission electron microscopy (STEM) is a powerful tool to reveal the morphologies and structures of materials, thereby attracting intensive interests from the scientific and industrial communities. The outstanding spatial (atomic level) and temporal (ms level) resolutions of the STEM techniques generate fruitful amounts of high-definition data, thereby enabling the high-volume and hig… ▽ More Scanning transmission electron microscopy (STEM) is a powerful tool to reveal the morphologies and structures of materials, thereby attracting intensive interests from the scientific and industrial communities. The outstanding spatial (atomic level) and temporal (ms level) resolutions of the STEM techniques generate fruitful amounts of high-definition data, thereby enabling the high-volume and high-speed analysis of materials. On the other hand, processing of the big dataset generated by STEM is time-consuming and beyond the capability of human-based manual work, which urgently calls for computer-based automation. In this work, we present a deep-learning mask region-based neural network (Mask R-CNN) for the recognition of nanoparticles imaged by STEM, as well as generating the associated dimensional analysis. The Mask R-CNN model was tested on simulated STEM-HAADF results with different Gaussian noises, particle shapes and particle sizes, and the results indicated that Gaussian noise has determining influence on the accuracy of recognition. By applying Gaussian and Non-Local Means filters on the noise-containing STEM-HAADF results, the influences of noises are largely mitigated, and recognition accuracy is significantly improved. This filtering-recognition approach was further applied to experimental STEM-HAADF results, which yields satisfying accuracy compared with the traditional threshold methods. The deep-learning-based method developed in this work has great potentials in analysis of the complicated structures and large data generated by STEM-HAADF. △ Less

Submitted 25 September, 2024; originally announced September 2024.

arXiv:2408.06597 [pdf, ps, other]

Line Spectral Estimation with Unlimited Sensing

Authors: Hongwei Wang, Jun Fang, Hongbin Li, Geert Leus

Abstract: In the paper, we consider the line spectral estimation problem in an unlimited sensing framework (USF), where a modulo analog-to-digital converter (ADC) is employed to fold the input signal back into a bounded interval before quantization. Such an operation is mathematically equivalent to taking the modulo of the input signal with respect to the interval. To overcome the noise sensitivity of highe… ▽ More In the paper, we consider the line spectral estimation problem in an unlimited sensing framework (USF), where a modulo analog-to-digital converter (ADC) is employed to fold the input signal back into a bounded interval before quantization. Such an operation is mathematically equivalent to taking the modulo of the input signal with respect to the interval. To overcome the noise sensitivity of higher-order difference-based methods, we explore the properties of the first-order difference of modulo samples, and develop two line spectral estimation algorithms based on first-order difference, which are robust against noise. Specifically, we show that, with a high probability, the first-order difference of the original samples is equivalent to that of the modulo samples. By utilizing this property, line spectral estimation is solved via a robust sparse signal recovery approach. The second algorithms is built on our finding that, with a sufficiently high sampling rate, the first-order difference of the original samples can be decomposed as a sum of the first-order difference of the modulo samples and a sequence whose elements are confined to be three possible values. This decomposition enables us to formulate the line spectral estimation problem as a mixed integer linear program that can be efficiently solved. Simulation results show that both proposed methods are robust against noise and achieve a significant performance improvement over the higher-order difference-based method. △ Less

Submitted 12 August, 2024; originally announced August 2024.

arXiv:2408.04951 [pdf, ps, other]

CSI-Free Position Optimization for Movable Antenna Communication Systems: A Black-Box Optimization Approach

Authors: Xianlong Zeng, Jun Fang, Bin Wang, Boyu Ning, Hongbin Li

Abstract: Movable antenna (MA) is a new technology which leverages local movement of antennas to improve channel qualities and enhance the communication performance. Nevertheless, to fully realize the potential of MA systems, complete channel state information (CSI) between the transmitter-MA and the receiver-MA is required, which involves estimating a large number of channel parameters and incurs an excess… ▽ More Movable antenna (MA) is a new technology which leverages local movement of antennas to improve channel qualities and enhance the communication performance. Nevertheless, to fully realize the potential of MA systems, complete channel state information (CSI) between the transmitter-MA and the receiver-MA is required, which involves estimating a large number of channel parameters and incurs an excessive amount of training overhead. To address this challenge, in this paper, we propose a CSI-free MA position optimization method. The basic idea is to treat position optimization as a black-box optimization problem and calculate the gradient of the unknown objective function using zeroth-order (ZO) gradient approximation techniques. Simulation results show that the proposed ZO-based method, through adaptively adjusting the position of the MA, can achieve a favorable signal-to-noise-ratio (SNR) using a smaller number of position measurements than the CSI-based approach. Such a merit makes the proposed algorithm more adaptable to fast-changing propagation channels. △ Less

Submitted 9 August, 2024; originally announced August 2024.

Comments: 5 pages, 4 figures, submitted for possible IEEE publication

arXiv:2407.03026 [pdf, other]

Qifusion-Net: Layer-adapted Stream/Non-stream Model for End-to-End Multi-Accent Speech Recognition

Authors: Jinming Chen, Jingyi Fang, Yuanzhong Zheng, Yaoxuan Wang, Haojun Fei

Abstract: Currently, end-to-end (E2E) speech recognition methods have achieved promising performance. However, auto speech recognition (ASR) models still face challenges in recognizing multi-accent speech accurately. We propose a layer-adapted fusion (LAF) model, called Qifusion-Net, which does not require any prior knowledge about the target accent. Based on dynamic chunk strategy, our approach enables str… ▽ More Currently, end-to-end (E2E) speech recognition methods have achieved promising performance. However, auto speech recognition (ASR) models still face challenges in recognizing multi-accent speech accurately. We propose a layer-adapted fusion (LAF) model, called Qifusion-Net, which does not require any prior knowledge about the target accent. Based on dynamic chunk strategy, our approach enables streaming decoding and can extract frame-level acoustic feature, facilitating fine-grained information fusion. Experiment results demonstrate that our proposed methods outperform the baseline with relative reductions of 22.1$\%$ and 17.2$\%$ in character error rate (CER) across multi accent test datasets on KeSpeech and MagicData-RMAC. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: accpeted by interspeech 2014, 5 pages, 1 figure

arXiv:2407.02160 [pdf, ps, other]

Intelligent Reflecting Surface-Assisted NLOS Sensing With OFDM Signals

Authors: Jilin Wang, Jun Fang, Hongbin Li, Lei Huang

Abstract: This work addresses the problem of intelligent reflecting surface (IRS) assisted target sensing in a non-line-of-sight (NLOS) scenario, where an IRS is employed to facilitate the radar/access point (AP) to sense the targets when the line-of-sight (LOS) path between the AP and the target is blocked by obstacles. To sense the targets, the AP transmits a train of uniformly-spaced orthogonal frequency… ▽ More This work addresses the problem of intelligent reflecting surface (IRS) assisted target sensing in a non-line-of-sight (NLOS) scenario, where an IRS is employed to facilitate the radar/access point (AP) to sense the targets when the line-of-sight (LOS) path between the AP and the target is blocked by obstacles. To sense the targets, the AP transmits a train of uniformly-spaced orthogonal frequency division multiplexing (OFDM) pulses, and then perceives the targets based on the echoes from the AP-IRS-targets-IRS-AP channel. To resolve an inherent scaling ambiguity associated with IRS-assisted NLOS sensing, we propose a two-phase sensing scheme by exploiting the diversity in the illumination pattern of the IRS across two different phases. Specifically, the received echo signals from the two phases are formulated as third-order tensors. Then a canonical polyadic (CP) decomposition-based method is developed to estimate each target's parameters including the direction of arrival (DOA), Doppler shift and time delay. Our analysis reveals that the proposed method achieves reliable NLOS sensing using a modest quantity of pulse/subcarrier resources. Simulation results are provided to show the effectiveness of the proposed method under the challenging scenario where the degrees-of-freedom provided by the AP-IRS channel are not enough for resolving the scaling ambiguity. △ Less

Submitted 2 July, 2024; originally announced July 2024.

arXiv:2406.18935 [pdf]

Generalized Averaging Method for Power Electronics Modeling from DC to above Half the Switching Frequency

Authors: Hongchang Li, Kangping Wang, Jingyang Fang, Wenjie Chen, Xu Yang

Abstract: Modeling power electronic converters at frequencies close to or above half the switching frequency has been difficult due to the time-variant and discontinuous switching actions. This paper uses the properties of moving Fourier coefficients to develop the generalized averaging method, breaking though the limit of half the switching frequency. The paper also proposes the generalized average model f… ▽ More Modeling power electronic converters at frequencies close to or above half the switching frequency has been difficult due to the time-variant and discontinuous switching actions. This paper uses the properties of moving Fourier coefficients to develop the generalized averaging method, breaking though the limit of half the switching frequency. The paper also proposes the generalized average model for various switching signals, including pulse-width modulation (PWM), phase-shift modulation, pulse-frequency modulation (PFM), and state-dependent switching signals, so that circuits and modulators/controllers can be modeled separately and combined flexibly. Using the Laplace transform of moving Fourier coefficients, the coupling of signals and their sidebands at different frequencies is clearly described as the coupling of moving Fourier coefficients at the same frequency in a linear time-invariant system framework. The modeling method is applied to a PWM controlled boost converter, a V2 constant on-time controlled buck converter, and a PFM controlled LLC converter, for demonstration and validation. Experimental results of the converters in different operating modes show that the proposed models have higher accuracy than exiting models, especially in the frequency range close to or above half the switching frequency. The developed method can be applied to almost all types of power electronic converters. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.16381 [pdf, other]

Polar-Coded Tensor-Based Unsourced Random Access with Soft Decoding

Authors: Jiaqi Fang, Yan Liang, Gangle Sun, Hongwei Hou, Yafei Wang, Li You, Wenjin Wang

Abstract: The unsourced random access (URA) has emerged as a viable scheme for supporting the massive machine-type communications (mMTC) in the sixth generation (6G) wireless networks. Notably, the tensor-based URA (TURA), with its inherent tensor structure, stands out by simultaneously enhancing performance and reducing computational complexity for the multi-user separation, especially in mMTC networks wit… ▽ More The unsourced random access (URA) has emerged as a viable scheme for supporting the massive machine-type communications (mMTC) in the sixth generation (6G) wireless networks. Notably, the tensor-based URA (TURA), with its inherent tensor structure, stands out by simultaneously enhancing performance and reducing computational complexity for the multi-user separation, especially in mMTC networks with a large numer of active devices. However, current TURA scheme lacks the soft decoder, thus precluding the incorporation of existing advanced coding techniques. In order to fully explore the potential of the TURA, this paper investigates the Polarcoded TURA (PTURA) scheme and develops the corresponding iterative Bayesian receiver with feedback (IBR-FB). Specifically, in the IBR-FB, we propose the Grassmannian modulation-aided Bayesian tensor decomposition (GM-BTD) algorithm under the variational Bayesian learning (VBL) framework, which leverages the property of the Grassmannian modulation to facilitate the convergence of the VBL process, and has the ability to generate the required soft information without the knowledge of the number of active devices. Furthermore, based on the soft information produced by the GM-BTD, we design the soft Grassmannian demodulator in the IBR-FB. Extensive simulation results demonstrate that the proposed PTURA in conjunction with the IBR-FB surpasses the existing state-of-the-art unsourced random access scheme in terms of accuracy and computational complexity. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2404.11304 [pdf]

Dynamic Phasor Modeling of Single-Phase Grid-Forming Converters

Authors: Wenjia Si, Chenming Liu, Steven Liu, Hongchang Li, Chenghui Zhang, Jingyang Fang

Abstract: In modern power systems, grid-forming power converters (GFMCs) have emerged as an enabling technology. However, the modeling of single-phase GFMCs faces new challenges. In particular, the nonlinear orthogonal signal generation unit, crucial for power measurement, still lacks an accurate model. To overcome the challenges, this letter proposes a dynamic phasor model of single-phase GFMCs. Moreover,… ▽ More In modern power systems, grid-forming power converters (GFMCs) have emerged as an enabling technology. However, the modeling of single-phase GFMCs faces new challenges. In particular, the nonlinear orthogonal signal generation unit, crucial for power measurement, still lacks an accurate model. To overcome the challenges, this letter proposes a dynamic phasor model of single-phase GFMCs. Moreover, we linearize the proposed model and perform stability analysis, which confirm that the proposed model is more accurate than existing models. Experimental results validate the improved accuracy of the proposed dynamic phasor model. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.05544 [pdf, other]

Near/Far-Field Channel Estimation For Terahertz Systems With ELAAs: A Block-Sparse-Aware Approach

Authors: Hongwei Wang, Jun Fang, Huiping Duan, Hongbin Li

Abstract: Millimeter wave/Terahertz (mmWave/THz) communication with extremely large-scale antenna arrays (ELAAs) offers a promising solution to meet the escalating demand for high data rates in next-generation communications. A large array aperture, along with the ever increasing carrier frequency within the mmWave/THz bands, leads to a large Rayleigh distance. As a result, the traditional plane-wave assump… ▽ More Millimeter wave/Terahertz (mmWave/THz) communication with extremely large-scale antenna arrays (ELAAs) offers a promising solution to meet the escalating demand for high data rates in next-generation communications. A large array aperture, along with the ever increasing carrier frequency within the mmWave/THz bands, leads to a large Rayleigh distance. As a result, the traditional plane-wave assumption may not hold valid for mmWave/THz systems featuring ELAAs. In this paper, we consider the problem of hybrid near/far-field channel estimation by taking spherical wave propagation into account. By analyzing the coherence properties of any two near-field steering vectors, we prove that the hybrid near/far-field channel admits a block-sparse representation on a specially designed orthogonal dictionary. Specifically, the percentage of nonzero elements of such a block-sparse representation decreases in the order of $1/\sqrt{N}$, which tends to zero as the number of antennas, $N$, grows. Such a block-sparse representation allows to convert channel estimation into a block-sparse signal recovery problem. Simulation results are provided to verify our theoretical results and illustrate the performance of the proposed channel estimation approach in comparison with existing state-of-the-art methods. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2402.18018 [pdf, ps, other]

Communication Efficient ConFederated Learning: An Event-Triggered SAGA Approach

Authors: Bin Wang, Jun Fang, Hongbin Li, Yonina C. Eldar

Abstract: Federated learning (FL) is a machine learning paradigm that targets model training without gathering the local data dispersed over various data sources. Standard FL, which employs a single server, can only support a limited number of users, leading to degraded learning capability. In this work, we consider a multi-server FL framework, referred to as \emph{Confederated Learning} (CFL), in order to… ▽ More Federated learning (FL) is a machine learning paradigm that targets model training without gathering the local data dispersed over various data sources. Standard FL, which employs a single server, can only support a limited number of users, leading to degraded learning capability. In this work, we consider a multi-server FL framework, referred to as \emph{Confederated Learning} (CFL), in order to accommodate a larger number of users. A CFL system is composed of multiple networked edge servers, with each server connected to an individual set of users. Decentralized collaboration among servers is leveraged to harness all users' data for model training. Due to the potentially massive number of users involved, it is crucial to reduce the communication overhead of the CFL system. We propose a stochastic gradient method for distributed learning in the CFL framework. The proposed method incorporates a conditionally-triggered user selection (CTUS) mechanism as the central component to effectively reduce communication overhead. Relying on a delicately designed triggering condition, the CTUS mechanism allows each server to select only a small number of users to upload their gradients, without significantly jeopardizing the convergence performance of the algorithm. Our theoretical analysis reveals that the proposed algorithm enjoys a linear convergence rate. Simulation results show that it achieves substantial improvement over state-of-the-art algorithms in terms of communication efficiency. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2401.17681 [pdf, ps, other]

Joint Transceiver Optimization for MmWave/THz MU-MIMO ISAC Systems

Authors: Peilan Wang, Jun Fang, Xianlong Zeng, Zhi Chen, Hongbin Li

Abstract: In this paper, we consider the problem of joint transceiver design for millimeter wave (mmWave)/Terahertz (THz) multi-user MIMO integrated sensing and communication (ISAC) systems. Such a problem is formulated into a nonconvex optimization problem, with the objective of maximizing a weighted sum of communication users' rates and the passive radar's signal-to-clutter-and-noise-ratio (SCNR). By expl… ▽ More In this paper, we consider the problem of joint transceiver design for millimeter wave (mmWave)/Terahertz (THz) multi-user MIMO integrated sensing and communication (ISAC) systems. Such a problem is formulated into a nonconvex optimization problem, with the objective of maximizing a weighted sum of communication users' rates and the passive radar's signal-to-clutter-and-noise-ratio (SCNR). By exploring a low-dimensional subspace property of the optimal precoder, a low-complexity block-coordinate-descent (BCD)-based algorithm is proposed. Our analysis reveals that the hybrid analog/digital beamforming structure can attain the same performance as that of a fully digital precoder, provided that the number of radio frequency (RF) chains is no less than the number of resolvable signal paths. Also, through expressing the precoder as a sum of a communication-precoder and a sensing-precoder, we develop an analytical solution to the joint transceiver design problem by generalizing the idea of block-diagonalization (BD) to the ISAC system. Simulation results show that with a proper tradeoff parameter, the proposed methods can achieve a decent compromise between communication and sensing, where the performance of each communication/sensing task experiences only a mild performance loss as compared with the performance attained by optimizing exclusively for a single task. △ Less

Submitted 31 January, 2024; originally announced January 2024.

arXiv:2312.15741 [pdf]

Improving the Accuracy and Interpretability of Neural Networks for Wind Power Forecasting

Authors: Wenlong Liao, Fernando Porte-Agel, Jiannong Fang, Birgitte Bak-Jensen, Zhe Yang, Gonghao Zhang

Abstract: Deep neural networks (DNNs) are receiving increasing attention in wind power forecasting due to their ability to effectively capture complex patterns in wind data. However, their forecasted errors are severely limited by the local optimal weight issue in optimization algorithms, and their forecasted behavior also lacks interpretability. To address these two challenges, this paper firstly proposes… ▽ More Deep neural networks (DNNs) are receiving increasing attention in wind power forecasting due to their ability to effectively capture complex patterns in wind data. However, their forecasted errors are severely limited by the local optimal weight issue in optimization algorithms, and their forecasted behavior also lacks interpretability. To address these two challenges, this paper firstly proposes simple but effective triple optimization strategies (TriOpts) to accelerate the training process and improve the model performance of DNNs in wind power forecasting. Then, permutation feature importance (PFI) and local interpretable model-agnostic explanation (LIME) techniques are innovatively presented to interpret forecasted behaviors of DNNs, from global and instance perspectives. Simulation results show that the proposed TriOpts not only drastically improve the model generalization of DNNs for both the deterministic and probabilistic wind power forecasting, but also accelerate the training process. Besides, the proposed PFI and LIME techniques can accurately estimate the contribution of each feature to wind power forecasting, which helps to construct feature engineering and understand how to obtain forecasted values for a given sample. △ Less

Submitted 25 December, 2023; originally announced December 2023.

Comments: 10 pages, 10 figures

arXiv:2310.18629 [pdf]

Explainable Modeling for Wind Power Forecasting: A Glass-Box Approach with High Accuracy

Authors: Wenlong Liao, Fernando Porte-Agel, Jiannong Fang, Birgitte Bak-Jensen, Guangchun Ruan, Zhe Yang

Abstract: Machine learning models (e.g., neural networks) achieve high accuracy in wind power forecasting, but they are usually regarded as black boxes that lack interpretability. To address this issue, the paper proposes a glass-box approach that combines high accuracy with transparency for wind power forecasting. Specifically, the core is to sum up the feature effects by constructing shape functions, whic… ▽ More Machine learning models (e.g., neural networks) achieve high accuracy in wind power forecasting, but they are usually regarded as black boxes that lack interpretability. To address this issue, the paper proposes a glass-box approach that combines high accuracy with transparency for wind power forecasting. Specifically, the core is to sum up the feature effects by constructing shape functions, which effectively map the intricate non-linear relationships between wind power output and input features. Furthermore, the forecasting model is enriched by incorporating interaction terms that adeptly capture interdependencies and synergies among the input features. The additive nature of the proposed glass-box approach ensures its interpretability. Simulation results show that the proposed glass-box approach effectively interprets the results of wind power forecasting from both global and instance perspectives. Besides, it outperforms most benchmark models and exhibits comparable performance to the best-performing neural networks. This dual strength of transparency and high accuracy positions the proposed glass-box approach as a compelling choice for reliable wind power forecasting. △ Less

Submitted 26 February, 2024; v1 submitted 28 October, 2023; originally announced October 2023.

arXiv:2309.02318 [pdf, other]

TiAVox: Time-aware Attenuation Voxels for Sparse-view 4D DSA Reconstruction

Authors: Zhenghong Zhou, Huangxuan Zhao, Jiemin Fang, Dongqiao Xiang, Lei Chen, Lingxia Wu, Feihong Wu, Wenyu Liu, Chuansheng Zheng, Xinggang Wang

Abstract: Four-dimensional Digital Subtraction Angiography (4D DSA) plays a critical role in the diagnosis of many medical diseases, such as Arteriovenous Malformations (AVM) and Arteriovenous Fistulas (AVF). Despite its significant application value, the reconstruction of 4D DSA demands numerous views to effectively model the intricate vessels and radiocontrast flow, thereby implying a significant radiatio… ▽ More Four-dimensional Digital Subtraction Angiography (4D DSA) plays a critical role in the diagnosis of many medical diseases, such as Arteriovenous Malformations (AVM) and Arteriovenous Fistulas (AVF). Despite its significant application value, the reconstruction of 4D DSA demands numerous views to effectively model the intricate vessels and radiocontrast flow, thereby implying a significant radiation dose. To address this high radiation issue, we propose a Time-aware Attenuation Voxel (TiAVox) approach for sparse-view 4D DSA reconstruction, which paves the way for high-quality 4D imaging. Additionally, 2D and 3D DSA imaging results can be generated from the reconstructed 4D DSA images. TiAVox introduces 4D attenuation voxel grids, which reflect attenuation properties from both spatial and temporal dimensions. It is optimized by minimizing discrepancies between the rendered images and sparse 2D DSA images. Without any neural network involved, TiAVox enjoys specific physical interpretability. The parameters of each learnable voxel represent the attenuation coefficients. We validated the TiAVox approach on both clinical and simulated datasets, achieving a 31.23 Peak Signal-to-Noise Ratio (PSNR) for novel view synthesis using only 30 views on the clinically sourced dataset, whereas traditional Feldkamp-Davis-Kress methods required 133 views. Similarly, with merely 10 views from the synthetic dataset, TiAVox yielded a PSNR of 34.32 for novel view synthesis and 41.40 for 3D reconstruction. We also executed ablation studies to corroborate the essential components of TiAVox. The code will be publically available. △ Less

Submitted 19 December, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: 10 pages, 8 figures

arXiv:2308.04892 [pdf, other]

Transmission and Color-guided Network for Underwater Image Enhancement

Authors: Pan Mu, Jing Fang, Haotian Qian, Cong Bai

Abstract: In recent years, with the continuous development of the marine industry, underwater image enhancement has attracted plenty of attention. Unfortunately, the propagation of light in water will be absorbed by water bodies and scattered by suspended particles, resulting in color deviation and low contrast. To solve these two problems, we propose an Adaptive Transmission and Dynamic Color guided networ… ▽ More In recent years, with the continuous development of the marine industry, underwater image enhancement has attracted plenty of attention. Unfortunately, the propagation of light in water will be absorbed by water bodies and scattered by suspended particles, resulting in color deviation and low contrast. To solve these two problems, we propose an Adaptive Transmission and Dynamic Color guided network (named ATDCnet) for underwater image enhancement. In particular, to exploit the knowledge of physics, we design an Adaptive Transmission-directed Module (ATM) to better guide the network. To deal with the color deviation problem, we design a Dynamic Color-guided Module (DCM) to post-process the enhanced image color. Further, we design an Encoder-Decoder-based Compensation (EDC) structure with attention and a multi-stage feature fusion mechanism to perform color restoration and contrast enhancement simultaneously. Extensive experiments demonstrate the state-of-the-art performance of the ATDCnet on multiple benchmark datasets. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: 6 pages; Accepted at IEEE ICME

arXiv:2307.00491 [pdf, other]

Line Spectrum Estimation and Detection with Few-bit ADCs: Theoretical Analysis and Generalized NOMP Algorithm

Authors: Jiang Zhu, Hansheng Zhang, Ning Zhang, Jun Fang, Fengzhong Qu

Abstract: As radar systems will be equipped with thousands of antenna elements and wide bandwidth, the associated costs and power consumption become exceedingly high, and a potential solution is to adopt low-resolution quantization technology, which not only reduces data storage needs but also lowers power and hardware costs. This paper focuses on line spectral estimation and detection (LSE\&D) with few-bit… ▽ More As radar systems will be equipped with thousands of antenna elements and wide bandwidth, the associated costs and power consumption become exceedingly high, and a potential solution is to adopt low-resolution quantization technology, which not only reduces data storage needs but also lowers power and hardware costs. This paper focuses on line spectral estimation and detection (LSE\&D) with few-bit ADCs (typically 1-4 bits) by investigating the signal-to-noise ratio (SNR) loss, establishing a framework to understand the impact of intersinusoidal interference, the bit-depth of the quantizer, and the noise variance on weak signal detection in scenarios involving multiple sinusoids under low-resolution quantization. Additionally, a low-complexity, super-resolution, and constant false alarm rate (CFAR) algorithm, named generalized Newtonized orthogonal matching pursuit (GNOMP), is proposed. Extensive numerical simulations are conducted to validate the theoretical findings, particularly in terms of the detection probability bound. The effectiveness of GNOMP is demonstrated through comparisons with state-of-the-art algorithms, the Cramér Rao bound, and the detection probability bound. Real data acquired by mmWave radar further substantiates the effectiveness of GNOMP in practical applications. △ Less

Submitted 3 August, 2024; v1 submitted 2 July, 2023; originally announced July 2023.

arXiv:2303.12249 [pdf, other]

State-of-the-art optical-based physical adversarial attacks for deep learning computer vision systems

Authors: Junbin Fang, You Jiang, Canjian Jiang, Zoe L. Jiang, Siu-Ming Yiu, Chuanyi Liu

Abstract: Adversarial attacks can mislead deep learning models to make false predictions by implanting small perturbations to the original input that are imperceptible to the human eye, which poses a huge security threat to the computer vision systems based on deep learning. Physical adversarial attacks, which is more realistic, as the perturbation is introduced to the input before it is being captured and… ▽ More Adversarial attacks can mislead deep learning models to make false predictions by implanting small perturbations to the original input that are imperceptible to the human eye, which poses a huge security threat to the computer vision systems based on deep learning. Physical adversarial attacks, which is more realistic, as the perturbation is introduced to the input before it is being captured and converted to a binary image inside the vision system, when compared to digital adversarial attacks. In this paper, we focus on physical adversarial attacks and further classify them into invasive and non-invasive. Optical-based physical adversarial attack techniques (e.g. using light irradiation) belong to the non-invasive category. As the perturbations can be easily ignored by humans as the perturbations are very similar to the effects generated by a natural environment in the real world. They are highly invisibility and executable and can pose a significant or even lethal threats to real systems. This paper focuses on optical-based physical adversarial attack techniques for computer vision systems, with emphasis on the introduction and discussion of optical-based physical adversarial attack techniques. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2303.07089

Range Resolution Enhanced Method with Spectral Properties for Hyperspectral Lidar

Authors: Yuhao Xia, Shilong Xu, Hui Shao, Ahui Hou, Jiajie Fang, Fei Han, Youlong Chen, Jiaqi Wen, Yuwei Chen, Yihua Hu

Abstract: Waveform decomposition is needed as a first step in the extraction of various types of geometric and spectral information from hyperspectral full-waveform LiDAR echoes. We present a new approach to deal with the "Pseudo-monopulse" waveform formed by the overlapped waveforms from multi-targets when they are very close. We use one single skew-normal distribution (SND) model to fit waveforms of all s… ▽ More Waveform decomposition is needed as a first step in the extraction of various types of geometric and spectral information from hyperspectral full-waveform LiDAR echoes. We present a new approach to deal with the "Pseudo-monopulse" waveform formed by the overlapped waveforms from multi-targets when they are very close. We use one single skew-normal distribution (SND) model to fit waveforms of all spectral channels first and count the geometric center position distribution of the echoes to decide whether it contains multi-targets. The geometric center position distribution of the "Pseudo-monopulse" presents aggregation and asymmetry with the change of wavelength, while such an asymmetric phenomenon cannot be found from the echoes of the single target. Both theoretical and experimental data verify the point. Based on such observation, we further propose a hyperspectral waveform decomposition method utilizing the SND mixture model with: 1) initializing new waveform component parameters and their ranges based on the distinction of the three characteristics (geometric center position, pulse width, and skew-coefficient) between the echo and fitted SND waveform and 2) conducting single-channel waveform decomposition for all channels and 3) setting thresholds to find outlier channels based on statistical parameters of all single-channel decomposition results (the standard deviation and the means of geometric center position) and 4) re-conducting single-channel waveform decomposition for these outlier channels. The proposed method significantly improves the range resolution from 60cm to 5cm at most for a 4ns width laser pulse and represents the state-of-the-art in "Pseudo-monopulse" waveform decomposition. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission

arXiv:2301.11066 [pdf, other]

Channel Estimation for RIS-aided mmWave Massive MIMO System Using Few-bit ADCs

Authors: Ruizhe Wang, Hong Ren, Cunhua Pan, Jun Fang, Mianxiong Dong, Octavia A. Dobre

Abstract: Millimeter wave (mmWave) massive multiple-input multiple-output (massive MIMO) is one of the most promising technologies for the fifth generation and beyond wireless communication system. However, a large number of antennas incur high power consumption and hardware costs, and high-frequency communications place a heavy burden on the analog-to-digital converters (ADCs) at the base station (BS). Fur… ▽ More Millimeter wave (mmWave) massive multiple-input multiple-output (massive MIMO) is one of the most promising technologies for the fifth generation and beyond wireless communication system. However, a large number of antennas incur high power consumption and hardware costs, and high-frequency communications place a heavy burden on the analog-to-digital converters (ADCs) at the base station (BS). Furthermore, it is too costly to equipping each antenna with a high-precision ADC in a large antenna array system. It is promising to adopt low-resolution ADCs to address this problem. In this paper, we investigate the cascaded channel estimation for a mmWave massive MIMO system aided by a reconfigurable intelligent surface (RIS) with the BS equipped with few-bit ADCs. Due to the low-rank property of the cascaded channel, the estimation of the cascaded channel can be formulated as a low-rank matrix completion problem. We introduce a Bayesian optimal estimation framework for estimating the user-RIS-BS cascaded channel to tackle with the information loss caused by quantization. To implement the estimator and achieve the matrix completion, we use efficient bilinear generalized approximate message passing (BiG-AMP) algorithm. Extensive simulation results verify that our proposed method can accurately estimate the cascaded channel for the RIS-aided mmWave massive MIMO system with low-resolution ADCs. △ Less

Submitted 26 January, 2023; originally announced January 2023.

arXiv:2301.09248 [pdf, ps, other]

Target-Mounted Intelligent Reflecting Surface for Joint Location and Orientation Estimation

Authors: Peilan Wang, Weidong Mei, Jun Fang, Rui Zhang

Abstract: Intelligent reflecting surface (IRS) has been widely recognized as an efficient technique to reconfigure the electromagnetic environment in favor of wireless communication performance. In this paper, we propose a new application of IRS for device-free target sensing via joint location and orientation estimation. In particular, different from the existing works that use IRS as an additional anchor… ▽ More Intelligent reflecting surface (IRS) has been widely recognized as an efficient technique to reconfigure the electromagnetic environment in favor of wireless communication performance. In this paper, we propose a new application of IRS for device-free target sensing via joint location and orientation estimation. In particular, different from the existing works that use IRS as an additional anchor node for localization/sensing, we consider mounting IRS on the sensing target, whereby estimating the IRS's location and orientation as that of the target by leveraging IRS's controllable signal reflection. To this end, we first propose a tensor-based method to acquire essential angle information between the IRS and the sensing transmitter as well as a set of distributed sensing receivers. Next, based on the estimated angle information, we formulate two optimization problems to estimate the location and orientation of the IRS/target, respectively, and obtain the locally optimal solutions to them by invoking two iterative algorithms, namely, gradient descent method and manifold optimization. In particular, we show that the orientation estimation problem admits a closed-form solution in a special case that usually holds in practice. Furthermore, theoretical analysis is conducted to draw essential insights into the proposed sensing system design and performance. Simulation results verify our theoretical analysis and demonstrate that the proposed methods can achieve high estimation accuracy which is close to the theoretical bound. △ Less

Submitted 22 January, 2023; originally announced January 2023.

Comments: 30pages

arXiv:2212.04314 [pdf, other]

A Scale-Arbitrary Image Super-Resolution Network Using Frequency-domain Information

Authors: Jing Fang, Yinbo Yu, Zhongyuan Wang, Xin Ding, Ruimin Hu

Abstract: Image super-resolution (SR) is a technique to recover lost high-frequency information in low-resolution (LR) images. Spatial-domain information has been widely exploited to implement image SR, so a new trend is to involve frequency-domain information in SR tasks. Besides, image SR is typically application-oriented and various computer vision tasks call for image arbitrary magnification. Therefore,… ▽ More Image super-resolution (SR) is a technique to recover lost high-frequency information in low-resolution (LR) images. Spatial-domain information has been widely exploited to implement image SR, so a new trend is to involve frequency-domain information in SR tasks. Besides, image SR is typically application-oriented and various computer vision tasks call for image arbitrary magnification. Therefore, in this paper, we study image features in the frequency domain to design a novel scale-arbitrary image SR network. First, we statistically analyze LR-HR image pairs of several datasets under different scale factors and find that the high-frequency spectra of different images under different scale factors suffer from different degrees of degradation, but the valid low-frequency spectra tend to be retained within a certain distribution range. Then, based on this finding, we devise an adaptive scale-aware feature division mechanism using deep reinforcement learning, which can accurately and adaptively divide the frequency spectrum into the low-frequency part to be retained and the high-frequency one to be recovered. Finally, we design a scale-aware feature recovery module to capture and fuse multi-level features for reconstructing the high-frequency spectrum at arbitrary scale factors. Extensive experiments on public datasets show the superiority of our method compared with state-of-the-art methods. △ Less

Submitted 8 December, 2022; originally announced December 2022.

arXiv:2210.10302 [pdf, other]

CFAR based NOMP for Line Spectral Estimation and Detection

Authors: Menghuai Xu, Jiang Zhu, Jun Fang, Ning Zhang, Zhiwei Xu

Abstract: The line spectrum estimation problem is considered in this paper. We propose a CFAR-based Newtonized OMP (NOMP-CFAR) method which can maintain a desired false alarm rate without the knowledge of the noise variance. The NOMP-CFAR consists of two steps, namely, an initialization step and a detection step. In the initialization step, NOMP is employed to obtain candidate sinusoidal components. In the… ▽ More The line spectrum estimation problem is considered in this paper. We propose a CFAR-based Newtonized OMP (NOMP-CFAR) method which can maintain a desired false alarm rate without the knowledge of the noise variance. The NOMP-CFAR consists of two steps, namely, an initialization step and a detection step. In the initialization step, NOMP is employed to obtain candidate sinusoidal components. In the detection step, CFAR detector is applied to detect each candidate frequency, and remove the most unlikely frequency component. Then, the Newton refinements are used to refine the remaining parameters. The relationship between the false alarm rate and the required threshold is established. By comparing with the NOMP, NOMP-CFAR has only $1$ dB performance loss in additive white Gaussian noise scenario with false alarm probability $10^{-2}$ and detection probability $0.8$ without knowledge of noise variance. For varied noise variance scenario, NOMP-CFAR still preserves its CFAR property, while NOMP violates the CFAR. Besides, real experiments are also conducted to demonstrate the detection performance of NOMP-CFAR, compared to CFAR and NOMP. △ Less

Submitted 19 October, 2022; originally announced October 2022.

arXiv:2210.01337 [pdf, ps, other]

Compressed CPD-Based Channel Estimation and Joint Beamforming for RIS-Assisted Millimeter Wave Communications

Authors: Xi Zheng, Jun Fang, Hongwei Wang, Peilan Wang, Hongbin Li

Abstract: We consider the problem of channel estimation and joint active and passive beamforming for reconfigurable intelligent surface (RIS) assisted millimeter wave (mmWave) multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) systems. We show that, with a well-designed frame-based training protocol, the received pilot signal can be organized into a low-rank third-order… ▽ More We consider the problem of channel estimation and joint active and passive beamforming for reconfigurable intelligent surface (RIS) assisted millimeter wave (mmWave) multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) systems. We show that, with a well-designed frame-based training protocol, the received pilot signal can be organized into a low-rank third-order tensor that admits a canonical polyadic decomposition (CPD). Based on this observation, we propose two CPD-based methods for estimating the cascade channels associated with different subcarriers. The proposed methods exploit the intrinsic low-rankness of the CPD formulation, which is a result of the sparse scattering characteristics of mmWave channels, and thus have the potential to achieve a significant training overhead reduction. Specifically, our analysis shows that the proposed methods have a sample complexity that scales quadratically with the sparsity of the cascade channel. Also, by utilizing the singular value decomposition-like structure of the effective channel, this paper develops a joint active and passive beamforming method based on the estimated cascade channels. Simulation results show that the proposed CPD-based channel estimation methods attain mean square errors that are close to the Cramer-Rao bound (CRB) and present a clear advantage over the compressed sensing-based method. In addition, the proposed joint beamforming method can effectively utilize the estimated channel parameters to achieve superior beamforming performance. △ Less

Submitted 3 October, 2022; originally announced October 2022.

Comments: arXiv admin note: text overlap with arXiv:2203.16164

arXiv:2209.08500 [pdf, other]

A Map-matching Algorithm with Extraction of Multi-group Information for Low-frequency Data

Authors: Jie Fang, Xiongwei Wu, Dianchao Lin, Mengyun Xu, Huahua Wu, Xuesong Wu, Ting Bi

Abstract: The growing use of probe vehicles generates a huge number of GNSS data. Limited by the satellite positioning technology, further improving the accuracy of map-matching is challenging work, especially for low-frequency trajectories. When matching a trajectory, the ego vehicle's spatial-temporal information of the present trip is the most useful with the least amount of data. In addition, there are… ▽ More The growing use of probe vehicles generates a huge number of GNSS data. Limited by the satellite positioning technology, further improving the accuracy of map-matching is challenging work, especially for low-frequency trajectories. When matching a trajectory, the ego vehicle's spatial-temporal information of the present trip is the most useful with the least amount of data. In addition, there are a large amount of other data, e.g., other vehicles' state and past prediction results, but it is hard to extract useful information for matching maps and inferring paths. Most map-matching studies only used the ego vehicle's data and ignored other vehicles' data. Based on it, this paper designs a new map-matching method to make full use of "Big data". We first sort all data into four groups according to their spatial and temporal distance from the present matching probe which allows us to sort for their usefulness. Then we design three different methods to extract valuable information (scores) from them: a score for speed and bearing, a score for historical usage, and a score for traffic state using the spectral graph Markov neutral network. Finally, we use a modified top-K shortest-path method to search the candidate paths within an ellipse region and then use the fused score to infer the path (projected location). We test the proposed method against baseline algorithms using a real-world dataset in China. The results show that all scoring methods can enhance map-matching accuracy. Furthermore, our method outperforms the others, especially when GNSS probing frequency is less than 0.01 Hz. △ Less

Submitted 18 September, 2022; originally announced September 2022.

Comments: 10 pages, 11 figures, 4 tables

arXiv:2206.15069 [pdf, other]

PVT-COV19D: Pyramid Vision Transformer for COVID-19 Diagnosis

Authors: Lilang Zheng, Jiaxuan Fang, Xiaorun Tang, Hanzhang Li, Jiaxin Fan, Tianyi Wang, Rui Zhou, Zhaoyan Yan

Abstract: With the outbreak of COVID-19, a large number of relevant studies have emerged in recent years. We propose an automatic COVID-19 diagnosis framework based on lung CT scan images, the PVT-COV19D. In order to accommodate the different dimensions of the image input, we first classified the images using Transformer models, then sampled the images in the dataset according to normal distribution, and fe… ▽ More With the outbreak of COVID-19, a large number of relevant studies have emerged in recent years. We propose an automatic COVID-19 diagnosis framework based on lung CT scan images, the PVT-COV19D. In order to accommodate the different dimensions of the image input, we first classified the images using Transformer models, then sampled the images in the dataset according to normal distribution, and fed the sampling results into the modified PVTv2 model for training. A large number of experiments on the COV19-CT-DB dataset demonstrate the effectiveness of the proposed method. △ Less

Submitted 30 June, 2022; originally announced June 2022.

Comments: 8 pages,1 figure

arXiv:2206.11458 [pdf, other]

Weighted Concordance Index Loss-based Multimodal Survival Modeling for Radiation Encephalopathy Assessment in Nasopharyngeal Carcinoma Radiotherapy

Authors: Jiansheng Fang, Anwei Li, Pu-Yun OuYang, Jiajian Li, Jingwen Wang, Hongbo Liu, Fang-Yun Xie, Jiang Liu

Abstract: Radiation encephalopathy (REP) is the most common complication for nasopharyngeal carcinoma (NPC) radiotherapy. It is highly desirable to assist clinicians in optimizing the NPC radiotherapy regimen to reduce radiotherapy-induced temporal lobe injury (RTLI) according to the probability of REP onset. To the best of our knowledge, it is the first exploration of predicting radiotherapy-induced REP by… ▽ More Radiation encephalopathy (REP) is the most common complication for nasopharyngeal carcinoma (NPC) radiotherapy. It is highly desirable to assist clinicians in optimizing the NPC radiotherapy regimen to reduce radiotherapy-induced temporal lobe injury (RTLI) according to the probability of REP onset. To the best of our knowledge, it is the first exploration of predicting radiotherapy-induced REP by jointly exploiting image and non-image data in NPC radiotherapy regimen. We cast REP prediction as a survival analysis task and evaluate the predictive accuracy in terms of the concordance index (CI). We design a deep multimodal survival network (MSN) with two feature extractors to learn discriminative features from multimodal data. One feature extractor imposes feature selection on non-image data, and the other learns visual features from images. Because the priorly balanced CI (BCI) loss function directly maximizing the CI is sensitive to uneven sampling per batch. Hence, we propose a novel weighted CI (WCI) loss function to leverage all REP samples effectively by assigning their different weights with a dual average operation. We further introduce a temperature hyper-parameter for our WCI to sharpen the risk difference of sample pairs to help model convergence. We extensively evaluate our WCI on a private dataset to demonstrate its favourability against its counterparts. The experimental results also show multimodal data of NPC radiotherapy can bring more gains for REP risk prediction. △ Less

Submitted 22 June, 2022; originally announced June 2022.

Comments: 11 pages, 3 figures, MICCAI2022

arXiv:2206.03049 [pdf, other]

Siamese Encoder-based Spatial-Temporal Mixer for Growth Trend Prediction of Lung Nodules on CT Scans

Authors: Jiansheng Fang, Jingwen Wang, Anwei Li, Yuguang Yan, Yonghe Hou, Chao Song, Hongbo Liu, Jiang Liu

Abstract: In the management of lung nodules, we are desirable to predict nodule evolution in terms of its diameter variation on Computed Tomography (CT) scans and then provide a follow-up recommendation according to the predicted result of the growing trend of the nodule. In order to improve the performance of growth trend prediction for lung nodules, it is vital to compare the changes of the same nodule in… ▽ More In the management of lung nodules, we are desirable to predict nodule evolution in terms of its diameter variation on Computed Tomography (CT) scans and then provide a follow-up recommendation according to the predicted result of the growing trend of the nodule. In order to improve the performance of growth trend prediction for lung nodules, it is vital to compare the changes of the same nodule in consecutive CT scans. Motivated by this, we screened out 4,666 subjects with more than two consecutive CT scans from the National Lung Screening Trial (NLST) dataset to organize a temporal dataset called NLSTt. In specific, we first detect and pair regions of interest (ROIs) covering the same nodule based on registered CT scans. After that, we predict the texture category and diameter size of the nodules through models. Last, we annotate the evolution class of each nodule according to its changes in diameter. Based on the built NLSTt dataset, we propose a siamese encoder to simultaneously exploit the discriminative features of 3D ROIs detected from consecutive CT scans. Then we novelly design a spatial-temporal mixer (STM) to leverage the interval changes of the same nodule in sequential 3D ROIs and capture spatial dependencies of nodule regions and the current 3D ROI. According to the clinical diagnosis routine, we employ hierarchical loss to pay more attention to growing nodules. The extensive experiments on our organized dataset demonstrate the advantage of our proposed method. We also conduct experiments on an in-house dataset to evaluate the clinical utility of our method by comparing it against skilled clinicians. △ Less

Submitted 7 June, 2022; originally announced June 2022.

Comments: MICCAI 2022

arXiv:2206.01435 [pdf]

Dual-Port Dynamically Reconfigurable Battery with Semi-Controlled and Fully-Controlled Outputs

Authors: N. Tashakor, J. Kacetl, J. Fang, Z. Li, S. Goetz

Abstract: Modular multilevel converters (MMC) and cascaded H-bridge (CHB) converters are an established concept in ultra-high voltage systems. In combination with batteries, these circuits allow dynamically changing the series or parallel configuration of subportions of the battery as so-called modular battery integrated converters or reconfigurable batteries, and are being discussed for grid-storage and el… ▽ More Modular multilevel converters (MMC) and cascaded H-bridge (CHB) converters are an established concept in ultra-high voltage systems. In combination with batteries, these circuits allow dynamically changing the series or parallel configuration of subportions of the battery as so-called modular battery integrated converters or reconfigurable batteries, and are being discussed for grid-storage and electromobility applications. A large body of research focuses on such circuits for supplying a single load, such as a motor for electric drives. Modularity, failure tolerance, less dependence on the weakest element of a battery pack, higher controllability, and better efficiency are the main incentives behind this pursuit. However, most studies neglect the auxiliary loads which require isolation from the high-voltage battery. This paper proposes a simple topology and controller that can fork off a second (galvanically isolated) output of a reconfigurable dc battery. The proposed system provides a nonisolated semicontrolled port for the dc link to maintain the operating point of the main inverter(s) close to optimal, while fully controlling an isolated output for the auxiliaries per the safety regulations. The proposed system does not require additional active switches for the auxiliary port and can operate with a wide range of voltages. Simulation and experiments verify the developed analysis. △ Less

Submitted 3 June, 2022; originally announced June 2022.

arXiv:2205.05675 [pdf, other]

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

Authors: Yawei Li, Kai Zhang, Radu Timofte, Luc Van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang , et al. (86 additional authors not shown)

Abstract: This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of e… ▽ More This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution. △ Less

Submitted 11 May, 2022; originally announced May 2022.

Comments: Validation code of the baseline model is available at https://github.com/ofsoundof/IMDN. Validation of all submitted models is available at https://github.com/ofsoundof/NTIRE2022_ESR

arXiv:2204.07894 [pdf, other]

Spatial Channel Covariance Estimation and Two-Timescale Beamforming for IRS-Assisted Millimeter Wave Systems

Authors: Hongwei Wang, Jun Fang, Huiping Duan, Hongbin Li

Abstract: We consider the problem of spatial channel covariance matrix (CCM) estimation for intelligent reflecting surface (IRS)-assisted millimeter wave (mmWave) communication systems. Spatial CCM is essential for two-timescale beamforming in IRS-assisted systems; however, estimating the spatial CCM is challenging due to the passive nature of reflecting elements and the large size of the CCM resulting from… ▽ More We consider the problem of spatial channel covariance matrix (CCM) estimation for intelligent reflecting surface (IRS)-assisted millimeter wave (mmWave) communication systems. Spatial CCM is essential for two-timescale beamforming in IRS-assisted systems; however, estimating the spatial CCM is challenging due to the passive nature of reflecting elements and the large size of the CCM resulting from massive reflecting elements of the IRS. In this paper, we propose a CCM estimation method by exploiting the low-rankness as well as the positive semi-definite (PSD) 3-level Toeplitz structure of the CCM. Estimation of the CCM is formulated as a semidefinite programming (SDP) problem and an alternating direction method of multipliers (ADMM) algorithm is developed. Our analysis shows that the proposed method is theoretically guaranteed to attain a reliable CCM estimate with a sample complexity much smaller than the dimension of the CCM. Thus the proposed method can help achieve a significant training overhead reduction. Simulation results are presented to illustrate the effectiveness of our proposed method and the performance of two-timescale beamforming scheme based on the estimated CCM. △ Less

Submitted 16 April, 2022; originally announced April 2022.

Comments: submitted to IEEE Transactions on Wireless Communications

arXiv:2203.16164 [pdf, ps, other]

Compressed Channel Estimation for IRS-Assisted Millimeter Wave OFDM Systems: A Low-Rank Tensor Decomposition-Based Approach

Authors: Xi Zheng, Peilan Wang, Jun Fang, Hongbin Li

Abstract: We consider the problem of downlink channel estimation for intelligent reflecting surface (IRS)-assisted millimeter Wave (mmWave) orthogonal frequency division multiplexing (OFDM) systems. By exploring the inherent sparse scattering characteristics of mmWave channels, we show that the received signals can be expressed as a low-rank third-order tensor that admits a tensor rank decomposition, also k… ▽ More We consider the problem of downlink channel estimation for intelligent reflecting surface (IRS)-assisted millimeter Wave (mmWave) orthogonal frequency division multiplexing (OFDM) systems. By exploring the inherent sparse scattering characteristics of mmWave channels, we show that the received signals can be expressed as a low-rank third-order tensor that admits a tensor rank decomposition, also known as canonical polyadic decomposition (CPD). A structured CPD-based method is then developed to estimate the channel parameters. Our analysis reveals that the training overhead required by our proposed method is as low as O(U^2), where U denotes the sparsity of the cascade channel. Simulation results are provided to illustrate the efficiency of the proposed method. △ Less

Submitted 30 March, 2022; originally announced March 2022.

Comments: Accepted by IEEE Wireless Communications Letters

arXiv:2203.14033 [pdf, other]

Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement Learning

Authors: Qiyu Sun, Jinbao Fang, Wei Xing Zheng, Yang Tang

Abstract: The ability to perform aggressive movements, which are called aggressive flights, is important for quadrotors during navigation. However, aggressive quadrotor flights are still a great challenge to practical applications. The existing solutions to aggressive flights heavily rely on a predefined trajectory, which is a time-consuming preprocessing step. To avoid such path planning, we propose a curi… ▽ More The ability to perform aggressive movements, which are called aggressive flights, is important for quadrotors during navigation. However, aggressive quadrotor flights are still a great challenge to practical applications. The existing solutions to aggressive flights heavily rely on a predefined trajectory, which is a time-consuming preprocessing step. To avoid such path planning, we propose a curiosity-driven reinforcement learning method for aggressive flight missions and a similarity-based curiosity module is introduced to speed up the training procedure. A branch structure exploration (BSE) strategy is also applied to guarantee the robustness of the policy and to ensure the policy trained in simulations can be performed in real-world experiments directly. The experimental results in simulations demonstrate that our reinforcement learning algorithm performs well in aggressive flight tasks, speeds up the convergence process and improves the robustness of the policy. Besides, our algorithm shows a satisfactory simulated to real transferability and performs well in real-world experiments. △ Less

Submitted 26 March, 2022; originally announced March 2022.

arXiv:2203.13399 [pdf, ps, other]

Beam Training and Alignment for RIS-Assisted Millimeter Wave Systems:State of the Art and Beyond

Authors: Peilan Wang, Jun Fang, Weizheng Zhang, Zhi Chen, Hongbin Li, Wei Zhang

Abstract: Reconfigurable intelligent surface (RIS) has recently emerged as a promising paradigm for future cellular networks. Specifically, due to its capability in reshaping the propagation environment, RIS was introduced to address the blockage issue in millimeter Wave (mmWave) or even Terahertz (THz) communications. The deployment of RIS, however, complicates the system architecture and poses a significa… ▽ More Reconfigurable intelligent surface (RIS) has recently emerged as a promising paradigm for future cellular networks. Specifically, due to its capability in reshaping the propagation environment, RIS was introduced to address the blockage issue in millimeter Wave (mmWave) or even Terahertz (THz) communications. The deployment of RIS, however, complicates the system architecture and poses a significant challenge for beam training (BT)/ beam alignment (BA), a process that is required to establish a reliable link between the transmitter and the receiver. In this article, we first review several state-of-the-art beam training solutions for RIS-assisted mmWave systems and discuss their respective advantages and limitations. We also present a new multi-directional BT method, which can achieve a decent BA performance with only a small amount of training overhead. Finally, we outline several important open issues in BT for RIS-assisted mmWave systems. △ Less

Submitted 24 March, 2022; originally announced March 2022.

Comments: Accepted by IEEE Wireless Communications

arXiv:2202.11757 [pdf]

Degradation-Reducing Control for Dynamically Reconfigurable Batteries

Authors: Tomas Kacetl, Jan Kacetl, Jinyang Fang, Malte Jaensch, Stefan Goetz

Abstract: Cascaded circuits such as modular multilevel con-verters (MMC) offer attractive qualities in reconfigurable battery applications. In contrast to conventional hard-wired dc battery packs, the MMC topology loads modules with ac current, which may lead to additional ageing of batteries. As recent studies reveal, such ageing of batteries occurs at low-frequency load ripple, and almost vanishes at high… ▽ More Cascaded circuits such as modular multilevel con-verters (MMC) offer attractive qualities in reconfigurable battery applications. In contrast to conventional hard-wired dc battery packs, the MMC topology loads modules with ac current, which may lead to additional ageing of batteries. As recent studies reveal, such ageing of batteries occurs at low-frequency load ripple, and almost vanishes at high frequencies. State of the art in MMC bat-tery control focuses on state of charge and temperature balancing of individual modules. Previous methods to suppress ripple rely on slow feedback loops and low dynamics, which tends to form low-frequency patterns in the module load that negatively contribute to their ageing. This paper presents a novel module-current-oriented high-bandwidth control technique which minimizes low-frequency components in the module load spectrum. The control method respects limitations related to module data acquisition and enhanc-es the feedback bandwidth using observation techniques. We verify the proposed method experimentally on a laboratory setup and estimate the influence on the battery cells. △ Less

Submitted 23 February, 2022; originally announced February 2022.

Comments: 9 pages, 11 figures

arXiv:2109.12275 [pdf, ps, other]

doi 10.1109/TSP.2022.3140926

A Variational Bayesian Inference-Inspired Unrolled Deep Network for MIMO Detection

Authors: Qian Wan, Jun Fang, Yinsen Huang, Huiping Duan, Hongbin Li

Abstract: The great success of deep learning (DL) has inspired researchers to develop more accurate and efficient symbol detectors for multi-input multi-output (MIMO) systems. Existing DL-based MIMO detectors, however, suffer several drawbacks. To address these issues, in this paper, we develop a model-driven DL detector based on variational Bayesian inference. Specifically, the proposed unrolled DL archite… ▽ More The great success of deep learning (DL) has inspired researchers to develop more accurate and efficient symbol detectors for multi-input multi-output (MIMO) systems. Existing DL-based MIMO detectors, however, suffer several drawbacks. To address these issues, in this paper, we develop a model-driven DL detector based on variational Bayesian inference. Specifically, the proposed unrolled DL architecture is inspired by an inverse-free variational Bayesian learning framework which circumvents matrix inversion via maximizing a relaxed evidence lower bound. Two networks are respectively developed for independent and identically distributed (i.i.d.) Gaussian channels and arbitrarily correlated channels. The proposed networks, referred to as VBINet, have only a few learnable parameters and thus can be efficiently trained with a moderate amount of training samples. The proposed VBINet-based detectors can work in both offline and online training modes. An important advantage of our proposed networks over state-of-the-art MIMO detection networks such as OAMPNet and MMNet is that the VBINet can automatically learn the noise variance from data, thus yielding a significant performance improvement over the OAMPNet and MMNet in the presence of noise variance uncertainty. Simulation results show that the proposed VBINet-based detectors achieve competitive performance for both i.i.d. Gaussian and realistic 3GPP MIMO channels. △ Less

Submitted 11 January, 2022; v1 submitted 25 September, 2021; originally announced September 2021.

Comments: This paper has been accepted by IEEE Transactions on Signal Processing for future publication

arXiv:2108.04095 [pdf, other]

doi 10.1109/LSP.2021.3134899

Joint Active and Passive Beamforming for IRS-Assisted Radar

Authors: Fangzhou Wang, Hongbin Li, Jun Fang

Abstract: Intelligent reflecting surface (IRS) is a promising technology being considered for future wireless communications due to its ability to control signal propagation. This paper considers the joint active and passive beamforming problem for an IRS-assisted radar, where multiple IRSs are deployed to assist the surveillance of multiple targets in cluttered environments. Specifically, we aim to maximiz… ▽ More Intelligent reflecting surface (IRS) is a promising technology being considered for future wireless communications due to its ability to control signal propagation. This paper considers the joint active and passive beamforming problem for an IRS-assisted radar, where multiple IRSs are deployed to assist the surveillance of multiple targets in cluttered environments. Specifically, we aim to maximize the minimum target illumination power at multiple target locations by jointly optimizing the active beamformer at the radar transmitter and the passive phase-shift matrices at the IRSs, subject to an upperbound on the clutter power at each clutter scatterer. The resulting optimization problem is nonconvex and solved with a sequential optimization procedure along with semedefinite relaxation (SDR). Simulation results show that IRSs can help create effective line-of-sight (LOS) paths and thus substantially improve the radar robustness against target blockage. △ Less

Submitted 9 August, 2021; originally announced August 2021.

arXiv:2107.12545 [pdf, other]

Double Deep Q-learning Based Real-Time Optimization Strategy for Microgrids

Authors: Hang Shuai, Xiaomeng Ai, Jiakun Fang, Wei Yao, Jinyu Wen

Abstract: The uncertainties from distributed energy resources (DERs) bring significant challenges to the real-time operation of microgrids. In addition, due to the nonlinear constraints in the AC power flow equation and the nonlinearity of the battery storage model, etc., the optimization of the microgrid is a mixed-integer nonlinear programming (MINLP) problem. It is challenging to solve this kind of stoch… ▽ More The uncertainties from distributed energy resources (DERs) bring significant challenges to the real-time operation of microgrids. In addition, due to the nonlinear constraints in the AC power flow equation and the nonlinearity of the battery storage model, etc., the optimization of the microgrid is a mixed-integer nonlinear programming (MINLP) problem. It is challenging to solve this kind of stochastic nonlinear optimization problem. To address the challenge, this paper proposes a deep reinforcement learning (DRL) based optimization strategy for the real-time operation of the microgrid. Specifically, we construct the detailed operation model for the microgrid and formulate the real-time optimization problem as a Markov Decision Process (MDP). Then, a double deep Q network (DDQN) based architecture is designed to solve the MINLP problem. The proposed approach can learn a near-optimal strategy only from the historical data. The effectiveness of the proposed algorithm is validated by the simulations on a 10-bus microgrid system and a modified IEEE 69-bus microgrid system. The numerical simulation results demonstrate that the proposed approach outperforms several existing methods. △ Less

Submitted 26 July, 2021; originally announced July 2021.

Comments: 13 pages, 14 figures. Submitted to IEEE Transactions on Systems, Man, and Cybernetics in Aug. 2019

arXiv:2105.12430 [pdf, other]

Weighing Features of Lung and Heart Regions for Thoracic Disease Classification

Authors: Jiansheng Fang, Yanwu Xu, Yitian Zhao, Yuguang Yan, Junling Liu, Jiang Liu

Abstract: Chest X-rays are the most commonly available and affordable radiological examination for screening thoracic diseases. According to the domain knowledge of screening chest X-rays, the pathological information usually lay on the lung and heart regions. However, it is costly to acquire region-level annotation in practice, and model training mainly relies on image-level class labels in a weakly superv… ▽ More Chest X-rays are the most commonly available and affordable radiological examination for screening thoracic diseases. According to the domain knowledge of screening chest X-rays, the pathological information usually lay on the lung and heart regions. However, it is costly to acquire region-level annotation in practice, and model training mainly relies on image-level class labels in a weakly supervised manner, which is highly challenging for computer-aided chest X-ray screening. To address this issue, some methods have been proposed recently to identify local regions containing pathological information, which is vital for thoracic disease classification. Inspired by this, we propose a novel deep learning framework to explore discriminative information from lung and heart regions. We design a feature extractor equipped with a multi-scale attention module to learn global attention maps from global images. To exploit disease-specific cues effectively, we locate lung and heart regions containing pathological information by a well-trained pixel-wise segmentation model to generate binarization masks. By introducing element-wise logical AND operator on the learned global attention maps and the binarization masks, we obtain local attention maps in which pixels are $1$ for lung and heart region and $0$ for other regions. By zeroing features of non-lung and heart regions in attention maps, we can effectively exploit their disease-specific cues in lung and heart regions. Compared to existing methods fusing global and local features, we adopt feature weighting to avoid weakening visual cues unique to lung and heart regions. Evaluated by the benchmark split on the publicly available chest X-ray14 dataset, the comprehensive experiments show that our method achieves superior performance compared to the state-of-the-art methods. △ Less

Submitted 26 May, 2021; originally announced May 2021.

Comments: 17 pages, 4 figures, BMC Medical Imaging

arXiv:2105.03029 [pdf, ps, other]

Recent Advances on Sub-Nyquist Sampling-Based Wideband Spectrum Sensing

Authors: Jun Fang, Bin Wang, Hongbin Li, Ying-Chang Liang

Abstract: Cognitive radio (CR) is a promising technology enabling efficient utilization of the spectrum resource for future wireless systems. As future CR networks are envisioned to operate over a wide frequency range, advanced wideband spectrum sensing (WBSS) capable of quickly and reliably detecting idle spectrum bands across a wide frequency span is essential. In this article, we provide an overview of r… ▽ More Cognitive radio (CR) is a promising technology enabling efficient utilization of the spectrum resource for future wireless systems. As future CR networks are envisioned to operate over a wide frequency range, advanced wideband spectrum sensing (WBSS) capable of quickly and reliably detecting idle spectrum bands across a wide frequency span is essential. In this article, we provide an overview of recent advances on sub-Nyquist sampling-based WBSS techniques, including compressed sensing-based methods and compressive covariance sensing-based methods. An elaborate discussion of the pros and cons of each approach is presented, along with some challenging issues for future research. A comparative study suggests that the compressive covariance sensing-based approach offers a more competitive solution for reliable real-time WBSS. △ Less

Submitted 6 May, 2021; originally announced May 2021.

Comments: This paper has been accepted by IEEE Wireless Communications Magazine for future publication

arXiv:2104.02301 [pdf, other]

Hyperspectral and LiDAR data classification based on linear self-attention

Authors: Min Feng, Feng Gao, Jian Fang, Junyu Dong

Abstract: An efficient linear self-attention fusion model is proposed in this paper for the task of hyperspectral image (HSI) and LiDAR data joint classification. The proposed method is comprised of a feature extraction module, an attention module, and a fusion module. The attention module is a plug-and-play linear self-attention module that can be extensively used in any model. The proposed model has achie… ▽ More An efficient linear self-attention fusion model is proposed in this paper for the task of hyperspectral image (HSI) and LiDAR data joint classification. The proposed method is comprised of a feature extraction module, an attention module, and a fusion module. The attention module is a plug-and-play linear self-attention module that can be extensively used in any model. The proposed model has achieved the overall accuracy of 95.40\% on the Houston dataset. The experimental results demonstrate the superiority of the proposed method over other state-of-the-art models. △ Less

Submitted 6 April, 2021; originally announced April 2021.

Comments: Accepted for publication in the International Geoscience and Remote Sensing Symposium (IGARSS 2021)

arXiv:2103.05812 [pdf, ps, other]

doi 10.1109/TWC.2021.3115152

Fast Beam Training and Alignment for IRS-Assisted Millimeter Wave/Terahertz Systems

Authors: Peilan Wang, Jun Fang, Wei Zhang, Hongbin Li

Abstract: Intelligent reflecting surface (IRS) has emerged as a competitive solution to address blockage issues in millimeter wave (mmWave) and Terahertz (THz) communications due to its capability of reshaping wireless transmission environments. Nevertheless, obtaining the channel state information of IRS-assisted systems is quite challenging because of the passive characteristics of the IRS. In this paper,… ▽ More Intelligent reflecting surface (IRS) has emerged as a competitive solution to address blockage issues in millimeter wave (mmWave) and Terahertz (THz) communications due to its capability of reshaping wireless transmission environments. Nevertheless, obtaining the channel state information of IRS-assisted systems is quite challenging because of the passive characteristics of the IRS. In this paper, we consider the problem of beam training/alignment for IRS-assisted downlink mmWave/THz systems, where a multi-antenna base station (BS) with a hybrid structure serves a single-antenna user aided by IRS. By exploiting the inherent sparse structure of the BS-IRS-user cascade channel, the beam training problem is formulated as a joint sparse sensing and phaseless estimation problem, which involves devising a sparse sensing matrix and developing an efficient estimation algorithm to identify the best beam alignment from compressive phaseless measurements. Theoretical analysis reveals that the proposed method can identify the best alignment with only a modest amount of training overhead. Simulation results show that, for both line-of-sight (LOS) and NLOS scenarios, the proposed method obtains a significant performance improvement over existing state-of-art methods. Notably, it can achieve performance close to that of the exhaustive beam search scheme, while reducing the training overhead by 95%. △ Less

Submitted 2 October, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

arXiv:2102.10260 [pdf, other]

Wireless sensor network for in situ soil moisture monitoring

Authors: Jianing Fang, Chuheng Hu, Nour Smaoui, Doug Carlson, Jayant Gupchup, Razvan Musaloiu-E., Chieh-Jan Mike Liang, Marcus Chang, Omprakash Gnawali, Tamas Budavari, Andreas Terzis, Katalin Szlavecz, Alexander S. Szalay

Abstract: We discuss the history and lessons learned from a series of deployments of environmental sensors measuring soil parameters and CO2 fluxes over the last fifteen years, in an outdoor environment. We present the hardware and software architecture of our current Gen-3 system, and then discuss how we are simplifying the user facing part of the software, to make it easier and friendlier for the environm… ▽ More We discuss the history and lessons learned from a series of deployments of environmental sensors measuring soil parameters and CO2 fluxes over the last fifteen years, in an outdoor environment. We present the hardware and software architecture of our current Gen-3 system, and then discuss how we are simplifying the user facing part of the software, to make it easier and friendlier for the environmental scientist to be in full control of the system. Finally, we describe the current effort to build a large-scale Gen-4 sensing platform consisting of hundreds of nodes to track the environmental parameters for urban green spaces in Baltimore, Maryland. △ Less

Submitted 20 February, 2021; originally announced February 2021.

Comments: 12 pages, 16 figures, Sensornets 2021 Conference

arXiv:2101.05998 [pdf, other]

doi 10.1109/TVT.2021.3109800

A Vehicles Control Model to Alleviate Traffic Instability

Authors: Jiancheng Fang, Yu Xiang, Yu Huang, Yilong Cui, Wenyong Wang

Abstract: While bringing convenience to people, the growing number of vehicles on road already cause inevitable traffic congestion. Some traffic congestion happen with observable reasons, but others occur without apparent reasons or bottlenecks, which referred to as phantom jams, are caused by traditional vehicle following model. In order to alleviate the traffic instability caused by phantom jam, several m… ▽ More While bringing convenience to people, the growing number of vehicles on road already cause inevitable traffic congestion. Some traffic congestion happen with observable reasons, but others occur without apparent reasons or bottlenecks, which referred to as phantom jams, are caused by traditional vehicle following model. In order to alleviate the traffic instability caused by phantom jam, several models have been proposed with the development of intelligent transportation system (ITS). these have been proved to be able to suppress traffic instability in the ideal situation. But in road scenarios, uncertainties of vehicle state measurements and time delay caused by on-board sensors, inter-vehicle communications and control system of vehicles will affect the performance of the existing models severely, and cannot be ignored. In this paper, a novel predictable bilateral control model-PBCM, which consists of best estimation and state prediction is proposed to determine accurate acceleration values of the host vehicle in traffic flow to alleviate traffic instability. Theoretical analysis and simulation results show that our model could reduce the influence of the measurement errors and the delay caused by communication and control system effectively, control the state of the vehicles in traffic flow accurately, thus achieve the goal of restrain the instability of traffic flow. △ Less

Submitted 15 January, 2021; originally announced January 2021.

Comments: 13 pages, 35 figures

Report number: 9863-9876

Journal ref: IEEE Transactions on Vehicular Technology ( Volume: 70, Issue: 10, Oct. 2021)

arXiv:2012.04830 [pdf, other]

Machine Learning for Cataract Classification and Grading on Ophthalmic Imaging Modalities: A Survey

Authors: Xiaoqing Zhang, Yan Hu, Zunjie Xiao, Jiansheng Fang, Risa Higashita, Jiang Liu

Abstract: Cataracts are the leading cause of visual impairment and blindness globally. Over the years, researchers have achieved significant progress in developing state-of-the-art machine learning techniques for automatic cataract classification and grading, aiming to prevent cataracts early and improve clinicians' diagnosis efficiency. This survey provides a comprehensive survey of recent advances in mach… ▽ More Cataracts are the leading cause of visual impairment and blindness globally. Over the years, researchers have achieved significant progress in developing state-of-the-art machine learning techniques for automatic cataract classification and grading, aiming to prevent cataracts early and improve clinicians' diagnosis efficiency. This survey provides a comprehensive survey of recent advances in machine learning techniques for cataract classification/grading based on ophthalmic images. We summarize existing literature from two research directions: conventional machine learning methods and deep learning methods. This survey also provides insights into existing works of both merits and limitations. In addition, we discuss several challenges of automatic cataract classification/grading based on machine learning techniques and present possible solutions to these challenges for future research. △ Less

Submitted 1 April, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

Comments: 26 pages, 13 figures

Journal ref: Machine Intelligence Research,2022

arXiv:2012.03687 [pdf, other]

Reconfigurable Intelligent Surface Aided Constant-Envelope Wireless Power Transfer

Authors: Huiyuan Yang, Xiaojun Yuan, Jun Fang, Ying-Chang Liang

Abstract: By reconfiguring the propagation environment of electromagnetic waves artificially, reconfigurable intelligent surfaces (RISs) have been regarded as a promising and revolutionary hardware technology to improve the energy and spectrum efficiency of wireless networks. In this paper, we study a RIS aided multiuser multiple-input multiple-output (MIMO) wireless power transfer (WPT) system, where the t… ▽ More By reconfiguring the propagation environment of electromagnetic waves artificially, reconfigurable intelligent surfaces (RISs) have been regarded as a promising and revolutionary hardware technology to improve the energy and spectrum efficiency of wireless networks. In this paper, we study a RIS aided multiuser multiple-input multiple-output (MIMO) wireless power transfer (WPT) system, where the transmitter is equipped with a constant-envelope analog beamformer. First, we maximize the total received power of the users by jointly optimizing the beamformer at transmitter and the phase-shifts at the RIS, and propose two alternating optimization based suboptimal solutions by leveraging the semidefinite relaxation (SDR) and the successive convex approximation (SCA) techniques respectively. Then, considering the user fairness, we formulate another problem to maximize the total received power subject to the users' individual minimum received power constraints. A low complexity iterative algorithm based on both alternating direction method of multipliers (ADMM) and SCA techniques is proposed to solve this problem. In the case of multiple users, we further analyze the asymptotic performance as the number of RIS elements approaches infinity, and bound the performance loss caused by RIS phase quantization. Numerical results show the correctness of the analysis results and the effectiveness of the proposed algorithms. △ Less

Submitted 7 December, 2020; originally announced December 2020.

Comments: arXiv admin note: text overlap with arXiv:2006.01696

arXiv:2010.05188 [pdf, ps, other]

doi 10.1109/TWC.2020.3030570

Joint Transceiver and Large Intelligent Surface Design for Massive MIMO MmWave Systems

Authors: Peilan Wang, Jun Fang, Linglong Dai, Hongbin Li

Abstract: Large intelligent surface (LIS) has recently emerged as a potential low-cost solution to reshape the wireless propagation environment for improving the spectral efficiency. In this paper, we consider a downlink millimeter-wave (mmWave) multiple-input-multiple-output (MIMO) system, where an LIS is deployed to assist the downlink data transmission from a base station (BS) to a user equipment (UE). B… ▽ More Large intelligent surface (LIS) has recently emerged as a potential low-cost solution to reshape the wireless propagation environment for improving the spectral efficiency. In this paper, we consider a downlink millimeter-wave (mmWave) multiple-input-multiple-output (MIMO) system, where an LIS is deployed to assist the downlink data transmission from a base station (BS) to a user equipment (UE). Both the BS and the UE are equipped with a large number of antennas, and a hybrid analog/digital precoding/combining structure is used to reduce the hardware cost and energy consumption. We aim to maximize the spectral efficiency by jointly optimizing the LIS's reflection coefficients and the hybrid precoder (combiner) at the BS (UE). To tackle this non-convex problem, we reformulate the complex optimization problem into a much more friendly optimization problem by exploiting the inherent structure of the effective (cascade) mmWave channel. A manifold optimization (MO)-based algorithm is then developed. Simulation results show that by carefully devising LIS's reflection coefficients, our proposed method can help realize a favorable propagation environment with a small channel matrix condition number. Besides, it can achieve a performance comparable to those of state-of-the-art algorithms, while at a much lower computational complexity. △ Less

Submitted 11 October, 2020; originally announced October 2020.

Comments: This paper has been accepted by IEEE Transactions on Wireless Communications for future publication

arXiv:2010.04928 [pdf, other]

Contrastive Rendering for Ultrasound Image Segmentation

Authors: Haoming Li, Xin Yang, Jiamin Liang, Wenlong Shi, Chaoyu Chen, Haoran Dou, Rui Li, Rui Gao, Guangquan Zhou, Jinghui Fang, Xiaowen Liang, Ruobing Huang, Alejandro Frangi, Zhiyi Chen, Dong Ni

Abstract: Ultrasound (US) image segmentation embraced its significant improvement in deep learning era. However, the lack of sharp boundaries in US images still remains an inherent challenge for segmentation. Previous methods often resort to global context, multi-scale cues or auxiliary guidance to estimate the boundaries. It is hard for these methods to approach pixel-level learning for fine-grained bounda… ▽ More Ultrasound (US) image segmentation embraced its significant improvement in deep learning era. However, the lack of sharp boundaries in US images still remains an inherent challenge for segmentation. Previous methods often resort to global context, multi-scale cues or auxiliary guidance to estimate the boundaries. It is hard for these methods to approach pixel-level learning for fine-grained boundary generating. In this paper, we propose a novel and effective framework to improve boundary estimation in US images. Our work has three highlights. First, we propose to formulate the boundary estimation as a rendering task, which can recognize ambiguous points (pixels/voxels) and calibrate the boundary prediction via enriched feature representation learning. Second, we introduce point-wise contrastive learning to enhance the similarity of points from the same class and contrastively decrease the similarity of points from different classes. Boundary ambiguities are therefore further addressed. Third, both rendering and contrastive learning tasks contribute to consistent improvement while reducing network parameters. As a proof-of-concept, we performed validation experiments on a challenging dataset of 86 ovarian US volumes. Results show that our proposed method outperforms state-of-the-art methods and has the potential to be used in clinical practice. △ Less

Submitted 10 October, 2020; originally announced October 2020.

Comments: 10 pages, 5 figures, 2 tables, 13 references

arXiv:2006.01696 [pdf, other]

Reconfigurable Intelligent Surface Aided Constant-Envelope Wireless Power Transfer

Authors: Huiyuan Yang, Xiaojun Yuan, Jun Fang, Ying-Chang Liang

Abstract: By reconfiguring the propagation environment of electromagnetic waves artificially, reconfigurable intelligent surfaces (RISs) have been regarded as a promising and revolutionary hardware technology to improve the energy and spectrum efficiency of wireless networks. In this paper, we study a RIS aided multiuser multiple-input single-output (MISO) wireless power transfer (WPT) system, where the tra… ▽ More By reconfiguring the propagation environment of electromagnetic waves artificially, reconfigurable intelligent surfaces (RISs) have been regarded as a promising and revolutionary hardware technology to improve the energy and spectrum efficiency of wireless networks. In this paper, we study a RIS aided multiuser multiple-input single-output (MISO) wireless power transfer (WPT) system, where the transmitter is equipped with a constant-envelope analog beamformer. We formulate a novel problem to maximize the total received power of all the users by jointly optimizing the beamformer at transmitter and the phase shifts at the RISs, subject to the individual minimum received power constraints of users. We further solve the problem iteratively with a closed-form expression for each step. Numerical results show the performance gain of deploying RIS and the effectiveness of the proposed algorithm. △ Less

Submitted 2 June, 2020; originally announced June 2020.

arXiv:2004.00226 [pdf, other]

Synthesis and Edition of Ultrasound Images via Sketch Guided Progressive Growing GANs

Authors: Jiamin Liang, Xin Yang, Haoming Li, Yi Wang, Manh The Van, Haoran Dou, Chaoyu Chen, Jinghui Fang, Xiaowen Liang, Zixin Mai, Guowen Zhu, Zhiyi Chen, Dong Ni

Abstract: Ultrasound (US) is widely accepted in clinic for anatomical structure inspection. However, lacking in resources to practice US scan, novices often struggle to learn the operation skills. Also, in the deep learning era, automated US image analysis is limited by the lack of annotated samples. Efficiently synthesizing realistic, editable and high resolution US images can solve the problems. The task… ▽ More Ultrasound (US) is widely accepted in clinic for anatomical structure inspection. However, lacking in resources to practice US scan, novices often struggle to learn the operation skills. Also, in the deep learning era, automated US image analysis is limited by the lack of annotated samples. Efficiently synthesizing realistic, editable and high resolution US images can solve the problems. The task is challenging and previous methods can only partially complete it. In this paper, we devise a new framework for US image synthesis. Particularly, we firstly adopt a sketch generative adversarial networks (Sgan) to introduce background sketch upon object mask in a conditioned generative adversarial network. With enriched sketch cues, Sgan can generate realistic US images with editable and fine-grained structure details. Although effective, Sgan is hard to generate high resolution US images. To achieve this, we further implant the Sgan into a progressive growing scheme (PGSgan). By smoothly growing both generator and discriminator, PGSgan can gradually synthesize US images from low to high resolution. By synthesizing ovary and follicle US images, our extensive perceptual evaluation, user study and segmentation results prove the promising efficacy and efficiency of the proposed PGSgan. △ Less

Submitted 1 April, 2020; originally announced April 2020.

Comments: IEEE International Symposium on Biomedical Imaging (IEEE ISBI 2020)

arXiv:2002.12648 [pdf]

doi 10.1109/JLT.2020.3037905

Fast and Accurate Optical Fiber Channel Modeling Using Generative Adversarial Network

Authors: Hang Yang, Zekun Niu, Shilin Xiao, Jiafei Fang, Zhiyang Liu, David Faninsin, Lilin Yi

Abstract: In this work, a new data-driven fiber channel modeling method, generative adversarial network (GAN) is investigated to learn the distribution of fiber channel transfer function. Our investigation focuses on joint channel effects of attenuation, chromic dispersion, self-phase modulation (SPM), and amplified spontaneous emission (ASE) noise. To achieve the success of GAN for channel modeling, we mod… ▽ More In this work, a new data-driven fiber channel modeling method, generative adversarial network (GAN) is investigated to learn the distribution of fiber channel transfer function. Our investigation focuses on joint channel effects of attenuation, chromic dispersion, self-phase modulation (SPM), and amplified spontaneous emission (ASE) noise. To achieve the success of GAN for channel modeling, we modify the loss function, design the condition vector of input and address the mode collapse for the long-haul transmission. The effective architecture, parameters, and training skills of GAN are also displayed in the paper. The results show that the proposed method can learn the accurate transfer function of the fiber channel. The transmission distance of modeling can be up to 1000 km and can be extended to arbitrary distance theoretically. Moreover, GAN shows robust generalization abilities under different optical launch powers, modulation formats, and input signal distributions. Comparing the complexity of GAN with the split-step Fourier method (SSFM), the total multiplication number is only 2% of SSFM and the running time is less than 0.1 seconds for 1000-km transmission, versus 400 seconds using the SSFM under the same hardware and software conditions, which highlights the remarkable reduction in complexity of the fiber channel modeling. △ Less

Submitted 17 January, 2022; v1 submitted 28 February, 2020; originally announced February 2020.

Journal ref: Journal of Lightwave Technology, vol. 39, no. 5, pp. 1322-1333, 1 March1, 2021

Showing 1–50 of 60 results for author: Fang, J