-
Restore-RWKV: Efficient and Effective Medical Image Restoration with RWKV
Authors:
Zhiwen Yang,
Hui Zhang,
Dan Zhao,
Bingzheng Wei,
Yan Xu
Abstract:
Transformers have revolutionized medical image restoration, but the quadratic complexity still poses limitations for their application to high-resolution medical images. The recent advent of RWKV in the NLP field has attracted much attention as it can process long sequences efficiently. To leverage its advanced design, we propose Restore-RWKV, the first RWKV-based model for medical image restorati…
▽ More
Transformers have revolutionized medical image restoration, but the quadratic complexity still poses limitations for their application to high-resolution medical images. The recent advent of RWKV in the NLP field has attracted much attention as it can process long sequences efficiently. To leverage its advanced design, we propose Restore-RWKV, the first RWKV-based model for medical image restoration. Since the original RWKV model is designed for 1D sequences, we make two necessary modifications for modeling spatial relations in 2D images. First, we present a recurrent WKV (Re-WKV) attention mechanism that captures global dependencies with linear computational complexity. Re-WKV incorporates bidirectional attention as basic for a global receptive field and recurrent attention to effectively model 2D dependencies from various scan directions. Second, we develop an omnidirectional token shift (Omni-Shift) layer that enhances local dependencies by shifting tokens from all directions and across a wide context range. These adaptations make the proposed Restore-RWKV an efficient and effective model for medical image restoration. Extensive experiments demonstrate that Restore-RWKV achieves superior performance across various medical image restoration tasks, including MRI image super-resolution, CT image denoising, PET image synthesis, and all-in-one medical image restoration. Code is available at: \href{https://github.com/Yaziwel/Restore-RWKV.git}{https://github.com/Yaziwel/Restore-RWKV}.
△ Less
Submitted 31 July, 2024; v1 submitted 14 July, 2024;
originally announced July 2024.
-
Region Attention Transformer for Medical Image Restoration
Authors:
Zhiwen Yang,
Haowei Chen,
Ziniu Qian,
Yang Zhou,
Hui Zhang,
Dan Zhao,
Bingzheng Wei,
Yan Xu
Abstract:
Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmen…
▽ More
Transformer-based methods have demonstrated impressive results in medical image restoration, attributed to the multi-head self-attention (MSA) mechanism in the spatial dimension. However, the majority of existing Transformers conduct attention within fixed and coarsely partitioned regions (\text{e.g.} the entire image or fixed patches), resulting in interference from irrelevant regions and fragmentation of continuous image content. To overcome these challenges, we introduce a novel Region Attention Transformer (RAT) that utilizes a region-based multi-head self-attention mechanism (R-MSA). The R-MSA dynamically partitions the input image into non-overlapping semantic regions using the robust Segment Anything Model (SAM) and then performs self-attention within these regions. This region partitioning is more flexible and interpretable, ensuring that only pixels from similar semantic regions complement each other, thereby eliminating interference from irrelevant regions. Moreover, we introduce a focal region loss to guide our model to adaptively focus on recovering high-difficulty regions. Extensive experiments demonstrate the effectiveness of RAT in various medical image restoration tasks, including PET image synthesis, CT image denoising, and pathological image super-resolution. Code is available at \href{https://github.com/Yaziwel/Region-Attention-Transformer-for-Medical-Image-Restoration.git}{https://github.com/RAT}.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Small Distance Increment Method for Measuring Complex Permittivity With mmWave Radar
Authors:
Hang Song,
Hyun Joon Kim,
Mingxia Wan,
Bo Wei,
Takamaro Kikkawa,
Jun-ichi Takada
Abstract:
Measuring the complex permittivity of material is essential in many scenarios such as quality check and component analysis. Generally, measurement methods for characterizing the material are based on the usage of vector network analyzer, which is large and not easy for on-site measurement, especially in high frequency range such as millimeter wave (mmWave). In addition, some measurement methods re…
▽ More
Measuring the complex permittivity of material is essential in many scenarios such as quality check and component analysis. Generally, measurement methods for characterizing the material are based on the usage of vector network analyzer, which is large and not easy for on-site measurement, especially in high frequency range such as millimeter wave (mmWave). In addition, some measurement methods require the destruction of samples, which is not suitable for non-destructive inspection. In this work, a small distance increment (SDI) method is proposed to non-destructively measure the complex permittivity of material. In SDI, the transmitter and receiver are formed as the monostatic radar, which is facing towards the material under test (MUT). During the measurement, the distance between radar and MUT changes with small increments and the signals are recorded at each position. A mathematical model is formulated to depict the relationship among the complex permittivity, distance increment, and measured signals. By fitting the model, the complex permittivity of MUT is estimated. To implement and evaluate the proposed SDI method, a commercial off-the-shelf mmWave radar is utilized and the measurement system is developed. Then, the evaluation was carried out on the acrylic plate. With the proposed method, the estimated complex permittivity of acrylic plate shows good agreement with the literature values, demonstrating the efficacy of SDI method for characterizing the complex permittivity of material.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
DPAFNet:Dual Path Attention Fusion Network for Single Image Deraining
Authors:
Bingcai Wei
Abstract:
Rainy weather will have a significant impact on the regular operation of the imaging system. Based on this premise, image rain removal has always been a popular branch of low-level visual tasks, especially methods using deep neural networks. However, most neural networks are but-branched, such as only using convolutional neural networks or Transformers, which is unfavourable for the multidimension…
▽ More
Rainy weather will have a significant impact on the regular operation of the imaging system. Based on this premise, image rain removal has always been a popular branch of low-level visual tasks, especially methods using deep neural networks. However, most neural networks are but-branched, such as only using convolutional neural networks or Transformers, which is unfavourable for the multidimensional fusion of image features. In order to solve this problem, this paper proposes a dual-branch attention fusion network. Firstly, a two-branch network structure is proposed. Secondly, an attention fusion module is proposed to selectively fuse the features extracted by the two branches rather than simply adding them. Finally, complete ablation experiments and sufficient comparison experiments prove the rationality and effectiveness of the proposed method.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Dynamical State Feedback Control for Linear Input Delay Systems, Part I: Dissipative Stabilization via Semidefinite Programming
Authors:
Qian Feng,
Cong Zhang,
Bo Wei
Abstract:
It is well known that predictor controllers can completely eliminate the destabilizing effects of input delays. However, their design is typically based on direct constructions that leave little room for incorporating closed-loop performance objectives. To address this issue, we introduce the concept of parameterized linear dynamical state feedbacks (LDSFs) that can achieve both input delay compen…
▽ More
It is well known that predictor controllers can completely eliminate the destabilizing effects of input delays. However, their design is typically based on direct constructions that leave little room for incorporating closed-loop performance objectives. To address this issue, we introduce the concept of parameterized linear dynamical state feedbacks (LDSFs) that can achieve both input delay compensation and stabilization for linear input delay systems with dissipative constraints. This control construct draws inspiration from recent developments in the mathematical treatment of distributed delays, and generalizes conventional predictor controllers, where the degree of parameterization can be increased by adjusting the integral term. A sufficient condition for the existence of the LDSF is formulated as matrix inequalities by constructing a complete type Krasovskii functional. To solve the bilinear matrix inequality in the synthesis condition, we employ an inner convex approximation algorithm that can be initialized using the gains of a predictor controller obtained via explicit construction. Unlike traditional predictor controllers, the parameters of our LTDS can be directly tuned via the proposed optimization framework. Numerical examples and simulation have been experimented to demonstrate the validity and effectiveness of our methodology.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
Overview of Human Activity Recognition Using Sensor Data
Authors:
Rebeen Ali Hamad,
Wai Lok Woo,
Bo Wei,
Longzhi Yang
Abstract:
Human activity recognition (HAR) is an essential research field that has been used in different applications including home and workplace automation, security and surveillance as well as healthcare. Starting from conventional machine learning methods to the recently developing deep learning techniques and the Internet of things, significant contributions have been shown in the HAR area in the last…
▽ More
Human activity recognition (HAR) is an essential research field that has been used in different applications including home and workplace automation, security and surveillance as well as healthcare. Starting from conventional machine learning methods to the recently developing deep learning techniques and the Internet of things, significant contributions have been shown in the HAR area in the last decade. Even though several review and survey studies have been published, there is a lack of sensor-based HAR overview studies focusing on summarising the usage of wearable sensors and smart home sensors data as well as applications of HAR and deep learning techniques. Hence, we overview sensor-based HAR, discuss several important applications that rely on HAR, and highlight the most common machine learning methods that have been used for HAR. Finally, several challenges of HAR are explored that should be addressed to further improve the robustness of HAR.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image Synthesis
Authors:
Zhiwen Yang,
Yang Zhou,
Hui Zhang,
Bingzheng Wei,
Yubo Fan,
Yan Xu
Abstract:
Multi-center positron emission tomography (PET) image synthesis aims at recovering low-dose PET images from multiple different centers. The generalizability of existing methods can still be suboptimal for a multi-center study due to domain shifts, which result from non-identical data distribution among centers with different imaging systems/protocols. While some approaches address domain shifts by…
▽ More
Multi-center positron emission tomography (PET) image synthesis aims at recovering low-dose PET images from multiple different centers. The generalizability of existing methods can still be suboptimal for a multi-center study due to domain shifts, which result from non-identical data distribution among centers with different imaging systems/protocols. While some approaches address domain shifts by training specialized models for each center, they are parameter inefficient and do not well exploit the shared knowledge across centers. To address this, we develop a generalist model that shares architecture and parameters across centers to utilize the shared knowledge. However, the generalist model can suffer from the center interference issue, \textit{i.e.} the gradient directions of different centers can be inconsistent or even opposite owing to the non-identical data distribution. To mitigate such interference, we introduce a novel dynamic routing strategy with cross-layer connections that routes data from different centers to different experts. Experiments show that our generalist model with dynamic routing (DRMC) exhibits excellent generalizability across centers. Code and data are available at: https://github.com/Yaziwel/Multi-Center-PET-Image-Synthesis.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation
Authors:
Yang Zhou,
Yongjian Wu,
Zihua Wang,
Bingzheng Wei,
Maode Lai,
Jianzhong Shou,
Yubo Fan,
Yan Xu
Abstract:
Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is under…
▽ More
Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is underexplored for nuclei instance segmentation. Compared with most existing methods using other weak annotations (scribble, point, etc.) for nuclei instance segmentation, our method is more labor-saving. The obstacle to using image-level annotations in nuclei instance segmentation is the lack of adequate location information, leading to severe nuclei omission or overlaps. In this paper, we propose a novel image-level weakly supervised method, called cyclic learning, to solve this problem. Cyclic learning comprises a front-end classification task and a back-end semi-supervised instance segmentation task to benefit from multi-task learning (MTL). We utilize a deep learning classifier with interpretability as the front-end to convert image-level labels to sets of high-confidence pseudo masks and establish a semi-supervised architecture as the back-end to conduct nuclei instance segmentation under the supervision of these pseudo masks. Most importantly, cyclic learning is designed to circularly share knowledge between the front-end classifier and the back-end semi-supervised part, which allows the whole system to fully extract the underlying information from image-level labels and converge to a better optimum. Experiments on three datasets demonstrate the good generality of our method, which outperforms other image-level weakly supervised methods for nuclei instance segmentation, and achieves comparable performance to fully-supervised methods.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
IDO-VFI: Identifying Dynamics via Optical Flow Guidance for Video Frame Interpolation with Events
Authors:
Chenyang Shi,
Hanxiao Liu,
Jing Jin,
Wenzhuo Li,
Yuzhen Li,
Boyi Wei,
Yibo Zhang
Abstract:
Video frame interpolation aims to generate high-quality intermediate frames from boundary frames and increase frame rate. While existing linear, symmetric and nonlinear models are used to bridge the gap from the lack of inter-frame motion, they cannot reconstruct real motions. Event cameras, however, are ideal for capturing inter-frame dynamics with their extremely high temporal resolution. In thi…
▽ More
Video frame interpolation aims to generate high-quality intermediate frames from boundary frames and increase frame rate. While existing linear, symmetric and nonlinear models are used to bridge the gap from the lack of inter-frame motion, they cannot reconstruct real motions. Event cameras, however, are ideal for capturing inter-frame dynamics with their extremely high temporal resolution. In this paper, we propose an event-and-frame-based video frame interpolation method named IDO-VFI that assigns varying amounts of computation for different sub-regions via optical flow guidance. The proposed method first estimates the optical flow based on frames and events, and then decides whether to further calculate the residual optical flow in those sub-regions via a Gumbel gating module according to the optical flow amplitude. Intermediate frames are eventually generated through a concise Transformer-based fusion network. Our proposed method maintains high-quality performance while reducing computation time and computational effort by 10% and 17% respectively on Vimeo90K datasets, compared with a unified process on the whole region. Moreover, our method outperforms state-of-the-art frame-only and frames-plus-events methods on multiple video frame interpolation benchmarks. Codes and models are available at https://github.com/shicy17/IDO-VFI.
△ Less
Submitted 18 May, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Estimating Continuous Muscle Fatigue For Multi-Muscle Coordinated Exercise: A Pilot Study
Authors:
Chunzhi Yi,
Baichun Wei,
Wei Jin,
Jianfei Zhu,
Seungmin Rho,
Zhiyuan Chen,
Feng Jiang
Abstract:
Assessing the progression of muscle fatigue for daily exercises provides vital indicators for precise rehabilitation, personalized training dose, especially under the context of Metaverse. Assessing fatigue of multi-muscle coordination-involved daily exercises requires the neuromuscular features that represent the fatigue-induced characteristics of spatiotemporal adaptions of multiple muscles and…
▽ More
Assessing the progression of muscle fatigue for daily exercises provides vital indicators for precise rehabilitation, personalized training dose, especially under the context of Metaverse. Assessing fatigue of multi-muscle coordination-involved daily exercises requires the neuromuscular features that represent the fatigue-induced characteristics of spatiotemporal adaptions of multiple muscles and the estimator that captures the time-evolving progression of fatigue. In this paper, we propose to depict fatigue by the features of muscle compensation and spinal module activation changes and estimate continuous fatigue by a physiological rationale model. First, we extract muscle synergy fractionation and the variance of spinal module spikings as features inspired by the prior of fatigue-induced neuromuscular adaptations. Second, we treat the features as observations and develop a Bayesian Gaussian process to capture the time-evolving progression. Third, we solve the issue of lacking supervision information by mathematically formulating the time-evolving characteristics of fatigue as the loss function. Finally, we adapt the metrics that follow the physiological principles of fatigue to quantitatively evaluate the performance. Our extensive experiments present a 0.99 similarity between days, a over 0.7 similarity with other views of fatigue and a nearly 1 weak monotonicity, which outperform other methods. This study would aim the objective assessment of muscle fatigue.
△ Less
Submitted 29 March, 2023;
originally announced March 2023.
-
QuDASH: Quantum-inspired rate adaptation approach for DASH video streaming
Authors:
Bo Wei,
Hang Song,
Makoto Nakamura,
Koichi Kimura,
Nozomu Togawa,
Jiro Katto
Abstract:
Internet traffic is dramatically increasing with the development of network technologies and video streaming traffic accounts for large amount within the total traffic, which reveals the importance to guarantee the quality of content delivery service. Based on the network conditions, adaptive bitrate (ABR) control is utilized as a common technique which can choose the proper bitrate to ensure the…
▽ More
Internet traffic is dramatically increasing with the development of network technologies and video streaming traffic accounts for large amount within the total traffic, which reveals the importance to guarantee the quality of content delivery service. Based on the network conditions, adaptive bitrate (ABR) control is utilized as a common technique which can choose the proper bitrate to ensure the video streaming quality. In this paper, new bitrate control method, QuDASH is proposed by taking advantage of the emerging quantum technology. In QuDASH, the adaptive control model is developed using the quadratic unconstrained binary optimization (QUBO), which aims at increasing the average bitrate and decreasing the video rebuffering events to maximize the user quality of experience (QoE). In order to formulate the video control model, first the QUBO terms of different factors are defined regarding video quality, bitrate change, and buffer condition. Then, all the individual QUBO terms are merged to generate an objective function. By minimizing the QUBO objective function, the bitrate choice is determined from the solution. The control model is solved by Digital Annealer, which is a quantum-inspired computing technology. The evaluation of the proposed method is carried out by simulation with the throughput traces obtained in real world under different scenarios and the comparison with other methods is conducted. Experiment results demonstrated that the proposed QuDASH method has better performance in terms of QoE compared with other advanced ABR methods. In 68.2% of the examined cases, QuDASH achieves the highest QoE results, which shows the superiority of the QuDASH over conventional methods.
△ Less
Submitted 21 October, 2023; v1 submitted 19 June, 2022;
originally announced June 2022.
-
Identification of cancer-keeping genes as therapeutic targets by finding network control hubs
Authors:
Xizhe Zhang,
Chunyu Pan,
Xinru Wei,
Meng Yu,
Shuangjie Liu,
Jun An,
Jieping Yang,
Baojun Wei,
Wenjun Hao,
Yang Yao,
Yuyan Zhu,
Weixiong Zhang
Abstract:
Finding cancer driver genes has been a focal theme of cancer research and clinical studies. One of the recent approaches is based on network structural controllability that focuses on finding a control scheme and driver genes that can steer the cell from an arbitrary state to a designated state. While theoretically sound, this approach is impractical for many reasons, e.g., the control scheme is o…
▽ More
Finding cancer driver genes has been a focal theme of cancer research and clinical studies. One of the recent approaches is based on network structural controllability that focuses on finding a control scheme and driver genes that can steer the cell from an arbitrary state to a designated state. While theoretically sound, this approach is impractical for many reasons, e.g., the control scheme is often not unique and half of the nodes may be driver genes for the cell. We developed a novel approach that transcends structural controllability. Instead of considering driver genes for one control scheme, we considered control hub genes that reside in the middle of a control path of every control scheme. Control hubs are the most vulnerable spots for controlling the cell and exogenous stimuli on them may render the cell uncontrollable. We adopted control hubs as cancer-keep genes (CKGs) and applied them to a gene regulatory network of bladder cancer (BLCA). All the genes on the cell cycle and p53 singling pathways in BLCA are CKGs, confirming the importance of these genes and the two pathways in cancer. A smaller set of 35 sensitive CKGs (sCKGs) for BLCA was identified by removing network links. Six sCKGs (RPS6KA3, FGFR3, N-cadherin (CDH2), EP300, caspase-1, and FN1) were subjected to small-interferencing-RNA knockdown in four cell lines to validate their effects on the proliferation or migration of cancer cells. Knocking down RPS6KA3 in a mouse model of BLCA significantly inhibited the growth of tumor xenografts in the mouse model. Combined, our results demonstrated the value of CKGs as therapeutic targets for cancer therapy and the potential of CKGs as an effective means for studying and characterizing cancer etiology.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Sparse dynamical system identification with simultaneous structural parameters and initial condition estimation
Authors:
Baolei Wei
Abstract:
Sparse Identification of Nonlinear Dynamics (SINDy) has been shown to successfully recover governing equations from data; however, this approach assumes the initial condition to be exactly known in advance and is sensitive to noise. In this work we propose an integral SINDy (ISINDy) method to simultaneously identify model structure and parameters of nonlinear ordinary differential equations (ODEs)…
▽ More
Sparse Identification of Nonlinear Dynamics (SINDy) has been shown to successfully recover governing equations from data; however, this approach assumes the initial condition to be exactly known in advance and is sensitive to noise. In this work we propose an integral SINDy (ISINDy) method to simultaneously identify model structure and parameters of nonlinear ordinary differential equations (ODEs) from noisy time-series observations. First, the states are estimated via penalized spline smoothing and then substituted into the integral-form numerical discretization solver, leading to a pseudo-linear regression. The sequential threshold least squares is performed to extract the fewest active terms from the overdetermined set of candidate features, thereby estimating structural parameters and initial condition simultaneously and meanwhile, making the identified dynamics parsimonious and interpretable. Simulations detail the method's recovery accuracy and robustness to noise. Examples include a logistic equation, Lokta-Volterra system, and Lorenz system.
△ Less
Submitted 31 October, 2022; v1 submitted 21 April, 2022;
originally announced April 2022.
-
Dissipative stabilization of linear input delay systems via dynamical state feedback controllers: an optimization based approach
Authors:
Qian Feng,
Bo Wei
Abstract:
In this note, we present an effective solution to the stabilization of linear input delay systems subject to dissipative constraints while all the effect of input delay is compensated by a controller with novel structure. The method is inspired by the recent development in the mathematical treatment of distributed delays and predictor controllers, which are critical for the derivation of the solut…
▽ More
In this note, we present an effective solution to the stabilization of linear input delay systems subject to dissipative constraints while all the effect of input delay is compensated by a controller with novel structure. The method is inspired by the recent development in the mathematical treatment of distributed delays and predictor controllers, which are critical for the derivation of the solution. An important conceptual innovation is the use of a parameterized dynamical state feedback controller (DSFC), where the dimension of the controller equals the dimension of the control input. A sufficient condition for the existence of a dissipative DSFC is obtained via the Krasovskii functional approach, where the condition includes a bilinear matrix inequality (BMI). To solve the BMI, we apply an inner convex approximation algorithm which can be initialized based on an explicit construction of a predictor controller gain. The proposed DSFC can be considered as an extension of the classical predictor controller, thereby capable of compensating all the effects of the pointwise input delay while satisfying dissipative constraints. A numerical example is given to illustrate the effectiveness of our proposed methodology.
△ Less
Submitted 16 August, 2022; v1 submitted 20 April, 2022;
originally announced April 2022.
-
RSSI-CSI Measurement and Variation Mitigation with Commodity WiFi Device
Authors:
Bo Wei,
Hang Song,
Jiro Katto,
Takamaro Kikkawa
Abstract:
Owing to the plentiful information released by the commodity devices, WiFi signals have been widely studied for various wireless sensing applications. In many works, both received signal strength indicator (RSSI) and the channel state information (CSI) are utilized as the key factors for precise sensing. However, the calculation and relationship between RSSI and CSI is not explained in detail. Fur…
▽ More
Owing to the plentiful information released by the commodity devices, WiFi signals have been widely studied for various wireless sensing applications. In many works, both received signal strength indicator (RSSI) and the channel state information (CSI) are utilized as the key factors for precise sensing. However, the calculation and relationship between RSSI and CSI is not explained in detail. Furthermore, there are few works focusing on the measurement variation of the WiFi signal which impacts the sensing results. In this paper, the relationship between RSSI and CSI is studied in detail and the measurement variation of amplitude and phase information is investigated by extensive experiments. In the experiments, transmitter and receiver are directly connected by power divider and RF cables and the signal transmission is quantitatively controlled by RF attenuators. By changing the intensity of attenuation, the measurement of RSSI and CSI is carried out under different conditions. From the results, it is found that in order to get a reliable measurement of the signal amplitude and phase by commodity WiFi, the attenuation of the channels should not exceed 60 dB. Meanwhile, the difference between two channels should be lower than 10 dB. An active control mechanism is suggested to ensure the measurement stability. The findings and criteria of this work is promising to facilitate more precise sensing technologies with WiFi signal.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Adaptive video transmission using QUBO method and Digital Annealer based on Ising machine
Authors:
Bo Wei,
Hang Song,
Jiro Katto
Abstract:
With the dramatically increasing video streaming in the total network traffic, it is critical to develop effective algorithms to promote the content delivery service of high quality. Adaptive bitrate (ABR) control is the most essential technique which determines the proper bitrate to be chosen based on network conditions, thus realize high-quality video streaming. In this paper, a novel ABR strate…
▽ More
With the dramatically increasing video streaming in the total network traffic, it is critical to develop effective algorithms to promote the content delivery service of high quality. Adaptive bitrate (ABR) control is the most essential technique which determines the proper bitrate to be chosen based on network conditions, thus realize high-quality video streaming. In this paper, a novel ABR strategy is proposed based on Ising machine by using the quadratic unconstrained binary optimization (QUBO) method and Digital Annealer (DA) for the first time. The proposed method is evaluated by simulation with the real-world measured throughput, and compared with other state-of-the-art methods. Experiment results show that the proposed QUBO-based method can outperform the existing methods, which demonstrating the superior of the proposed QUBO-based method.
△ Less
Submitted 25 September, 2021;
originally announced September 2021.
-
Distributed strategy-updating rules for aggregative games of multi-integrator systems with coupled constraints
Authors:
Xin Cai,
Feng Xiao,
Bo Wei
Abstract:
In this paper, we explore aggregative games over networks of multi-integrator agents with coupled constraints. To reach the general Nash equilibrium of an aggregative game, a distributed strategy-updating rule is proposed by a combination of the coordination of Lagrange multipliers and the estimation of the aggregator. Each player has only access to partial-decision information and communicates wi…
▽ More
In this paper, we explore aggregative games over networks of multi-integrator agents with coupled constraints. To reach the general Nash equilibrium of an aggregative game, a distributed strategy-updating rule is proposed by a combination of the coordination of Lagrange multipliers and the estimation of the aggregator. Each player has only access to partial-decision information and communicates with his neighbors in a weight-balanced digraph which characterizes players' preferences as to the values of information received from neighbors. We first consider networks of double-integrator agents and then focus on multi-integrator agents. The effectiveness of the proposed strategy-updating rules is demonstrated by analyzing the convergence of corresponding dynamical systems via the Lyapunov stability theory, singular perturbation theory and passive theory. Numerical examples are given to illustrate our results.
△ Less
Submitted 20 June, 2021;
originally announced June 2021.
-
NLHD: A Pixel-Level Non-Local Retinex Model for Low-Light Image Enhancement
Authors:
Hao Hou,
Yingkun Hou,
Yuxuan Shi,
Benzheng Wei,
Jun Xu
Abstract:
Retinex model has been applied to low-light image enhancement in many existing methods. More appropriate decomposition of a low-light image can help achieve better image enhancement. In this paper, we propose a new pixel-level non-local Haar transform based illumination and reflectance decomposition method (NLHD). The unique low-frequency coefficient of Haar transform on each similar pixel group i…
▽ More
Retinex model has been applied to low-light image enhancement in many existing methods. More appropriate decomposition of a low-light image can help achieve better image enhancement. In this paper, we propose a new pixel-level non-local Haar transform based illumination and reflectance decomposition method (NLHD). The unique low-frequency coefficient of Haar transform on each similar pixel group is used to reconstruct the illumination component, and the rest of all high-frequency coefficients are employed to reconstruct the reflectance component. The complete similarity of pixels in a matched similar pixel group and the simple separable Haar transform help to obtain more appropriate image decomposition; thus, the image is hardly sharpened in the image brightness enhancement procedure. The exponential transform and logarithmic transform are respectively implemented on the illumination component. Then a minimum fusion strategy on the results of these two transforms is utilized to achieve more natural illumination component enhancement. It can alleviate the mosaic artifacts produced in the darker regions by the exponential transform with a gamma value less than 1 and reduce information loss caused by excessive enhancement of the brighter regions due to the logarithmic transform. Finally, the Retinex model is applied to the enhanced illumination and reflectance to achieve image enhancement. We also develop a local noise level estimation based noise suppression method and a non-local saturation reduction based color deviation correction method. These two methods can respectively attenuate noise or color deviation usually presented in the enhanced results of the extremely dark low-light images. Experiments on benchmark datasets show that the proposed method can achieve better low-light image enhancement results on subjective and objective evaluations than most existing methods.
△ Less
Submitted 15 June, 2021; v1 submitted 13 June, 2021;
originally announced June 2021.
-
Detecting and Correcting IMU Movements During Joint Angle Estimation
Authors:
Chunzhi Yi,
Feng Jiang,
Baichun Wei,
Chifu Yang,
Zhen Ding,
Jubo Jin,
Jie Liu
Abstract:
Inertial measurement units (IMUs) increasingly function as a basic component of wearable sensor network (WSN)systems. IMU-based joint angle estimation (JAE) is a relatively typical usage of IMUs, with extensive applications. However, the issue that IMUs move with respect to their original placement during JAE is still a research gap, and limits the robustness of deploying the technique in real-wor…
▽ More
Inertial measurement units (IMUs) increasingly function as a basic component of wearable sensor network (WSN)systems. IMU-based joint angle estimation (JAE) is a relatively typical usage of IMUs, with extensive applications. However, the issue that IMUs move with respect to their original placement during JAE is still a research gap, and limits the robustness of deploying the technique in real-world application scenarios. In this study, we propose to detect and correct the IMU movement online in a relatively computationally lightweight manner. Particularly, we first experimentally investigate the influence of IMU movements. Second, we design the metrics for detecting IMU movements by mathematically formulating how the IMU movement affects the IMU measurements. Third, we determine the optimal thresholds of metrics by synthetic IMU data from a significantly amended simulation model. Finally, a correction method is proposed to correct the effects of IMU movements. We demonstrate our method on both synthetic data and real-user data. The results demonstrate our method is a promising solution to detecting and correcting IMU movements during JAE.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
Continuous Prediction of Lower-Limb Kinematics From Multi-Modal Biomedical Signals
Authors:
Chunzhi Yi,
Feng Jiang,
Shengping Zhang,
Hao Guo,
Chifu Yang,
Zhen Ding,
Baichun Wei,
Xiangyuan Lan,
Huiyu Zhou
Abstract:
The fast-growing techniques of measuring and fusing multi-modal biomedical signals enable advanced motor intent decoding schemes of lowerlimb exoskeletons, meeting the increasing demand for rehabilitative or assistive applications of take-home healthcare. Challenges of exoskeletons motor intent decoding schemes remain in making a continuous prediction to compensate for the hysteretic response caus…
▽ More
The fast-growing techniques of measuring and fusing multi-modal biomedical signals enable advanced motor intent decoding schemes of lowerlimb exoskeletons, meeting the increasing demand for rehabilitative or assistive applications of take-home healthcare. Challenges of exoskeletons motor intent decoding schemes remain in making a continuous prediction to compensate for the hysteretic response caused by mechanical transmission. In this paper, we solve this problem by proposing an ahead of time continuous prediction of lower limb kinematics, with the prediction of knee angles during level walking as a case study. Firstly, an end-to-end kinematics prediction network(KinPreNet), consisting of a feature extractor and an angle predictor, is proposed and experimentally compared with features and methods traditionally used in ahead-of-time prediction of gait phases. Secondly, inspired by the electromechanical delay(EMD), we further explore our algorithm's capability of compensating response delay of mechanical transmission by validating the performance of the different sections of prediction time. And we experimentally reveal the time boundary of compensating the hysteretic response. Thirdly, a comparison of employing EMG signals or not is performed to reveal the EMG and kinematic signals collaborated contributions to the continuous prediction. During the experiments, EMG signals of nine muscles and knee angles calculated from inertial measurement unit (IMU) signals are recorded from ten healthy subjects. To the best of our knowledge, this is the first study of continuously predicting lower-limb kinematics in an ahead-of-time manner based on the electromechanical delay (EMD).
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
FeatherTTS: Robust and Efficient attention based Neural TTS
Authors:
Qiao Tian,
Zewang Zhang,
Chao Liu,
Heng Lu,
Linghui Chen,
Bin Wei,
Pujiang He,
Shan Liu
Abstract:
Attention based neural TTS is elegant speech synthesis pipeline and has shown a powerful ability to generate natural speech. However, it is still not robust enough to meet the stability requirements for industrial products. Besides, it suffers from slow inference speed owning to the autoregressive generation process. In this work, we propose FeatherTTS, a robust and efficient attention-based neura…
▽ More
Attention based neural TTS is elegant speech synthesis pipeline and has shown a powerful ability to generate natural speech. However, it is still not robust enough to meet the stability requirements for industrial products. Besides, it suffers from slow inference speed owning to the autoregressive generation process. In this work, we propose FeatherTTS, a robust and efficient attention-based neural TTS system. Firstly, we propose a novel Gaussian attention which utilizes interpretability of Gaussian attention and the strict monotonic property in TTS. By this method, we replace the commonly used stop token prediction architecture with attentive stop prediction. Secondly, we apply block sparsity on the autoregressive decoder to speed up speech synthesis. The experimental results show that our proposed FeatherTTS not only nearly eliminates the problem of word skipping, repeating in particularly hard texts and keep the naturalness of generated speech, but also speeds up acoustic feature generation by 3.5 times over Tacotron. Overall, the proposed FeatherTTS can be $35$x faster than real-time on a single CPU.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
A Novel Mobility Model to Support the Routing of Mobile Energy Resources
Authors:
Wei Wang,
Xiaofu Xiong,
Chao Xiao,
Bihui Wei
Abstract:
Mobile energy resources (MERs) have received increasing attention due to their effectiveness in boosting the power system resilience in a flexible way. In this paper, a novel mobility model for MERs is proposed, which can support the routing of MERs to provide various services for the power system. Two key points, the state transitions and travel time of MERs, are formulated by linear constraints.…
▽ More
Mobile energy resources (MERs) have received increasing attention due to their effectiveness in boosting the power system resilience in a flexible way. In this paper, a novel mobility model for MERs is proposed, which can support the routing of MERs to provide various services for the power system. Two key points, the state transitions and travel time of MERs, are formulated by linear constraints. The feasibility of the proposed model, especially its advantages in model size and computational efficiency for the routing of MERs among many nodes with a small time span, is demonstrated by a series of tests.
△ Less
Submitted 18 March, 2022; v1 submitted 21 July, 2020;
originally announced July 2020.
-
Unifying Neural Learning and Symbolic Reasoning for Spinal Medical Report Generation
Authors:
Zhongyi Han,
Benzheng Wei,
Yilong Yin,
Shuo Li
Abstract:
Automated medical report generation in spine radiology, i.e., given spinal medical images and directly create radiologist-level diagnosis reports to support clinical decision making, is a novel yet fundamental study in the domain of artificial intelligence in healthcare. However, it is incredibly challenging because it is an extremely complicated task that involves visual perception and high-level…
▽ More
Automated medical report generation in spine radiology, i.e., given spinal medical images and directly create radiologist-level diagnosis reports to support clinical decision making, is a novel yet fundamental study in the domain of artificial intelligence in healthcare. However, it is incredibly challenging because it is an extremely complicated task that involves visual perception and high-level reasoning processes. In this paper, we propose the neural-symbolic learning (NSL) framework that performs human-like learning by unifying deep neural learning and symbolic logical reasoning for the spinal medical report generation. Generally speaking, the NSL framework firstly employs deep neural learning to imitate human visual perception for detecting abnormalities of target spinal structures. Concretely, we design an adversarial graph network that interpolates a symbolic graph reasoning module into a generative adversarial network through embedding prior domain knowledge, achieving semantic segmentation of spinal structures with high complexity and variability. NSL secondly conducts human-like symbolic logical reasoning that realizes unsupervised causal effect analysis of detected entities of abnormalities through meta-interpretive learning. NSL finally fills these discoveries of target diseases into a unified template, successfully achieving a comprehensive medical report generation. When it employed in a real-world clinical dataset, a series of empirical studies demonstrate its capacity on spinal medical report generation as well as show that our algorithm remarkably exceeds existing methods in the detection of spinal structures. These indicate its potential as a clinical tool that contributes to computer-aided diagnosis.
△ Less
Submitted 28 April, 2020;
originally announced April 2020.
-
Robust Screening of COVID-19 from Chest X-ray via Discriminative Cost-Sensitive Learning
Authors:
Tianyang Li,
Zhongyi Han,
Benzheng Wei,
Yuanjie Zheng,
Yanfei Hong,
Jinyu Cong
Abstract:
This paper addresses the new problem of automated screening of coronavirus disease 2019 (COVID-19) based on chest X-rays, which is urgently demanded toward fast stopping the pandemic. However, robust and accurate screening of COVID-19 from chest X-rays is still a globally recognized challenge because of two bottlenecks: 1) imaging features of COVID-19 share some similarities with other pneumonia o…
▽ More
This paper addresses the new problem of automated screening of coronavirus disease 2019 (COVID-19) based on chest X-rays, which is urgently demanded toward fast stopping the pandemic. However, robust and accurate screening of COVID-19 from chest X-rays is still a globally recognized challenge because of two bottlenecks: 1) imaging features of COVID-19 share some similarities with other pneumonia on chest X-rays, and 2) the misdiagnosis rate of COVID-19 is very high, and the misdiagnosis cost is expensive. While a few pioneering works have made much progress, they underestimate both crucial bottlenecks. In this paper, we report our solution, discriminative cost-sensitive learning (DCSL), which should be the choice if the clinical needs the assisted screening of COVID-19 from chest X-rays. DCSL combines both advantages from fine-grained classification and cost-sensitive learning. Firstly, DCSL develops a conditional center loss that learns deep discriminative representation. Secondly, DCSL establishes score-level cost-sensitive learning that can adaptively enlarge the cost of misclassifying COVID-19 examples into other classes. DCSL is so flexible that it can apply in any deep neural network. We collected a large-scale multi-class dataset comprised of 2,239 chest X-ray examples: 239 examples from confirmed COVID-19 cases, 1,000 examples with confirmed bacterial or viral pneumonia cases, and 1,000 examples of healthy people. Extensive experiments on the three-class classification show that our algorithm remarkably outperforms state-of-the-art algorithms. It achieves an accuracy of 97.01%, a precision of 97%, a sensitivity of 97.09%, and an F1-score of 96.98%. These results endow our algorithm as an efficient tool for the fast large-scale screening of COVID-19.
△ Less
Submitted 21 May, 2020; v1 submitted 27 April, 2020;
originally announced April 2020.
-
WiEps: Measurement of Dielectric Property with Commodity WiFi Device -- An application to Ethanol/Water Mixture
Authors:
Hang Song,
Bo Wei,
Qun Yu,
Xia Xiao,
Takamaro Kikkawa
Abstract:
WiFi signal has become accessible everywhere, providing high-speed data transmission experience. Besides the communication service, channel state information (CSI) of the WiFi signals is widely employed for numerous Internet of Things (IoT) applications. Recently, most of these applications are based on analysis of the microwave reflections caused by physical movement of the objective. In this pap…
▽ More
WiFi signal has become accessible everywhere, providing high-speed data transmission experience. Besides the communication service, channel state information (CSI) of the WiFi signals is widely employed for numerous Internet of Things (IoT) applications. Recently, most of these applications are based on analysis of the microwave reflections caused by physical movement of the objective. In this paper, a novel contactless wireless sensing technique named WiEps is developed to measure the dielectric properties of the material, exploiting the transmission characteristics of the WiFi signals. In WiEps, the material under test is placed between the transmitter antenna and receiver antenna. A theoretical model is proposed to quantitatively describe the relationship between CSI data and dielectric properties of the material. During the experiment, the phase and amplitude of the transmitted WiFi signals are extracted from the measured CSI data. The parameters of the theoretical model are calculated using measured data from the known materials. Then, WiEps is utilized to estimate the dielectric properties of unknown materials. The proposed technique is first applied to the ethanol/water mixtures. Then, additional liquids are measured for further verification. The estimated permittivities and conductivities show good agreement with the actual values, with the average error of 4.0% and 8.9%, respectively, indicating the efficacy of WiEps. By measuring the dielectric property, this technique is promising to be applied to new IoT applications using ubiquitous WiFi signals, such as food engineering, material manufacturing process monitoring, and security check.
△ Less
Submitted 5 June, 2020; v1 submitted 3 March, 2020;
originally announced March 2020.
-
A Multi-view CNN-based Acoustic Classification System for Automatic Animal Species Identification
Authors:
Weitao Xu,
Xiang Zhang,
Lina Yao,
Wanli Xue,
Bo Wei
Abstract:
Automatic identification of animal species by their vocalization is an important and challenging task. Although many kinds of audio monitoring system have been proposed in the literature, they suffer from several disadvantages such as non-trivial feature selection, accuracy degradation because of environmental noise or intensive local computation. In this paper, we propose a deep learning based ac…
▽ More
Automatic identification of animal species by their vocalization is an important and challenging task. Although many kinds of audio monitoring system have been proposed in the literature, they suffer from several disadvantages such as non-trivial feature selection, accuracy degradation because of environmental noise or intensive local computation. In this paper, we propose a deep learning based acoustic classification framework for Wireless Acoustic Sensor Network (WASN). The proposed framework is based on cloud architecture which relaxes the computational burden on the wireless sensor node. To improve the recognition accuracy, we design a multi-view Convolution Neural Network (CNN) to extract the short-, middle-, and long-term dependencies in parallel. The evaluation on two real datasets shows that the proposed architecture can achieve high accuracy and outperforms traditional classification systems significantly when the environmental noise dominate the audio signal (low SNR). Moreover, we implement and deploy the proposed system on a testbed and analyse the system performance in real-world environments. Both simulation and real-world evaluation demonstrate the accuracy and robustness of the proposed acoustic classification system in distinguishing species of animals.
△ Less
Submitted 22 February, 2020;
originally announced February 2020.
-
Sensor-Movement-Robust Angle Estimation for 3-DoF Lower Limb Joints Without Calibration
Authors:
Chunzhi Yi,
Feng Jiang,
Zhiyuan Chen,
Baichun Wei,
Hao Guo,
Xunfeng Yin,
Fangzhuo Li,
Chifu Yang
Abstract:
Inertial measurement unit (IMU)-based 3-DoF angle estimation methods for lower limb joints have been studied for decades, however the calibration motions and/or careful sensor placement are still necessary due to challenges of real-time application. This study proposes a novel sensormovement-robust 3-DoF method for lower-limb joint angle estimation without calibration. A realtime optimization proc…
▽ More
Inertial measurement unit (IMU)-based 3-DoF angle estimation methods for lower limb joints have been studied for decades, however the calibration motions and/or careful sensor placement are still necessary due to challenges of real-time application. This study proposes a novel sensormovement-robust 3-DoF method for lower-limb joint angle estimation without calibration. A realtime optimization process, which is based on a feedback iteration progress to identify three joint axes of a 3-DoF joint, has been presented with a reference frame calibration algorithm, and a safe-guarded strategy is proposed to detect and compensate for the errors caused by sensor movements. The experimental results obtained from a 3-DoF gimbal and ten healthy subjects demonstrate a promising performance on 3-DoF angle estimation. Specially, the experiments on ten subjects are performed with three gait modes and a 2-min level walking. The root mean square error is below 2 deg for level walking and 5 deg for other two gait modes. The result of 2-min level walking shows our algorithms stability under long run. The robustness against sensor movement are demonstrated through data from multiple sets of IMUs. In addition, results from the 3-DoF gimbal indicate that the accuracy of 3-DoF angle estimation could be improved by 84.9% with our reference frame calibration algorithm. In conclusion, our study proposes and validates a sensor-movement-robust 3-DoF angle estimation for lowerlimb joints based on IMU. To the best of our knowledge, our approach is the first experimental implementation of IMUbased 3-DoF angle estimation for lower-limb joints without calibration.
△ Less
Submitted 16 October, 2019;
originally announced October 2019.
-
No Need of Data Pre-processing: A General Framework for Radio-Based Device-Free Context Awareness
Authors:
Bo Wei,
Kai Li,
Chengwen Luo,
Weitao Xu,
Jin Zhang
Abstract:
Device-free context awareness is important to many applications. There are two broadly used approaches for device-free context awareness, i.e. video-based and radio-based. Video-based applications can deliver good performance, but privacy is a serious concern. Radio-based context awareness has drawn researchers attention instead because it does not violate privacy and radio signal can penetrate ob…
▽ More
Device-free context awareness is important to many applications. There are two broadly used approaches for device-free context awareness, i.e. video-based and radio-based. Video-based applications can deliver good performance, but privacy is a serious concern. Radio-based context awareness has drawn researchers attention instead because it does not violate privacy and radio signal can penetrate obstacles. Recently, deep learning has been introduced into radio-based device-free context awareness and helps boost the recognition accuracy. The present works design explicit methods for each radio based application. They also use one additional step to extract features before conducting classification and exploit deep learning as a classification tool. The additional initial data processing step introduces unnecessary noise and information loss. Without initial data processing, it is, however, challenging to explore patterns of raw signals. In this paper, we are the first to propose an innovative deep learning based general framework for both signal processing and classification. The key novelty of this paper is that the framework can be generalised for all the radio-based context awareness applications. We also eliminate the additional effort to extract features from raw radio signals. We conduct extensive evaluations to show the superior performance of our proposed method and its generalisation.
△ Less
Submitted 9 August, 2019;
originally announced August 2019.