Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (123)

Search Parameters:
Keywords = Mamba

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
20 pages, 1439 KiB  
Article
SSM-Net: Enhancing Compressed Sensing Image Reconstruction with Mamba Architecture and Fast Iterative Shrinking Threshold Algorithm Optimization
by Xianwei Gao, Bi Chen, Xiang Yao and Ye Yuan
Sensors 2025, 25(4), 1026; https://doi.org/10.3390/s25041026 - 9 Feb 2025
Viewed by 336
Abstract
Compressed sensing (CS) is a powerful technique that can reduce data size while maintaining high reconstruction quality, which makes it particularly valuable in high-dimensional image applications. However, many existing methods have difficulty balancing reconstruction accuracy, computational efficiency, and fast convergence. To address these [...] Read more.
Compressed sensing (CS) is a powerful technique that can reduce data size while maintaining high reconstruction quality, which makes it particularly valuable in high-dimensional image applications. However, many existing methods have difficulty balancing reconstruction accuracy, computational efficiency, and fast convergence. To address these challenges, this paper proposes SSM-Net, a novel framework that combines the state-space modeling (SSM) of the Mamba architecture with the fast iterative shrinking threshold algorithm (FISTA). The Mamba-based SSM module can effectively capture local and global dependencies with linear computational complexity and significantly reduces the computation time compared to Transformer-based methods. In addition, the momentum update inspired by FISTA improves the convergence speed during deep iterative reconstruction. SSM-Net features a lightweight sampling module for efficient data compression, an initial reconstruction module for fast approximation, and a deep reconstruction module for iterative refinement. Extensive experiments on various benchmark datasets show that SSM-Net achieves state-of-the-art reconstruction performance while reducing both training and inference reconstruction time, making SSM-Net a scalable and practical solution for real-time applications of compressed sensing. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

21 pages, 10628 KiB  
Article
Thermal Video Enhancement Mamba: A Novel Approach to Thermal Video Enhancement for Real-World Applications
by Sargis Hovhannisyan, Sos Agaian, Karen Panetta and Artyom Grigoryan
Information 2025, 16(2), 125; https://doi.org/10.3390/info16020125 - 9 Feb 2025
Viewed by 402
Abstract
Object tracking in thermal video is challenging due to noise, blur, and low contrast. We present TVEMamba, a Mamba-based enhancement framework with near-linear complexity that improves tracking in these conditions. Our approach uses a State Space 2D (SS2D) module integrated with Convolutional Neural [...] Read more.
Object tracking in thermal video is challenging due to noise, blur, and low contrast. We present TVEMamba, a Mamba-based enhancement framework with near-linear complexity that improves tracking in these conditions. Our approach uses a State Space 2D (SS2D) module integrated with Convolutional Neural Networks (CNNs) to filter, sharpen, and highlight important details. Key components include (i) a denoising module to reduce background noise and enhance image clarity, (ii) an optical flow attention module to handle complex motion and reduce blur, and (iii) entropy-based labeling to create a fully labeled thermal dataset for training and evaluation. TVEMamba outperforms existing methods (DCRGC, RLBHE, IE-CGAN, BBCNN) across multiple datasets (BIRDSAI, FLIR, CAMEL, Autonomous Vehicles, Solar Panels) and achieves higher scores on standard quality metrics (EME, BDIM, DMTE, MDIMTE, LGTA). Extensive tests, including ablation studies and convergence analysis, confirm its robustness. Real-world examples, such as tracking humans, animals, and moving objects for self-driving vehicles and remote sensing, demonstrate the practical value of TVEMamba. Full article
(This article belongs to the Special Issue Emerging Research in Object Tracking and Image Segmentation)
Show Figures

Figure 1

21 pages, 9794 KiB  
Article
Weamba: Weather-Degraded Remote Sensing Image Restoration with Multi-Router State Space Model
by Shuang Wu, Xin He and Xiang Chen
Remote Sens. 2025, 17(3), 458; https://doi.org/10.3390/rs17030458 - 29 Jan 2025
Viewed by 500
Abstract
Adverse weather conditions, such as haze and raindrop, consistently degrade the quality of remote sensing images and affect subsequent vision-based applications. Recent years have witnessed advancements in convolutional neural networks (CNNs) and Transformers in the field of remote sensing image restoration. However, these [...] Read more.
Adverse weather conditions, such as haze and raindrop, consistently degrade the quality of remote sensing images and affect subsequent vision-based applications. Recent years have witnessed advancements in convolutional neural networks (CNNs) and Transformers in the field of remote sensing image restoration. However, these methods either suffer from limited receptive fields or incur quadratic computational overhead, leading to an imbalance between performance and model efficiency. In this paper, we propose an effective vision state space model (called Weamba) for remote sensing image restoration by modeling long-range pixel dependencies with linear complexity. Specifically, we develop a local-enhanced state space module to better aggregate rich local and global information, both of which are complementary and beneficial for high-quality image reconstruction. Furthermore, we design a multi-router scanning strategy for spatially varying feature extraction, alleviating the issue of redundant information caused by repeated scanning directions in existing methods. Extensive experiments on multiple benchmarks show that the proposed Weamba performs favorably against state-of-the-art approaches. Full article
Show Figures

Figure 1

20 pages, 10948 KiB  
Article
HMCNet: A Hybrid Mamba–CNN UNet for Infrared Small Target Detection
by Bolin Li, Peng Rao, Yueqi Su and Xin Chen
Remote Sens. 2025, 17(3), 452; https://doi.org/10.3390/rs17030452 - 29 Jan 2025
Viewed by 577
Abstract
Using infrared technology to accurately detect small weak targets is crucial in various fields, such as reconnaissance and security. However, the infrared detection of small weak targets is challenged by complex backgrounds, tiny target sizes, and low signal-to-noise ratios, which significantly increase the [...] Read more.
Using infrared technology to accurately detect small weak targets is crucial in various fields, such as reconnaissance and security. However, the infrared detection of small weak targets is challenged by complex backgrounds, tiny target sizes, and low signal-to-noise ratios, which significantly increase the difficulty of detection. Early studies in this domain typically utilized manually designed feature-extraction methods that performed inadequately in the presence of complex backgrounds. While advancements in deep learning have spurred rapid progress in this field, with CNN models effectively enhancing the detection performance, the problem of small weak target features being lost persists. HMCNet, which employs a hybrid architecture combining a state space model and a CNN, is proposed in this paper; its hybrid architecture demonstrates the capacity to extract the local features and model the global context, facilitating superior suppression of complex backgrounds and detection of small weak targets. Our experimental results on the public IRSTD-1k dataset and our own MISTD dataset indicate that, compared to the current mainstream methods, the method proposed achieves better detection accuracy while maintaining high-speed inference capabilities, thus validating the rationality and effectiveness of this research. Full article
Show Figures

Figure 1

20 pages, 7947 KiB  
Article
Towards an Efficient Remote Sensing Image Compression Network with Visual State Space Model
by Yongqiang Wang, Feng Liang, Shang Wang, Hang Chen, Qi Cao, Haisheng Fu and Zhenjiao Chen
Remote Sens. 2025, 17(3), 425; https://doi.org/10.3390/rs17030425 - 26 Jan 2025
Viewed by 548
Abstract
In the past few years, deep learning has achieved remarkable advancements in the area of image compression. Remote sensing image compression networks focus on enhancing the similarity between the input and reconstructed images, effectively reducing the storage and bandwidth requirements for high-resolution remote [...] Read more.
In the past few years, deep learning has achieved remarkable advancements in the area of image compression. Remote sensing image compression networks focus on enhancing the similarity between the input and reconstructed images, effectively reducing the storage and bandwidth requirements for high-resolution remote sensing images. As the network’s effective receptive field (ERF) expands, it can capture more feature information across the remote sensing images, thereby reducing spatial redundancy and improving compression efficiency. However, the majority of these learned image compression (LIC) techniques are primarily CNN-based and transformer-based, often failing to balance the global ERF and computational complexity optimally. To alleviate this issue, we propose a learned remote sensing image compression network with visual state space model named VMIC to achieve a better trade-off between computational complexity and performance. Specifically, instead of stacking small convolution kernels or heavy self-attention mechanisms, we employ a 2D-bidirectional selective scan mechanism. Every element within the feature map aggregates data from multiple spatial positions, establishing a globally effective receptive field with linear computational complexity. We extend it to an omni-selective scan for the global-spatial correlations within our Channel and Global Context Entropy Model (CGCM), enabling the integration of spatial and channel priors to minimize redundancy across slices. Experimental results demonstrate that the proposed method achieves superior trade-off between rate-distortion performance and complexity. Furthermore, in comparison to traditional codecs and learned image compression algorithms, our model achieves BD-rate reductions of −4.48%, −9.80% over the state-of-the-art VTM on the AID and NWPU VHR-10 datasets, respectively, as well as −6.73% and −7.93% on the panchromatic and multispectral images of the WorldView-3 remote sensing dataset. Full article
Show Figures

Figure 1

44 pages, 2672 KiB  
Review
Magnetic Nanoparticles: Advances in Synthesis, Sensing, and Theragnostic Applications
by Adeyemi O. Adeeyo, Mercy A. Alabi, Joshua A. Oyetade, Thabo T. I. Nkambule, Bhekie B. Mamba, Adewale O. Oladipo, Rachel Makungo and Titus A. M. Msagati
Magnetochemistry 2025, 11(2), 9; https://doi.org/10.3390/magnetochemistry11020009 - 26 Jan 2025
Viewed by 719
Abstract
The synthesis of magnetic nanoparticles (MNPs) via the chemical, biological, and physical routes has been reported on along with advantages and attendant limitations. This study focuses on the sensing and emerging theragnostic applications of this category of nanoparticles (NPs) in clinical sciences by [...] Read more.
The synthesis of magnetic nanoparticles (MNPs) via the chemical, biological, and physical routes has been reported on along with advantages and attendant limitations. This study focuses on the sensing and emerging theragnostic applications of this category of nanoparticles (NPs) in clinical sciences by unveiling the unique performance of these NPs in the biological sensing of bacteria and nucleotide sequencing. Also, in terms of medicine and clinical science, this review analyzes the emerging theragnostic applications of NPs in drug delivery, bone tissue engineering, deep brain stimulation, therapeutic hyperthermia, tumor detection, magnetic imaging and cell tracking, lymph node visualization, blood purification, and COVID-19 detection. This review presents succinct surface functionalization and unique surface coating techniques to confer less toxicity and biocompatibility during synthesis, which are often identified as limitations in medical applications. This study also indicates that these surface improvement techniques are useful for refining the selective activity of MNPs during their use as sensors and biomarkers. In addition, this study unveils attendant limitations, especially toxicological impacts on biomolecules, and suggests that future research should pay attention to the mitigation of the biotoxicity of MNPs. Thus, this study presents a proficient approach for the synthesis of high-performance MNPs fit for proficient medicine in the detection of microorganisms, better diagnosis, and treatment in medicine. Full article
Show Figures

Figure 1

18 pages, 2656 KiB  
Article
Multimodal Natural Disaster Scene Recognition with Integrated Large Model and Mamba
by Yuxuan Shao and Liwen Xu
Appl. Sci. 2025, 15(3), 1149; https://doi.org/10.3390/app15031149 - 23 Jan 2025
Viewed by 548
Abstract
The accurate identification of natural disasters is crucial in ensuring effective post-disaster relief efforts. However, the existing models for disaster classification often incur high costs. To address this, we propose leveraging the most advanced pre-trained large language models, which offer superior generative and [...] Read more.
The accurate identification of natural disasters is crucial in ensuring effective post-disaster relief efforts. However, the existing models for disaster classification often incur high costs. To address this, we propose leveraging the most advanced pre-trained large language models, which offer superior generative and multimodal understanding capabilities. Using a question-answering approach, we extract textual descriptions and category prediction probabilities for disaster scenarios, which are then used as input to our proposed Mamba Multimodal Disaster Recognition Network (Mamba-MDRNet). This model integrates a large pre-trained model with the Mamba mechanism, enabling the selection of the most reliable modality information as a robust basis for scene classification. Extensive experiments demonstrate consistent performance improvements across various visual models with heterogeneous architectures. Notably, integrating EfficientNet within Mamba-MDRNet yielded 97.82% accuracy for natural scene classification, surpassing the performance of the CNN (91.75%), ViT (94.50%), and ResNet18 (97.25%). These results highlight the potential of multimodal models combining large models and the Mamba mechanism for disaster type prediction. Full article
(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)
Show Figures

Figure 1

28 pages, 127916 KiB  
Article
A Pine Wilt Disease Detection Model Integrated with Mamba Model and Attention Mechanisms Using UAV Imagery
by Minhui Bai, Xinyu Di, Lechuan Yu, Jian Ding and Haifeng Lin
Remote Sens. 2025, 17(2), 255; https://doi.org/10.3390/rs17020255 - 13 Jan 2025
Viewed by 867
Abstract
Pine wilt disease (PWD) is a highly destructive worldwide forest quarantine disease that has the potential to destroy entire pine forests in a relatively brief period, resulting in significant economic losses and environmental damage. Manual monitoring, biochemical detection and satellite remote sensing are [...] Read more.
Pine wilt disease (PWD) is a highly destructive worldwide forest quarantine disease that has the potential to destroy entire pine forests in a relatively brief period, resulting in significant economic losses and environmental damage. Manual monitoring, biochemical detection and satellite remote sensing are frequently inadequate for the timely detection and control of pine wilt disease. This paper presents a fusion model, which integrates the Mamba model and the attention mechanism, for deployment on unmanned aerial vehicles (UAVs) to detect infected pine trees. The experimental dataset presented in this paper comprises images of pine trees captured by UAVs in mixed forests. The images were gathered primarily during the spring of 2023, spanning the months of February to May. The images were subjected to a preprocessing phase, during which they were transformed into the research dataset. The fusion model comprised three principal components. The initial component is the Mamba backbone network with State Space Model (SSM) at its core, which is capable of extracting pine wilt features with a high degree of efficacy. The second component is the attention network, which enables our fusion model to center on PWD features with greater efficacy. The optimal configuration was determined through an evaluation of various attention mechanism modules, including four attention modules. The third component, Path Aggregation Feature Pyramid Network (PAFPN), facilitates the fusion and refinement of data at varying scales, thereby enhancing the model’s capacity to detect multi-scale objects. Furthermore, the convolutional layers within the model have been replaced with depth separable convolutional layers (DSconv), which has the additional benefit of reducing the number of model parameters and improving the model’s detection speed. The final fusion model was validated on a test set, achieving an accuracy of 90.0%, a recall of 81.8%, a map of 86.5%, a parameter counts of 5.9 Mega, and a detection speed of 40.16 FPS. In comparison to Yolov8, the accuracy is enhanced by 7.1%, the recall by 5.4%, and the map by 3.1%. These outcomes demonstrate that our fusion model is appropriate for implementation on edge devices, such as UAVs, and is capable of effective detection of PWD. Full article
Show Figures

Figure 1

23 pages, 10942 KiB  
Article
MambaShadowDet: A High-Speed and High-Accuracy Moving Target Shadow Detection Network for Video SAR
by Xiaowo Xu, Tianwen Zhang, Xiaoling Zhang, Wensi Zhang, Xiao Ke and Tianjiao Zeng
Remote Sens. 2025, 17(2), 214; https://doi.org/10.3390/rs17020214 - 9 Jan 2025
Viewed by 760
Abstract
Existing convolution neural network (CNN)-based video synthetic aperture radar (SAR) moving target shadow detectors are difficult to model long-range dependencies, while transformer-based ones often suffer from greater complexity. To handle these issues, this paper proposes MambaShadowDet, a novel lightweight deep learning (DL) detector [...] Read more.
Existing convolution neural network (CNN)-based video synthetic aperture radar (SAR) moving target shadow detectors are difficult to model long-range dependencies, while transformer-based ones often suffer from greater complexity. To handle these issues, this paper proposes MambaShadowDet, a novel lightweight deep learning (DL) detector based on a state space model (SSM), dedicated to high-speed and high-accuracy moving target shadow detection in video SAR images. By introducing SSM with the linear complexity into YOLOv8, MambaShadowDet effectively captures the global feature dependencies while relieving computational load. Specifically, it designs Mamba-Backbone, combining SSM and CNN to effectively extract both global contextual and local spatial information, as well as a slim path aggregation feature pyramid network (Slim-PAFPN) to enhance multi-level feature extraction and further reduce complexity. Abundant experiments on the Sandia National Laboratories (SNL) video SAR data show that MambaShadowDet achieves superior moving target shadow detection performance with a detection accuracy of 80.32% F1 score and an inference speed of 44.44 frames per second (FPS), outperforming existing models in both accuracy and speed. Full article
Show Figures

Figure 1

22 pages, 1078 KiB  
Article
An Event Causality Identification Framework Using Ensemble Learning
by Xiaoyang Wang, Wenjie Luo and Xiudan Yang
Information 2025, 16(1), 32; https://doi.org/10.3390/info16010032 - 7 Jan 2025
Viewed by 422
Abstract
Event causality identification is an upstream operation for many tasks, including knowledge graphs and intelligent question-and-answer systems. The latest models introduce external knowledge and then use deep learning for causality prediction. However, event causality recognition still faces problems such as data imbalance and [...] Read more.
Event causality identification is an upstream operation for many tasks, including knowledge graphs and intelligent question-and-answer systems. The latest models introduce external knowledge and then use deep learning for causality prediction. However, event causality recognition still faces problems such as data imbalance and insufficient event content richness. Additionally, previous frameworks have utilized a single model, but these frequently produce unsatisfactory outcomes such as lower precision rates and lower recall rates. We propose the concept of ensemble learning, which combines multiple models to achieve frameworks that perform as well as or better than the latest models. This framework combines the advantages of Mamba, a temporal convolutional network, and graph computation to identify event causality more effectively and accurately. After comparing our framework to standard datasets, our F1-scores (measures of model accuracy) are essentially the same as those of the state-of-the-art (SOTA) methods on one dataset. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

22 pages, 3424 KiB  
Article
A Line of Sight/Non Line of Sight Recognition Method Based on the Dynamic Multi-Level Optimization of Comprehensive Features
by Ziyao Ma, Zhongliang Deng, Zidu Tian, Yingjian Zhang, Jizhou Wang and Jilong Guo
Sensors 2025, 25(2), 304; https://doi.org/10.3390/s25020304 - 7 Jan 2025
Viewed by 515
Abstract
With the advent of the 5G era, high-precision localization based on mobile communication networks has become a research hotspot, playing an important role in indoor emergency rescue in shopping malls, smart factory management and tracking, as well as precision marketing. However, in complex [...] Read more.
With the advent of the 5G era, high-precision localization based on mobile communication networks has become a research hotspot, playing an important role in indoor emergency rescue in shopping malls, smart factory management and tracking, as well as precision marketing. However, in complex environments, non-line-of-sight (NLOS) propagation reduces the measurement accuracy of 5G signals, causing large deviations in position solving. In order to obtain high-precision position information, it is necessary to recognize the propagation state of the signal before distance measurement or angle measurement. In this paper, we propose a dynamic multi-level optimization of comprehensive features (DMOCF) network model for line-of-sight (LOS)/NLOS identification. The DMOCF model improves the expression ability of the deep model by adding a res2 module to the time delay neural network (TDNN), so that fine-grained feature information such as weak reflections or noise in the signal can be deeply understood by the model, enabling the network to realize layer-level feature processing by adding Squeeze and Excitation (SE) blocks with adaptive weight adjustment for each layer. A mamba module with position coding is added to each layer to capture the local patterns of wireless signals under complex propagation phenomena by extracting local features, enabling the model to understand the evolution of signals over time in a deeper way. In addition, this paper proposes an improved sand cat search algorithm for network parameter search, which improves search efficiency and search accuracy. Overall, this new network architecture combines the capabilities of local feature extraction, global feature preservation, and time series modeling, resulting in superior performance in the 5G channel impulse response (CIR) signal classification task, improving the accuracy of the model and accurately identifying the key characteristics of multipath signal propagation. Experimental results show that the NLOS/LOS recognition method proposed in this paper has higher accuracy than other deep learning methods. Full article
Show Figures

Figure 1

19 pages, 4546 KiB  
Article
MultiPhys: Heterogeneous Fusion of Mamba and Transformer for Video-Based Multi-Task Physiological Measurement
by Chaoyang Huo, Pengbo Yin and Bo Fu
Sensors 2025, 25(1), 100; https://doi.org/10.3390/s25010100 - 27 Dec 2024
Viewed by 609
Abstract
Due to its non-contact characteristics, remote photoplethysmography (rPPG) has attracted widespread attention in recent years, and has been widely applied for remote physiological measurements. However, most of the existing rPPG models are unable to estimate multiple physiological signals simultaneously, and the performance of [...] Read more.
Due to its non-contact characteristics, remote photoplethysmography (rPPG) has attracted widespread attention in recent years, and has been widely applied for remote physiological measurements. However, most of the existing rPPG models are unable to estimate multiple physiological signals simultaneously, and the performance of the limited available multi-task models is also restricted due to their single-model architectures. To address the above problems, this study proposes MultiPhys, adopting a heterogeneous network fusion approach for its development. Specifically, a Convolutional Neural Network (CNN) is used to quickly extract local features in the early stage, a transformer captures global context and long-distance dependencies, and Mamba is used to compensate for the transformer’s deficiencies, reducing the computational complexity and improving the accuracy of the model. Additionally, a gate is utilized for feature selection, which classifies the features of different physiological indicators. Finally, physiological indicators are estimated after passing features to each task-related head. Experiments on three datasets show that MultiPhys has superior performance in handling multiple tasks. The results of cross-dataset and hyper-parameter sensitivity tests also verify its generalization ability and robustness, respectively. MultiPhys can be considered as an effective solution for remote physiological estimation, thus promoting the development of this field. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

17 pages, 13691 KiB  
Article
MambaPose: A Human Pose Estimation Based on Gated Feedforward Network and Mamba
by Jianqiang Zhang, Jing Hou, Qiusheng He, Zhengwei Yuan and Hao Xue
Sensors 2024, 24(24), 8158; https://doi.org/10.3390/s24248158 - 20 Dec 2024
Viewed by 4327
Abstract
Human pose estimation is an important research direction in the field of computer vision, which aims to accurately identify the position and posture of keypoints of the human body through images or videos. However, multi-person pose estimation yields false detection or missed detection [...] Read more.
Human pose estimation is an important research direction in the field of computer vision, which aims to accurately identify the position and posture of keypoints of the human body through images or videos. However, multi-person pose estimation yields false detection or missed detection in dense crowds, and it is still difficult to detect small targets. In this paper, we propose a Mamba-based human pose estimation. First, we design a GMamba structure to be used as a backbone network to extract human keypoints. A gating mechanism is introduced into the linear layer of Mamba, which allows the model to dynamically adjust the weights according to the different input images to locate the human keypoints more precisely. Secondly, GMamba as the backbone network can effectively solve the long-sequence problem. The direct use of convolutional downsampling reduces selectivity for different stages of information flow. We used slice downsampling (SD) to reduce the resolution of the feature map to half the original size, and then fused local features from four different locations. The fusion of multi-channel information helped the model obtain rich pose information. Finally, we introduced an adaptive threshold focus loss (ATFL) to dynamically adjust the weights of different keypoints. We assigned higher weights to error-prone keypoints to strengthen the model’s attention to these points. Thus, we effectively improved the accuracy of keypoint identification in cases of occlusion, complex background, etc., and significantly improved the overall performance of attitude estimation and anti-interference ability. Experimental results showed that the AP and AP50 of the proposed algorithm on the COCO 2017 validation set were 72.2 and 92.6. Compared with the typical algorithm, it was improved by 1.1% on AP50. The proposed method can effectively detect the keypoints of the human body, and provides stronger robustness and accuracy for the estimation of human posture in complex scenes. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

15 pages, 3905 KiB  
Article
Conditional Skipping Mamba Network for Pan-Sharpening
by Yunxuan Tang, Huaguang Li, Peng Liu and Tong Li
Symmetry 2024, 16(12), 1681; https://doi.org/10.3390/sym16121681 - 19 Dec 2024
Viewed by 621
Abstract
Pan-sharpening aims to generate high-resolution multispectral (HRMS) images by combining high-resolution panchromatic (PAN) images with low-resolution multispectral (LRMS) data, while maintaining the symmetry of spatial and spectral characteristics. Traditional convolutional neural networks (CNNs) struggle with global dependency modeling due to local receptive fields, [...] Read more.
Pan-sharpening aims to generate high-resolution multispectral (HRMS) images by combining high-resolution panchromatic (PAN) images with low-resolution multispectral (LRMS) data, while maintaining the symmetry of spatial and spectral characteristics. Traditional convolutional neural networks (CNNs) struggle with global dependency modeling due to local receptive fields, and Transformer-based models are computationally expensive. Recent Mamba models offer linear complexity and effective global modeling. However, existing Mamba-based methods lack sensitivity to local feature variations, leading to suboptimal fine-detail preservation. To address this, we propose a Conditional Skipping Mamba Network (CSMN), which enhances global-local feature fusion symmetrically through two modules: (1) the Adaptive Mamba Module (AMM), which improves global perception using adaptive spatial-frequency integration; and (2) the Cross-domain Mamba Module (CDMM), optimizing cross-domain spectral-spatial representation. Experimental results on the IKONOS and WorldView-2 datasets demonstrate that CSMN surpasses existing state-of-the-art methods in achieving superior spectral consistency and preserving spatial details, with performance that is more symmetric in fine-detail preservation. Full article
(This article belongs to the Section Computer)
Show Figures

Figure 1

16 pages, 5744 KiB  
Article
Early Prediction of Remaining Useful Life for Lithium-Ion Batteries with the State Space Model
by Yuqi Liang and Shuai Zhao
Energies 2024, 17(24), 6326; https://doi.org/10.3390/en17246326 - 16 Dec 2024
Viewed by 723
Abstract
In the realm of lithium-ion batteries (LIBs), issues like material aging and capacity decline contribute to performance degradation or potential safety hazards. Predicting remaining useful life (RUL) serves as a crucial method of assessing the health of batteries, thereby enhancing reliability and safety. [...] Read more.
In the realm of lithium-ion batteries (LIBs), issues like material aging and capacity decline contribute to performance degradation or potential safety hazards. Predicting remaining useful life (RUL) serves as a crucial method of assessing the health of batteries, thereby enhancing reliability and safety. To reduce the complexity and improve the accuracy and applicability of early RUL predictions for LIBs, we proposed a Mamba-based state space model for early RUL prediction. Due to the impacts of abnormal data, we first use the interquartile range (IQR) method with a sliding window for data cleansing. Subsequently, the top three highest correlated features are selected, and only the first 300 cycling data are used for training. The model has the ability to make forecasts using these few historical data. Extensive experiments are conducted using CALCE CS2 datasets. The MAE, RMSE, and RE are less than 0.015, 0.019, and 0.0261; meanwhile, R2 is higher than 0.99. Compared to the baseline approaches (CNN, BiLSTM, and CNN-BiLSTM), the average MAE, RMSE, and RE of the proposed approach are reduced by at least 29%, 21%, and 36%, respectively. According to the experimental results, the proposed approach performs better in terms of accuracy, robustness, and efficiency. Full article
(This article belongs to the Section D2: Electrochem: Batteries, Fuel Cells, Capacitors)
Show Figures

Figure 1

Back to TopTop