Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleDecember 2024
EffiSeaNet: Pioneering Lightweight Network for Underwater Salient Object Detection
AbstractUnderwater salient object detection seeks to pinpoint the most vital elements in underwater environments, offering considerable promise for underwater exploration. Considering the preference for low-complexity algorithms in underwater applications ...
- research-articleOctober 2024
SimCLIP: Refining Image-Text Alignment with Simple Prompts for Zero-/Few-shot Anomaly Detection
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 1761–1770https://doi.org/10.1145/3664647.3681376Recently, large pre-trained vision-language models, such as CLIP, have demonstrated significant potential in zero-/few-shot anomaly detection tasks. However, existing methods not only rely on expert knowledge to manually craft extensive text prompts but ...
- research-articleOctober 2024
Efficient Perceiving Local Details via Adaptive Spatial-Frequency Information Integration for Multi-focus Image Fusion
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 9350–9359https://doi.org/10.1145/3664647.3680738Multi-focus image fusion (MFIF) aims to combine multiple images with different focused regions into a single all-in-focus image. Existing unsupervised deep learning-based methods only fuse structural information of images in the spatial domain, ...
- research-articleOctober 2024
P2SAM: Probabilistically Prompted SAMs Are Efficient Segmentator for Ambiguous Medical Images
- Yuzhi Huang,
- Chenxin Li,
- Zixu Lin,
- Hengyu Liu,
- Haote Xu,
- Yifan Liu,
- Yue Huang,
- Xinghao Ding,
- Xiaotong Tu,
- Yixuan Yuan
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 9779–9788https://doi.org/10.1145/3664647.3680628Generating diverse plausible outputs from a single input is crucial for addressing visual ambiguities, exemplified in medical imaging where experts may provide varying semantic segmentation annotations for the same image.Existing methods handles ...
- research-articleSeptember 2024
Source-free cross-domain fault diagnosis of rotating machinery using the Siamese framework
AbstractDespite deep learning based intelligent diagnosis has become an essential means of perceiving the health status of rotating machinery, existing diagnostic models struggle to efficiently extract domain-invariant representations for cross-domain ...
Highlights- DRSN-SCW network is proposed to handle vibration signals with strong noise, enhancing the feature extraction capability.
- A self-supervised Siamese framework is designed for end-to-end representation learning.
- The pseudo-labeling ...
-
- research-articleApril 2024
AIVR-Net: Attribute-based invariant visual representation learning for vehicle re-identification
AbstractVehicle re-identification (ReID) aims to match and track vehicles in a surveillance system across non-overlapping camera views. Despite great advances have been achieved in intra-domain and cross-domain vehicle ReID, most existing methods still ...
Highlights- A novel framework uses attribute visual information to improve the performance of Vehicle ReID.
- Attribute Representation Disentangle Branch is proposed to promote attribute learning flow.
- Our model achieves superior results in both ...
- research-articleFebruary 2024
Progressive high-frequency reconstruction for pan-sharpening with implicit neural representation
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 466, Pages 4189–4197https://doi.org/10.1609/aaai.v38i5.28214Pan-sharpening aims to leverage the high-frequency signal of the panchromatic (PAN) image to enhance the resolution of its corresponding multi-spectral (MS) image. However, deep neural networks (DNNs) tend to prioritize learning the low-frequency ...
- research-articleFebruary 2024
Unsupervised pan-sharpening via mutually guided detail restoration
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 377, Pages 3386–3394https://doi.org/10.1609/aaai.v38i4.28125Pan-sharpening is a task that aims to super-resolve the low-resolution multispectral (LRMS) image with the guidance of a corresponding high-resolution panchromatic (PAN) image. The key challenge in pan-sharpening is to accurately modeling the ...
- research-articleFebruary 2024
Learning to sound imaging by a model-based interpretable network
AbstractAcoustic beamforming methods based on microphone arrays have been widely used for sound source localization in various industrial fields. The conventional methods such as Delay and Sum (DAS) beamforming are limited by poor spatial resolution ...
Highlights- The propose DAMAS-FISTA-LASSO algorithm can significantly reduce the runtime requirement of DAMAS.
- We further design DFLNet, which adopts the structure of the DAMAS-FISTA-LASSO and inherits the relevant acoustic domain knowledge.
- ...
- research-articleDecember 2023
Joint Image and Feature Levels Disentanglement for Generalizable Vehicle Re-identification
IEEE Transactions on Intelligent Transportation Systems (ITS-TRANSACTIONS), Volume 24, Issue 12Pages 15259–15273https://doi.org/10.1109/TITS.2023.3314213Domain generalization (DG), which doesn’t require any data from target domains during training, is more challenging but practical than unsupervised domain adaptation (UDA). Since different vehicles of the same type have a similar appearance, neural ...
- ArticleNovember 2023
Domain Generalized Object Detection with Triple Graph Reasoning Network
AbstractRecent advances in Domain Adaptive Object Detection (DAOD) have vastly restrained the performance degradation caused by distribution shift. However, DAOD relies on the strong assumption of accessible target domain during the learning procedure, ...
- research-articleNovember 2023
Adaptive nonlinear group delay mode estimation
Highlights- Combined with a complex Bayesian compressive sensing framework, our method allows the components to adaptively be extracted via an iterative optimization framework.
- This algorithm can directly estimate the instantaneous amplitude and ...
The decomposition of non-stationary signals remains a challenge in a wide variety of fields. Especially, the impulse or cross-mode signals are difficult to be reconstructed by recent methods due to their transient characteristic. Moreover, most ...
- research-articleOctober 2023
Learning High-frequency Feature Enhancement and Alignment for Pan-sharpening
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 358–367https://doi.org/10.1145/3581783.3611937Pan-sharpening aims to utilize the high-resolution panchromatic (PAN) image as a guidance to super-resolve the spatial resolution of the low-resolution multispectral (MS) image. The key challenge in pan-sharpening is how to effectively and precisely ...
- research-articleOctober 2023
Domain-irrelevant Feature Learning for Generalizable Pan-sharpening
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 3287–3296https://doi.org/10.1145/3581783.3611894Pan-sharpening aims to spatially enhance the low-resolution multispectral image (LRMS) by transferring high-frequency details from a panchromatic image (PAN) while preserving the spectral characteristics of LRMS. Previous arts mainly focus on how to ...
- ArticleDecember 2023
DP-INNet: Dual-Path Implicit Neural Network for Spatial and Spectral Features Fusion in Pan-Sharpening
AbstractPan-sharpening is a technique that fuses a high-resolution panchromatic (PAN) image with its corresponding low-resolution multispectral (MS) image to create a high-resolution multispectral image. Due to the powerful representation ability of ...
- ArticleDecember 2023
High-Resolution Feature Representation Driven Infrared Small-Dim Object Detection
AbstractInfrared small-dim object detection is a challenging task due to the small size, weak features, lack of prominent structural information, and vulnerability to background interference. During the process of deep learning-based feature extraction, ...
- ArticleDecember 2023
A Two-Stage Federated Learning Framework for Class Imbalance in Aerial Scene Classification
AbstractCentralized aerial imagery analysis techniques face two challenges. The first one is the data silos problem, where data is located at different organizations separately. The second challenge is the class imbalance in the overall distribution of ...
- ArticleDecember 2023
Recognizer Embedding Diffusion Generation for Few-Shot SAR Recognization
AbstractSynthetic Aperture Radar (SAR) has become a research hotspot due to its ability to identify targets in all weather conditions and at all times. To achieve satisfactory recognition performance in most existing automatic target recognition (ATR) ...
- ArticleDecember 2023
Adversarial Robustness via Multi-experts Framework for SAR Recognition with Class Imbalanced
AbstractWith the rapid development of deep learning technology, significant progress has been made in the field of synthetic aperture radar (SAR) target recognition algorithms. However, deep neural networks are vulnerable to adversarial attacks in ...
- ArticleDecember 2023
Graph-Based Dependency-Aware Non-Intrusive Load Monitoring
AbstractNon-intrusive load monitoring (NILM) is able to analyze and predict users’ power consumption behaviors for further improving the power consumption efficiency of the grid. Neural network-based techniques have been developed for NILM. However, the ...