Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
GAF-Net: A new automated segmentation method based on multiscale feature fusion and feedback module
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 86–92https://doi.org/10.1016/j.patrec.2024.11.025Highlights- A multi-scale feature fusion feedback module is proposed to refine the local features;.
- The global feature module is developed to obtain a fine global information map;.
- The adaptive feature fusion is proposed to obtain final ...
Surface defect detection (SDD) is the necessary technique to monitor the surface quality of production. However, fine grain defects caused by stress loading, environmental influences, and construction defects is still a challenge to detect. In ...
- research-articleJanuary 2025
Neural network modelling of kinematic and dynamic features for signature verification
- Moises Diaz,
- Miguel A. Ferrer,
- Jose Juan Quintana,
- Adam Wolniakowski,
- Roman Trochimczuk,
- Kanstantsin Miatliuk,
- Giovanna Castellano,
- Gennaro Vessio
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 130–136https://doi.org/10.1016/j.patrec.2024.11.021AbstractOnline signature parameters, which are based on human characteristics, broaden the applicability of an automatic signature verifier. Although kinematic and dynamic features have previously been suggested, accurately measuring features such as arm ...
Graphical abstractDisplay Omitted
Highlights- We explore kinematic and dynamic features for online signature verification.
- A UR5 robotic arm is used to acquire these features from the MCYT330 dataset.
- A neural network estimates the kinematic and dynamic features of a ...
- research-articleJanuary 2025
Generation of super-resolution for medical image via a self-prior guided Mamba network with edge-aware constraint
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 93–99https://doi.org/10.1016/j.patrec.2024.11.020AbstractExisting deep learning-based super-resolution generation approaches usually depend on the backbone of convolutional neural networks (CNNs) or Transformers. CNN-based approaches are unable to model long-range dependencies, whereas Transformer-...
Highlights- We propose a Mamba-based U-Net network for medical image super-resolution generation.
- We develop the self-prior learning to mine the self-prior information of the image.
- We design improved 2D-Selective-Scan to adaptively fuse multi-...
- research-articleJanuary 2025
Incremental component tree contour computation
- Dennis J. Silva,
- Jiří Kosinka,
- Ronaldo F. Hashimoto,
- Jos B.T.M. Roerdink,
- Alexandre Morimitsu,
- Wonder A.L. Alves
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 115–121https://doi.org/10.1016/j.patrec.2024.11.019AbstractA component tree is a graph representation that encodes the connected components of the upper or lower level sets of a grayscale image. Consequently, the nodes of a component tree represent binary images of the encoded connected components. There ...
Graphical abstractDisplay Omitted
Highlights- Novel incremental contour computation algorithm for component trees.
- Contour computation for component tree time complexity analysis.
- Contour computation for component tree experimental runtime analysis.
- Comparison between ...
- research-articleJanuary 2025
IDA-NET: Individual Difference aware Medical Image Segmentation with Meta-Learning
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 21–27https://doi.org/10.1016/j.patrec.2024.11.012AbstractIndividual differences in organ size and spatial distribution can lead to significant variations in the content of medical images at similar anatomical locations. These case-level differences are distinct from the domain shift between multi-...
Highlights- A hybrid attention network is proposed to better model multiscale relationships.
- A meta-learning strategy is designed to achieve better segmentation generalization.
- An individual discriminator is designed to extract individual-...
-
- research-articleJanuary 2025
Neuromorphic face analysis: A survey
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 42–48https://doi.org/10.1016/j.patrec.2024.11.009AbstractNeuromorphic sensors, also known as event cameras, are a class of imaging devices mimicking the function of biological visual systems. Unlike traditional frame-based cameras, which capture fixed images at discrete intervals, neuromorphic sensors ...
Highlights- Neuromorphic sensors are becoming more common in human-face related vision tasks.
- As a new niche field it lacks overview papers helping researchers orient themselves.
- We compare the relevant literature to classical methods and ...
- research-articleJanuary 2025
A semi-supervised approach for breast tumor segmentation using sparse transformer attention UNet
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 63–72https://doi.org/10.1016/j.patrec.2024.11.008Highlights- Develop a semi-supervised learning framework for breast tumor segmentation, integrating a DIGN network, APMN network, and STA-UNet to leverage labeled and unlabeled data effectively.
- Enhance the STA-UNet encoder with a computationally ...
Accurate segmentation of breast tumors, especially in younger women, remains a significant challenge in cancer research. Ultrasound imaging, a non-invasive screening method, relies on tumor characteristics such as size and texture, which are ...
- research-articleJanuary 2025
MACT: Underwater image color correction via Minimally Attenuated Channel Transfer
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 28–34https://doi.org/10.1016/j.patrec.2024.11.007AbstractUnderwater images usually show reduced quality due to the underwater environment where light propagation is affected by scattering and absorption, severely limiting the effectiveness of underwater images in practical applications. To effectively ...
Highlights- We propose a double mean difference method for color compensation.
- We enhance color distribution by determining the extreme values of color channels.
- Our method can preprocess images with defogging.
- research-articleJanuary 2025
Segmentation of MRI tumors and pelvic anatomy via cGAN-synthesized data and attention-enhanced U-Net
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 100–106https://doi.org/10.1016/j.patrec.2024.11.003Highlights- Pioneering cGAN-based technique revolutionizes MRI tumor segmentation accuracy.
- Patch discriminator integration crafts ultra-realistic synthetic MRI datasets.
- Attention-augmented U-Net model dramatically boosts feature-focused ...
Accurate tumor segmentation within MRI images is of great importance for both diagnosis and treatment; however, in many cases, sufficient annotated datasets may not be available. This paper develops a novel approach to the medical image ...
- research-articleJanuary 2025
Improving ViT interpretability with patch-level mask prediction
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 73–79https://doi.org/10.1016/j.patrec.2024.11.018AbstractVision Transformers (ViTs) have demonstrated remarkable performances on various computer vision tasks. Attention scores are often used to explain the decision-making process of ViTs, showing which tokens are more important than others. However, ...
Highlights- We propose a novel visual explanation for ViT using a patch-level localization task.
- Our method enhances explainability and localization performance across benchmarks.
- Our method works with pseudo masks from self-supervised ...
- research-articleJanuary 2025
Multichannel image classification based on adaptive attribute profiles
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 107–114https://doi.org/10.1016/j.patrec.2024.11.015AbstractMorphological Attribute Profiles serve as powerful tools for extracting meaningful features from remote sensing data. The construction of Morphological Attribute Profiles relies on two primary parameters: the choice of attribute type and the ...
Highlights- Adaptive threshold adjustments enhance attribute profile features in remote sensing.
- A semi-automatic method for optimizing attribute profiles in remote sensing data analysis.
- Maximally Stable Extremal Regions theory applied to ...
- research-articleJanuary 2025
Sparse-attention augmented domain adaptation for unsupervised person re-identification
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 8–13https://doi.org/10.1016/j.patrec.2024.11.013AbstractThe domain gap persists as a demanding problem for unsupervised domain adaptive person re-identification (UDA re-ID). In response to this question, we present a novel Sparse self-Attention Augmented Domain Adaptation approach (SAADA Model) to ...
Highlights- This method addresses some shortcomings of existing domain adaptation methods.
- This method is a fine-grained solution with better flexibility.
- This approach shows that the sparse self-attention can improve the performance of domain ...
- research-articleJanuary 2025
Dehazing with all we have
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 122–129https://doi.org/10.1016/j.patrec.2024.11.011AbstractIn the near past, a large number of classical intuitively originated dehazing and image enhancing approaches have been worked out, and once played key roles in tremendous practical application scenes. Nevertheless, nowadays, the booming of deep ...
Highlights- Universally merging intuitive ideas into deep driven dehazing structure is plausible.
- An adaptive multi-source visual content enhancing framework is designed.
- Multiple structure complexity paths are worked out to fuse various ...
- research-articleJanuary 2025
GANzzle+ +: Generative approaches for jigsaw puzzle solving as local to global assignment in latent spatial representations
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 35–41https://doi.org/10.1016/j.patrec.2024.11.010AbstractJigsaw puzzles are a popular and enjoyable pastime that humans can easily solve, even with many pieces. However, solving a jigsaw is a combinatorial problem, and the space of possible solutions is exponential in the number of pieces, intractable ...
Highlights- We present new generative modules for estimating the jigsaw solution image.
- We show the effect of estimating the target image for placement of pieces.
- We evaluate on open datasets showing a large margin improvement.
- research-articleJanuary 2025
LED-Net: A lightweight edge detection network
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 56–62https://doi.org/10.1016/j.patrec.2024.11.006AbstractAs a fundamental task in computer vision, edge detection is becoming increasingly vital in many fields. Recently, large-parameter pre-training models have been used in edge detection tasks. However, significant computational resources are ...
Graphical abstractDisplay Omitted
Highlights- LED-Net improves the edge detection performance while reducing the computational complexity.
- Positional information has been introduced into the edge feature extraction process.
- Feature sample process becomes efficient and ...
- research-articleJanuary 2025
FM-detector: End-to-end flight maneuver recognition method based on flight data
Pattern Recognition Letters (PTRL), Volume 187, Issue CPages 1–7https://doi.org/10.1016/j.patrec.2024.11.005AbstractFlight maneuver recognition (FMR) refers to automatic recognizing a series of aircraft flight patterns and is a key technology for flight training evaluation. However, the traditional FMR methods generally have higher computational complexity and ...
Graphical abstractDisplay Omitted
Highlights- Designed an end-to-end detection framework for flight maneuver recognition (FMR).
- Designed a backbone network tailored for flight data.
- Improved the prediction accuracy of FMR significantly with much lower inference time.
- research-articleOctober 2024
Measuring student behavioral engagement using histogram of actions
Pattern Recognition Letters (PTRL), Volume 186, Issue CPages 337–344https://doi.org/10.1016/j.patrec.2024.11.002AbstractIn this work, we propose a novel method for assessing students’ behavioral engagement by representing student’s actions and their frequencies over an arbitrary time interval as a histogram of actions. This histogram and the student’s gaze are ...
Highlights- A student dataset is formed with annotated videos for students actions and engagement.
- PoseConv3D model is finetuned to detect student’s actions using upper body joints.
- A novel histogram-based approach is proposed to measure ...
- research-articleOctober 2024
MMIFR: Multi-modal industry focused data repository
Pattern Recognition Letters (PTRL), Volume 186, Issue CPages 306–313https://doi.org/10.1016/j.patrec.2024.11.001AbstractIn the rapidly advancing field of industrial automation, the availability of robust and diverse datasets is crucial for the development and evaluation of machine learning models. The data repository consists of four distinct versions of datasets: ...
Highlights- MMIFR is a multimodal dataset for industrial equipment related work.
- It includes four components: MMIFR-D, MMIFR-FS, MMIFR-OD, and MMIFR-P.
- Comparative analysis shows MMIFR’s advantages over other industrial datasets.
- Benchmark ...
- research-articleOctober 2024
On the effects of obfuscating speaker attributes in privacy-aware depression detection
Pattern Recognition Letters (PTRL), Volume 186, Issue CPages 300–305https://doi.org/10.1016/j.patrec.2024.10.016AbstractDetection of depressive symptoms from spoken content has emerged as an efficient Artificial Intelligence (AI) tool for diagnosing this serious mental health condition. Since speech is a highly sensitive form of data, privacy-enhancing measures ...
Highlights- First study on how obfuscating speaker attributes impacts depression detection.
- Investigates multimodal depression detection,prior work largely unimodal.
- Results on a clinically validated dataset,unlike prior works using self-...
- research-articleOctober 2024
LuminanceGAN: Controlling the brightness of generated images for various night conditions
Pattern Recognition Letters (PTRL), Volume 186, Issue CPages 292–299https://doi.org/10.1016/j.patrec.2024.10.014AbstractThere are diverse datasets available for training deep learning models utilized in autonomous driving. However, most of these datasets are composed of images obtained in day conditions, leading to a data imbalance issue when dealing with night ...
Graphical abstractDisplay Omitted
Highlights- Our novel Y-control loss is proposed to control the brightness degree of the generated night condition images.
- We reduced the artifacts in the generated night condition images by incorporating a self-attention module.
- Addressing ...