VISC: Vol 39, No 7

Volume 39, Issue 7Jul 2023

Volume 39, Issue 7

Jul 2023

Publisher:

Springer-Verlag
Berlin, Heidelberg

ISSN:0178-2789

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

IPCS: An improved corner detector with intensity, pattern, curvature, and scale

Pages 2499–2513https://doi.org/10.1007/s00371-022-02474-6

Abstract

The corner detection plays an important role in the area of image processing and computer vision. The current corner detection methods often utilize few cues or single model to improve the detection correctness and repeatability. A composite model ...

research-article

A coarse-to-fine ghost removal scheme for HDR imaging

Pages 2515–2528https://doi.org/10.1007/s00371-022-02475-5

Abstract

Ghost removal in high dynamic range imaging is a challenging problem especially when relative camera or object motion exists. To solve the problem, an effective coarse-to-fine deghosting method combining registration and matching based on ...

research-article

Parameter-adaptive multi-frame joint pose optimization method

Pages 2529–2541https://doi.org/10.1007/s00371-022-02476-4

Abstract

Camera pose optimization is the basis of geometric vision works, such as 3D reconstruction, structure from motion, and visual odometry. We designed a multi-frame pose optimization method based on the inverse compositional algorithm. The neural ...

research-article

HSNet: hierarchical semantics network for scene parsing

Pages 2543–2554https://doi.org/10.1007/s00371-022-02477-3

Abstract

Scene parsing is one of the fundamental tasks in computer vision. Humans tend to perceive a scene in a hierarchical manner, i.e., first identifying the coarse category (e.g., vehicle) of a group of objects and then the fine category (e.g., bicycle,...

research-article

GlcMatch: global and local constraints for reliable feature matching

Pages 2555–2570https://doi.org/10.1007/s00371-022-02478-2

Abstract

A match is considered as an incorrect match when the matched features in two views do not correspond to the same physical location. It is inevitable that generates mismatches at a local descriptor level. Differentiating true and false matches ...

research-article

Real-time and on-line removal of moving human figures in hand-held mobile augmented reality

Pages 2571–2582https://doi.org/10.1007/s00371-022-02479-1

Abstract

In this paper, we present a real time on-line augmented/diminished reality system that runs entirely on the hand-held moving mobile device. Specifically, we introduce an improved inpainting algorithm that is designed for the on-line usage (i.e., ...

research-article

VTNCT: an image-based virtual try-on network by combining feature with pixel transformation

Pages 2583–2596https://doi.org/10.1007/s00371-022-02480-8

Abstract

Image-based virtual try-on tasks with the goal of transferring a target clothing item onto the corresponding region of a person have attracted increasing research attention recently. However, most of the existing image-based virtual try-on methods ...

research-article

Inverse transformation sampling-based attentive cutout for fine-grained visual recognition

Pages 2597–2608https://doi.org/10.1007/s00371-022-02481-7

Abstract

Recent works on fine-grained visual categorization rely on detecting discriminative regions that correspond to specific visual patterns. Promising progress has been obtained by constructing complicated network architecture, which either involves ...

review-article

Suspect face retrieval using visual and linguistic information

Pages 2609–2635https://doi.org/10.1007/s00371-022-02482-6

Abstract

Faces are the most common biometric used for the identification of a person. Law enforcement agencies use face as a key point to identify the suspect involved in unlawful activities. Forensic sketches are normally developed by the sketch artist ...

research-article

Soft thresholding squeeze-and-excitation network for pose-invariant facial expression recognition

Pages 2637–2652https://doi.org/10.1007/s00371-022-02483-5

Abstract

Pose-invariant facial expression recognition is one of the popular research directions within the field of computer vision, but pose variant usually change the facial appearance significantly, making the recognition results unstable from different ...

research-article

Point spread function estimation for blind image deblurring problems based on framelet transform

Reza Parvaz

Pages 2653–2669https://doi.org/10.1007/s00371-022-02484-4

Abstract

One of the most important issues in image processing is the approximation of the image that has been lost due to the blurring process. These types of matters are divided into non-blind and blind problems. The second type of problem is more complex ... $_{}_{}$

research-article

CCST: crowd counting with swin transformer

Pages 2671–2682https://doi.org/10.1007/s00371-022-02485-3

Abstract

Accurately estimating the number of individuals contained in an image is the purpose of the crowd counting. It has always faced two major difficulties: uneven distribution of crowd density and large span of head size. Focusing on the former, most ...

research-article

Analysis of seam carving technique: limitations, improvements and possible solutions

Pages 2683–2709https://doi.org/10.1007/s00371-022-02486-2

Abstract

Nowadays, many efficient content-aware image resizing techniques are being used to safeguard the prominent regions of the image so that aesthetically pleasing retargeting results can be generated. In this paper, firstly various energy map ...

research-article

Real-time tunnel projection from a moving subway train

Pages 2711–2724https://doi.org/10.1007/s00371-022-02487-1

Abstract

In this study, we present the first actual working system that can project content onto a tunnel wall from a moving subway train so that passengers can enjoy the display of digital content through a train window. Our stand-alone system can be ...

research-article

NIR/RGB image fusion for scene classification using deep neural networks

Pages 2725–2739https://doi.org/10.1007/s00371-022-02488-0

Abstract

Near-infrared (NIR) imaging can add very useful data to many visible range image processing applications. In this paper, new fusion techniques are proposed to benefit from the data of both NIR/RGB sensors for the application of scene recognition ...

research-article

Retinopathy grading with deep learning and wavelet hyper-analytic activations

Pages 2741–2756https://doi.org/10.1007/s00371-022-02489-z

Abstract

Recent developments reveal the prominence of Diabetic Retinopathy (DR) grading. In the past few decades, Wavelet-based DR classification has shown successful impacts and the Deep Learning models, like Convolutional Neural Networks (CNN’s), have ...

research-article

Particle filter-based video object tracking using feature fusion in template partitions

Pages 2757–2779https://doi.org/10.1007/s00371-022-02490-6

Abstract

Moving object tracking is one of the key issues in the domain of computer vision. A variety of challenges are posed while tracking the object in the real-world scenario. In this paper, we have proposed a particle filtering-based algorithm to track ...

research-article

A multimodal transformer to fuse images and metadata for skin disease classification

Pages 2781–2793https://doi.org/10.1007/s00371-022-02492-4

Abstract

Skin disease cases are rising in prevalence, and the diagnosis of skin diseases is always a challenging task in the clinic. Utilizing deep learning to diagnose skin diseases could help to meet these challenges. In this study, a novel neural ...

research-article

Visibility restoration of haze and dust image using color correction and composite channel prior

Pages 2795–2809https://doi.org/10.1007/s00371-022-02493-3

Abstract

Visibility restoration of images under haze and dust weather is essential in computer vision tasks. In this work, an algorithm for image visibility restoration based on color correction and composite channel prior (CCP) is proposed. First, the ...

research-article

OneSketch: learning high-level shape features from simple sketches

Pages 2811–2822https://doi.org/10.1007/s00371-022-02494-2

Abstract

Humans use simple sketches to convey complex concepts and abstract ideas in a concise way. Just a few abstract pencil strokes can carry a large amount of semantic information that can be used as meaningful representation for many applications. In ...

research-article

Temporal action localization using gated recurrent units

Pages 2823–2834https://doi.org/10.1007/s00371-022-02495-1

Abstract

Temporal action localization (TAL) task which is to predict the start and end of each action in a video along with the class label of the action has numerous applications in the real world. But due to the complexity of this task, acceptable ...

research-article

Improving virtual pipes model of hydraulic and thermal erosion with vegetation considerations

Pages 2835–2846https://doi.org/10.1007/s00371-022-02496-0

Abstract

Current research in real-time water simulation can also calculate hydraulic erosion to a landscape; however, vegetation, which carries one of the biggest impacts on hydraulic erosion, is often not considered. We proposed an improvement upon the ...

research-article

Entanglement inspired approach for determining the preeminent arrangement of static cameras in a multi-view computer vision system

Pages 2847–2863https://doi.org/10.1007/s00371-022-02497-z

Abstract

This paper is on the concept of quantum steering and quantum entanglement of two observers. The concept is applied to a multi-view computer vision system that incorporates two cameras. Three separate multi-view static camera setups are used to ...

research-article

Material-aware Cross-channel Interaction Attention (MCIA) for occluded prohibited item detection

Pages 2865–2877https://doi.org/10.1007/s00371-022-02498-y

Abstract

For security inspection, detecting prohibited items in X-ray images is challenging since they are usually occluded by non-prohibited items. In X-ray images, different materials present different colors and textures. On this basis, we exploit the ...

correction

Correction: Material-aware Cross-channel Interaction Attention (MCIA) for occluded prohibited item detection

Page 2879https://doi.org/10.1007/s00371-022-02529-8

research-article

Hypergraph attentional convolutional neural network for salient object detection

Pages 2881–2907https://doi.org/10.1007/s00371-022-02499-x

Abstract

Learning discriminative features and mining salient visual patterns play an important role in salient object detection (SOD) task. Existing SOD methods suffer from limited receptive field and insufficient cross-level feature mining. To this end, ...

research-article

Handwritten Arabic and Roman word recognition using holistic approach

Pages 2909–2932https://doi.org/10.1007/s00371-022-02500-7

Abstract

The research community considers handwritten word recognition (HWR) as an open research problem to date. The reasons behind this are variations in intra-/interpersonal writing style, overlapping and/or touching characters in a word, degraded ...

research-article

A two-stage image process for water level recognition via dual-attention CornerNet and CTransformer

Pages 2933–2952https://doi.org/10.1007/s00371-022-02501-6

Abstract

Image processing-based water level detectors have promising practical application value in intelligent agriculture and early water logging alerts. However, water level recognition based on image processing faces illumination, shooting angle, and ...

research-article

PTCERE: personality-trait mapping using cognitive-based emotion recognition from electroencephalogram signals

Pages 2953–2967https://doi.org/10.1007/s00371-022-02502-5

Abstract

Human emotion recognition is a technique for identifying human emotions with respect to various aspects of human life, such as in decision-making, detecting lies, assessing social behaviour, measuring brain-related activity and identifying the ...

research-article

MFANet: Multi-scale feature fusion network with attention mechanism

Pages 2969–2980https://doi.org/10.1007/s00371-022-02503-4

Abstract

In order to improve the detection accuracy of the network, it proposes multi-scale feature fusion and attention mechanism net (MFANet) based on deep learning, which integrates pyramid module and channel attention mechanism effectively. Pyramid ...

The Visual Computer: International Journal of Computer Graphics

Sections

IPCS: An improved corner detector with intensity, pattern, curvature, and scale

A coarse-to-fine ghost removal scheme for HDR imaging

Parameter-adaptive multi-frame joint pose optimization method

HSNet: hierarchical semantics network for scene parsing

GlcMatch: global and local constraints for reliable feature matching

Real-time and on-line removal of moving human figures in hand-held mobile augmented reality

VTNCT: an image-based virtual try-on network by combining feature with pixel transformation

Inverse transformation sampling-based attentive cutout for fine-grained visual recognition

Suspect face retrieval using visual and linguistic information

Soft thresholding squeeze-and-excitation network for pose-invariant facial expression recognition

Point spread function estimation for blind image deblurring problems based on framelet transform

CCST: crowd counting with swin transformer

Analysis of seam carving technique: limitations, improvements and possible solutions

Real-time tunnel projection from a moving subway train

NIR/RGB image fusion for scene classification using deep neural networks

Retinopathy grading with deep learning and wavelet hyper-analytic activations

Particle filter-based video object tracking using feature fusion in template partitions

A multimodal transformer to fuse images and metadata for skin disease classification

Visibility restoration of haze and dust image using color correction and composite channel prior

OneSketch: learning high-level shape features from simple sketches

Temporal action localization using gated recurrent units

Improving virtual pipes model of hydraulic and thermal erosion with vegetation considerations

Entanglement inspired approach for determining the preeminent arrangement of static cameras in a multi-view computer vision system

Material-aware Cross-channel Interaction Attention (MCIA) for occluded prohibited item detection

Correction: Material-aware Cross-channel Interaction Attention (MCIA) for occluded prohibited item detection

Hypergraph attentional convolutional neural network for salient object detection

Handwritten Arabic and Roman word recognition using holistic approach

A two-stage image process for water level recognition via dual-attention CornerNet and CTransformer

PTCERE: personality-trait mapping using cognitive-based emotion recognition from electroencephalogram signals

MFANet: Multi-scale feature fusion network with attention mechanism

Sections

Save to Binder

Comments