IPCS: An improved corner detector with intensity, pattern, curvature, and scale
The corner detection plays an important role in the area of image processing and computer vision. The current corner detection methods often utilize few cues or single model to improve the detection correctness and repeatability. A composite model ...
A coarse-to-fine ghost removal scheme for HDR imaging
Ghost removal in high dynamic range imaging is a challenging problem especially when relative camera or object motion exists. To solve the problem, an effective coarse-to-fine deghosting method combining registration and matching based on ...
Parameter-adaptive multi-frame joint pose optimization method
Camera pose optimization is the basis of geometric vision works, such as 3D reconstruction, structure from motion, and visual odometry. We designed a multi-frame pose optimization method based on the inverse compositional algorithm. The neural ...
HSNet: hierarchical semantics network for scene parsing
Scene parsing is one of the fundamental tasks in computer vision. Humans tend to perceive a scene in a hierarchical manner, i.e., first identifying the coarse category (e.g., vehicle) of a group of objects and then the fine category (e.g., bicycle,...
GlcMatch: global and local constraints for reliable feature matching
A match is considered as an incorrect match when the matched features in two views do not correspond to the same physical location. It is inevitable that generates mismatches at a local descriptor level. Differentiating true and false matches ...
Real-time and on-line removal of moving human figures in hand-held mobile augmented reality
In this paper, we present a real time on-line augmented/diminished reality system that runs entirely on the hand-held moving mobile device. Specifically, we introduce an improved inpainting algorithm that is designed for the on-line usage (i.e., ...
VTNCT: an image-based virtual try-on network by combining feature with pixel transformation
Image-based virtual try-on tasks with the goal of transferring a target clothing item onto the corresponding region of a person have attracted increasing research attention recently. However, most of the existing image-based virtual try-on methods ...
Inverse transformation sampling-based attentive cutout for fine-grained visual recognition
Recent works on fine-grained visual categorization rely on detecting discriminative regions that correspond to specific visual patterns. Promising progress has been obtained by constructing complicated network architecture, which either involves ...
Suspect face retrieval using visual and linguistic information
Faces are the most common biometric used for the identification of a person. Law enforcement agencies use face as a key point to identify the suspect involved in unlawful activities. Forensic sketches are normally developed by the sketch artist ...
Soft thresholding squeeze-and-excitation network for pose-invariant facial expression recognition
Pose-invariant facial expression recognition is one of the popular research directions within the field of computer vision, but pose variant usually change the facial appearance significantly, making the recognition results unstable from different ...
CCST: crowd counting with swin transformer
Accurately estimating the number of individuals contained in an image is the purpose of the crowd counting. It has always faced two major difficulties: uneven distribution of crowd density and large span of head size. Focusing on the former, most ...
Analysis of seam carving technique: limitations, improvements and possible solutions
Nowadays, many efficient content-aware image resizing techniques are being used to safeguard the prominent regions of the image so that aesthetically pleasing retargeting results can be generated. In this paper, firstly various energy map ...
Real-time tunnel projection from a moving subway train
In this study, we present the first actual working system that can project content onto a tunnel wall from a moving subway train so that passengers can enjoy the display of digital content through a train window. Our stand-alone system can be ...
NIR/RGB image fusion for scene classification using deep neural networks
Near-infrared (NIR) imaging can add very useful data to many visible range image processing applications. In this paper, new fusion techniques are proposed to benefit from the data of both NIR/RGB sensors for the application of scene recognition ...
Retinopathy grading with deep learning and wavelet hyper-analytic activations
Recent developments reveal the prominence of Diabetic Retinopathy (DR) grading. In the past few decades, Wavelet-based DR classification has shown successful impacts and the Deep Learning models, like Convolutional Neural Networks (CNN’s), have ...
Particle filter-based video object tracking using feature fusion in template partitions
Moving object tracking is one of the key issues in the domain of computer vision. A variety of challenges are posed while tracking the object in the real-world scenario. In this paper, we have proposed a particle filtering-based algorithm to track ...
A multimodal transformer to fuse images and metadata for skin disease classification
Skin disease cases are rising in prevalence, and the diagnosis of skin diseases is always a challenging task in the clinic. Utilizing deep learning to diagnose skin diseases could help to meet these challenges. In this study, a novel neural ...
Visibility restoration of haze and dust image using color correction and composite channel prior
Visibility restoration of images under haze and dust weather is essential in computer vision tasks. In this work, an algorithm for image visibility restoration based on color correction and composite channel prior (CCP) is proposed. First, the ...
OneSketch: learning high-level shape features from simple sketches
Humans use simple sketches to convey complex concepts and abstract ideas in a concise way. Just a few abstract pencil strokes can carry a large amount of semantic information that can be used as meaningful representation for many applications. In ...
Temporal action localization using gated recurrent units
Temporal action localization (TAL) task which is to predict the start and end of each action in a video along with the class label of the action has numerous applications in the real world. But due to the complexity of this task, acceptable ...
Improving virtual pipes model of hydraulic and thermal erosion with vegetation considerations
Current research in real-time water simulation can also calculate hydraulic erosion to a landscape; however, vegetation, which carries one of the biggest impacts on hydraulic erosion, is often not considered. We proposed an improvement upon the ...
Entanglement inspired approach for determining the preeminent arrangement of static cameras in a multi-view computer vision system
This paper is on the concept of quantum steering and quantum entanglement of two observers. The concept is applied to a multi-view computer vision system that incorporates two cameras. Three separate multi-view static camera setups are used to ...
Material-aware Cross-channel Interaction Attention (MCIA) for occluded prohibited item detection
For security inspection, detecting prohibited items in X-ray images is challenging since they are usually occluded by non-prohibited items. In X-ray images, different materials present different colors and textures. On this basis, we exploit the ...
Hypergraph attentional convolutional neural network for salient object detection
Learning discriminative features and mining salient visual patterns play an important role in salient object detection (SOD) task. Existing SOD methods suffer from limited receptive field and insufficient cross-level feature mining. To this end, ...
Handwritten Arabic and Roman word recognition using holistic approach
The research community considers handwritten word recognition (HWR) as an open research problem to date. The reasons behind this are variations in intra-/interpersonal writing style, overlapping and/or touching characters in a word, degraded ...
A two-stage image process for water level recognition via dual-attention CornerNet and CTransformer
Image processing-based water level detectors have promising practical application value in intelligent agriculture and early water logging alerts. However, water level recognition based on image processing faces illumination, shooting angle, and ...
PTCERE: personality-trait mapping using cognitive-based emotion recognition from electroencephalogram signals
Human emotion recognition is a technique for identifying human emotions with respect to various aspects of human life, such as in decision-making, detecting lies, assessing social behaviour, measuring brain-related activity and identifying the ...
MFANet: Multi-scale feature fusion network with attention mechanism
In order to improve the detection accuracy of the network, it proposes multi-scale feature fusion and attention mechanism net (MFANet) based on deep learning, which integrates pyramid module and channel attention mechanism effectively. Pyramid ...