Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Reflects downloads up to 16 Oct 2024Bibliometrics
research-article
Model-Free Tracker for Multiple Objects Using Joint Appearance and Motion Inference

Model-free tracking is a widely accepted approach to track an arbitrary object in a video using a single frame annotation with no further prior knowledge about the object of interest. Extending this problem to track multiple objects is really challenging ...

research-article
Advanced 3D Motion Prediction for Video-Based Dynamic Point Cloud Compression

Point cloud-based immersive media representation format has provided many opportunities for extended reality applications and has become widely used in volumetric content capturing scenarios. The high data rate of the point cloud is one of the key ...

research-article
Deep Neural Network Regression for Automated Retinal Layer Segmentation in Optical Coherence Tomography Images

Segmenting the retinal layers in optical coherence tomography (OCT) images helps to quantify the layer information in early diagnosis of retinal diseases, which are the main cause of permanent blindness. Thus, the segmentation process plays a critical ...

research-article
Geometry Coding for Dynamic Voxelized Point Clouds Using Octrees and Multiple Contexts

We present a method to compress geometry information of point clouds that explores redundancies across consecutive frames of a sequence. It uses octrees and works by progressively increasing the resolution of the octree. At each branch of the tree, we ...

research-article
HA-CCN: Hierarchical Attention-Based Crowd Counting Network

Single image-based crowd counting has recently witnessed increased focus, but many leading methods are far from optimal, especially in highly congested scenes. In this paper, we present the Hierarchical Attention-based Crowd Counting Network (HA-CCN) that ...

research-article
Morphology-Based Noise Reduction: Structural Variation and Thresholding in the Bitonic Filter

The bitonic filter was recently developed to embody the novel concept of signal bitonicity (one local extremum within a set range) to differentiate from noise, by use of data ranking and linear operators. For processing images, the spatial extent was ...

research-article
Inpainting Versus Denoising for Dose Reduction in Scanning-Beam Microscopies

We consider sampling strategies for reducing the radiation dose during image acquisition in scanning-beam microscopies, such as SEM, STEM, and STXM. Our basic assumption is that we may acquire subsampled image data (with some pixels missing) and then ...

research-article
Deep Salient Object Detection With Contextual Information Guidance

Integration of multi-level contextual information, such as feature maps and side outputs, is crucial for Convolutional Neural Networks (CNNs)-based salient object detection. However, most existing methods either simply concatenate multi-level feature maps ...

research-article
Image Compressed Sensing Using Convolutional Neural Network

In the study of compressed sensing (CS), the two main challenges are the design of sampling matrix and the development of reconstruction method. On the one hand, the usually used random sampling matrices (e.g., GRM) are signal independent, which ignore ...

research-article
Exemplar-Based Recursive Instance Segmentation With Application to Plant Image Analysis

Instance segmentation is a challenging computer vision problem which lies at the intersection of object detection and semantic segmentation. Motivated by plant image analysis in the context of plant phenotyping, a recently emerging application field of ...

research-article
Supervised Deep Sparse Coding Networks for Image Classification

In this paper, we propose a novel deep sparse coding network (SCN) capable of efficiently adapting its own regularization parameters for a given application. The network is trained end-to-end with a supervised task-driven learning algorithm via error ...

research-article
Graph Transform Optimization With Application to Image Compression

In this paper, we propose a new graph-based transform and illustrate its potential application to signal compression. Our approach relies on the careful design of a graph that optimizes the overall rate-distortion performance through an effective graph-...

research-article
Wavelet-Based Spectral–Spatial Transforms for CFA-Sampled Raw Camera Image Compression

Spectral–spatial transforms (SSTs) change a raw camera image captured using a color filter array (CFA-sampled image) from an RGB color space composed of red, green, and blue components into a decorrelated color space, such as YDgCbCr or YDgCoCg ...

research-article
Enhanced Fuzzy-Based Local Information Algorithm for Sonar Image Segmentation

The recent boost in undersea operations has led to the development of high-resolution sonar systems mounted on autonomous vehicles. These vehicles are used to scan the seafloor in search of different objects such as sunken ships, archaeological sites, and ...

research-article
High-Resolution Encoder–Decoder Networks for Low-Contrast Medical Image Segmentation

Automatic image segmentation is an essential step for many medical image analysis applications, include computer-aided radiation therapy, disease diagnosis, and treatment effect evaluation. One of the major challenges for this task is the blurry nature of ...

research-article
Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition

We investigate the localization of subtle yet discriminative parts for fine-grained image recognition. Based on the observation that such parts typically exist within a hierarchical structure (e.g., from a coarse-scale “head” to a fine-scale ...

research-article
Holistic Multi-Modal Memory Network for Movie Question Answering

Answering questions using multi-modal context is a challenging problem, as it requires a deep integration of diverse data sources. Existing approaches only consider a subset of all possible interactions among data sources during one attention hop. In this ...

research-article
Weighted Guided Image Filtering With Steering Kernel

Due to its local property, guided image filter (GIF) generally suffers from halo artifacts near edges. To make up for the deficiency, a weighted guided image filter (WGIF) was proposed recently by incorporating an edge-aware weighting into the filtering ...

research-article
Sparse Representation-Based Video Quality Assessment for Synthesized 3D Videos

The temporal flicker distortion is one of the most annoying noises in synthesized virtual view videos when they are rendered by compressed multi-view video plus depth in Three Dimensional (3D) video system. To assess the synthesized view video quality and ...

research-article
Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition

Scene recognition is challenging due to the intra-class diversity and inter-class similarity. Previous works recognize scenes either with global representations or with the intermediate representations of objects. In contrast, we investigate more ...

research-article
Semi-Supervised Deep Coupled Ensemble Learning With Classification Landmark Exploration

Using an ensemble of neural networks with consistency regularization is effective for improving performance and stability of deep learning, compared to the case of a single network. In this paper, we present a semi-supervised Deep Coupled Ensemble (DCE) ...

research-article
Summit Navigator: A Novel Approach for Local Maxima Extraction

This paper presents a novel method, called the Summit Navigator, to effectively extract local maxima of an image histogram for multi-object segmentation of images. After smoothing with a moving average filter, the obtained histogram is analyzed, based on ...

research-article
Hyperspectral Image Denoising via Matrix Factorization and Deep Prior Regularization

Deep learning has been successfully introduced for 2D-image denoising, but it is still unsatisfactory for hyperspectral image (HSI) denoising due to the unacceptable computational complexity of the end-to-end training process and the difficulty of ...

research-article
Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification

Traditional person re-identification (re-id) methods perform poorly under changing illuminations. This situation can be addressed by using dual-cameras that capture visible images in a bright environment and infrared images in a dark environment. Yet, ...

research-article
Unambiguous Scene Text Segmentation With Referring Expression Comprehension

Text instance provides valuable information for the understanding and interpretation of natural scenes. The rich precise high-level semantics embodied in the text could be beneficial for understanding the world around us, and empower a wide range of real-...

research-article
Geometry-Aware Graph Transforms for Light Field Compact Representation

This paper addresses the problem of energy compaction of dense 4D light fields by designing geometry-aware local graph-based transforms. Local graphs are constructed on super-rays that can be seen as a grouping of spatially and geometry-dependent ...

research-article
Multi-View Image Classification With Visual, Semantic and View Consistency

Multi-view visual classification methods have been widely applied to use discriminative information of different views. This strategy has been proven very effective by many researchers. On the one hand, images are often treated independently without fully ...

research-article
RYF-Net: Deep Fusion Network for Single Image Haze Removal

Haze removal from a single image is a challenging task. Estimation of accurate scene transmission map (TrMap) is the key to reconstruct the haze-free scene. In this paper, we propose a convolutional neural network based architecture to estimate the TrMap ...

research-article
Deep Learning-Based Picture-Wise Just Noticeable Distortion Prediction Model for Image Compression

Picture Wise Just Noticeable Difference (PW-JND), which accounts for the minimum difference of a picture that human visual system can perceive, can be widely used in perception-oriented image and video processing. However, the conventional Just Noticeable ...

research-article
Multi-Task Deep Relative Attribute Learning for Visual Urban Perception

Visual urban perception aims to quantify perceptual attributes (e.g., safe and depressing attributes) of physical urban environment from crowd-sourced street-view images and their pairwise comparisons. It has been receiving more and more attention in ...

Comments