TIP: Vol 29, No

Volume 292020

Volume 29

2020

Publisher:

IEEE Press

ISSN:1057-7149

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Model-Free Tracker for Multiple Objects Using Joint Appearance and Motion Inference

Pages 277–288https://doi.org/10.1109/TIP.2019.2928123

Model-free tracking is a widely accepted approach to track an arbitrary object in a video using a single frame annotation with no further prior knowledge about the object of interest. Extending this problem to track multiple objects is really challenging ...

research-article

Advanced 3D Motion Prediction for Video-Based Dynamic Point Cloud Compression

Pages 289–302https://doi.org/10.1109/TIP.2019.2931621

Point cloud-based immersive media representation format has provided many opportunities for extended reality applications and has become widely used in volumetric content capturing scenarios. The high data rate of the point cloud is one of the key ...

research-article

Deep Neural Network Regression for Automated Retinal Layer Segmentation in Optical Coherence Tomography Images

Pages 303–312https://doi.org/10.1109/TIP.2019.2931461

Segmenting the retinal layers in optical coherence tomography (OCT) images helps to quantify the layer information in early diagnosis of retinal diseases, which are the main cause of permanent blindness. Thus, the segmentation process plays a critical ...

research-article

Geometry Coding for Dynamic Voxelized Point Clouds Using Octrees and Multiple Contexts

Pages 313–322https://doi.org/10.1109/TIP.2019.2931466

We present a method to compress geometry information of point clouds that explores redundancies across consecutive frames of a sequence. It uses octrees and works by progressively increasing the resolution of the octree. At each branch of the tree, we ...

research-article

HA-CCN: Hierarchical Attention-Based Crowd Counting Network

Pages 323–335https://doi.org/10.1109/TIP.2019.2928634

Single image-based crowd counting has recently witnessed increased focus, but many leading methods are far from optimal, especially in highly congested scenes. In this paper, we present the Hierarchical Attention-based Crowd Counting Network (HA-CCN) that ...

research-article

Morphology-Based Noise Reduction: Structural Variation and Thresholding in the Bitonic Filter

Graham Treece

Pages 336–350https://doi.org/10.1109/TIP.2019.2932572

The bitonic filter was recently developed to embody the novel concept of signal bitonicity (one local extremum within a set range) to differentiate from noise, by use of data ranking and linear operators. For processing images, the spatial extent was ...

research-article

Inpainting Versus Denoising for Dose Reduction in Scanning-Beam Microscopies

Pages 351–359https://doi.org/10.1109/TIP.2019.2928133

We consider sampling strategies for reducing the radiation dose during image acquisition in scanning-beam microscopies, such as SEM, STEM, and STXM. Our basic assumption is that we may acquire subsampled image data (with some pixels missing) and then ...

research-article

Deep Salient Object Detection With Contextual Information Guidance

Pages 360–374https://doi.org/10.1109/TIP.2019.2930906

Integration of multi-level contextual information, such as feature maps and side outputs, is crucial for Convolutional Neural Networks (CNNs)-based salient object detection. However, most existing methods either simply concatenate multi-level feature maps ...

research-article

Image Compressed Sensing Using Convolutional Neural Network

Pages 375–388https://doi.org/10.1109/TIP.2019.2928136

In the study of compressed sensing (CS), the two main challenges are the design of sampling matrix and the development of reconstruction method. On the one hand, the usually used random sampling matrices (e.g., GRM) are signal independent, which ignore ...

research-article

Exemplar-Based Recursive Instance Segmentation With Application to Plant Image Analysis

Pages 389–404https://doi.org/10.1109/TIP.2019.2923571

Instance segmentation is a challenging computer vision problem which lies at the intersection of object detection and semantic segmentation. Motivated by plant image analysis in the context of plant phenotyping, a recently emerging application field of ...

research-article

Supervised Deep Sparse Coding Networks for Image Classification

Pages 405–418https://doi.org/10.1109/TIP.2019.2928121

In this paper, we propose a novel deep sparse coding network (SCN) capable of efficiently adapting its own regularization parameters for a given application. The network is trained end-to-end with a supervised task-driven learning algorithm via error ...

research-article

Graph Transform Optimization With Application to Image Compression

Pages 419–432https://doi.org/10.1109/TIP.2019.2932853

In this paper, we propose a new graph-based transform and illustrate its potential application to signal compression. Our approach relies on the careful design of a graph that optimizes the overall rate-distortion performance through an effective graph-...

research-article

Wavelet-Based Spectral–Spatial Transforms for CFA-Sampled Raw Camera Image Compression

Taizo Suzuki

Pages 433–444https://doi.org/10.1109/TIP.2019.2928124

Spectral–spatial transforms (SSTs) change a raw camera image captured using a color filter array (CFA-sampled image) from an RGB color space composed of red, green, and blue components into a decorrelated color space, such as YDgCbCr or YDgCoCg ...

research-article

Enhanced Fuzzy-Based Local Information Algorithm for Sonar Image Segmentation

Pages 445–460https://doi.org/10.1109/TIP.2019.2930148

The recent boost in undersea operations has led to the development of high-resolution sonar systems mounted on autonomous vehicles. These vehicles are used to scan the seafloor in search of different objects such as sunken ships, archaeological sites, and ...

research-article

High-Resolution Encoder–Decoder Networks for Low-Contrast Medical Image Segmentation

Pages 461–475https://doi.org/10.1109/TIP.2019.2919937

Automatic image segmentation is an essential step for many medical image analysis applications, include computer-aided radiation therapy, disease diagnosis, and treatment effect evaluation. One of the major challenges for this task is the blurry nature of ...

research-article

Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition

Pages 476–488https://doi.org/10.1109/TIP.2019.2921876

We investigate the localization of subtle yet discriminative parts for fine-grained image recognition. Based on the observation that such parts typically exist within a hierarchical structure (e.g., from a coarse-scale “head” to a fine-scale ...

research-article

Holistic Multi-Modal Memory Network for Movie Question Answering

Pages 489–499https://doi.org/10.1109/TIP.2019.2931534

Answering questions using multi-modal context is a challenging problem, as it requires a deep integration of diverse data sources. Existing approaches only consider a subset of all possible interactions among data sources during one attention hop. In this ...

research-article

Weighted Guided Image Filtering With Steering Kernel

Pages 500–508https://doi.org/10.1109/TIP.2019.2928631

Due to its local property, guided image filter (GIF) generally suffers from halo artifacts near edges. To make up for the deficiency, a weighted guided image filter (WGIF) was proposed recently by incorporating an edge-aware weighting into the filtering ...

research-article

Sparse Representation-Based Video Quality Assessment for Synthesized 3D Videos

Pages 509–524https://doi.org/10.1109/TIP.2019.2929433

The temporal flicker distortion is one of the most annoying noises in synthesized virtual view videos when they are rendered by compressed multi-view video plus depth in Three Dimensional (3D) video system. To assess the synthesized view video quality and ...

research-article

Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition

Pages 525–537https://doi.org/10.1109/TIP.2019.2933728

Scene recognition is challenging due to the intra-class diversity and inter-class similarity. Previous works recognize scenes either with global representations or with the intermediate representations of objects. In contrast, we investigate more ...

research-article

Semi-Supervised Deep Coupled Ensemble Learning With Classification Landmark Exploration

Pages 538–550https://doi.org/10.1109/TIP.2019.2933724

Using an ensemble of neural networks with consistency regularization is effective for improving performance and stability of deep learning, compared to the case of a single network. In this paper, we present a semi-supervised Deep Coupled Ensemble (DCE) ...

research-article

Summit Navigator: A Novel Approach for Local Maxima Extraction

Pages 551–564https://doi.org/10.1109/TIP.2019.2932501

This paper presents a novel method, called the Summit Navigator, to effectively extract local maxima of an image histogram for multi-object segmentation of images. After smoothing with a moving average filter, the obtained histogram is analyzed, based on ...

research-article

Hyperspectral Image Denoising via Matrix Factorization and Deep Prior Regularization

Pages 565–578https://doi.org/10.1109/TIP.2019.2928627

Deep learning has been successfully introduced for 2D-image denoising, but it is still unsatisfactory for hyperspectral image (HSI) denoising due to the unacceptable computational complexity of the end-to-end training process and the difficulty of ...

research-article

Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification

Pages 579–590https://doi.org/10.1109/TIP.2019.2928126

Traditional person re-identification (re-id) methods perform poorly under changing illuminations. This situation can be addressed by using dual-cameras that capture visible images in a bright environment and infrared images in a dark environment. Yet, ...

research-article

Unambiguous Scene Text Segmentation With Referring Expression Comprehension

Pages 591–601https://doi.org/10.1109/TIP.2019.2930176

Text instance provides valuable information for the understanding and interpretation of natural scenes. The rich precise high-level semantics embodied in the text could be beneficial for understanding the world around us, and empower a wide range of real-...

research-article

Geometry-Aware Graph Transforms for Light Field Compact Representation

Pages 602–616https://doi.org/10.1109/TIP.2019.2928873

This paper addresses the problem of energy compaction of dense 4D light fields by designing geometry-aware local graph-based transforms. Local graphs are constructed on super-rays that can be seen as a grouping of spatially and geometry-dependent ...

research-article

Multi-View Image Classification With Visual, Semantic and View Consistency

Pages 617–627https://doi.org/10.1109/TIP.2019.2934576

Multi-view visual classification methods have been widely applied to use discriminative information of different views. This strategy has been proven very effective by many researchers. On the one hand, images are often treated independently without fully ...

research-article

RYF-Net: Deep Fusion Network for Single Image Haze Removal

Pages 628–640https://doi.org/10.1109/TIP.2019.2934360

Haze removal from a single image is a challenging task. Estimation of accurate scene transmission map (TrMap) is the key to reconstruct the haze-free scene. In this paper, we propose a convolutional neural network based architecture to estimate the TrMap ...

research-article

Deep Learning-Based Picture-Wise Just Noticeable Distortion Prediction Model for Image Compression

Pages 641–656https://doi.org/10.1109/TIP.2019.2933743

Picture Wise Just Noticeable Difference (PW-JND), which accounts for the minimum difference of a picture that human visual system can perceive, can be widely used in perception-oriented image and video processing. However, the conventional Just Noticeable ...

research-article

Multi-Task Deep Relative Attribute Learning for Visual Urban Perception

Pages 657–669https://doi.org/10.1109/TIP.2019.2932502

Visual urban perception aims to quantify perceptual attributes (e.g., safe and depressing attributes) of physical urban environment from crowd-sourced street-view images and their pairwise comparisons. It has been receiving more and more attention in ...

IEEE Transactions on Image Processing

Sections

Model-Free Tracker for Multiple Objects Using Joint Appearance and Motion Inference

Advanced 3D Motion Prediction for Video-Based Dynamic Point Cloud Compression

Deep Neural Network Regression for Automated Retinal Layer Segmentation in Optical Coherence Tomography Images

Geometry Coding for Dynamic Voxelized Point Clouds Using Octrees and Multiple Contexts

HA-CCN: Hierarchical Attention-Based Crowd Counting Network

Morphology-Based Noise Reduction: Structural Variation and Thresholding in the Bitonic Filter

Inpainting Versus Denoising for Dose Reduction in Scanning-Beam Microscopies

Deep Salient Object Detection With Contextual Information Guidance

Image Compressed Sensing Using Convolutional Neural Network

Exemplar-Based Recursive Instance Segmentation With Application to Plant Image Analysis

Supervised Deep Sparse Coding Networks for Image Classification

Graph Transform Optimization With Application to Image Compression

Wavelet-Based Spectral–Spatial Transforms for CFA-Sampled Raw Camera Image Compression

Enhanced Fuzzy-Based Local Information Algorithm for Sonar Image Segmentation

High-Resolution Encoder–Decoder Networks for Low-Contrast Medical Image Segmentation

Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition

Holistic Multi-Modal Memory Network for Movie Question Answering

Weighted Guided Image Filtering With Steering Kernel

Sparse Representation-Based Video Quality Assessment for Synthesized 3D Videos

Image Representations With Spatial Object-to-Object Relations for RGB-D Scene Recognition

Semi-Supervised Deep Coupled Ensemble Learning With Classification Landmark Exploration

Summit Navigator: A Novel Approach for Local Maxima Extraction

Hyperspectral Image Denoising via Matrix Factorization and Deep Prior Regularization

Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification

Unambiguous Scene Text Segmentation With Referring Expression Comprehension

Geometry-Aware Graph Transforms for Light Field Compact Representation

Multi-View Image Classification With Visual, Semantic and View Consistency

RYF-Net: Deep Fusion Network for Single Image Haze Removal

Deep Learning-Based Picture-Wise Just Noticeable Distortion Prediction Model for Image Compression

Multi-Task Deep Relative Attribute Learning for Visual Urban Perception

Sections

Save to Binder

Comments