Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-030-01252-6guideproceedingsBook PagePublication PagesConference Proceedingsacm-pubtype
Computer Vision – ECCV 2018: 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part XI
2018 Proceeding
Publisher:
  • Springer-Verlag
  • Berlin, Heidelberg
Conference:
European Conference on Computer VisionMunich, Germany8 September 2018
ISBN:
978-3-030-01251-9
Published:
08 September 2018

Reflects downloads up to 25 Oct 2024Bibliometrics
Abstract

No abstract available.

Article
Deep Boosting for Image Denoising
Abstract

Boosting is a classic algorithm which has been successfully applied to diverse computer vision tasks. In the scenario of image denoising, however, the existing boosting algorithms are surpassed by the emerging learning-based models. In this paper, ...

Article
Self-Supervised Relative Depth Learning for Urban Scene Understanding
Abstract

As an agent moves through the world, the apparent motion of scene elements is (usually) inversely proportional to their depth (Strictly speaking, this statement is true only after one has compensated for camera rotation, individual object motion, ...

Article
K-convexity Shape Priors for Segmentation
Abstract

This work extends popular star-convexity and other more general forms of convexity priors. We represent an object as a union of “convex” overlappable subsets. Since an arbitrary shape can always be divided into convex parts, our regularization ...

Article
Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images
Abstract

We propose an end-to-end deep learning architecture that produces a 3D shape in triangular mesh from a single color image. Limited by the nature of deep neural network, previous methods usually represent a 3D shape in volume or point cloud, and it ...

Article
Boosted Attention: Leveraging Human Attention for Image Captioning
Abstract

Visual attention has shown usefulness in image captioning, with the goal of enabling a caption model to selectively focus on regions of interest. Existing models typically rely on top-down language information and learn attention implicitly by ...

Article
Image Inpainting for Irregular Holes Using Partial Convolutions
Abstract

Existing deep learning based image inpainting methods use a standard convolutional network over the corrupted image, using convolutional filter responses conditioned on both valid pixels as well as the substitute values in the masked holes (...

Article
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
Abstract

Advances in photo editing and manipulation tools have made it significantly easier to create fake imagery. Learning to detect such manipulations, however, remains a challenging problem due to the lack of sufficient amounts of manipulated training ...

Article
Hand Pose Estimation via Latent 2.5D Heatmap Regression
Abstract

Estimating the 3D pose of a hand is an essential part of human-computer interaction. Estimating 3D pose using depth or multi-view sensors has become easier with recent advances in computer vision, however, regressing pose from a single RGB image ...

Article
Depth-Aware CNN for RGB-D Segmentation
Abstract

Convolutional neural networks (CNN) are limited by the lack of capability to handle geometric information due to the fixed grid kernel structure. The availability of depth data enables progress in RGB-D semantic segmentation with CNNs. State-of-...

Article
CAR-Net: Clairvoyant Attentive Recurrent Network
Abstract

We present an interpretable framework for path prediction that leverages dependencies between agents’ behaviors and their spatial navigation environment. We exploit two sources of information: the past motion trajectory of the agent of interest ...

Article
Evaluating Capability of Deep Neural Networks for Image Classification via Information Plane
Abstract

Inspired by the pioneering work of information bottleneck principle for Deep Neural Networks (DNNs) analysis, we design an information plane based framework to evaluate the capability of DNNs for image classification tasks, which not only helps ...

Article
Super-Identity Convolutional Neural Network for Face Hallucination
Abstract

Face hallucination is a generative task to super-resolve the facial image with low resolution while human perception of face heavily relies on identity information. However, previous face hallucination approaches largely ignore facial identity ...

Article
What Do I Annotate Next? An Empirical Study of Active Learning for Action Localization
Abstract

Despite tremendous progress achieved in temporal action localization, state-of-the-art methods still struggle to train accurate models when annotated data is scarce. In this paper, we introduce a novel active learning framework for temporal ...

Article
Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable Model
Abstract

We propose a novel end-to-end semi-supervised adversarial framework to generate photorealistic face images of new identities with a wide range of expressions, poses, and illuminations conditioned by synthetic images sampled from a 3D morphable ...

Article
HairNet: Single-View Hair Reconstruction Using Convolutional Neural Networks
Abstract

We introduce a deep learning-based method to generate full 3D hair geometry from an unconstrained image. Our method can recover local strand details and has real-time performance. State-of-the-art hair modeling techniques rely on large hairstyle ...

Article
Neural Network Encapsulation
Abstract

A capsule is a collection of neurons which represents different variants of a pattern in the network. The routing scheme ensures only certain capsules which resemble lower counterparts in the higher layer should be activated. However, the ...

Article
Learning Deep Representations with Probabilistic Knowledge Transfer
Abstract

Knowledge Transfer (KT) techniques tackle the problem of transferring the knowledge from a large and complex neural network into a smaller and faster one. However, existing KT methods are tailored towards classification tasks and they cannot be ...

Article
Integrating Egocentric Videos in Top-View Surveillance Videos: Joint Identification and Temporal Alignment
Abstract

Videos recorded from first person (egocentric) perspective have little visual appearance in common with those from third person perspective, especially with videos captured by top-view surveillance cameras. In this paper, we aim to relate these ...

Article
Visual-Inertial Object Detection and Mapping
Abstract

We present a method to populate an unknown environment with models of previously seen objects, placed in a Euclidean reference frame that is inferred causally and on-line using monocular video along with inertial sensors. The system we implement ...

Article
Actor-Centric Relation Network
Abstract

Current state-of-the-art approaches for spatio-temporal action localization rely on detections at the frame level and model temporal context with 3D ConvNets. Here, we go one step further and model spatio-temporal relations to capture the ...

Article
Liquid Pouring Monitoring via Rich Sensory Inputs
Abstract

Humans have the amazing ability to perform very subtle manipulation task using a closed-loop control system with imprecise mechanics (i.e., our body parts) but rich sensory information (e.g., vision, tactile, etc.). In the closed-loop system, the ...

Article
Weakly Supervised Region Proposal Network and Object Detection
Abstract

The Convolutional Neural Network (CNN) based region proposal generation method (i.e. region proposal network), trained using bounding box annotations, is an essential component in modern fully supervised object detectors. However, Weakly ...

Article
Zero-Annotation Object Detection with Web Knowledge Transfer
Abstract

Object detection is one of the major problems in computer vision, and has been extensively studied. Most of the existing detection works rely on labor-intensive supervision, such as ground truth bounding boxes of objects or at least image-level ...

Article
Receptive Field Block Net for Accurate and Fast Object Detection
Abstract

Current top-performing object detectors depend on deep CNN backbones, such as ResNet-101 and Inception, benefiting from their powerful feature representations but suffering from high computational costs. Conversely, some lightweight model based ...

Article
Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: The Benefit of Target Expectation Maximization
Abstract

In this paper, we make two contributions to unsupervised domain adaptation (UDA) using the convolutional neural network (CNN). First, our approach transfers knowledge in all the convolutional layers through attention alignment. Most previous ...

Article
MultiPoseNet: Fast Multi-Person Pose Estimation Using Pose Residual Network
Abstract

In this paper, we present MultiPoseNet, a novel bottom-up multi-person pose estimation architecture that combines a multi-task model with a novel assignment method. MultiPoseNet can jointly handle person detection, person segmentation and pose ...

Article
TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection
Abstract

This work provides a simple approach to discover tight object bounding boxes with only image-level supervision, called Tight box mining with Surrounding Segmentation Context (TS2C). We observe that object candidates mined through current multiple ...

Article
Hierarchy of Alternating Specialists for Scene Recognition
Abstract

We introduce a method for improving convolutional neural networks (CNNs) for scene classification. We present a hierarchy of specialist networks, which disentangles the intra-class variation and inter-class similarity in a coarse to fine manner. ...

Article
Move Forward and Tell: A Progressive Generator of Video Descriptions
Abstract

We present an efficient framework that can generate a coherent paragraph to describe a given video. Previous works on video captioning usually focus on video clips. They typically treat an entire video as a whole and generate the caption ...

Article
Learning Monocular Depth by Distilling Cross-Domain Stereo Networks
Abstract

Monocular depth estimation aims at estimating a pixelwise depth map for a single image, which has wide applications in scene understanding and autonomous driving. Existing supervised and unsupervised methods face great challenges. Supervised ...

Contributors
  • Google LLC
  • Carnegie Mellon University
  • Hebrew University of Jerusalem

Index Terms

  1. Computer Vision – ECCV 2018: 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part XI
        Index terms have been assigned to the content through auto-classification.

        Recommendations