Scene understanding

Applied Filters

People

Publications

Conferences

Publication Date

26 Results for: Book/Issue: MM '15: Proceedings of the 23rd ACM international conference on MultimediaEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,846,323 records)|Limit your search to The ACM Full-Text Collection (775,757 records)

Showing 1 - 20of26 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

invited-talk
October 2015
Vision-enhanced Immersive Interaction and Remote Collaboration with Large Touch Displays
- Zhengyou Zhang
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 3–4https://doi.org/10.1145/2733373.2817845

Large displays are becoming commodity, and more and more, they are touch-enabled. In this keynote, we describe a system called ViiBoard (Vision-enhanced Immersive Interaction with touch Board) that enables natural interaction and immersive remote ...
0
282
Metrics
Total Citations0
Total Downloads282
Last 12 Months8
Last 6 weeks1
Get Access
research-article
October 2015
Who are the Devils Wearing Prada in New York City?
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 177–180https://doi.org/10.1145/2733373.2809930

Fashion is a perpetual topic in human social life, and the mass has the penchant to emulate what large city residents and celebrities wear. Undeniably, New York City is such a bellwether large city with all kinds of fashion leadership. Consequently, to ...
23
404
Metrics
Total Citations23
Total Downloads404
Last 12 Months17
Last 6 weeks4
Get Access
abstract
October 2015
Captioning Images Using Different Styles
- Alexander Patrick Mathews
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 665–668https://doi.org/10.1145/2733373.2807998

I develop techniques that can be used to incorporate stylistic objectives into existing image captioning systems. Style is generally a very tricky concept to define, thus I concentrate on two specific components of style. First I develop a technique for ...
7
294
Metrics
Total Citations7
Total Downloads294
Last 12 Months6
Last 6 weeks1
Get Access
demonstration
October 2015
AR in Hand: Egocentric Palm Pose Tracking and Gesture Recognition for Augmented Reality Applications
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 743–744https://doi.org/10.1145/2733373.2807972

Wearable devices such as Microsoft Hololens and Google glass are highly popular in recent years. As traditional input hardware is difficult to use on such platforms, vision-based hand pose tracking and gesture control techniques are more suitable ...
36
1,045
Metrics
Total Citations36
Total Downloads1,045
Last 12 Months37
Last 6 weeks3
Get Access
abstract
October 2015
ImmersiveMe'15: 3rd ACM International Workshop on Immersive Media Experiences
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1339–1340https://doi.org/10.1145/2733373.2806410

This ACM International Workshop on Immersive Media Experiences is in its 3rd edition. Since 2013 in Barcelona, it has been a meeting point of researchers, students, media producers, service providers and industry players in the area of immersive media ...
0
98
Metrics
Total Citations0
Total Downloads98
Last 12 Months3
Last 6 weeks1
Get Access
short-paper
October 2015
Vision-Inertial Hybrid Tracking for Robust and Efficient Augmented Reality on Smartphones
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1039–1042https://doi.org/10.1145/2733373.2806396

This paper aims at robust and efficient pose tracking for augmented reality on modern smartphones. Existing methods, relying on either vision analysis or motion sensing, are either too computationally expensive to achieve real-time performance on a ...
6
193
Metrics
Total Citations6
Total Downloads193
Last 12 Months4
Last 6 weeks0
Get Access
short-paper
October 2015
Deep People Counting in Extremely Dense Crowds
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1299–1302https://doi.org/10.1145/2733373.2806337

People counting in extremely dense crowds is an important step for video surveillance and anomaly warning. The problem becomes especially more challenging due to the lack of training samples, severe occlusions, cluttered scenes and variation of ...
288
1,788
Metrics
Total Citations288
Total Downloads1,788
Last 12 Months91
Last 6 weeks8
Get Access
short-paper
October 2015
Exclusive Constrained Discriminative Learning for Weakly-Supervised Semantic Segmentation
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1251–1254https://doi.org/10.1145/2733373.2806329

How to import image-level labels as weak supervision to direct the region-level labeling task is the core task of weakly-supervised semantic segmentation. In this paper, we focus on designing an effective but simple weakly-supervised constraint, and ...
1
157
Metrics
Total Citations1
Total Downloads157
Last 12 Months3
Last 6 weeks0
Get Access
short-paper
October 2015
Semi- and Weakly- Supervised Semantic Segmentation with Deep Convolutional Neural Networks
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1223–1226https://doi.org/10.1145/2733373.2806322

Successful semantic segmentation methods typically rely on the training datasets containing a large number of pixel-wise labeled images. To alleviate the dependence on such a fully annotated training dataset, in this paper, we propose a semi- and weakly-...
8
408
Metrics
Total Citations8
Total Downloads408
Last 12 Months9
Last 6 weeks0
Get Access
short-paper
October 2015
GPU Accelerated Generalised Subclass Discriminant Analysis for Event and Concept Detection in Video
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1219–1222https://doi.org/10.1145/2733373.2806321

In this paper a discriminant analysis (DA) technique called accelerated generalised subclass discriminant analysis (AGSDA) and its GPU implementation are presented. This method identifies a discriminant subspace of the input space in three steps: a) ...
4
124
Metrics
Total Citations4
Total Downloads124
Last 12 Months2
Last 6 weeks0
Get Access
short-paper
October 2015
Human Action Recognition With Trajectory Based Covariance Descriptor In Unconstrained Videos
- Hanli Wang,
- Yun Yi,
- Jun Wu
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1175–1178https://doi.org/10.1145/2733373.2806310

Human action recognition from realistic videos plays a key role in multimedia event detection and understanding. In this paper, a novel Trajectory Based Covariance (TBC) descriptor is proposed, which is formulated along the dense trajectories. To map ...
9
232
Metrics
Total Citations9
Total Downloads232
Last 12 Months6
Last 6 weeks0
Get Access
short-paper
October 2015
Online Object Tracking Based on CNN with Metropolis-Hasting Re-Sampling
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1163–1166https://doi.org/10.1145/2733373.2806307

Tracking-by-learning strategies have been effective in solving many challenging problems in visual tracking, in which the learning sample generation and labeling play important roles for final performance. Since the concern of deep learning based ...
7
301
Metrics
Total Citations7
Total Downloads301
Last 12 Months7
Last 6 weeks0
Get Access
short-paper
October 2015
Hyperspectral Image Classification with Convolutional Neural Networks
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1159–1162https://doi.org/10.1145/2733373.2806306

Hyperspectral image (HSI) classification is one of the most widely used methods for scene analysis from hyperspectral imagery. In the past, many different engineered features have been proposed for the HSI classification problem. In this paper, however, ...
87
1,379
Metrics
Total Citations87
Total Downloads1,379
Last 12 Months75
Last 6 weeks5
Get Access
short-paper
October 2015
Spatio-Temporal Triangular-Chain CRF for Activity Recognition
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1151–1154https://doi.org/10.1145/2733373.2806304

Understanding human activities in video is a fundamental problem in computer vision. In real life, human activities are composed of temporal and spatial arrangement of actions. Understanding such complex activities requires recognizing not only each ...
4
212
Metrics
Total Citations4
Total Downloads212
Last 12 Months4
Last 6 weeks1
Get Access
short-paper
October 2015
Predicting Image Memorability by Multi-view Adaptive Regression
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1147–1150https://doi.org/10.1145/2733373.2806303

The images we encounter throughout our lives make different impressions on us: Some are remembered at first glance, while others are forgotten. This phenomenon is caused by the intrinsic memorability of images revealed by recent studies [5,6]. In this ...
15
228
Metrics
Total Citations15
Total Downloads228
Last 12 Months7
Last 6 weeks2
Get Access
short-paper
October 2015
3D Person Tracking In World Coordinates and Attribute Estimation with PDR
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1139–1142https://doi.org/10.1145/2733373.2806301

In this paper, we propose an online 3D person tracking method and an attribute estimation method with pedestrian dead reckoning (PDR). For person tracking, we employ a structured prediction approach, which extends the Struck algorithm. Although the main ...
2
210
Metrics
Total Citations2
Total Downloads210
Last 12 Months6
Last 6 weeks0
Get Access
short-paper
October 2015
Weak Labeled Multi-Label Active Learning for Image Classification
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1127–1130https://doi.org/10.1145/2733373.2806298

In order to achieve better classification performance with even fewer labeled images, active learning is suitable for these situations. Several active learning methods have been proposed for multi-label image classification, but all of them assume that ...
10
388
Metrics
Total Citations10
Total Downloads388
Last 12 Months7
Last 6 weeks1
Get Access
short-paper
October 2015
Local Depth Patterns for Tracking in Depth Videos
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1115–1118https://doi.org/10.1145/2733373.2806295

Conventional video tracking operates over RGB or grey-level data which contain significant clues for the identification of the targets. While this is often desirable in a video surveillance context, use of video tracking in privacy-sensitive ...
14
290
Metrics
Total Citations14
Total Downloads290
Last 12 Months2
Last 6 weeks0
Get Access
short-paper
October 2015
A Probabilistic Approach for Image Retrieval Using Descriptive Textual Queries
- Yashaswi Verma,
- C.V. Jawahar
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1091–1094https://doi.org/10.1145/2733373.2806289

We address the problem of image retrieval using textual queries. In particular, we focus on descriptive queries that can be either in the form of simple captions (e.g., ``a brown cat sleeping on a sofa''), or even long descriptions with multiple ...
0
146
Metrics
Total Citations0
Total Downloads146
Last 12 Months2
Last 6 weeks0
Get Access
short-paper
October 2015
Detecting Salient Objects via Spatial and Appearance Compactness Hypotheses
MM '15: Proceedings of the 23rd ACM international conference on MultimediaPages 1087–1090https://doi.org/10.1145/2733373.2806288

Object-level saliency detection has been attracting a lot of attention, due to its potential enhancement in many high-level vision tasks. Many previous methods are based on the contrast hypothesis which regards the regions with high contrast in a ...
1
176
Metrics
Total Citations1
Total Downloads176
Last 12 Months3
Last 6 weeks0
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Results

Vision-enhanced Immersive Interaction and Remote Collaboration with Large Touch Displays

Who are the Devils Wearing Prada in New York City?

Captioning Images Using Different Styles

AR in Hand: Egocentric Palm Pose Tracking and Gesture Recognition for Augmented Reality Applications

ImmersiveMe'15: 3rd ACM International Workshop on Immersive Media Experiences

Vision-Inertial Hybrid Tracking for Robust and Efficient Augmented Reality on Smartphones

Deep People Counting in Extremely Dense Crowds

Exclusive Constrained Discriminative Learning for Weakly-Supervised Semantic Segmentation

Semi- and Weakly- Supervised Semantic Segmentation with Deep Convolutional Neural Networks

GPU Accelerated Generalised Subclass Discriminant Analysis for Event and Concept Detection in Video

Human Action Recognition With Trajectory Based Covariance Descriptor In Unconstrained Videos

Online Object Tracking Based on CNN with Metropolis-Hasting Re-Sampling

Hyperspectral Image Classification with Convolutional Neural Networks

Spatio-Temporal Triangular-Chain CRF for Activity Recognition

Predicting Image Memorability by Multi-view Adaptive Regression

3D Person Tracking In World Coordinates and Attribute Estimation with PDR

Weak Labeled Multi-Label Active Learning for Image Classification

Local Depth Patterns for Tracking in Depth Videos

A Probabilistic Approach for Image Retrieval Using Descriptive Textual Queries

Detecting Salient Objects via Spatial and Appearance Compactness Hypotheses