Volume Table of Contents

PROCEEDINGS VOLUME 8301

IS&T/SPIE ELECTRONIC IMAGING | 22-26 JANUARY 2012

Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques

Editor(s): Juha Röning, David P. Casasent

Editor Affiliations +

IN THIS VOLUME

10 Sessions, 40 Papers, 0 Presentations, 0 Posters

Front Matter: Volume 8301 (1)

Invited Papers on Intelligent Robotics (3)

Stereo Vision and Applications (6)

Novel People and Vehicle Tracking Approaches (5)

UAVs and Aerial Applications (3)

Robot Manipulation and Application (3)

Vision Navigation and Activity Recognition (5)

Visual Algorithms (4)

Intelligent Ground Vehicle Competition (5)

Interactive Paper Session (5)

IS&T/SPIE ELECTRONIC IMAGING

22-26 January 2012

Burlingame, California, United States

Present at an SPIE Conference

Subscribe to Digital Library

VIEW ALL ABSTRACTS +

Front Matter: Volume 8301

Front Matter: Volume 8301 Open Access

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830101 (2012) https://doi.org/10.1117/12.918432

Read Abstract +

Invited Papers on Intelligent Robotics

Software-based neural network assisted movement compensation for nanoresolution piezo actuators Paper

Marko Kauppinen, Juha Röning

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830102 (2012) https://doi.org/10.1117/12.905749

Read Abstract +

Traffic monitoring with distributed smart cameras Paper

Oliver Sidla, Marcin Rosner, Michael Ulm, Gert Schwingshackl

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830103 (2012) https://doi.org/10.1117/12.908358

Read Abstract +

The 19th Annual Intelligent Ground Vehicle Competition: student built autonomous ground vehicles Paper

Bernard L. Theisen

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830104 (2012) https://doi.org/10.1117/12.909520

Read Abstract +

Stereo Vision and Applications

Accurate dense 3D reconstruction of moving and still objects from dynamic stereo sequences based on temporal modified-RANSAC and feature-cut Paper

Naotomo Tatematsu, Jun Ohya

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830105 (2012) https://doi.org/10.1117/12.908037

Read Abstract +

This paper improves the authors' conventional method for reconstructing the 3D structure of moving and still objects that are tracked in the video and/or depth image sequences acquired by moving cameras and/or range finder. The authors proposed a Temporal Modified-RANSAC based method [1] that (1) can discriminate each moving object from the still background in color image and depth image sequences acquired by moving stereo cameras or moving range finder, (2) can compute the stereo cameras' egomotion, (3) can compute the motion of each moving object, and (4) can reconstruct the 3D structure of each moving object and the background. However, the TMR based method has the following two problems concerning the 3D reconstruction: lack of accuracy of segmenting into each object's region and sparse 3D reconstructed points in each object's region. To solve these problems of our conventional method, this paper proposes a new 3D segmentation method that utilizes Graph-cut, which is frequently used for segmentation tasks. First, the proposed method tracks feature points in the color and depth image sequences so that 3D optical flows of the feature points in every N frames are obtained. Then, TMR classifies all the obtained 3D optical flows into regions (3D flow set) for the background and each moving object; simultaneously, the rotation matrix and the translation vector for each 3D flow set are computed. Next, Graph-Cut using the energy function that consists of color probability, structure probability and a-priori probability is performed so that pixels in each frame are segmented into object regions and the background region. Finally, 3D point clouds are obtained from the segmentation result image and depth image, and then the point clouds are merged using the rotation and translation from the N-th frame prior to the current frame so that 3D models for the background and each moving object are constructed with dense 3D point data.

Efficient hybrid monocular-stereo approach to on-board video-based traffic sign detection and tracking Paper

Javier Marinas, Luis Salgado, Jon Arróspide, Massimo Camplani

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830106 (2012) https://doi.org/10.1117/12.908585

Read Abstract +

A general model and calibration method for spherical stereoscopic vision Paper

Weijia Feng, Juha Röning, Juho Kannala, Xiaoning Zong, Baofeng Zhang

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830107 (2012) https://doi.org/10.1117/12.907071

Read Abstract +

An approach to stereo-point cloud registration using image homographies Paper

Stephen D. Fox, Damian M. Lyons

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830108 (2012) https://doi.org/10.1117/12.908968

Read Abstract +

Hazardous sign detection for safety applications in traffic monitoring Paper

Wanda Benesova, Michal Kottman, Oliver Sidla

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830109 (2012) https://doi.org/10.1117/12.905813

Read Abstract +

PRoViScout: a planetary scouting rover demonstrator Paper

Gerhard Paar, Mark Woods, Christiane Gimkiewicz, Frédéric Labrosse, Alberto Medina, Laurence Tyler, David P. Barnes, Gerald Fritz, Konstantinos Kapellos

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010A (2012) https://doi.org/10.1117/12.906122

Read Abstract +

Novel People and Vehicle Tracking Approaches

Red-light traffic enforcement at railway crossings Paper

Oliver Sidla, Gernot Loibner

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010B (2012) https://doi.org/10.1117/12.908369

Read Abstract +

Image projection clues for improved real-time vehicle tracking in tunnels Paper

Vedran Jelaca, Jorge Oswaldo Niño Castaneda, Aleksandra Pizurica, Wilfried Philips

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010C (2012) https://doi.org/10.1117/12.908873

Read Abstract +

Decentralized tracking of humans using a camera network Paper

Sebastian Gruenwedel, Vedran Jelaca, Jorge Oswaldo Niño-Castañeda, Peter Van Hese, Dimitri Van Cauwelaert, Peter Veelaert, Wilfried Philips

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010D (2012) https://doi.org/10.1117/12.907082

Read Abstract +

Real-time detection of traffic events using smart cameras Paper

M. Macesic, V. Jelaca, J. O. Niño-Castaneda, N. Prodanovic, M. Panic, A. Pizurica, V. Crnojevic, W. Philips

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010E (2012) https://doi.org/10.1117/12.909461

Read Abstract +

Vehicle tracking data for calibrating microscopic traffic simulation models Paper

R. Schönauer, Y. Lipetski, H. Schrom-Feiertag

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010F (2012) https://doi.org/10.1117/12.912090

Read Abstract +

UAVs and Aerial Applications

AR.Drone: security threat analysis and exemplary attack to track persons Paper

Fred Samland, Jana Fruth, Mario Hildebrandt, Tobias Hoppe, Jana Dittmann

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010G (2012) https://doi.org/10.1117/12.902990

Read Abstract +

Detection of unknown targets from aerial camera and extraction of simple object fingerprints for the purpose of target reacquisition Paper

T. Nathan Mundhenk, Kang-Yu Ni, Yang Chen, Kyungnam Kim, Yuri Owechko

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010H (2012) https://doi.org/10.1117/12.906491

Read Abstract +

An aerial multiple camera tracking paradigm needs to not only spot unknown targets and track them, but also needs to know how to handle target reacquisition as well as target handoff to other cameras in the operating theater. Here we discuss such a system which is designed to spot unknown targets, track them, segment the useful features and then create a signature fingerprint for the object so that it can be reacquired or handed off to another camera. The tracking system spots unknown objects by subtracting background motion from observed motion allowing it to find targets in motion, even if the camera platform itself is moving. The area of motion is then matched to segmented regions returned by the EDISON mean shift segmentation tool. Whole segments which have common motion and which are contiguous to each other are grouped into a master object. Once master objects are formed, we have a tight bound on which to extract features for the purpose of forming a fingerprint. This is done using color and simple entropy features. These can be placed into a myriad of different fingerprints. To keep data transmission and storage size low for camera handoff of targets, we try several different simple techniques. These include Histogram, Spatiogram and Single Gaussian Model. These are tested by simulating a very large number of target losses in six videos over an interval of 1000 frames each from the DARPA VIVID video set. Since the fingerprints are very simple, they are not expected to be valid for long periods of time. As such, we test the shelf life of fingerprints. This is how long a fingerprint is good for when stored away between target appearances. Shelf life gives us a second metric of goodness and tells us if a fingerprint method has better accuracy over longer periods. In videos which contain multiple vehicle occlusions and vehicles of highly similar appearance we obtain a reacquisition rate for automobiles of over 80% using the simple single Gaussian model compared with the null hypothesis of <20%. Additionally, the performance for fingerprints stays well above the null hypothesis for as much as 800 frames. Thus, a simple and highly compact single Gaussian model is useful for target reacquisition. Since the model is agnostic to view point and object size, it is expected to perform as well on a test of target handoff. Since some of the performance degradation is due to problems with the initial target acquisition and tracking, the simple Gaussian model may perform even better with an improved initial acquisition technique. Also, since the model makes no assumption about the object to be tracked, it should be possible to use it to fingerprint a multitude of objects, not just cars. Further accuracy may be obtained by creating manifolds of objects from multiple samples.

Super-resolution terrain map enhancement for navigation based on satellite imagery Paper

Jeremy Straub

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010J (2012) https://doi.org/10.1117/12.911942

Read Abstract +

Robot Manipulation and Application

3D positional control of magnetic levitation system using adaptive control: improvement of positioning control in horizontal plane Paper

Toshimasa Nishino, Yasuhiro Fujitani, Norihiko Kato, Naoaki Tsuda, Yoshihiko Nomura, Hirokazu Matsui

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010K (2012) https://doi.org/10.1117/12.907722

Read Abstract +

The magic glove: a gesture-based remote controller for intelligent mobile robots Paper

Chaomin Luo, Yue Chen, Mohan Krishnan, Mark Paulik

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010M (2012) https://doi.org/10.1117/12.912186

Read Abstract +

Way-point navigation for a skid-steer vehicle in unknown environments Paper

Peiyi Chen, Arun Das, Prasenjit Mukherjee, Steven Waslander

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010N (2012) https://doi.org/10.1117/12.910074

Read Abstract +

Vision Navigation and Activity Recognition

Integrated field testing of planetary robotics vision processing: the PRoVisG campaign in Tenerife 2011 Paper

G. Paar, L. Waugh, D. P. Barnes, T. Pajdla, M. Woods, H.-R. Graf, Y. Gao, K. Willner, J.-P. Muller, et al.

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010O (2012) https://doi.org/10.1117/12.906410

Read Abstract +

Hierarchical loop detection for mobile outdoor robots Paper

Dagmar Lang, Christian Winkens, Marcel Häselich, Dietrich Paulus

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010P (2012) https://doi.org/10.1117/12.908277

Read Abstract +

A novel margin-based linear embedding technique for visual object recognition Paper

F. Dornaika, A. Assoum

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010Q (2012) https://doi.org/10.1117/12.906698

Read Abstract +

Linear Dimensionality Reduction (LDR) techniques have been increasingly important in computer vision and pattern recognition since they permit a relatively simple mapping of data onto a lower dimensional subspace, leading to simple and computationally efficient classification strategies. Recently, many linear discriminant methods have been developed in order to reduce the dimensionality of visual data and to enhance the discrimination between different groups or classes. Although many linear discriminant analysis methods have been proposed in the literature, they suffer from at least one of the following shortcomings: i) they require the setting of many parameters (e.g., the neighborhood sizes for homogeneous and heterogeneous samples), ii) they suffer from the Small Sample Size problem that often occurs when dealing with visual data sets for which the number of samples is less than the dimension of the sample, and iii) most of the traditional subspace learning methods have to determine the dimension of the projected space by either cross-validation or exhaustive search. In this paper, we propose a novel margin-based linear embedding method that exploits the nearest hit and the nearest miss samples only. Our proposed method tackles all the above shortcomings. It finds the projection directions such that the sum of local margins is maximized. Our proposed approach has been applied to the problem of appearancebased face recognition. Experimental results performed on four public face databases show that the proposed approach can give better generalization performance than the competing methods. These competing methods used for performance comparison were: Principal Component Analysis (PCA), Locality Preserving Projections (LPP), Average Neighborhood Margin Maximization (ANMM), and Maximally Collapsing Metric Learning algorithm (MCML). The proposed approach could also be applied to other category of objects characterized by large variations in their appearance.

Real-time two-level foreground detection and person-silhouette extraction enhanced by body-parts tracking Paper

Rada Deeb, Elodie Desserée, Saida Bouakaz

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010R (2012) https://doi.org/10.1117/12.908435

Read Abstract +

In this paper we discuss foreground detection and human body silhouette extraction and tracking in monocular video systems designed for human motion analysis applications. Vision algorithms face many challenges when it comes to analyze human activities in non-controlled environments. For instance, issues like illumination changes, shadows, camouflage and occlusions make the detection and the tracking of a moving person a hard task to accomplish. Hence, advanced solutions are required to analyze the content of video sequences. We propose a real-time, two-level foreground detection, enhanced by body parts tracking, designed to efficiently extract person silhouette and body parts for monocular video-based human motion analysis systems. We aim to find solutions for different non-controlled environment challenges, which make the detection and the tracking of a moving person a hard task to accomplish. On the first level, we propose an enhanced Mixture of Gaussians, built on both chrominanceluminance and chrominance-only spaces, which handles global illumination changes. On the second level, we improve segmentation results, in interesting areas, by using statistical foreground models updated by a high-level tracking of body parts. Each body part is represented with a set of template characterized by a feature vector built in an initialization phase. Then, high level tracking is done by finding blob-template correspondences via distance minimization in feature space. Correspondences are then used to update foreground models, and a graph cut algorithm, which minimizes a Markov random field energy function containing these models, is used to refine segmentation. We were able to extract a refined silhouette in the presence of light changes, noise and camouflage. Moreover, the tracking approach allowed us to infer information about the presence and the location of body parts even in the case of partial occlusion.

Activity recognition from video using layered approach Paper

Charles A. McPherson, John M. Irvine, Mon Young, Anthony Stefanidis

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010S (2012) https://doi.org/10.1117/12.909585

Read Abstract +

Visual Algorithms

Method for fast detecting the intersection of a plane and a cube in an octree structure to find point sets within a convex region Paper

K, Fujimoto, N. Kimura, T. Moriya

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010T (2012) https://doi.org/10.1117/12.906497

Read Abstract +

Lucas-Kanade image registration using camera parameters Paper

Sunghyun Cho, Hojin Cho, Yu-Wing Tai, Young Su Moon, Junguk Cho, Shihwa Lee, Seungyong Lee

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010V (2012) https://doi.org/10.1117/12.907776

Read Abstract +

Object tracking with adaptive HOG detector and adaptive Rao-Blackwellised particle filter Paper

Stefano Rosa, Marco Paleari, Paolo Ariano, Basilio Bona

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010W (2012) https://doi.org/10.1117/12.911991

Read Abstract +

A modular real-time vision system for humanoid robots Paper

Alina L. Trifan, António J. R. Neves, Nuno Lau, Bernardo Cunha

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010X (2012) https://doi.org/10.1117/12.911206

Read Abstract +

Robotic vision is nowadays one of the most challenging branches of robotics. In the case of a humanoid robot, a robust vision system has to provide an accurate representation of the surrounding world and to cope with all the constraints imposed by the hardware architecture and the locomotion of the robot. Usually humanoid robots have low computational capabilities that limit the complexity of the developed algorithms. Moreover, their vision system should perform in real time, therefore a compromise between complexity and processing times has to be found. This paper presents a reliable implementation of a modular vision system for a humanoid robot to be used in color-coded environments. From image acquisition, to camera calibration and object detection, the system that we propose integrates all the functionalities needed for a humanoid robot to accurately perform given tasks in color-coded environments. The main contributions of this paper are the implementation details that allow the use of the vision system in real-time, even with low processing capabilities, the innovative self-calibration algorithm for the most important parameters of the camera and its modularity that allows its use with different robotic platforms. Experimental results have been obtained with a NAO robot produced by Aldebaran, which is currently the robotic platform used in the RoboCup Standard Platform League, as well as with a humanoid build using the Bioloid Expert Kit from Robotis. As practical examples, our vision system can be efficiently used in real time for the detection of the objects of interest for a soccer playing robot (ball, field lines and goals) as well as for navigating through a maze with the help of color-coded clues. In the worst case scenario, all the objects of interest in a soccer game, using a NAO robot, with a single core 500Mhz processor, are detected in less than 30ms. Our vision system also includes an algorithm for self-calibration of the camera parameters as well as two support applications that can run on an external computer for color calibration and debugging purposes. These applications are built based on a typical client-server model, in which the main vision pipe runs as a server, allowing clients to connect and distantly monitor its performance, without interfering with its efficiency. The experimental results that we acquire prove the efficiency of our approach both in terms of accuracy and processing time. Despite having been developed for the NAO robot, the modular design of the proposed vision system allows it to be easily integrated into other humanoid robots with a minimum number of changes, mostly in the acquisition module.

Intelligent Ground Vehicle Competition

Radial polar histogram: obstacle avoidance and path planning for robotic cognition and motion control Paper

Po-Jen Wang, Nicholas R. Keyawa, Craig Euler

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010Y (2012) https://doi.org/10.1117/12.909069

Read Abstract +

Optimizing a mobile robot control system using GPU acceleration Paper

Nat Tuck, Michael McGuinness, Fred Martin

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 83010Z (2012) https://doi.org/10.1117/12.909231

Read Abstract +

Design and realization of an intelligent ground vehicle with modular payloads Paper

Mehmet A. Akmanalp, Ryan M. Doherty, Jeffrey Gorges, Peter Kalauskas, Ellen Peterson, Felipe Polido, Stephen S. Nestinger, Taskin Padir

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830110 (2012) https://doi.org/10.1117/12.909946

Read Abstract +

Navigating a path delineated by colored flags: an approach for a 2011 IGVC requirement Paper

Alex Szmatula, Matt Parrish, Mohan Krishnan, Mark Paulik, Utayba Mohammad, Chaomin Luo

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830111 (2012) https://doi.org/10.1117/12.912124

Read Abstract +

Navigating with VFH: a strategy to avoid traps Paper

Chaomin Luo, Mohan Krishnan, Mark Paulik, Utayba Mohammad, Qing Wang

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830112 (2012) https://doi.org/10.1117/12.912170

Read Abstract +

Interactive Paper Session

Measurement of noises and modulation transfer function of cameras used in optical-digital correlators Paper

Nikolay N. Evtikhiev, Sergey N. Starikov, Pavel A. Cheryomkhin, Vitaly V. Krasnov

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830113 (2012) https://doi.org/10.1117/12.908198

Read Abstract +

A phase space approach for detection and removal of rain in video Paper

Varun Santhaseelan, Vijayan K. Asari

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830114 (2012) https://doi.org/10.1117/12.909491

Read Abstract +

Intelligence algorithms for autonomous navigation in a ground vehicle Paper

Steve Petkovsek, Rahul Shakya, Young Ho Shin, Prasanna Gautam, Adam Norton, David J. Ahlgren

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830115 (2012) https://doi.org/10.1117/12.909514

Read Abstract +

This paper will discuss the approach to autonomous navigation used by "Q," an unmanned ground vehicle designed by the Trinity College Robot Study Team to participate in the Intelligent Ground Vehicle Competition (IGVC). For the 2011 competition, Q's intelligence was upgraded in several different areas, resulting in a more robust decision-making process and a more reliable system. In 2010-2011, the software of Q was modified to operate in a modular parallel manner, with all subtasks (including motor control, data acquisition from sensors, image processing, and intelligence) running simultaneously in separate software processes using the National Instruments (NI) LabVIEW programming language. This eliminated processor bottlenecks and increased flexibility in the software architecture. Though overall throughput was increased, the long runtime of the image processing process (150 ms) reduced the precision of Q's realtime decisions. Q had slow reaction times to obstacles detected only by its cameras, such as white lines, and was limited to slow speeds on the course. To address this issue, the image processing software was simplified and also pipelined to increase the image processing throughput and minimize the robot's reaction times. The vision software was also modified to detect differences in the texture of the ground, so that specific surfaces (such as ramps and sand pits) could be identified. While previous iterations of Q failed to detect white lines that were not on a grassy surface, this new software allowed Q to dynamically alter its image processing state so that appropriate thresholds could be applied to detect white lines in changing conditions. In order to maintain an acceptable target heading, a path history algorithm was used to deal with local obstacle fields and GPS waypoints were added to provide a global target heading. These modifications resulted in Q placing 5th in the autonomous challenge and 4th in the navigation challenge at IGVC.

Hierarchical multi-level image mosaicing for autonomous navigation of UAV Paper

Sangho Park, Debabrata Ghosh, Naima Kaabouch, Ronald A. Fevig, William Semke

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830116 (2012) https://doi.org/10.1117/12.910334

Read Abstract +

A fluidic lens with reduced optical aberration Paper

Jei-Yin Yiu, Robert Batchko, Sam Robinson, Andrei Szilagyi

Proceedings Volume Intelligent Robots and Computer Vision XXIX: Algorithms and Techniques, 830117 (2012) https://doi.org/10.1117/12.918337

Read Abstract +

Keywords/Phrases

Search In:

Publication Years