Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2733373.2806324acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Using the Eyes to "See" the Objects

Published: 13 October 2015 Publication History

Abstract

This paper investigates how to exploit eye gaze data for understanding visual content. In particular, we propose a human-in-the-loop approach for object segmentation in videos, where humans provide significant cues on spatiotemporal relations between object parts (i.e. superpixels in our approach) by simply looking at video sequences. Such constraints, together with object appearance properties, are encoded into an energy function so as to tackle the segmentation problem as a labeling one. The proposed method uses gaze data from only two people and was tested on two challenging visual benchmarks: 1) SegTrack v2 and 2) FBMS-59. The achieved performance showed how our method outperformed more complex video object segmentation approaches, while reducing the effort needed for collecting human feedback

References

[1]
Image segmentation by iterated region merging with localized graph cuts. Pattern Recognition 44, 2527 -- 2538.
[2]
Achanta, R., Shaji, A., and Smith, K. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE PAMI 6, 1 (2012), 1--8.
[3]
Brutzer, S., Hoferlin, B., and Heidemann, G. Evaluation of background subtraction techniques for video surveillance. In CVPR 2011, pp. 1937--1944.
[4]
Deng, J., Krause, J., and Fei-Fei, L. Fine-grained crowdsourcing for fine-grained recognition. In CVPR 2013, pp. 580--587.
[5]
Kavasidis, I., Spampinato, C., and Giordano, D. Generation of ground truth for object detection while playing an online game: Productive gaming or recreational working? In CVPRW 2013, pp. 694--699.
[6]
Li, F., Kim, T., Humayun, A., Tsai, D., and Rehg, J. M. Video segmentation by tracking many figure-ground segments. In ICCV (2013).
[7]
Lim, J., and Han, B. Generalized background subtraction using superpixels with label integrated motion estimation. In ECCV 2014. 2014, pp. 173--187.
[8]
Maji, S. Discovering a lexicon of parts and attributes. In ECCV 2012 Workshops. 2012, pp. 21--30.
[9]
Mishra, A. K., and Aloimonos, Y. Active segmentation with fixation. In ICCV (2009).
[10]
Mohedano, E., Healy, G., McGuinness, K., Giró-i Nieto, X., O'Connor, N. E., and Smeaton, A. F. Object segmentation in images using eeg signals. In ACM MM '14 (New York, NY, USA), ACM, pp. 417--426.
[11]
Ochs, P., Malik, J., and Brox, T. Segmentation of moving objects by long term video analysis. IEEE PAMI 36, 6, 1187--1200.
[12]
Papazoglou, A., and Ferrari, V. Fast Object Segmentation in Unconstrained Video. ICCV'13, 1777--1784.
[13]
Shanmuga Vadivel, K., Ngo, T., Eckstein, M., and Manjunath, B. Eye tracking assisted extraction of attentionally important objects from videos.
[14]
Walber, T., Scherp, A., and Staab, S. Can you see it? two novel eye-tracking-based measures for assigning tags to image regions. In Advances in Multimedia Modeling, vol. 7732. 2013, pp. 36--46.
[15]
Wu, J., Zhao, Y., Zhu, J.-Y., Luo, S., and Tu, Z. Milcut: A sweeping line multiple instance learning paradigm for interactive image segmentation. In CVPR 2014, pp. 256--263.

Cited By

View all
  • (2019)Data-Intensive Workflow Management: For Clouds and Data-Intensive and Scalable Computing EnvironmentsSynthesis Lectures on Data Management10.2200/S00915ED1V01Y201904DTM06014:4(1-179)Online publication date: 13-May-2019
  • (2017)Implicit Vs. Explicit Human Feedback for Interactive Video Object SegmentationNew Trends in Image Analysis and Processing – ICIAP 201710.1007/978-3-319-70742-6_12(131-142)Online publication date: 31-Dec-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '15: Proceedings of the 23rd ACM international conference on Multimedia
October 2015
1402 pages
ISBN:9781450334594
DOI:10.1145/2733373
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. eye-tracking
  2. human in the loop
  3. video object segmentation

Qualifiers

  • Short-paper

Conference

MM '15
Sponsor:
MM '15: ACM Multimedia Conference
October 26 - 30, 2015
Brisbane, Australia

Acceptance Rates

MM '15 Paper Acceptance Rate 56 of 252 submissions, 22%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Data-Intensive Workflow Management: For Clouds and Data-Intensive and Scalable Computing EnvironmentsSynthesis Lectures on Data Management10.2200/S00915ED1V01Y201904DTM06014:4(1-179)Online publication date: 13-May-2019
  • (2017)Implicit Vs. Explicit Human Feedback for Interactive Video Object SegmentationNew Trends in Image Analysis and Processing – ICIAP 201710.1007/978-3-319-70742-6_12(131-142)Online publication date: 31-Dec-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media