Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Video Object Segmentation

Tasks, Datasets, and Methods

  • Book
  • © 2024

Overview

  • Provides a thorough introduction to the most common problem settings, including semi-supervised VOS and unsupervised VOS
  • Discusses recent progress in video object segmentation, including new datasets, methods, and experimental findings
  • Aids readers to gain a better understanding of the most important problems and advances via real-world examples

Part of the book series: Synthesis Lectures on Computer Vision (SLCV)

  • 154 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this book

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

eBook USD 34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 44.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This book provides a thorough overview of recent progress in video object segmentation, providing researchers and industrial practitioners with thorough information on the most important problems and developed technologies in the area. Video segmentation is a fundamental topic for video understanding in computer vision. Segmenting unique objects in a given video is useful for a variety of applications, including video conference, video editing, surveillance, and autonomous driving. Given the revolution of deep learning in computer vision problems, numerous new tasks, datasets, and methods have been recently proposed in the domain of segmentation. The book includes these recent results and findings in large-scale video object segmentation as well as benchmarks in large-scale human-centric video analysis in complex events. The authors provide readers with a comprehensive understanding of the challenges involved in video object segmentation, as well as the most effective methods for resolving them. 

Similar content being viewed by others

Keywords

Table of contents (3 chapters)

Authors and Affiliations

  • Adobe Research, San Jose, USA

    Ning Xu

  • Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai, China

    Weiyao Lin

  • School of Software, Shandong University, Jinan City, China

    Xiankai Lu

  • School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China

    Yunchao Wei

About the authors

Ning Xu, Ph.D., is a Research Scientist at Adobe Research. He received his Ph.D. in Electrical and Computer Engineering from the University of Illinois at Urbana-Champaign. He was an organizer of the first and second LSVOS Challenge in ECCV 2018 and ICCV 2019. He is also an organizer for the ACM MM 2020 grand challenge “Large-scale Human-centric Video Analysis in Complex Events” and ACCV 2020 tutorial “Spatial –Temporal Parsing of Objects: From Segmentation to Actions”. His research interests include image and video segmentation.


Weiyao Lin, Ph.D. is a Professor at Shanghai Jiao Tong University. He received his B.S. and M.E. from Shanghai Jiao Tong University and Ph.D. degree from the University of Washington, all in electrical engineering.


Xiankai Lu, Ph.D., is a Research Professor at Shandong University. Prior to this role, he was a research associate with Inception Institute of Artificial Intelligence at Abu Dhabi, UAE. Dr. Lu received a B. E. from the Department of Automation at Shan Dong University.


Yunchao Wei, Ph.D, is a Professor in the Center of Digital Media Information Processing at Beijing Jiaotong University. He received his Ph.D. from Beijing Jiaotong University. His current research interests include visual recognition with imperfect data, image/video segmentation and object detection, and multi-modal perception.

Bibliographic Information

Publish with us