Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

EQUI-VOCAL Demonstration: Synthesizing Video Queries from User Interactions

Published: 01 August 2023 Publication History

Abstract

We demonstrate EQUI-VOCAL, a system that synthesizes compositional queries over videos from user feedback. EQUI-VOCAL enables users to query a video database for complex events by providing a few positive and negative examples of what they are looking for and labeling a small number of additional system-selected examples. Using those user inputs, EQUI-VOCAL synthesizes declarative queries that can then retrieve additional instances of the desired events. The demonstration makes two contributions: it introduces EQUI-VOCAL's graphical user interface and enables conference attendees to experiment with EQUI-VOCAL on a variety of queries. Both enable users to gain a better understanding of EQUI-VOCAL's query synthesis approach and to explore the impact of hyperparameters and label noise on system performance.

References

[1]
Daren Chao et al. 2020. SVQ++: Querying for Object Interactions in Video Streams. In SIGMOD. 2769--2772.
[2]
Yueting Chen et al. 2022. Spatial and Temporal Constrained Ranked Retrieval over Videos. PVLDB 15, 11 (2022), 3226--3239.
[3]
Maureen Daum et al. 2022. VOCAL: Video Organization and Interactive Compositional AnaLytics. In CIDR.
[4]
Maureen Daum et al. 2023. VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building [Technical Report]. arXiv preprint arXiv:2303.04068 (2023).
[5]
Daniel Y. Fu et al. 2019. Rekall: Specifying Video Events using Compositions of Spatiotemporal Labels. arXiv preprint arXiv:1910.02993 (2019).
[6]
Mohammad Reza Karimi et al. 2021. Online Active Model Selection for Pre-trained Classifiers. In AISTATS, Vol. 130. 307--315.
[7]
Ranjay Krishna et al. 2017. Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations. IJCV 123, 1 (2017), 32--73.
[8]
Jie Lei et al. 2021. QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries. In NeurIPS.
[9]
Yao Lu et al. 2018. Accelerating Machine Learning Inference with Probabilistic Predicates. In SIGMOD. 1493--1508.
[10]
Stephen Mell et al. 2021. Synthesizing Video Trajectory Queries. In AIPLANS Workshop.
[11]
Oscar R. Moll et al. 2022. ExSample: Efficient Searches on Video Repositories through Adaptive Sampling. In ICDE. 3065--3077.
[12]
Kexin Yi et al. 2020. CLEVRER: Collision Events for Video Representation and Reasoning. In ICLR.
[13]
Enhao Zhang et al. 2023. EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions. PVLDB 16, 12 (2023).

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 16, Issue 12
August 2023
685 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 August 2023
Published in PVLDB Volume 16, Issue 12

Check for updates

Badges

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 96
    Total Downloads
  • Downloads (Last 12 months)64
  • Downloads (Last 6 weeks)2
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media