Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3302424.3303971acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article
Public Access

VStore: A Data Store for Analytics on Large Videos

Published: 25 March 2019 Publication History

Abstract

We present VStore, a data store for supporting fast, resource-efficient analytics over large archival videos. VStore manages video ingestion, storage, retrieval, and consumption. It controls video formats along the video data path. It is challenged by i) the huge combinatorial space of video format knobs; ii) the complex impacts of these knobs and their high profiling cost; iii) optimizing for multiple resource types. It explores an idea called backward derivation of configuration: in the opposite direction along the video data path, VStore passes the video quantity and quality expected by analytics backward to retrieval, to storage, and to ingestion. In this process, VStore derives an optimal set of video formats, optimizing for different resources in a progressive manner.
VStore automatically derives large, complex configurations consisting of more than one hundred knobs over tens of video formats. In response to queries, VStore selects video formats catering to the executed operators and the target accuracy. It streams video data from disks through decoder to operators. It runs queries as fast as 362x of video realtime.

References

[1]
2018. Amazon EC2 P3 Instances. https://aws.amazon.com/ec2/instance-types/p3/.
[2]
2018. NVIDIA. https://developer.nvidia.com/deepstream-sdk.
[3]
2018. RollingDB Storage Library. https://github.com/openalpr/rollingdb.
[4]
Daniel J. Abadi, Don Carney, Uğur Çetintemel, Mitch Cherniack, Christian Convey, Sangdon Lee, Michael Stonebraker, Nesime Tatbul, and Stan Zdonik. 2003. Aurora: A New Model and Architecture for Data Stream Management. The VLDB Journal 12, 2 (Aug. 2003), 120--139.
[5]
Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit Steiner, Paul Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A System for Large-Scale Machine Learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). USENIX Association, Savannah, GA, 265--283. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/abadi
[6]
Nitin Agrawal and Ashish Vulimiri. 2017. Low-Latency Analytics on Colossal Data Streams with SummaryStore. In Proceedings of the 26th Symposium on Operating Systems Principles (SOSP '17). ACM, New York, NY, USA, 647--664.
[7]
Michael P Andersen and David E. Culler. 2016. BTrDB: Optimizing Storage System Design for Timeseries Processing. In 14th USENIX Conference on File and Storage Technologies (FAST 16). USENIX Association, Santa Clara, CA, 39--52. https://www.usenix.org/conference/fast16/technical-sessions/presentation/andersen
[8]
Brian Babcock, Shivnath Babu, Mayur Datar, Rajeev Motwani, and Jennifer Widom. 2002. Models and Issues in Data Stream Systems. In Proceedings of the Twenty-first ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS '02). ACM, New York, NY, USA, 1--16.
[9]
Doug Beaver, Sanjeev Kumar, Harry C. Li, Jason Sobel, and Peter Vajgel. 2010. Finding a Needle in Haystack: Facebook's Photo Storage. In Proceedings of the 9th USENIX Conference on Operating Systems Design and Implementation (OSDI'10). USENIX Association, Berkeley, CA, USA, 47--60. http://dl.acm.org/citation.cfm?id=1924943.1924947
[10]
E. T. Bell. 1934. Exponential Numbers. The American Mathematical Monthly 41, 7 (1934), 411--419. http://www.jstor.org/stable/2300300
[11]
E. T. Bell. 1934. Exponential Polynomials. Annals of Mathematics 35, 2 (1934), 258--277. http://www.jstor.org/stable/1968431
[12]
Dimitri Bertsekas and Robert Gallager. 1992. Data Networks (2Nd Ed.). Prentice-Hall, Inc., Upper Saddle River, NJ, USA.
[13]
Don Carney, Ugur Çetintemel, Mitch Cherniack, Christian Convey, Sangdon Lee, Greg Seidman, Michael Stonebraker, Nesime Tatbul, and Stan Zdonik. 2002. Monitoring Streams: A New Class of Data Management Applications. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB '02). VLDB Endowment, 215--226. http://dl.acm.org/citation.cfm?id=1287369.1287389
[14]
Jared Casper, Jon Barker, and Bryan Catanzaro. 2018. NVVL: NVIDIA Video Loader. https://github.com/NVIDIA/nvvl.
[15]
Y. Chen, X. Zhu, W. Zheng, and J. Lai. 2018. Person Re-Identification by Camera Correlation Aware Feature Augmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 2 (Feb 2018), 392--408.
[16]
Yongxi Cheng, Xiaoming Sun, and Yiqun Lisa Yin. 2008. Searching monotone multi-dimensional arrays. Discrete Mathematics 308, 11 (2008), 2213--2221.
[17]
Tzi-cker Chiueh and Randy H. Katz. 1993. Multi-resolution Video Representation for Parallel Disk Arrays. In Proceedings of the First ACM International Conference on Multimedia (MULTIMEDIA '93). ACM, New York, NY, USA, 401--409.
[18]
Ziqiang Feng, Shilpa George, Jan Harkes, Padmanabhan Pillai, Roberta Klatzky, and Mahadev Satyanarayanan. 2019. Eureka: Edge-based Discovery of Training Data for Machine Learning. IEEE Internet Computing PP (01 2019), 1--1.
[19]
Ziqiang Feng, Junjue Wang, Jan Harkes, Padmanabhan Pillai, and Mahadev Satyanarayanan. 2018. EVA: An Efficient System for Exploratory Video Analysis. SysML (2018).
[20]
Sadjad Fouladi, Riad S. Wahby, Brennan Shacklett, Karthikeyan Vasuki Balasubramaniam, William Zeng, Rahul Bhalerao, Anirudh Sivaraman, George Porter, and Keith Winstein. 2017. Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). USENIX Association, Boston, MA, 363--376. https://www.usenix.org/conference/nsdi17/technical-sessions/presentation/fouladi
[21]
Ross Girshick. 2015. Fast R-CNN. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV) (ICCV '15). IEEE Computer Society, Washington, DC, USA, 1440--1448.
[22]
Vishakha Gupta-Cledat, Luis Remis, and Christina R Strong. 2017. Addressing the Dark Side of Vision Research: Storage. In 9th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 17). USENIX Association, Santa Clara, CA. https://www.usenix.org/conference/hotstorage17/program/presentation/gupta-cledat
[23]
Seungyeop Han, Haichen Shen, Matthai Philipose, Sharad Agarwal, Alec Wolman, and Arvind Krishnamurthy. 2016. MCDNN: An Approximation-Based Execution Framework for Deep Stream Processing Under Resource Constraints. In Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys '16). ACM, New York, NY, USA, 123--136.
[24]
Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Shivaram Venkataraman, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, and Onur Mutlu. 2018. Focus: Querying Large Video Datasets with Low Latency and Low Cost. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 18). USENIX Association, Carlsbad, CA. https://www.usenix.org/conference/osdi18/presentation/hsieh
[25]
Qi Huang, Petchean Ang, Peter Knowles, Tomasz Nykiel, Iaroslav Tverdokhlib, Amit Yajurvedi, Paul Dapolito, IV, Xifan Yan, Maxim Bykov, Chuen Liang, Mohit Talwar, Abhishek Mathur, Sachin Kulkarni, Matthew Burke, and Wyatt Lloyd. 2017. SVE: Distributed Video Processing at Facebook Scale. In Proceedings of the 26th Symposium on Operating Systems Principles (SOSP '17). ACM, New York, NY, USA, 87--103.
[26]
Chien-Chun Hung, Ganesh Ananthanarayanan, Peter BodÃŋk, Leana Golubchik, Minlan Yu, Victor Bahl, and Matthai Philipose. 2018. VideoEdge: Processing Camera Streams using Hierarchical Clusters. https://www.microsoft.com/en-us/research/publication/videoedge-processing-camera-streams-using-hierarchical-clusters/
[27]
IHS. 2016. Top Video Surveillance Trends for 2016.
[28]
IHS. 2018. Top Video Surveillance Trends for 2018.
[29]
iMatix Corporation. 2018. Lightning Memory-mapped Database. https://symas.com/lmdb/.
[30]
InfluxData. 2018. InfluxDB. https://www.influxdata.com/.
[31]
Samvit Jain, Ganesh Ananthanarayanan, Junchen Jiang, Yuanchao Shu, and Joseph E Gonzalez. 2018. Scaling Video Analytics Systems to Large Camera Deployments. arXiv preprint arXiv:1809.02318 (2018).
[32]
Junchen Jiang, Ganesh Ananthanarayanan, Peter Bodik, Siddhartha Sen, and Ion Stoica. 2018. Chameleon: Scalable Adaptation of Video Analytics. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication (SIGCOMM '18). ACM, New York, NY, USA, 253--266.
[33]
Daniel Kang, Peter Bailis, and Matei Zaharia. 2018. BlazeIt: Fast Exploratory Video Queries using Neural Networks. arXiv preprint arXiv:1805.01046 (2018).
[34]
Daniel Kang, John Emmons, Firas Abuzaid, Peter Bailis, and Matei Zaharia. 2017. NoScope: Optimizing Neural Network Queries over Video at Scale. Proc. VLDB Endow. 10, 11 (Aug. 2017), 1586--1597.
[35]
Sooyong Kang, Sungwoo Hong, and Youjip Won. 2009. Storage technique for realtime streaming of layered video. Multimedia Systems 15, 2 (01 Apr 2009), 63--81.
[36]
Kimberly Keeton and Randy H. Katz. 1995. Evaluating video layout strategies for a high-performance storage server. Multimedia Systems 3, 2 (01 May 1995), 43--52.
[37]
Christian Kreuzberger, Daniel Posch, and Hermann Hellwagner. 2015. A Scalable Video Coding Dataset and Toolchain for Dynamic Adaptive Streaming over HTTP. In Proceedings of the 6th ACM Multimedia Systems Conference (MMSys '15). ACM, New York, NY, USA, 213--218.
[38]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems 25, F. Pereira, C.J. C. Burges, L. Bottou, and K. Q. Weinberger (Eds.). Curran Associates, Inc., 1097--1105. http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
[39]
C. A. Lang, B. Bhattacharjee, T. Malkemus, S. Padmanabhan, and K. Wong. 2007. Increasing Buffer-Locality for Multiple Relational Table Scans through Grouping and Throttling. In 2007 IEEE 23rd International Conference on Data Engineering. 1136--1145.
[40]
Nathan Linial and Michael Saks. 1985. Searching ordered structures. Journal of algorithms 6, 1 (1985), 86--103.
[41]
Peng Liu, Jongwon Yoon, Lance Johnson, and Suman Banerjee. 2016. Greening the Video Transcoding Service with Low-Cost Hardware Transcoders. In 2016 USENIX Annual Technical Conference (USENIX ATC 16). USENIX Association, Denver, CO, 407--419. https://www.usenix.org/conference/atc16/technical-sessions/presentation/liu
[42]
Shayan Modiri Assari, Haroon Idrees, and Mubarak Shah. 2016. Human Re-identification in Crowd Videos Using Personal, Social and Environmental Constraints. Springer International Publishing, Cham, 119--136.
[43]
Ingo Molnár. 2007. {patch} Modular Scheduler Core and Completely Fair Scheduler. http://lwn.net/Articles/230501/.
[44]
Rajeev Motwani, Jennifer Widom, Arvind Arasu, Brian Babcock, Shivnath Babu, Mayur Datar, Gurmeet Manku, Chris Olston, Justin Rosenstein, and Rohit Varma. 2003. Query Processing, Resource Management, and Approximation in a Data Stream Management System. In IN CIDR. 245--256.
[45]
JungHwan Oh and Kien A. Hua. 2000. Efficient and Cost-effective Techniques for Browsing and Indexing Large Video Databases. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD '00). ACM, New York, NY, USA, 415--426.
[46]
OpenALPR Technology, Inc. 2018. OpenALPR. https://github.com/openalpr/openalpr.
[47]
OpenCV. 2018. Contours.
[48]
OpenCV. 2018. Optical Flow.
[49]
Chrisma Pakha, Aakanksha Chowdhery, and Junchen Jiang. 2018. Reinventing Video Streaming for Distributed Vision Analytics. In 10th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 18). USENIX Association, Boston, MA. https://www.usenix.org/conference/hotcloud18/presentation/pakha
[50]
Stavros Papadopoulos, Kushal Datta, Samuel Madden, and Timothy Mattson. 2016. The TileDB Array Data Storage Manager. Proc. VLDB Endow. 10, 4 (Nov. 2016), 349--360.
[51]
Alex Poms, Will Crichton, Pat Hanrahan, and Kayvon Fatahalian. 2018. Scanner: Efficient Video Analysis at Scale. ACM Trans. Graph. 37, 4, Article 138 (July 2018), 13 pages.
[52]
Lin Qiao, Vijayshankar Raman, Frederick Reiss, Peter J. Haas, and Guy M. Lohman. 2008. Main-memory Scan Sharing for Multicore CPUs. Proc. VLDB Endow. 1, 1 (Aug. 2008), 610--621.
[53]
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 779--788.
[54]
Oracle. 2015. Dramatically Reduce the Cost and Complexity of Video Surveillance Storage. https://www.oracle.com/assets/wp-video-surveillance-storage-2288409.pdf.
[55]
Luis Remis, Vishakha Gupta-Cledat, Christina R. Strong, and Margriet IJzerman-Korevaar. 2018. VDMS: An Efficient Big-Visual-Data Access for Machine Learning Workloads. CoRR abs/1810.11832 (2018).
[56]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks. In Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 1 (NIPS'15). MIT Press, Cambridge, MA, USA, 91--99. http://dl.acm.org/citation.cfm?id=2969239.2969250
[57]
Seagate. 2017. Video Surveillance Trends Report. https://www.seagate.com/files/www-content/solutions-content/surveillance-security-video-analytics/en-us/docs/video-surveillance-trends-report.pdf.
[58]
Timos K. Sellis. 1988. Multiple-query Optimization. ACM Trans. Database Syst. 13, 1 (March 1988), 23--52.
[59]
Michael Seufert, Sebastian Egger, Martin Slanina, Thomas Zinner, Tobias Hossfeld, and Phuoc Tran-Gia. 2015. A survey on quality of experience of HTTP adaptive streaming. IEEE Communications Surveys & Tutorials 17, 1 (2015), 469--492.
[60]
Haichen Shen, Seungyeop Han, Matthai Philipose, and Arvind Krishnamurthy. 2017. Fast Video Classification via Adaptive Cascading of Deep Models. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[61]
V. Srinivasan, Brian Bulkowski, Wei-Ling Chu, Sunil Sayyaparaju, Andrew Gooding, Rajkumar Iyer, Ashish Shinde, and Thomas Lopatic. 2016. Aerospike: Architecture of a Realtime Operational DBMS. Proc. VLDB Endow. 9, 13 (Sept. 2016), 1389--1400.
[62]
Nesime Tatbul, Uğur Çetintemel, Stan Zdonik, Mitch Cherniack, and Michael Stonebraker. 2003. Load Shedding in a Data Stream Manager. In Proceedings of the 29th International Conference on Very Large Data Bases - Volume 29 (VLDB '03). VLDB Endowment, 309--320. http://dl.acm.org/citation.cfm?id=1315451.1315479
[63]
Chao-Yuan Wu, Manzil Zaheer, Hexiang Hu, R. Manmatha, Alexander J. Smola, and Philipp Krähenbühl. 2018. Compressed Video Action Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[64]
Yi Wu, Jongwoo Lim, and Ming-Hsuan Yang. 2013. Online Object Tracking: A Benchmark. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR '13). IEEE Computer Society, Washington, DC, USA, 2411--2418.
[65]
S. Yi, Z. Hao, Q. Zhang, Q. Zhang, W. Shi, and Q. Li. 2017. LAVEA: Latency-Aware Video Analytics on Edge Computing Platform. In 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS). 2573--2574.
[66]
J. Yoon, P. Liu, and S. Banerjee. 2016. Low-Cost Video Transcoding at the Wireless Edge. In 2016 IEEE/ACM Symposium on Edge Computing (SEC). 129--141.
[67]
Haoyu Zhang, Ganesh Ananthanarayanan, Peter Bodik, Matthai Philipose, Paramvir Bahl, and Michael J. Freedman. 2017. Live Video Analytics at Scale with Approximation and Delay-Tolerance. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). USENIX Association, Boston, MA, 377--392. https://www.usenix.org/conference/nsdi17/technical-sessions/presentation/zhang
[68]
Tan Zhang, Aakanksha Chowdhery, Paramvir (Victor) Bahl, Kyle Jamieson, and Suman Banerjee. 2015. The Design and Implementation of a Wireless Video Surveillance System. In Proceedings of the 21st Annual International Conference on Mobile Computing and Networking (MobiCom '15). ACM, New York, NY, USA, 426--438.
[69]
Marcin Zukowski, Sándor Héman, Niels Nes, and Peter Boncz. 2007. Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS. In Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB '07). VLDB Endowment, 723--734. http://dl.acm.org/citation.cfm?id=1325851.1325934

Cited By

View all
  • (2024)Modeling and Performance Analysis of a Notification-Based Method for Processing Video Queries on the FlyApplied Sciences10.3390/app1409356614:9(3566)Online publication date: 24-Apr-2024
  • (2024)V2V: Efficiently Synthesizing Video Results for Video Queries2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00449(5614-5621)Online publication date: 13-May-2024
  • (2023)TVM: A Tile-based Video Management FrameworkProceedings of the VLDB Endowment10.14778/3636218.363622417:4(671-684)Online publication date: 1-Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
EuroSys '19: Proceedings of the Fourteenth EuroSys Conference 2019
March 2019
714 pages
ISBN:9781450362818
DOI:10.1145/3302424
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 March 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Data Store
  2. Deep Neural Networks
  3. Video Analytics

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

EuroSys '19
Sponsor:
EuroSys '19: Fourteenth EuroSys Conference 2019
March 25 - 28, 2019
Dresden, Germany

Acceptance Rates

Overall Acceptance Rate 241 of 1,308 submissions, 18%

Upcoming Conference

EuroSys '25
Twentieth European Conference on Computer Systems
March 30 - April 3, 2025
Rotterdam , Netherlands

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)625
  • Downloads (Last 6 weeks)105
Reflects downloads up to 04 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Modeling and Performance Analysis of a Notification-Based Method for Processing Video Queries on the FlyApplied Sciences10.3390/app1409356614:9(3566)Online publication date: 24-Apr-2024
  • (2024)V2V: Efficiently Synthesizing Video Results for Video Queries2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00449(5614-5621)Online publication date: 13-May-2024
  • (2023)TVM: A Tile-based Video Management FrameworkProceedings of the VLDB Endowment10.14778/3636218.363622417:4(671-684)Online publication date: 1-Dec-2023
  • (2023)Extract-Transform-Load for Video StreamsProceedings of the VLDB Endowment10.14778/3598581.359860016:9(2302-2315)Online publication date: 1-May-2023
  • (2023)OneAdaptProceedings of the 2023 ACM Symposium on Cloud Computing10.1145/3620678.3624653(158-176)Online publication date: 30-Oct-2023
  • (2023)LEED: A Low-Power, Fast Persistent Key-Value Store on SmartNIC JBOFsProceedings of the ACM SIGCOMM 2023 Conference10.1145/3603269.3604880(1012-1027)Online publication date: 10-Sep-2023
  • (2023)A Contributory Public-Event Recording and Querying SystemProceedings of the Eighth ACM/IEEE Symposium on Edge Computing10.1145/3583740.3628445(185-198)Online publication date: 6-Dec-2023
  • (2023)An Edge-Side Real-Time Video Analytics System With Dual Computing Resource ControlIEEE Transactions on Computers10.1109/TC.2023.330113672:12(3399-3415)Online publication date: Dec-2023
  • (2022)Towards causal physical error discovery in video analytics systemsProceedings of the Workshop on Human-In-the-Loop Data Analytics10.1145/3546930.3547495(1-6)Online publication date: 12-Jun-2022
  • (2022)EVA: A Symbolic Approach to Accelerating Exploratory Video Analytics with Materialized ViewsProceedings of the 2022 International Conference on Management of Data10.1145/3514221.3526142(602-616)Online publication date: 10-Jun-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media