research-article

DyCo: Dynamic, Contextualized AI Models

Authors:

Murugan Sankaradas,

Srimat ChakradharAuthors Info & Claims

ACM Transactions on Embedded Computing Systems, Volume 21, Issue 6

Article No.: 76, Pages 1 - 21

https://doi.org/10.1145/3520131

Published: 12 December 2022 Publication History

Abstract

Devices with limited computing resources use smaller AI models to achieve low-latency inferencing. However, model accuracy is typically much lower than the accuracy of a bigger model that is trained and deployed in places where the computing resources are relatively abundant. We describe DyCo, a novel system that ensures privacy of stream data and dynamically improves the accuracy of small models used in devices. Unlike knowledge distillation or federated learning, DyCo treats AI models as black boxes. DyCo uses a semi-supervised approach to leverage existing training frameworks and network model architectures to periodically train contextualized, smaller models for resource-constrained devices. DyCo uses a bigger, highly accurate model in the edge-cloud to auto-label data received from each sensor stream. Training in the edge-cloud (as opposed to the public cloud) ensures data privacy, and bespoke models for thousands of live data streams can be designed in parallel by using multiple edge-clouds. DyCo uses the auto-labeled data to periodically re-train, stream-specific, bespoke small models. To reduce the periodic training costs, DyCo uses different policies that are based on stride, accuracy, and confidence information.

We evaluate our system, and the contextualized models, by using two object detection models for vehicles and people, and two datasets (a public benchmark and another real-world proprietary dataset). Our results show that DyCo increases the mAP accuracy measure of small models by an average of 16.3% (and up to 20%) for the public benchmark and an average of 19.0% (and up to 64.9%) for the real-world dataset. DyCo also decreases the training costs for contextualized models by more than an order of magnitude.

References

[1]

Hamidreza Arasteh, Vahid Hosseinnezhad, Vincenzo Loia, Aurelio Tommasetti, Orlando Troisi, Miadreza Shafie-khah, and Pierluigi Siano. 2016. IoT-based smart cities: A survey. In IEEE 16th International Conference on Environment and Electrical Engineering (EEEIC). IEEE, 1–6.

[2]

Alcardo Alex Barakabitze, Arslan Ahmad, Rashid Mijumbi, and Andrew Hines. 2020. 5G network slicing using SDN and NFV: A survey of taxonomy, architectures and future challenges. Comput. Netw. 167 (Feb.2020). DOI:

Digital Library

[3]

Keith Bonawitz, Hubert Eichner, Wolfgang Grieskamp, Dzmitry Huba, Alex Ingerman, Vladimir Ivanov, Chloe Kiddon, Jakub Konečnỳ, Stefano Mazzocchi, H. Brendan McMahan, et al. 2019. Towards federated learning at scale: System design. arXiv preprint arXiv:1902.01046 (2019).

[4]

Samia Bouyakoub, Abdelkader Belkhir, Fayçal M’hamed Bouyakoub, and Wassila Guebli. 2017. Smart airport: An IoT-based airport management system. In International Conference on Future Networks and Distributed Systems. 1–7.

Digital Library

[5]

Olivier Chapelle, Bernhard Scholkopf, and Alexander Zien. 2009. Semi-supervised learning. IEEE Trans. Neural Netw. 20, 3 (2009), 542–542.

Digital Library

[6]

Guobin Chen, Wongun Choi, Xiang Yu, Tony Han, and Manmohan Chandraker. 2017. Learning efficient object detection models with knowledge distillation. In Conference on Advances in Neural Information Processing Systems. 742–751.

[7]

Chuanqi. 2017. A caffe implementation of MobileNet-SSD detection network. Retrieved from https://github.com/chuanqi305/MobileNet-SSD.

[8]

Harshit Daga, Patrick K. Nicholson, Ada Gavrilovska, and Diego Lugones. 2019. Cartel: A system for collaborative transfer learning at the edge. In ACM Symposium on Cloud Computing. 25–37.

[9]

Nhu-Ngoc Dao, Woongsoo Na, and Sungrae Cho. 2020. Mobile cloudization storytelling: Current issues from optimization perspective. IEEE Internet Comput. PP (12020), 1–1. DOI:

[10]

Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, and Laura Leal-Taixé. 2020. Mot20: A benchmark for multi object tracking in crowded scenes. arXiv preprint arXiv:2003.09003 (2020).

[11]

Biyi Fang, Xiao Zeng, Faen Zhang, Hui Xu, and Mi Zhang. 2020. FlexDNN: Input-adaptive on-device deep learning for efficient mobile vision. In 5th ACM/IEEE Symposium on Edge Computing (SEC).

[12]

Ross Girshick, Ilija Radosavovic, Georgia Gkioxari, Piotr Dollár, and Kaiming He. 2018. Detectron. Retrieved from https://github.com/facebookresearch/detectron.

[13]

Bing Han and Kaushik Roy. 2018. DeltaFrame-BP: An algorithm using frame difference for deep convolutional neural networks training and inference on video data. IEEE Trans. Multi-scale Comput. Syst. 4, 4 (2018), 624–634.

[14]

Haowei Chen, Liekang Zeng, Shuai Yu, and Xu Chen. 2020. Knowledge distillation for mobile edge computation offloading. ZTE Commun. 18, 2 (2020), 40–48.

[15]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition. 770–778.

[16]

Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).

[17]

Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).

[18]

Kevin Hsieh, Aaron Harlap, Nandita Vijaykumar, Dimitris Konomis, Gregory R. Ganger, Phillip B. Gibbons, and Onur Mutlu. 2017. Gaia: Geo-distributed machine learning approaching \(\lbrace\)LAN\(\rbrace\) speeds. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI’17). 629–647.

[19]

Yiping Kang, Johann Hauswald, Cao Gao, Austin Rovinski, Trevor Mudge, Jason Mars, and Lingjia Tang. 2017. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Comput. Archit. News 45, 1 (2017), 615–629.

Digital Library

[20]

En Li, Liekang Zeng, Zhi Zhou, and Xu Chen. 2019. Edge AI: On-demand accelerating deep neural network inference via edge computing. IEEE Trans. Wirel. Commun. 19, 1 (2019), 447–457.

[21]

Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, and Piotr Dollár. 2015. Microsoft COCO: Common Objects in Context. arXiv:cs.CV/1405.0312.

[22]

Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single shot multibox detector. In European Conference on Computer Vision. Springer, 21–37.

[23]

Ichiro Masaki. 1998. Machine-vision systems for intelligent transportation systems. IEEE Intell. Syst. Applic. 13, 6 (1998), 24–31.

Digital Library

[24]

Francisco Massa and Ross Girshick. 2018. maskrcnn-benchmark: Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch. Retrieved from https://github.com/facebookresearch/maskrcnn-benchmark. (2018).

[25]

Yoshitomo Matsubara, Sabur Baidya, Davide Callegaro, Marco Levorato, and Sameer Singh. 2019. Distilled split deep neural networks for edge-assisted real-time systems. In Workshop on Hot Topics in Video Analytics and Intelligent Edges. 21–26.

[26]

Ravi Teja Mullapudi, Steven Chen, Keyi Zhang, Deva Ramanan, and Kayvon Fatahalian. 2019. Online model distillation for efficient video inference. In IEEE International Conference on Computer Vision. 3573–3582.

[27]

Shadi A. Noghabi, John Kolb, Peter Bodik, and Eduardo Cuervo. 2018. Steel: Simplified development and deployment of edge-cloud applications. In 10th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud’18).

[28]

Yanwei Pang, Yuan Yuan, Xuelong Li, and Jing Pan. 2011. Efficient HOG human detection. Sig. Process. 91, 4 (2011), 773–781.

Digital Library

[29]

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. PyTorch: An imperative style, high-performance deep learning library. In Conference on Advances in Neural Information Processing Systems. 8024–8035.

[30]

Manouchehr Rafie. 2020. AI-Powered Camera Sensors. Retrieved from https://www.gyrfalcontech.ai/ai-powered-camera-sensors-whitepaper/. (2020).

[31]

Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Conference on Advances in Neural Information Processing Systems. 91–99.

[32]

Shreyak Sawhney, Karan Kacker, Samyak Jain, Shailendra Narayan Singh, and Rakesh Garg. 2019. Real-time smart attendance system using face recognition techniques. In 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence). IEEE, 522–525.

[33]

Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. FaceNet: A unified embedding for face recognition and clustering. In IEEE Conference on Computer Vision and Pattern Recognition. 815–823.

[34]

Amr Suleiman, Yu-Hsin Chen, Joel Emer, and Vivienne Sze. 2017. Towards closing the energy gap between HOG and CNN features for embedded vision. In IEEE International Symposium on Circuits and Systems (ISCAS). 1–4. DOI:

[35]

Yunchuan Sun, Junsheng Zhang, Yongping Xiong, and Guangyu Zhu. 2014. Data security and privacy in cloud computing. Int. J. Distrib. Sensor Netw. 10, 7 (2014), 190903.

[36]

Hui Suo, Zhuohua Liu, Jiafu Wan, and Keliang Zhou. 2013. Security and privacy in mobile cloud computing. In 9th International Wireless Communications and Mobile Computing Conference (IWCMC). IEEE, 655–659.

[37]

Feng Tang, Shane Brennan, Qi Zhao, and Hai Tao. 2007. Co-tracking using semi-supervised support vector machines. In IEEE 11th International Conference on Computer Vision. IEEE, 1–8.

[38]

Surat Teerapittayanon, Bradley McDanel, and Hsiang-Tsung Kung. 2017. Distributed deep neural networks over the cloud, the edge and end devices. In IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE, 328–339.

[39]

Zhi-Hua Zhou. 2018. A brief introduction to weakly supervised learning. Nat. Sci. Rev. 5, 1 (2018), 44–53.

[40]

Zhi-Hua Zhou and Ming Li. 2005. Semi-supervised regression with co-training. In International Joint Conferences on Artificial Intelligence, Vol. 5. 908–913.

[41]

Xiaojin Jerry Zhu. 2005. Semi-supervised Learning Literature Survey. Technical Report. University of Wisconsin-Madison Department of Computer Sciences.

Cited By

Index Terms

DyCo: Dynamic, Contextualized AI Models
1. Computer systems organization
  1. Architectures
    1. Other architectures
      1. Neural networks

Recommendations

Transductive Multilabel Learning via Label Set Propagation

The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image ...
Inductive Semi-supervised Multi-Label Learning with Co-Training
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

In multi-label learning, each training example is associated with multiple class labels and the task is to learn a mapping from the feature space to the power set of label space. It is generally demanding and time-consuming to obtain labels for training ...
Federated semi-supervised learning with tolerant guidance and powerful classifier in edge scenarios
Abstract
Federated Learning is a distributed machine learning method that offers inherent advantages in efficient learning and privacy protection within edge computing scenarios. However, terminal nodes often encounter challenges such as insufficient ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Embedded Computing Systems

ACM Transactions on Embedded Computing Systems Volume 21, Issue 6

November 2022

498 pages

ISSN:1539-9087

EISSN:1558-3465

DOI:10.1145/3561948

Editor:
Tulika Mitra
National University of Singapore, Singapore

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Journal Family

ACM Journals for the Design of Smart and Connected Systems

Publication History

Published: 12 December 2022

Online AM: 26 March 2022

Accepted: 19 February 2022

Revised: 23 January 2022

Received: 15 July 2021

Published in TECS Volume 21, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
246
Total Downloads

Downloads (Last 12 months)39
Downloads (Last 6 weeks)1

Reflects downloads up to 27 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View full text|Download PDF

View Issue’s Table of Contents