Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

DyCo: Dynamic, Contextualized AI Models

Published: 12 December 2022 Publication History

Abstract

Devices with limited computing resources use smaller AI models to achieve low-latency inferencing. However, model accuracy is typically much lower than the accuracy of a bigger model that is trained and deployed in places where the computing resources are relatively abundant. We describe DyCo, a novel system that ensures privacy of stream data and dynamically improves the accuracy of small models used in devices. Unlike knowledge distillation or federated learning, DyCo treats AI models as black boxes. DyCo uses a semi-supervised approach to leverage existing training frameworks and network model architectures to periodically train contextualized, smaller models for resource-constrained devices. DyCo uses a bigger, highly accurate model in the edge-cloud to auto-label data received from each sensor stream. Training in the edge-cloud (as opposed to the public cloud) ensures data privacy, and bespoke models for thousands of live data streams can be designed in parallel by using multiple edge-clouds. DyCo uses the auto-labeled data to periodically re-train, stream-specific, bespoke small models. To reduce the periodic training costs, DyCo uses different policies that are based on stride, accuracy, and confidence information.
We evaluate our system, and the contextualized models, by using two object detection models for vehicles and people, and two datasets (a public benchmark and another real-world proprietary dataset). Our results show that DyCo increases the mAP accuracy measure of small models by an average of 16.3% (and up to 20%) for the public benchmark and an average of 19.0% (and up to 64.9%) for the real-world dataset. DyCo also decreases the training costs for contextualized models by more than an order of magnitude.

References

[1]
Hamidreza Arasteh, Vahid Hosseinnezhad, Vincenzo Loia, Aurelio Tommasetti, Orlando Troisi, Miadreza Shafie-khah, and Pierluigi Siano. 2016. IoT-based smart cities: A survey. In IEEE 16th International Conference on Environment and Electrical Engineering (EEEIC). IEEE, 1–6.
[2]
Alcardo Alex Barakabitze, Arslan Ahmad, Rashid Mijumbi, and Andrew Hines. 2020. 5G network slicing using SDN and NFV: A survey of taxonomy, architectures and future challenges. Comput. Netw. 167 (Feb.2020). DOI:
[3]
Keith Bonawitz, Hubert Eichner, Wolfgang Grieskamp, Dzmitry Huba, Alex Ingerman, Vladimir Ivanov, Chloe Kiddon, Jakub Konečnỳ, Stefano Mazzocchi, H. Brendan McMahan, et al. 2019. Towards federated learning at scale: System design. arXiv preprint arXiv:1902.01046 (2019).
[4]
Samia Bouyakoub, Abdelkader Belkhir, Fayçal M’hamed Bouyakoub, and Wassila Guebli. 2017. Smart airport: An IoT-based airport management system. In International Conference on Future Networks and Distributed Systems. 1–7.
[5]
Olivier Chapelle, Bernhard Scholkopf, and Alexander Zien. 2009. Semi-supervised learning. IEEE Trans. Neural Netw. 20, 3 (2009), 542–542.
[6]
Guobin Chen, Wongun Choi, Xiang Yu, Tony Han, and Manmohan Chandraker. 2017. Learning efficient object detection models with knowledge distillation. In Conference on Advances in Neural Information Processing Systems. 742–751.
[7]
Chuanqi. 2017. A caffe implementation of MobileNet-SSD detection network. Retrieved from https://github.com/chuanqi305/MobileNet-SSD.
[8]
Harshit Daga, Patrick K. Nicholson, Ada Gavrilovska, and Diego Lugones. 2019. Cartel: A system for collaborative transfer learning at the edge. In ACM Symposium on Cloud Computing. 25–37.
[9]
Nhu-Ngoc Dao, Woongsoo Na, and Sungrae Cho. 2020. Mobile cloudization storytelling: Current issues from optimization perspective. IEEE Internet Comput. PP (12020), 1–1. DOI:
[10]
Patrick Dendorfer, Hamid Rezatofighi, Anton Milan, Javen Shi, Daniel Cremers, Ian Reid, Stefan Roth, Konrad Schindler, and Laura Leal-Taixé. 2020. Mot20: A benchmark for multi object tracking in crowded scenes. arXiv preprint arXiv:2003.09003 (2020).
[11]
Biyi Fang, Xiao Zeng, Faen Zhang, Hui Xu, and Mi Zhang. 2020. FlexDNN: Input-adaptive on-device deep learning for efficient mobile vision. In 5th ACM/IEEE Symposium on Edge Computing (SEC).
[12]
Ross Girshick, Ilija Radosavovic, Georgia Gkioxari, Piotr Dollár, and Kaiming He. 2018. Detectron. Retrieved from https://github.com/facebookresearch/detectron.
[13]
Bing Han and Kaushik Roy. 2018. DeltaFrame-BP: An algorithm using frame difference for deep convolutional neural networks training and inference on video data. IEEE Trans. Multi-scale Comput. Syst. 4, 4 (2018), 624–634.
[14]
Haowei Chen, Liekang Zeng, Shuai Yu, and Xu Chen. 2020. Knowledge distillation for mobile edge computation offloading. ZTE Commun. 18, 2 (2020), 40–48.
[15]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In IEEE Conference on Computer Vision and Pattern Recognition. 770–778.
[16]
Geoffrey Hinton, Oriol Vinyals, and Jeff Dean. 2015. Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531 (2015).
[17]
Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, and Hartwig Adam. 2017. MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).
[18]
Kevin Hsieh, Aaron Harlap, Nandita Vijaykumar, Dimitris Konomis, Gregory R. Ganger, Phillip B. Gibbons, and Onur Mutlu. 2017. Gaia: Geo-distributed machine learning approaching \(\lbrace\)LAN\(\rbrace\) speeds. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI’17). 629–647.
[19]
Yiping Kang, Johann Hauswald, Cao Gao, Austin Rovinski, Trevor Mudge, Jason Mars, and Lingjia Tang. 2017. Neurosurgeon: Collaborative intelligence between the cloud and mobile edge. ACM SIGARCH Comput. Archit. News 45, 1 (2017), 615–629.
[20]
En Li, Liekang Zeng, Zhi Zhou, and Xu Chen. 2019. Edge AI: On-demand accelerating deep neural network inference via edge computing. IEEE Trans. Wirel. Commun. 19, 1 (2019), 447–457.
[21]
Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, and Piotr Dollár. 2015. Microsoft COCO: Common Objects in Context. arXiv:cs.CV/1405.0312.
[22]
Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single shot multibox detector. In European Conference on Computer Vision. Springer, 21–37.
[23]
Ichiro Masaki. 1998. Machine-vision systems for intelligent transportation systems. IEEE Intell. Syst. Applic. 13, 6 (1998), 24–31.
[24]
Francisco Massa and Ross Girshick. 2018. maskrcnn-benchmark: Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch. Retrieved from https://github.com/facebookresearch/maskrcnn-benchmark. (2018).
[25]
Yoshitomo Matsubara, Sabur Baidya, Davide Callegaro, Marco Levorato, and Sameer Singh. 2019. Distilled split deep neural networks for edge-assisted real-time systems. In Workshop on Hot Topics in Video Analytics and Intelligent Edges. 21–26.
[26]
Ravi Teja Mullapudi, Steven Chen, Keyi Zhang, Deva Ramanan, and Kayvon Fatahalian. 2019. Online model distillation for efficient video inference. In IEEE International Conference on Computer Vision. 3573–3582.
[27]
Shadi A. Noghabi, John Kolb, Peter Bodik, and Eduardo Cuervo. 2018. Steel: Simplified development and deployment of edge-cloud applications. In 10th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud’18).
[28]
Yanwei Pang, Yuan Yuan, Xuelong Li, and Jing Pan. 2011. Efficient HOG human detection. Sig. Process. 91, 4 (2011), 773–781.
[29]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. PyTorch: An imperative style, high-performance deep learning library. In Conference on Advances in Neural Information Processing Systems. 8024–8035.
[30]
Manouchehr Rafie. 2020. AI-Powered Camera Sensors. Retrieved from https://www.gyrfalcontech.ai/ai-powered-camera-sensors-whitepaper/. (2020).
[31]
Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster R-CNN: Towards real-time object detection with region proposal networks. In Conference on Advances in Neural Information Processing Systems. 91–99.
[32]
Shreyak Sawhney, Karan Kacker, Samyak Jain, Shailendra Narayan Singh, and Rakesh Garg. 2019. Real-time smart attendance system using face recognition techniques. In 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence). IEEE, 522–525.
[33]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. FaceNet: A unified embedding for face recognition and clustering. In IEEE Conference on Computer Vision and Pattern Recognition. 815–823.
[34]
Amr Suleiman, Yu-Hsin Chen, Joel Emer, and Vivienne Sze. 2017. Towards closing the energy gap between HOG and CNN features for embedded vision. In IEEE International Symposium on Circuits and Systems (ISCAS). 1–4. DOI:
[35]
Yunchuan Sun, Junsheng Zhang, Yongping Xiong, and Guangyu Zhu. 2014. Data security and privacy in cloud computing. Int. J. Distrib. Sensor Netw. 10, 7 (2014), 190903.
[36]
Hui Suo, Zhuohua Liu, Jiafu Wan, and Keliang Zhou. 2013. Security and privacy in mobile cloud computing. In 9th International Wireless Communications and Mobile Computing Conference (IWCMC). IEEE, 655–659.
[37]
Feng Tang, Shane Brennan, Qi Zhao, and Hai Tao. 2007. Co-tracking using semi-supervised support vector machines. In IEEE 11th International Conference on Computer Vision. IEEE, 1–8.
[38]
Surat Teerapittayanon, Bradley McDanel, and Hsiang-Tsung Kung. 2017. Distributed deep neural networks over the cloud, the edge and end devices. In IEEE 37th International Conference on Distributed Computing Systems (ICDCS). IEEE, 328–339.
[39]
Zhi-Hua Zhou. 2018. A brief introduction to weakly supervised learning. Nat. Sci. Rev. 5, 1 (2018), 44–53.
[40]
Zhi-Hua Zhou and Ming Li. 2005. Semi-supervised regression with co-training. In International Joint Conferences on Artificial Intelligence, Vol. 5. 908–913.
[41]
Xiaojin Jerry Zhu. 2005. Semi-supervised Learning Literature Survey. Technical Report. University of Wisconsin-Madison Department of Computer Sciences.

Cited By

View all

Index Terms

  1. DyCo: Dynamic, Contextualized AI Models

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Embedded Computing Systems
    ACM Transactions on Embedded Computing Systems  Volume 21, Issue 6
    November 2022
    498 pages
    ISSN:1539-9087
    EISSN:1558-3465
    DOI:10.1145/3561948
    • Editor:
    • Tulika Mitra
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Journal Family

    Publication History

    Published: 12 December 2022
    Online AM: 26 March 2022
    Accepted: 19 February 2022
    Revised: 23 January 2022
    Received: 15 July 2021
    Published in TECS Volume 21, Issue 6

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Object detector
    2. semi-supervised learning
    3. contextualized
    4. edge computing
    5. edge cloud
    6. deep learning

    Qualifiers

    • Research-article
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 246
      Total Downloads
    • Downloads (Last 12 months)39
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 27 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media