Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3400302.3415756acmconferencesArticle/Chapter ViewAbstractPublication PagesiccadConference Proceedingsconference-collections
invited-talk

Challenges for building a cloud native scalable and trustable multi-tenant AIoT platform

Published: 17 December 2020 Publication History

Abstract

The arrival of 5G together with advances in artificial intelligence, machine learning, cloud computing, virtualization, and service orchestration have created a ubiquitous computing model at the network edge, enabling a host of new, AI driven edge computing applications. Although edge computing shares many characteristics of cloud computing, there are unique challenges for edge computing to meet the ever growing demands for scalability, security and multi-tenancy, especially in the upcoming 5G era. These challenges are discussed through two typical edge computing use cases: streaming video analytics and industrial IoT. A number of open research problems are discussed to call for help from the design automation community with the focus on new automation methodologies in building a cloud-native end-to-end edge computing platform.

References

[1]
3GPP release 16. https://www.3gpp.org/release-16/.
[2]
Y. Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief. A survey on mobile edge computing: The communication perspective. IEEE Communications Surveys Tutorials, 19(4):2322--2358, 2017.
[3]
L. Dai, B. Wang, Y. Yuan, S. Han, C. I, and Z. Wang. Non-orthogonal multiple access for 5g: solutions, challenges, opportunities, and future research trends. IEEE Communications Magazine, 53(9):74--81, 2015.
[4]
Quoc-Viet Pham and Won-Joo Hwang. Energy-efficient power control for uplink spectrum-sharing heterogeneous networks. International Journal of Communication Systems, 31(14):e3717, 2018.
[5]
P. Rost, C. J. Bernardos, A. D. Domenico, M. D. Girolamo, M. Lalam, A. Maeder, D. Sabella, and D. Wübben. Cloud technologies for flexible 5g radio access networks. IEEE Communications Magazine, 52(5):68--76, 2014.
[6]
Artificial intelligence of things (aiot). https://internetofthingsagenda.techtarget.com/definition/Artificial-Intelligence-of-Things-AIoT, 2019.
[7]
kata container: Open source lightweight container runtime for virtual machines. https://katacontainers.io/.
[8]
C. Li, A. Dakkak, J. Xiong, W. Wei, L. Xu, and W. Hwu. Xsp: Across-stack profiling and analysis of machine learning models on gpus. In 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pages 326--327, 2020.
[9]
CNCF cloud native definition v1.0. https://github.com/cncf/toc/blob/master/DEFINITION.md, 2020.
[10]
Abhishek Verma, Luis Pedrosa, Madhukar Korupolu, David Oppenheimer, Eric Tune, and John Wilkes. Large-scale cluster management at google with borg. In Proceedings of the Tenth European Conference on Computer Systems, EuroSys '15, New York, NY, USA, 2015. Association for Computing Machinery.
[11]
Istio: Kubernetes based service mesh. https://istio.io/.
[12]
Kubeflow: Open source machine learning platform. https://www.kubeflow.org/.
[13]
ETSI MEC white papers. https://portal.etsi.org/TB-SiteMap/MEC/MEC-White-Papers.
[14]
Apache kafka. https://kafka.apache.org/.
[15]
Ceph: Open source distributed object storage. https://ceph.com.
[16]
Knative: Kubernetes-based serverless framework. https://knative.dev/.
[17]
Docker container. https://www.docker.com/.
[18]
Kubernetes: Open source container orchestration platform. https://kubernetes.io/.
[19]
Node-RED: Flow-based iot application development platform. https://nodered.org.
[20]
Real time traffic monitoring and global intelligent traffic control. https://research.redhat.com/blog/engineering_project/a-real-time-streaming-video-analytics-stack-for-the-edge/, 2020.
[21]
MEC and 5G for smart stadiums - changing the sports entertainment game. https://www.nokia.com/blog/mec-5g-smart-stadiums-changing-sports-entertainment-game/, 2018.
[22]
K. Schwab. The fourth industrial revolution. In Foreign Affairs, 2015.
[23]
Standardization of (nb-iot) completed. https://www.3gpp.org/news-events/3gpp-news/1785-nb_iot_complete, 2016.
[24]
MANUela - MANUfacturing Edge Lightweight Accelerator. https://github.com/sa-mw-dach/manuela.
[25]
MQTT: The standard for iot messaging. https://mqtt.org.
[26]
Apache hadoop YARN. https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html.
[27]
Apache mesos. http://mesos.apache.org/.
[28]
Dana Van Aken, Andrew Pavlo, Geoffrey J. Gordon, and Bohan Zhang. Automatic database management system tuning through large-scale machine learning. In Proceedings of the 2017 ACM International Conference on Management of Data, SIGMOD '17, pages 1009--1024, 2017.
[29]
Ji Zhang, Yu Liu, Ke Zhou, Guoliang Li, Zhili Xiao, Bin Cheng, Jiashu Xing, Yangtao Wang, Tianheng Cheng, Li Liu, Minwei Ran, and Zekang Li. An end-to-end automatic cloud database tuning system using deep reinforcement learning. In Proceedings of the 2019 International Conference on Management of Data, SIGMOD '19, page 415--432, New York, NY, USA, 2019. Association for Computing Machinery.
[30]
Yan Li, Kenneth Chang, Oceane Bel, Ethan L. Miller, and Darrell D. E. Long. CAPES: Unsupervised storage performance tuning using neural network-based deep reinforcement learning. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC '17, New York, NY, USA, 2017. Association for Computing Machinery.
[31]
R. R. Noel, R. Mehra, and P. Lama. Towards self-managing cloud storage with reinforcement learning. In 2019 IEEE International Conference on Cloud Engineering (IC2E), pages 34--44, 2019.
[32]
J. Xiong, D. T. Phan, and D. Kung. A resource supply-demand based approach for automatic mapreduce job optimization. In 2015 IEEE 17th International Conference on High Performance Computing and Communications, pages 740--745, 2015.
[33]
LINPACK. https://www.top500.org/project/linpack/.
[34]
TPC-H. http://www.tpc.org/tpch/.
[35]
Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, and Russell Sears. Benchmarking cloud serving systems with YCSB. In Proceedings of the 1st ACM Symposium on Cloud Computing, SoCC '10, page 143--154, New York, NY, USA, 2010. Association for Computing Machinery.
[36]
Cody A. Coleman, Deepak Narayanan, Daniel Kang, Tian Jiao Zhao, Jian Zhang, Luigi Nardi, Peter Bailis, Kunle Olukotun, Christopher Ré, and Matei Zaharia. Dawnbench: An end-to-end deep learning benchmark and competition. In NIPS ML Systems Workshop, 2017.
[37]
DeepBench: A new benchmark for deep learning models. http://research.baidu.com/Blog/index-view?id=100.
[38]
M. Poess, R. Nambiar, K. Kulkarni, C. Narasimhadevara, T. Rabl, and H. Jacobsen. Analysis of tpcx-iot: The first industry standard benchmark for iot gateway systems. In 2018 IEEE 34th International Conference on Data Engineering (ICDE), pages 1519--1530, 2018.
[39]
Paul Castro, Vatche Ishakian, Vinod Muthusamy, and Aleksander Slominski. The rise of serverless computing. Commun. ACM, 62(12):44--54, November 2019.
[40]
Amazon aurora serverless. https://aws.amazon.com/rds/aurora/serverless/.
[41]
Ana Klimovic, Yawen Wang, Patrick Stuedi, Animesh Trivedi, Jonas Pfefferle, and Christos Kozyrakis. Pocket: Elastic ephemeral storage for serverless analytics. In Proceedings of the 13th USENIX Conference on Operating Systems Design and Implementation, OSDI'18, page 427--444, USA, 2018. USENIX Association.
[42]
Qifan Pu, Shivaram Venkataraman, and Ion Stoica. Shuffling, fast and slow: Scalable analytics on serverless infrastructure. In Proceedings of the 16th USENIX Conference on Networked Systems Design and Implementation, NSDI'19, page 193--206, USA, 2019. USENIX Association.
[43]
Apache TVM. https://tvm.apache.org.
[44]
AWS NNVM. https://aws.amazon.com/blogs/machine-learning/introducing-nnvm-compiler-a-new-open-end-to-end-compiler-for-ai-framewoks/.
[45]
Nadav Rotem, Jordan Fix, Saleem Abdulrasool, Garret Catron, Summer Deng, Roman Dzhabarov, Nick Gibson, James Hegeman, Meghan Lele, Roman Levenstein, et al. Glow: Graph lowering compiler techniques for neural networks. arXiv, pages arXiv--1805, 2018.
[46]
Yanjie Gao, Yu Liu, Hongyu Zhang, Zhengxian Li, Yonghao Zhu, Haoxiang Lin, and Mao Yang. Estimating gpu memory consumption of deep learning models. Technical Report MSR-TR-2020-20, Microsoft, May 2020.
[47]
S. S B, A. Garg, and P. Kulkarni. Dynamic memory management for gpu-based training of deep neural networks. In 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pages 200--209, 2019.
[48]
Donglin Yang and Dazhao Cheng. Efficient gpu memory management for nonlinear dnns. In Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing, HPDC '20, page 185--196, New York, NY, USA, 2020. Association for Computing Machinery.
[49]
Junzhe Zhang, Sai-Ho Yeung, Yao Shu, Bingsheng He, and Wei Wang. Efficient memory management for gpu-based deep learning systems. CoRR, abs/1903.06631, 2019.
[50]
F. Y. Rashid. The rise of confidential computing: Big tech companies are adopting a new security model to protect data while it's in use - [news]. IEEE Spectrum, 57(6):8--9, 2020.
[51]
Confidential computing consortium. https://confidentialcomputing.io/wp-content/uploads/sites/85/2019/12/CCC_Overview.pdf.
[52]
Project keylime: Bootstrap and maintain trust on the edge and cloud and iot. https://keylime.dev/.
[53]
Amin Mosayyebzadeh, Gerardo Ravago, Apoorve Mohan, Ali Raza, Sahil Tikale, Nabil Schear, Trammell Hudson, Jason Hennessey, Naved Ansari, Kyle Hogan, Charles Munson, Larry Rudolph, Gene Cooperman, Peter Desnoyers, and Orran Krieger. A secure cloud with minimal provider trust. In Proceedings of the 10th USENIX Conference on Hot Topics in Cloud Computing, HotCloud'18, page 16, USA, 2018. USENIX Association.
[54]
Rishabh Poddar, Ganesh Ananthanarayanan, Srinath Setty, Stavros Volos, and Raluca Ada Popa. Visor: Privacy-preserving video analytics as a cloud service. In USENIX Security Symposium, August 2020.

Cited By

View all
  • (2024)Development of Artificial Intelligence of Things and Cloud Computing Environments Through Semantic Web Control ModelsEmerging Technologies for Securing the Cloud and IoT10.4018/979-8-3693-0766-3.ch005(112-143)Online publication date: 23-Feb-2024
  • (2024)Developing Cloud-Native Autonomous Systems for Real-Time Edge Analytics2024 IEEE International Conference on Blockchain and Distributed Systems Security (ICBDS)10.1109/ICBDS61829.2024.10837008(1-6)Online publication date: 17-Oct-2024
  • (2024)Enhancing quality of service through federated learning in edge-cloud architectureAd Hoc Networks10.1016/j.adhoc.2024.103430156(103430)Online publication date: Apr-2024
  • Show More Cited By
  1. Challenges for building a cloud native scalable and trustable multi-tenant AIoT platform

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICCAD '20: Proceedings of the 39th International Conference on Computer-Aided Design
    November 2020
    1396 pages
    ISBN:9781450380263
    DOI:10.1145/3400302
    • General Chair:
    • Yuan Xie
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    • IEEE CAS
    • IEEE CEDA
    • IEEE CS

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 December 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. AI
    2. IoT
    3. cloud native
    4. edge cloud
    5. edge computing
    6. industry 4.0
    7. resource management
    8. streaming video analytics

    Qualifiers

    • Invited-talk

    Conference

    ICCAD '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 457 of 1,762 submissions, 26%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)65
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 16 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Development of Artificial Intelligence of Things and Cloud Computing Environments Through Semantic Web Control ModelsEmerging Technologies for Securing the Cloud and IoT10.4018/979-8-3693-0766-3.ch005(112-143)Online publication date: 23-Feb-2024
    • (2024)Developing Cloud-Native Autonomous Systems for Real-Time Edge Analytics2024 IEEE International Conference on Blockchain and Distributed Systems Security (ICBDS)10.1109/ICBDS61829.2024.10837008(1-6)Online publication date: 17-Oct-2024
    • (2024)Enhancing quality of service through federated learning in edge-cloud architectureAd Hoc Networks10.1016/j.adhoc.2024.103430156(103430)Online publication date: Apr-2024
    • (2024)Assessment of Performance for Cloud-Native Machine Learning on Edge DevicesDigital Interaction and Machine Intelligence10.1007/978-3-031-66594-3_10(95-105)Online publication date: 15-Aug-2024
    • (2024)AIoT and Its Trust Models to Enhance Societal Applications Using Intelligent TechnologiesArtificial Intelligence of Things for Achieving Sustainable Development Goals10.1007/978-3-031-53433-1_16(311-334)Online publication date: 9-Mar-2024
    • (2023)Multi-tenancy in Cloud-native Architecture: A Systematic Mapping StudyWSEAS TRANSACTIONS ON COMPUTERS10.37394/23205.2023.22.422(25-43)Online publication date: 7-Mar-2023
    • (2023)Invited Paper: Intelligent Agent Support for Achieving Low Latency in Cloud-Native NextG Mobile Core NetworksProceedings of the 24th International Conference on Distributed Computing and Networking10.1145/3571306.3571386(12-19)Online publication date: 4-Jan-2023
    • (2023)TrustSys: Trusted Decision Making Scheme for Collaborative Artificial Intelligence of ThingsIEEE Transactions on Industrial Informatics10.1109/TII.2022.317300619:1(1059-1068)Online publication date: Jan-2023
    • (2023)Security-Aware Resource Allocation in the Edge-Cloud Continuum2023 IEEE 12th International Conference on Cloud Networking (CloudNet)10.1109/CloudNet59005.2023.10490073(161-169)Online publication date: 1-Nov-2023
    • (2023)Cloud Manufacturing Services Adoption in Higher Education Institutions: Challenges and Framework for Developing CountriesFlexible Automation and Intelligent Manufacturing: The Human-Data-Technology Nexus10.1007/978-3-031-17629-6_69(655-662)Online publication date: 4-Feb-2023
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media