TinyML: Enabling of Inference Deep Learning Models on Ultra-Low-Power IoT Edge Devices for AI Applications
Abstract
:1. Introduction
- (i)
- TinyML studies are reviewed in two aspects; first, studies that have developed the DL model and applied it in IoT applications. Second, studies that design frameworks and libraries for TinyML.
- (ii)
- Analysis and findings from previous studies are provided for the main three items used in TinyML (Model, Dataset and Devices).
- (iii)
- Discussion of main findings in TinyML (Model, Dataset and Devices) is presented.
- (iv)
- Light is shed on the most relevant limitation concerning TinyML, which will provide the directions for future research.
2. Overview of TinyML
Benefits of TinyML
- Energy efficiency: this is a great benefit when adopting TinyML that works on MCUs. Since IoT devices that work on MCUs rely on batteries or even energy harvesting, they consume less energy in comparison with powerful processors and Graphics Processing Units (GPUs) that demand a great amount of power. Therefore, IoT devices can be placed everywhere without the need to be plugged into the power grid. Thus, it opens the door for novel cognitive itinerant applications. In addition, scarce power consumption allows the IoT devices to be coupled with larger battery-powered devices, hence converting them into connected smart entities, e.g., personal mobility devices such as scooters or seaways [23].
- Low cost: IoT devices performed a variety of tasks in many IoT applications. Therefore, they require excellent specifications such as high computation processing power, large storage and memory. Further required is large amounts of storage in the cloud to conduct processing and store the data; thus, it has a high cost due to use of large resources. TinyML processes data locally in microcontroller devices that have low cost and can apply AI locally on the device itself to process the data with high performance [23]. Microcontrollers have low cost compared to other solutions due to their limited use of resources, as processor compute in the range of 1 MHz to 400 MHz. The memory for microcontroller can be of 2 KB to 512 K, whilst the storage capacity can be 32 KB to 2 MB. The cost of a Microcontroller is a few dollars in comparison to other smart IoT devices used to process data locally using DL models.
- Latency: In TinyML, data processing is executed locally in the device since the computations are performed in the device. Thus, IoT devices do not suffer from latency. The real-time local processing of data in the devices leads to faster response and rapid analysis in emergency scenarios. Besides this, the burden on the cloud is reduced [24,25].
- System reliability and data security: IoT devices require communication channels to transfer stream raw data from the IoT device to the cloud. When the data is transmitted to the cloud, it is prone to transmission errors, cyber-attacks e.g., eavesdropping or man-in-the-middle issues. Thereby, the transmitted data may be compromised or lost. According to the IBM Cost of Data Breach report 2020, the average cost of a data breach is pegged at USD 3.86 million [26]. Therefore, data need to be processed locally to limit cloud traffic. TinyML can prevent these issues through performing processing of data locally in the same device. which permits it to perform fewer transmissions with aggregated or meaningless data for an attack [23,25].
3. Research Methodology
3.1. Core Questions
- What are the datasets, models and devices used in TinyML?
- What are the types of application domains that are used with TinyML?
- What are the frameworks and libraries used to develop TinyML?
- What are the existing limitations for developing TinyML and objects for future research?
3.2. Search Strategy
3.3. Eligibility Criteria
3.4. Inclusion and Exclusion Criteria
3.5. Data Extraction and Synthesis
3.6. Data Selection
4. Related Work to TinyML
4.1. TinyML Use Cases Studies
4.1.1. Environment
4.1.2. Sign Language Detection
4.1.3. Handwriting Recognition
4.1.4. Medical Face Mask Detection
4.1.5. Gesture Recognition
4.1.6. Speech Recognition
4.1.7. Autonomous Mini Vehicles
4.2. Studies Related to Design TinyML Frameworks and Libraries
4.2.1. TinyML Framework Studies
4.2.2. TinyML Libraries Studies
5. Discussion and Findings
5.1. Findings Based on the Datasets
5.2. Findings Based on the Machine and Deep Learning Models
5.3. Findings Based on the Devices
6. TinyML Limitation
6.1. Device Heterogeneity
6.2. Process Power
6.3. Limited Memory
6.4. Limited DL Models
7. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Hamdan, S.; Ayyash, M.; Almajali, S. Edge-Computing Architectures for Internet of Things Applications: A Survey. Sensors 2020, 20, 6441. [Google Scholar] [CrossRef] [PubMed]
- Wu, Z.; Qiu, K.; Zhang, J. A Smart Microcontroller Architecture for the Internet of Things. Sensors 2020, 20, 1821. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Signoretti, G.; Silva, M.; Andrade, P.; Silva, I.; Sisinni, E.; Ferrari, P. An Evolving Tinyml Compression Algorithm for IoT Environments Based on Data Eccentricity. Sensors 2021, 21, 4153. [Google Scholar] [CrossRef] [PubMed]
- Chen, Y.; Zheng, B.; Zhang, Z.; Wang, Q.; Shen, C.; Zhang, Q. Deep Learning on Mobile and Embedded Devices: State-of-the-Art, Challenges, and Future Directions. ACM Comput. Surv. 2020, 53, 1–37. [Google Scholar] [CrossRef]
- Sakr, F.; Bellotti, F.; Berta, R.; De Gloria, A. Machine Learning on Mainstream Microcontrollers. Sensors 2020, 20, 2638. [Google Scholar] [CrossRef]
- Gorospe, J.; Mulero, R.; Arbelaitz, O.; Muguerza, J.; Antón, M.Á. A Generalization Performance Study Using Deep Learning Networks in Embedded Systems. Sensors 2021, 21, 1031. [Google Scholar] [CrossRef]
- Atitallah, S.B.; Driss, M.; Boulila, W.; Ghezala, H. Ben Leveraging Deep Learning and IoT big data analytics to support the smart cities development: Review and future directions. Comput. Sci. Rev. 2020, 38, 100303. [Google Scholar] [CrossRef]
- Bhattacharya, S.; Somayaji, S.R.K.; Gadekallu, T.R.; Alazab, M.; Maddikunta, P.K.R. A review on deep learning for future smart cities. Internet Technol. Lett. 2020, 5, e187. [Google Scholar] [CrossRef]
- Wang, F.; Zhang, M.; Wang, X.; Ma, X.; Liu, J. Deep learning for edge computing applications: A State-of-the-Art survey. IEEE Access 2020, 8, 58322–58336. [Google Scholar] [CrossRef]
- Khalil, R.A.; Saeed, N.; Masood, M.; Fard, Y.M.; Alouini, M.S.; Al-Naffouri, T.Y. Deep Learning in the Industrial Internet of Things: Potentials, Challenges, and Emerging Applications. IEEE Internet Things J. 2021, 8, 11016–11040. [Google Scholar] [CrossRef]
- Thai-Nghe, N.; Thanh-Hai, N.; Ngon, N.C. Deep learning approach for forecasting water quality in IoT systems. Int. J. Adv. Comput. Sci. Appl. 2020, 11, 686–693. [Google Scholar] [CrossRef]
- Chen, Q.; Wang, W.; Wu, F.; De, S.; Wang, R.; Zhang, B.; Huang, X. A Survey on an Emerging Area: Deep Learning for Smart City Data. IEEE Trans. Emerg. Top. Comput. Intell. 2019, 3, 392–410. [Google Scholar] [CrossRef] [Green Version]
- Syed, A.S.; Sierra-Sosa, D.; Kumar, A.; Elmaghraby, A. IoT in smart cities: A survey of technologies, practices and challenges. Smart Cities 2021, 4, 429–475. [Google Scholar] [CrossRef]
- Warden, P.; Situnayake, D. Tinyml: Machine Learning with Tensorflow Lite on Arduino and Ultra-Low-Power Microcontrollers, 2nd ed.; Loukides, M., Taché, N., Eds.; O’Reilly Media: Sebastopol, CA, USA, 2019. [Google Scholar]
- Wang, Y.; Zhang, X.; Xie, L.; Zhou, J.; Su, H.; Zhang, B.; Hu, X. Pruning from Scratch. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; AAAI Press: Palo Alto, CA, USA, 2020; Volume 34, pp. 12273–12280. [Google Scholar] [CrossRef]
- Wardana, I.N.K.; Gardner, J.W.; Fahmy, S.A. Optimising Deep Learning at the Edge for Accurate Hourly Air Quality Prediction. Sensors 2021, 21, 1064. [Google Scholar] [CrossRef]
- Lin, J.; Chen, W.M.; Lin, Y.; Cohn, J.; Gan, C.; Han, S. MCUNet: Tiny Deep Learning on IoT Devices. arXiv 2020, arXiv:2007.10319v2. [Google Scholar]
- Banbury, C.R.; Reddi, V.J.; Lam, M.; Fu, W.; Fazel, A.; Holleman, J.; Huang, X.; Hurtado, R.; Kanter, D.; Lokhmotov, A.; et al. Benchmarking TinyML Systems: Challenges and Direction. arXiv 2020, arXiv:2003.04821. [Google Scholar]
- TensorFlow Lite. Available online: http://www.tensorflow.org/lite (accessed on 23 September 2021).
- Dennis, D.K.; Gopinath, S.; Gupta, C.; Kumar, A.; Kusupati, A.; Patil, S.G.; Simhadri, H.V. EdgeML Machine LEARNING for Resource-Constrained Edge Devices. Available online: https://github.com/Microsoft/EdgeML (accessed on 23 September 2021).
- Suda, N.; Loh, D. Machine Learning on ARM Cortex-M Microcontrollers; Arm Ltd.: Cambridge, UK, 2019. [Google Scholar]
- X-CUBE-AI—AI Expansion Pack for STM32CubeMX—STMicroelectronics. Available online: http://www.st.com/en/embedded-software/x-cube-ai.html (accessed on 29 September 2021).
- Sanchez-Iborra, R.; Skarmeta, A.F. TinyML-Enabled Frugal Smart Objects: Challenges and Opportunities. IEEE Circuits Syst. Mag. 2020, 20, 4–18. [Google Scholar] [CrossRef]
- Puthal, D.; Mohanty, S.; Wilson, S.; Choppali, U. Collaborative Edge Computing for Smart Villages. IEEE Consum. Electron. Mag. 2021, 10, 68–71. [Google Scholar] [CrossRef]
- Merenda, M.; Porcaro, C.; Iero, D. Edge Machine Learning for Ai-Enabled IoT Devices: A Review. Sensors 2020, 20, 2533. [Google Scholar] [CrossRef]
- IBM Security. Cost of a Data Breach Report 2021. 2021, pp. 1–73. Available online: https://www.ibm.com/downloads/cas/OJDVQGRY (accessed on 26 May 2022).
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. Int. J. Surg. 2021, 88, 105906. [Google Scholar] [CrossRef]
- Paul, A.J.; Mohan, P.; Sehgal, S. Rethinking Generalization in American Sign Language Prediction for Edge Devices with Extremely Low Memory Footprint. In Proceedings of the 2020 IEEE Recent Advances in Intelligent Computational Systems, RAICS, Thiruvananthapuram, India, 3–5 December 2020; pp. 147–152. [Google Scholar] [CrossRef]
- Mohan, P.; Paul, A.J.; Chirania, A. A Tiny Cnn Architecture for Medical Face Mask Detection for Resource-Constrained Endpoints. In Innovations in Electrical and Electronic Engineering; Springer: Singapore, 2020; pp. 657–670. [Google Scholar] [CrossRef]
- Coffen, B.; Mahmud, M.S. TinyDL: Edge Computing and Deep Learning Based Real-Time Hand Gesture Recognition Using Wearable Sensor. In Proceedings of the 2020 IEEE International Conference on E-Health Networking, Application & Services (HEALTHCOM), Shenzhen, China, 1–2 March 2021; pp. 1–6. [Google Scholar] [CrossRef]
- Venzke, M.; Klisch, D.; Kubik, P.; Ali, A.; Missier, J.D.; Turau, V. Artificial Neural Networks for Sensor Data Classification on Small Embedded Systems. arXiv 2020, arXiv:2012.08403v1. [Google Scholar]
- Orfanidis, C.; Hassen, R.B.H.; Kwiek, A.; Fafoutis, X.; Jacobsson, M. A Discreet Wearable Long-Range Emergency System Based on Embedded Machine Learning. In Proceedings of the 2021 IEEE International Conference on Pervasive Computing and Communications Workshops and Other Affiliated Events (PerCom Workshops), Kassel, Germany, 22–26 March 2021; pp. 182–187. [Google Scholar] [CrossRef]
- Wong, A.; Famouri, M.; Pavlova, M.; Surana, S. TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices. arXiv 2020, arXiv:2008.04245v6. [Google Scholar]
- De Prado, M.; Rusci, M.; Donze, R.; Capotondi, A.; Monnerat, S.; Benini, L.; Pazos, N. Robustifying the Deployment of TinyML Models for Autonomous Mini-Vehicles. Sensors 2021, 21, 1339. [Google Scholar] [CrossRef]
- David, R.; Duke, J.; Jain, A.; Reddi, V.J.; Jeffries, N.; Li, J.; Kreeger, N.; Nappier, I.; Natraj, M.; Regev, S.; et al. TensorFlow Lite Micro: Embedded Machine Learning on TinyML Systems. arXiv 2020, arXiv:2010.08678. [Google Scholar]
- Capotondi, A.; Rusci, M.; Fariselli, M.; Benini, L. CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices. IEEE Trans. Circuits Syst. II Express Briefs 2020, 67, 871–875. [Google Scholar] [CrossRef]
- Cai, H.; Gan, C.; Zhu, L.; Han, S. TinyTL: Reduce Memory, Not Parameters for Efficient On-Device Learning. arXiv 2020, arXiv:2007.11622. [Google Scholar]
- LeCun, Y.; Cortes, C.; Burges, C.J.C. MNIST Handwritten Digit Database. 1998. Available online: http://yann.lecun.com/exdb/mnist/ (accessed on 26 May 2022).
- Tecperson. Sign Language MNIST from Kaggle. Kaggle. 2017. Available online: https://www.kaggle.com/datamunge/sign-language-mnist/metadata (accessed on 26 May 2022).
- Akash. ASL Alphabet from Kaggle. Kaggle. 2020. Available online: https://www.kaggle.com/grassknoted/asl-alphabet (accessed on 26 May 2022).
- Rasband, D. ASL Alphabet Test from Kaggle. Kaggle. 2018. Available online: https://www.kaggle.com/datasets/danrasband/asl-alphabet-test (accessed on 26 May 2022).
- Ashish Jangra. Face Mask Detection ~12K Images Dataset from Kaggle. Kaggle. 2020. Available online: https://www.kaggle.com/ashishjangra27/face-mask-12k-images-dataset (accessed on 26 May 2022).
- Makwana, D. Face Mask Classification. Kaggle. 2020. Available online: https://www.kaggle.com/dhruvmak/face-mask-detection (accessed on 26 May 2022).
- Deng, J.; Dong, W.; Socher, R.; Li, L.J.; Li, K.; Fei-Fei, L. ImageNet: A Large-Scale Hierarchical Image Database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar] [CrossRef] [Green Version]
- Bulat, A.; Tzimiropoulos, G. XNOR-Net++: Improved Binary Neural Networks. In Proceedings of the 30th British Machine Vision Conference 2019, BMVC 2019, Cardiff, UK, 9–12 September 2019; pp. 1–12. [Google Scholar]
- Imagenet. ImageNet Object Localization Challenge from Kaggle. 2018. Available online: https://www.kaggle.com/c/imagenet-object-localization-challenge/data%0Ahttps://www.kaggle.com/c/imagenet-object-localization-challenge (accessed on 26 May 2022).
- Nilsback, M.E.; Zisserman, A. Automated Flower Classification over a Large Number of Classes. In Proceeding of the 6th Indian Conference on Computer Vision, Graphics and Image Processing, ICVGIP 2008, Bhubaneswar, India, 16–19 December 2008; pp. 722–729. [Google Scholar] [CrossRef]
- Englert, B.; Lam, S. The Caltech UCSD Birds 200-2011 Dataset; California Institute of Technology: Pasadena, CA, USA, 2011. [Google Scholar]
- Parkhi, O.M.; Vedaldi, A.; Zisserman, A.; Jawahar, C.V. Cats and Dogs. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 16–21 June 2012; pp. 3498–3505. [Google Scholar] [CrossRef]
- Vu, T.H.; Olsson, C.; Laptev, I.; Oliva, A.; Sivic, J. Food-101–Mining Discriminative Components with Random Forests; LNCS 8693; Springer International Publishing: Zurich, Switzerland, 2014. [Google Scholar] [CrossRef] [Green Version]
- Krizhevsky, A. Learning Multiple Layers of Features from Tiny Images; University of Toronto: Toronto, ON, Canada, 2009; Volume 34, Available online: https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf (accessed on 26 May 2022).
- Chowdhery, A.; Warden, P.; Shlens, J.; Howard, A.; Rhodes, R. Visual Wake Words Dataset. arXiv 2019, arXiv:1906.05721v1. [Google Scholar]
- Avigan, A. Cleveland Clinic Heart Disease Dataset. Kaggle. 2020. Available online: https://www.kaggle.com/aavigan/cleveland-clinic-heart-disease-dataset (accessed on 26 May 2022).
- Boero, L.; Cello, M.; Marchese, M.; Mariconti, E.; Naqash, T.; Zappatore, S. Statistical Fingerprint-Based Intrusion Detection System (SF-IDS). Int. J. Commun. Syst. 2016, 30, e3225. [Google Scholar] [CrossRef]
- Gorman, R.P.; Sejnowski, T.J. Analysis of Hidden Units in a Layered Network Trained to Classify Sonar Targets. Neural Netw. 1988, 1, 75–89. [Google Scholar] [CrossRef]
- Gorman, P.; Sejnowski, T. Connectionist Bench (Sonar, Mines vs. Rocks) Dataset. UCI Machine Learning Repository. 1989. Available online: https://archive.ics.uci.edu/ml/datasets/Connectionist+Bench+(Sonar,+Mines+vs.+Rocks) (accessed on 26 May 2022).
- Loseto, G. Traffic, Driving Style and Road Surface Condition. Kaggle. 2019. Available online: https://www.kaggle.com/gloseto/traffic-driving-style-road-surface-condition (accessed on 26 May 2022).
- EnviroCar—Datasets—The Datahub. Available online: http://www.old.datahub.io/dataset/envirocar (accessed on 23 May 2022).
- Search for and Download Air Quality Data|NSW Dept of Planning, Industry and Environment. Available online: http://www.dpie.nsw.gov.au/air-quality/search-for-and-download-air-quality-data (accessed on 23 May 2022).
- Zhang, Y.; Suda, N.; Lai, L.; Chandra, V. Hello Edge: Keyword Spotting on Microcontrollers. arXiv 2018, arXiv:1711.07128v3. [Google Scholar]
- STMicroelectronics. Available online: https://www.st.com/content/st_com/en.html (accessed on 23 May 2022).
- Mahdavinejad, M.S.; Rezvan, M.; Barekatain, M.; Adibi, P.; Barnaghi, P.; Sheth, A.P. Machine learning for internet of things data analysis: A survey. Digit. Commun. Netw. 2018, 4, 161–175. [Google Scholar] [CrossRef]
- Ian, G.; Bengio, Y.; Courville, A. Deep Learning; Massachusett of Institute of Technology: London, UK, 2016; p. 706. ISBN 9780262035613. [Google Scholar]
- Wu, Z.; Jiang, M.; Li, H.; Zhang, X. Mapping the knowledge domain of smart city development to Urban Sustainability: A Scientometric Study. J. Urban Technol. 2020, 28, 29–53. [Google Scholar] [CrossRef]
Source | Criteria |
---|---|
Database | web-based resources and Web of Science |
Date of publication | 2019–2021 |
Keywords | TinyML Tiny Machine Learning Tiny Deep Learning Deep learning AND Ultra-power-device Deep Learning AND Microcontroller |
Language | English |
Type of publication | Conference Proceedings Research Article ArXiv preprint |
Inclusion criteria | TinyML Use Cases paper Develop Framework Develop library for TinyML |
Exclusion criteria | Challenges and directions paper Compression models techniques on Ultra power devices Not related to research questions |
Study | Model | Model Result in Desktop | Inference in Devices | Result after Deployment | ||||||
---|---|---|---|---|---|---|---|---|---|---|
ACC | Model Size | Platform | Name | Platform | Metrics | Latency | Ram | Flash Memory | ||
[5] | SVM | 84% | - | - | All devices | STM X-Cube-AI expansion package, and C language platform | All 84% | <1 ms | - | - |
ANN1 | 99%—<1 m | - | - | F746ZG | Both 99% | 1 ms | - | - | ||
H743ZI2 | ||||||||||
KNN | 99%—<1 ms | - | - | F746ZG | Both 92% | Both 10 ms | - | - | ||
H743ZI2 | ||||||||||
ANN2 | 99%—<1 ms | - | - | F746ZG | Both 99% | Both <1 ms | - | - | ||
H743ZI2 | ||||||||||
DT | 99%—<1 ms | - | - | F746ZG | Both 99% | Both <1 ms | - | - | ||
H743ZI2 | ||||||||||
ANN3 | 0.86 | - | - | F401RE | 0.86 R2 | <1 ms | - | - | ||
F746ZG | ||||||||||
H743ZI2 | ||||||||||
L452RE | ||||||||||
[25] | NN CNN | 97.25% 99% | 15 MB 7172 KB | TFLite and TFliteConver | F746ZG | X-CUBE-AI tool | 100% | 330 ms | 135.68 | 668.97 |
[28] | CNN1 CNN2 | 98.53% 99.02% | 185 KB | TF Lite | OpenMV H7 board STM32H743VI. | TF-Convert | 95.28% 98.84% | 20 FPS | - | - |
[29] | CNN | 99.83% | 1.5 MB | - | OpenMV H7 STM32H743VI. | - | 99.83% | 30 FPS | - | - |
SqueezeNet | 98.50% | 8.0 MB | 98.53% | |||||||
SqueezeNet2 | 98.93% | 3.8 MB | 98.99% | |||||||
[30] | Keras | 19% | - | TensorFlow | Taiyo Yuden EYSHSNZWZ NRF52 | - | - | - | - | - |
LSTM | 93% | 2.8 MB | TensorFlow | - | Tensor Flow Lite Micro- Not Support it | - | - | - | - | |
[31] | RNN FFNN | 61% | - | - | ATMega4809 | TensorFlow. | 84% | Both 40 Hz | Both 2 KB | Both 32 KB |
93% | ||||||||||
[32] | NN | - | - | - | ESP32 | Arduino-LMIC software | 99.33% indoor 97.5%. outdoor | 2 min per activity. 0.5 per gesture. | - | - |
[33] | TinySpeech-X. | 96.4% | - | TensorFlow Lite for Microcontroller | - | - | - | - | - | - |
TinySpeech-Y | 93.6% | 48.8 KB | ||||||||
TinySpeech-Z | 92.4% | 21.6 KB | ||||||||
TinySpeech-M | 91.9% | - | ||||||||
[34] | LetNet5 model Vehicle Neural Networks (VNN1,2) | 99.53% 79.62% 81.27% | - | PyTorch | STM32 L476 board | X-Cube-AI (float32 operations) | - | 14.15 ms | 80 MHz | - |
NXP k64f | ARM CMSIS-NN. | - | 0.97 | 120 MHz | - | |||||
GAP8 | PULP-NN | - | 1000 fps with 1 ms | - | - |
Study | Model | Model Result in Desktop | Inference in Devices | Result after Deployment | ||||||
---|---|---|---|---|---|---|---|---|---|---|
ACC | Model Size | Platform | Name | Platform | Metrics | Latency | Ram | Flash Memory | ||
[37] | VWW model Google Hotword Optimized | - | - | - | Sparkfun Edge Xtensa Hifi Mini digital signal processors (DSP) | TensorFlow Lite Micro | - | - | 4.857 KB 49.95 KB 36.4 KB 88.4 KB | 81.79 KB 12.80 KB |
[17] | TinyNAS model and TinyEngin library | - | - | - | STM32F746 | MCUNET | 61.8% | 49.5% at 5 FPS and 40.5% at 10 FPS | 0.49 MB | 1.9 MB |
87% | 89% at 5 FPS and 87% at 10 FPS | 91 KB | <140 MB | |||||||
- | 94% at 5 FPS and 91% at 10 FPS | - | <124 MB | |||||||
36.4 KB 88.4 KB | ||||||||||
[6] | MobileNet-V2 | - | - | - | STM32H747I-Disco | TensorFlow Lite and MbedOS | 88% | 220 ms | 138,240 KB | Matrix Size: 611,912 |
OpenMV Cam H7 | ||||||||||
[36] | MobileNet-V1 | - | 1.97 MB | STM32H743 SoC | CMix-NN | 68.2% | 1.86 s | - | - | |
[35] | ProxyNAS with FT-Full | - | - | Computation theory | - | - | - | - | 391 MB | - |
ProxyNAS with tinyTL | 65 MB | |||||||||
Inception-V3 with FT-Full | 850 MB | |||||||||
TinyTL with FA | 66 MB |
Input Type | Dataset | Reference |
---|---|---|
Images | Handwritten digits | [38] |
Sign MNIST dataset from Kaggle | [39] | |
Kaggle ASL dataset (26 classes) | [40] | |
ASL dataset created by authors | [28] | |
Kaggle ASL Alphabet test set. | [41] | |
Face Mask 12 K Images dataset from Kaggle | [42] | |
Face Mask Classification dataset from Kaggle | [43] | |
Face Mask Dataset created by authors | [29] | |
Face Mask testing dataset created by authors | [29] | |
ImageNET (Flower, CUP, Pets, Food, CIFAR10 and CIFAR100) | [44,45,46,47,48,49,50,51,52] | |
Visual Wake Word (VWW) | ||
Physiological/ Behavioral Metrics | Heart dataset | [53] |
Hand gesture recorded ((Forward, Backward, Select and Abort) using fingers created by authors | [30] | |
Hand Gesture data (0–9) Created by the authors | [31] | |
Foot gesture and activity data Created by authors | [32] | |
Data | Virus dataset | [54] |
Sonar dataset | [55,56] | |
Peugeot 14 | [57] | |
Peugeot 15 | [57] | |
EnviroCar | [58] | |
Air Quality Index (AQI) | [59] | |
Dset-2.0 created by authors | [34] | |
Dset-1.5 created by authors | [34] | |
Dset-1.0 created by authors | [34] | |
Audio | Google Speech Commands (GSC) | [60] |
Study | Dataset | Description | Total | Training Dataset | Testing Dataset |
---|---|---|---|---|---|
[5] | Heart dataset | Heart dataset produced by the University of California Irvine (UCI), contains 13 features. In these, 0 represents an absence of coronary heart disease (CHD) in the patient and labels 1–4 represent the presence of CHD | 300 data | - | - |
Virus dataset | Developed to be used in data traffic analysis | - | - | - | |
Sonar dataset | Contains reading sonar system for two classes (Miners and Rocks) to materials analysis | 208 data | - | - | |
Peugeot 14 | Contains different parameters from cars to predict road surface | 8615 data | - | - | |
Peugeot 15 | Contains different parameters from cars to predict the traffic | 8615 data | - | - | |
EnviroCar | Contains anonymized tracks of car measurements collected by citizen bus The used dataset contains approximately 1.7 million data points. Each data record contains 24 attributes reflecting sensor values of the vehicle (e.g., speed, rpm...) | Around 1.7 million data point | - | - | |
AQI | Air Quality Index (AQI) dataset includes measurement of air quality for one year in Australia. | Real-time data from website | - | - | |
[25] | Handwritten Digits | Handwritten digit images from (0 to 9) | 70,000 images | 60,000 | 10,000 |
[28] | Sign MNIST dataset | Used 24 of the 26 letters of the alphabet in English, leaving out the letters J and Z | 34,627 images | 27,455 images | 7172 images |
Kaggle ASL dataset | Used 3000 images per class, 24 classes for 26 letters of the alphabet in English, except for J and Z | 72,024 images | 72,000 images | 24 images | |
Sign Language dataset | Used 400 images for each of the 24 classes, used inter_area interpolation OpenCV to downscale the images into 28 × 28 | 400 images | 40 images | 360 images | |
Kaggle ASL Alphabet test set. | 30 images per 24 classes, used as generalization dataset or as final testset | 720 images | - | - | |
[29] | Face Mask 12 K Images Dataset | Faces images with/without a mask with a variety of backgrounds and cropped to face region | 58,960 images. | 58,960 images | - |
Face Mask Classification dataset | Face images with/Without Mask | 22,200 images | 22,200 images | - | |
Medical face OpenMV | Used the OpenMV Cam H7 camera to create a dataset. The size of images was 200 × 200 then saved on the SD Card of the development board | 49,895 images | 49,895 images | - | |
Medical face testing dataset OpenMV Dataset. | Used OpenMV camera to create a dataset | 4794 images | 4794 images | - | |
[30] | Hand Gesture data from numbers (0–9) | Data for 10 numbers of gestures (from 0 to 9) | 1000 gestures | - | - |
[32] | Foot gestures and activity data | Set of data activities (walking, jogging, standing) and two gestures (double tap at the toe tip and double tap at heel). | 30,000 data | 24,000 data | 6000 data |
[31] | Hand gestures | A set of gestures were recorded such (Forward, Backward, Select and Abort) using fingers, arms and entire hands. | - | -RNN 15 data recorded then augmented to 540 -FFNN 350 data recorded then augmented to 1400 | 24 signals |
[33] | Google Speech Commands (GSC) dataset. | 65,000 of 1 s verbal command for short words with background noise. | 65,000 | - | - |
[34] | Dset-2.0 | Dset-2.0 contains samples (clear images) with (2.0 ms) | - | 1000 sample | 300 sample |
Dset-1.5 | Dset-1.5 contains samples with (low-contrast images) with (1.5 ms) | - | 1000 sample | 300 sample | |
Dset-1.0 | Dset-1.0 contains samples (low-contrast images) with (1.0 ms) | - | 1000 sample | 300 sample | |
[17] | ImageNET: | the standard large-scale benchmark for image classification consists of Set of 1000 object categories containing internal and leaf nodes, but do not interfere with each other. | 10,000 images | - | - |
Wake word: Visual Wake Word (VWW): | VWW: is a set of natural images of a complex day. Each image classifies to label 1 images present (Person) or 0 (Not Person) | 5000 images | - | - | |
Wake word: Google Speech Commands (GSC) dataset | GSC: Speech Commands is an audio dataset for keyword spotting (e.g., “Hey Siri”), requiring classifying a spoken word from a vocabulary of size 35. | - | - | - | |
[36] | ImageNET Dataset | Set of 1000 object categories contains internal and leaf nodes. | 200,000 images | 50,000 images | 150,000 images |
[6] | VWW | VWW: is a set of natural images of a complex day. Each image classifies to label 1 images present (Person) or 0 (Not Person) | 115,387 | 115,287 images | 100 images |
[35] | 9 datasets (Flower, Cars, CUB, food, Pets, Aircraft, CIFAR10, CIFAR-100 and CelebA) | Used ImageNet at pre-train on eight object classification datasets (Flower, Cars, CUB, Food, Pets, Aircraft, CIFAR10 and CIFAR-100) used VGG Face2 as a pre-train to one human facial attribute classification dataset (CelebA) | - | - | - |
[37] | Visual Wake Word (VWW) | Each image classifies to label images (1 present Person or 0 Not Person) | 115 k | 115 k images | 8 k |
Processor | Flash Memory | RAM | Processor Speed (MHz) |
---|---|---|---|
STM32-L476RG | 1 MB | 128 KB | 80 MHz |
STM32-H743VI | 2 MB | 1 MbB | 480 MHz |
STM32 Nucleo-64 F091RC | 256 KB | 32 KB | 48 (max: 48) |
STM32 Nucleo-64 F303RE | 512 KB | 80 KB | 72 (max: 72) |
STM32 Nucleo-64 F401RE | 512 KB | 96 KB | 84 (max: 84) |
STM32 Nucleo-144 F746ZG | 1 MB | 340 KB | 96 (max: 216) |
STM32 Nucleo-144 H743ZI2 | 2 MB | 1 MB | 96 (max: 480) |
STM32 Nucleo-64 L452RE | 512 KB | 160 KB | 80 (max: 80) |
STM32H747I-Disco_CPU (ARM Cortex M4+ ARM Cortex M7) | 1 MB | 2 MB | 240 MHz (M4) + 480 MHz (M7) |
STM32H743VI | 2 MB | 1 MB | 400 MHz |
GAP 8 based PULP architecture | 512 kB | 80 KB | 22.65 Giga Operations Per Secon (GOPS) |
NXP Semiconductors FRDM-K64F | 1 MB | 256 KB | 120 MHz |
ATMEGA4809 | 48 KB | 6 KB | 20 MHz |
Arm CPU Cortex-M4 | 0.38 MB | 1 MB | 96 MHz |
Xtensa DSP HiFi Mini | 1 MB | 1 MB | 10 MHz |
STM32H743 SoC- ARM Cortex- M7 | 2 MB | 512 KB | 480 MHz |
Sparkfun Edge (Ambiq Apollo3), Arm CPU Cortex-M4 | 1 MB | 0.38 MB | 96 MHz |
Tensilica HiFi, Xtensa DSP HiFi Mini processor | 1 MB | 1 MB | 10 MHz |
ESP32 | 448 KB | 520 KiB SRAM | 160 MHz–240 MHz |
Taiyo Yuden EYSHSNZWZ NRF52 | 512 KB | 64 KB | 2402 MHz–2480 MHz |
OpenMV Cam H7—Processor (ARM Cortex M7 480 MHz) | 2 MB | 1 MB | 480 MHz |
STM32H747I-Disco_CPU (ARM Cortex M4 + ARM Cortex M7) | 1 MB | 2 MB | 240 MHz (M4) + 480 MHz (M7) |
STM32H743VI | 2 MB | 1 MB | 400 MHz |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alajlan, N.N.; Ibrahim, D.M. TinyML: Enabling of Inference Deep Learning Models on Ultra-Low-Power IoT Edge Devices for AI Applications. Micromachines 2022, 13, 851. https://doi.org/10.3390/mi13060851
Alajlan NN, Ibrahim DM. TinyML: Enabling of Inference Deep Learning Models on Ultra-Low-Power IoT Edge Devices for AI Applications. Micromachines. 2022; 13(6):851. https://doi.org/10.3390/mi13060851
Chicago/Turabian StyleAlajlan, Norah N., and Dina M. Ibrahim. 2022. "TinyML: Enabling of Inference Deep Learning Models on Ultra-Low-Power IoT Edge Devices for AI Applications" Micromachines 13, no. 6: 851. https://doi.org/10.3390/mi13060851
APA StyleAlajlan, N. N., & Ibrahim, D. M. (2022). TinyML: Enabling of Inference Deep Learning Models on Ultra-Low-Power IoT Edge Devices for AI Applications. Micromachines, 13(6), 851. https://doi.org/10.3390/mi13060851