Abstract
TinyML is a rapidly evolving field at the intersection of machine learning and embedded systems. This paper describes and evaluates a TinyML-optimized convolutional neural network (CNN) for real-time digit spotting in the Arabic language when executed on three different computational platforms. The proposed system is designed to recognize a set of Arabic digits from a continuous audio stream in real-time, enabling the development of intelligent voice-activated applications on edge devices.
Our results show that our TinyML-optimized CNN model can achieve 90%–93% inference accuracy, within 0.06–38 ms, while occupying only 19–139 KB of memory. These results demonstrate the feasibility of deploying a CNN-based Arabic digit spotting system on resource-constrained edge devices. They also provide insights into the trade-offs between performance and resource utilization on different hardware platforms. This work has important implications for the development of intelligent voice-activated applications in the Arabic language on edge devices, which enables new opportunities for real-time speech processing at the edge.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The dialect spoken in Arab states around the Persian Gulf.
References
TensorFlow Lite for Microcontrollers. https://www.tensorflow.org/lite/microcontrollers
Alalshekmubarak, A., Smith, L.S.: On Improving the Classification Capability of Reservoir Computing for Arabic Speech Recognition. In: Wermter, S., et al. (eds.) ICANN 2014. LNCS, vol. 8681, pp. 225–232. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11179-7_29
Ammari, T., Kaye, J., Tsai, J.Y., Bentley, F.: Music, search, and IoT: How people (really) use voice assistants. ACM Trans. Comput. Hum. Interact. 26(3) (2019). https://doi.org/10.1145/3311956
ARM: ARM NN SDK. https://www.arm.com/products/silicon-ip-cpu/ethos/arm-nn
Cao, K., Liu, Y., Meng, G., Sun, Q.: An overview on edge computing research. IEEE Access 8, 85714–85728 (2020). https://doi.org/10.1109/ACCESS.2020.2991734
Amir, G., et al.: A survey of quantization methods for efficient neural network inference. arXiv preprint arXiv:2103.13630 (2021)
Benoit, J., et al.: Quantization and training of neural networks for efficient integer-arithmetic-only inference. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2704–2713 (2018)
Reddi, J., et al.: Widening Access to Applied Machine Learning With TinyML. Harvard Data Sci. Rev. 4 (2022)
Keras: BayesianOptimization Tuner
Microsoft: Embedded Learning Library (2020). https://microsoft.github.io/ELL/
Novac, P.-E., et al.: Quantization and deployment of deep neural networks on microcontrollers. Sensors 21(9), 2984 (2021). https://doi.org/10.3390/s21092984. https://arxiv.org/abs/2105.13331
Saha, S.S., Sandha, S.S., Srivastava, M.: Machine learning for microcontroller-class hardware: A review. IEEE Sens. J. 22(22), 21362–21390 (2022). https://doi.org/10.1109/jsen.2022.3210773
Sithara, A., Thomas, A., Mathew, D.: Study of MFCC and IHC feature extraction methods with probabilistic acoustic models for speaker biometric applications. Proc. Comput. Sci. 143, 267–276 (2018)
Soro, S.: TinyML for Ubiquitous Edge AI (2021). arXiv preprint arxiv.org/2102.01255https://doi.org/10.48550/ARXIV.2102.01255
STMicroelectronics: Artificial Intelligence Ecosystem for STM32. https://www.st.com/content/stcom/en/ecosystems/artificial-intelligence-ecosystem-stm32.html
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 IFIP International Federation for Information Processing
About this paper
Cite this paper
Abu Adla, Y., Saghir, M.A.R., Awad, M. (2023). Real-Time Arabic Digit Spotting with TinyML-Optimized CNNs on Edge Devices. In: Maglogiannis, I., Iliadis, L., MacIntyre, J., Dominguez, M. (eds) Artificial Intelligence Applications and Innovations. AIAI 2023. IFIP Advances in Information and Communication Technology, vol 675. Springer, Cham. https://doi.org/10.1007/978-3-031-34111-3_44
Download citation
DOI: https://doi.org/10.1007/978-3-031-34111-3_44
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34110-6
Online ISBN: 978-3-031-34111-3
eBook Packages: Computer ScienceComputer Science (R0)