Abstract
This paper describes the development of an A.I. application for animal sound classification using pre-trained and custom-trained machine-learning models deployed on mobile devices. The research aims to address the challenges of traditional animal acoustic sound signal analysis, which is computationally intensive, requires a strong network connection, and is challenging to implement on low-cost microcontroller-based systems. By using Yet Another Mobile Network (YAMNet), a pre-trained model, and a custom-trained model, animal sounds and noises can be identified in real time, and the animal making the sound can be determined. The accuracy of the predictions is evaluated using a mobile device’s trained model against test datasets in three different modes. Although the animal scope is currently limited to birds found in Singapore due to dataset constraints, the system can be expanded to other animals and species as long as sufficient datasets are available, making it a promising solution for continuous real-time biodiversity monitoring.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Riede, T., Zuberbühler, K.: The relationship between acoustic structure and semantic information in Diana monkey alarm vocalizations. J. Acoust. Soc. Am. 114(2), 1132–1142 (2003)
Chesmore, E.D.: Automated bioacoustic identification of species. An. Acad. Bras. Ciênc. 76(2), 435–440 (2004)
Plakal, M., Ellis, D.: Yamnet, January 2020. https://github.com/tensorflow/models/tree/master/research/audioset/yamnet. Accessed 18 Sept 2022
Moolayil, J.: Learn Keras for Deep Neural Networks: A Fast-Track Approach to Modern Deep Learning with Python. Apress, Berkeley (2019)
Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: TensorFlow, Savannah, GA, USA (2016)
“Tensorflow Lite,” TensorFlow. https://www.tensorflow.org/lite/guide#:~:text=Optimized%20for%20on%2Ddevice%20machine,inference%20and%20a%20lack%20of. Accessed 21 Jan 2023
Shonfield, J., Bayne, E.: Autonomous recording units in avian ecological research: current use and future applications. Avian Conserv. Ecol. 12(1) (2017)
Barlow, J., et al.: The future of hyperdiverse tropical ecosystems. Nature 559(7715), 517–526 (2018)
Wilson, K.A., et al.: Conservation research is not happening where it is most needed. PLoS Biol. 14(3), e1002413 (2016)
Clarke, D.A., York, P.H., Rasheed, M.A., Northfield, T.D.: Does biodiversity– ecosystem function literature neglect tropical ecosystems? Trends Ecol. Evolut. 32(5), 320–323 (2017)
Socolar, J.B., Valderrama Sandoval, E.H., Wilcove, D.S.: Overlooked biodiversity loss in tropical smallholder agriculture. Conserv. Biol. 33(6), 1338–1349 (2019)
Boakes, E.H., et al.: Distorted views of biodiversity: spatial and temporal bias in species occurrence data. PLoS Biol. 8(6), e1000385 (2010)
Darras, K., et al.: Comparing the sampling performance of sound recorders versus point counts in bird surveys: a meta-analysis. J. Appl. Ecol. 55(6), 2575–2586 (2018)
Kahl, S., Wood, C.M., Eibl, M., Klinck, H.: BirdNET: a deep learning solution for avian diversity monitoring. Ecol. Inform. 61, 101236 (2021)
Shiu, Y., et al.: Deep neural networks for automated detection of marine mammal species. Sci. Rep. 10(1), 1–12 (2020)
Grill, T., Schlüter, J.: Two convolutional neural networks for bird detection in audio signals. In: 2017 25th European Signal Processing Conference (EUSIPCO)
Lasseck, M.: Audio-based bird species identification with deep convolutional neural networks. In: CLEF working notes 2018, CLEF: Conference and Labs of the Evaluation Forum, Avignon, France, September 2018 (2018)
Mühling, M., Franz, J., Korfhage, N., Freisleben, B.: Bird species recognition via neural architecture search. In: CLEF working notes 2020, CLEF: Conference and Labs of the Evaluation Forum, Thessaloniki, Greece, September 2020 (2020)
Wood, C.M., Kahl, S., Chaon, P., Peery, M.Z., Klinck, H.: Survey coverage, recording duration and community composition affect observed species richness in passive acoustic surveys. Methods Ecol. Evol. 12(5), 885–896 (2021)
Joly, A., et al.: Overview of LifeCLEF 2021: an evaluation of machine-learning based species identification and species distribution prediction. In: Selçuk Candan, K., et al. (eds.) CLEF 2021. LNCS, vol. 12880. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-85251-1_24
S’pore to become beautiful, clean city within three years (1967, May 12). The Straits Times, p. 4. Retrieved from NewspaperSG; Lee, K. Y. (2000). From third world to first: The Singapore story: 1965–2000: Memoirs of Lee Kuan Yew (p. 188). Singapore: Times Editions: Singapore Press Holdings. Call no.: RSING 959.57092 LEE-[HIS]
Lee, J. (1998, December 11) ‘City in a garden’ plan set out for Singapore. The Straits Times, p. 3. Retrieved from NewspaperSG; Prime Minister’s Office. (2014, November 6) Speech by Prime Minister Lee Hsien Loong at the opening of Bishan Park – ABC Waters, 17 Mar 2012. Prime Minister’s Office website. http://www.pmo.gov.sg/mediacentre/speech-prime-minister-lee-hsien-loong-opening-bishan-park-abc-waters-17-mar-2012. Accessed 23 Jan 2023
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Lin, Y.H., Fernando, O.N.N. (2023). Animal Hunt: AI-Based Animal Sound Recognition Application. In: Stephanidis, C., Antona, M., Ntoa, S., Salvendy, G. (eds) HCI International 2023 Posters. HCII 2023. Communications in Computer and Information Science, vol 1836. Springer, Cham. https://doi.org/10.1007/978-3-031-36004-6_64
Download citation
DOI: https://doi.org/10.1007/978-3-031-36004-6_64
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36003-9
Online ISBN: 978-3-031-36004-6
eBook Packages: Computer ScienceComputer Science (R0)