Abstract
This paper has emphasized several sounds for Bird Species Recognition based on their vocalization. Although various techniques have been designed with good equipment for identifying different birds’ sounds, still it has limitations to classify or recognize different birds’ sounds at dissimilar moments of traveling. Thus, this paper presents the model for recognizing the bird’s sound in several situations using Spiking Neural Network (SNN). In this model, Permutation Pair Frequency Matrix Coefficients are considered for the features representing the bird sound frames, after the application of pre-processing method on the moment of recordings. These features are applied as input to the three-layered SNN and subsequently to the hidden layer, which performs the work of spiking if there is any change in the feature. The spikes activate any one of the neuron’s outputs, that produces a higher probability class. Further, the SNN along with the Distance Fuzzy Co-clustering algorithm technique used on these bird sounds to achieve optimum results. The performance of this model achieved more than 94% accuracy and a computation time of 4.16 ms better than other existing approaches.








Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Anwani, N., & Rajendran, B. (2015) NormAD-normalized approximate descent based supervised learning rule for spiking neurons. In International joint conference on neural networks (pp. 1–8).
Bardeli, R., Wolff, D., Kurth, F., Koch, M., Tauchert, K. H., & Frommolt, K. H. (2010). Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring. Pattern Recognition Letters, 31(12), 1524–1534. https://doi.org/10.1016/j.patrec.2009.09.014
Bhuyan H. K., Chakraborty, C., Pani, S. K., & Ravi, V. K. (2023c). Feature and sub-feature selection for classification using correlation coefficient and fuzzy model. IEEE Transaction on Engineering Management, 70(5).
Bhuyan, H. K., Kamila, N. K., & Jena, L. D. (2016). Pareto-based multi-objective optimization for classification in data mining. Cluster Computing, 19(4), 1723–1745.
Bhuyan, H. K., & Ravi V. K. (2021). Analysis of sub-feature for classification in data mining. IEEE Transaction on Engineering Management.
Bhuyan, H. K., & Ravi, V. K. (2023b). An integrated framework with deep learning for segmentation and classification of cancer disease. International Journal on Artificial Intelligence Tools (IJAIT), 32(02), 2340002.
Bhuyan, H. K., Ravi, V., Brahma, B., & Kamila, N. K. (2022b). Disease analysis using machine learning approaches in healthcare system. Health and Technology, 12(5), 987–1005.
Bhuyan, H. K., Ravi, V., & Yadav, M. S. (2022a). Multi-objective optimization-based privacy in data mining. Cluster Computing, 25(6), 4275–4287.
Bhuyan, H. K., & Reddy C. V. (2018) Madhusudan: Sub-feature selection for novel classification. In International conference on inventive communication and computational technologies (ICICCT). IEEE Explore, 20–21 April. https://doi.org/10.1109/ICICCT.2018.8473206
Bhuyan, H. K., Saikiran, M., Tripathy, M., & Ravi, V. (2023a). Wide-ranging approach-based feature selection for classification. Multimedia Tools and Applications, 82(15), 23277–23304.
Bhuyan H. K., Vijayaraj, A., & Ravi, V. (2023a). Diagnosis system for cancer disease using a single setting approach. Multimedia Tools and Applications 1–27.
Bhuyan, H. K., Vijayaraj, A., & Ravi, V. K. (2023b). Development of secrete images in image transferring system. Multimedia Tools and Applications, 82(5), 7529–7552.
Cao, Y., Chen, Y., & Khosla, D. (2015). Spiking deep convolutional neural networks for energy-efficient object recognition. International Journal of Computer Vision, 113(1), 54–66.
Couchoux, C., Aubert, M., Garant, D., & Reale, D. (2015). Spying on small wildlife sounds using affordable collar-mounted miniature microphones: An innovative method to record individual daylong vocalisations in chip munks. Science and Reports, 5, 10118.
Fagerlund, S., & Laine, U. K. (2014). New parametric representation of bird sounds for automatic classification. In 2014 IEEE international conference on acoustics, speech and signal processing (ICASSP), May 4-9.
Fodor, G. (2013). The ninth annual MLSP competition: First place. In Proceedings of the IEEE international workshop on machine learning for signal processing (MLSP) (pp. 1–2).
Gütig, R. (2014). To spike, or when to spike? Current opinion. Neurobiology, 25, 134–139.
Hunsberger, E. (2017). Spiking deep neural networks: Engineered and biological approaches to object recognition. (Ph.D. thesis), UWSpace.
Ilany, A., Barocas, A., Kam, M., Ilany, T., & Geffen, E. (2013). The energy cost of singing in wild rock hyrax males: Evidence for an index signal. Animal Behaviour, 85(5), 995–1001.
Jancovic, P., & Köküer, M. (2019). Bird species recognition using unsupervised modeling of individual vocalization elements. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27(5), 932–947.
Lasseck, M. (2015). Improved automatic Bird identification through decision tree based feature selection and bagging. CLEF (working notes), 1391. https://ceur-ws.org/Vol-1391/160-CR.pdf
Lee, J. H., Delbruck, T., & Pfeiffer, M. (2016). Training deep spiking neural networks using backpropagation. Frontiers in Neuroscience, 10, 508.
Lee, W. W., Kukreja, S. L., & Thakor, N. V. (2017). CONE: Convex-optimized-synaptic efficacies for temporally precise spike mapping. IEEE Transactions on Neural Networks and Learning Systems, 28(4), 849–861.
Liu, Y., Chen, J., & Chao, H. (2017a). A fuzzy co-clustering algorithm via modularity maximization. Hindawi Mathematical Problems in Engineering. https://doi.org/10.1155/2018/3757580
Liu, Y., Wu, S., Liu, Z., Chao, H., & Deng, Z. (2017b). A fuzzy co-clustering algorithm for biomedical data. PLoS ONE, 12(4), e0176536.
Maass, W. (1997). Network of spiking neuron: The third generation of neural network models. Neural Network, 10(9), 1659–1671.
Mohanty, R., Mallik, B. K., & Solanki, S. S. (2020a). Automatic bird species recognition system using neural network based on spike. Applied Acoustics, 161(2020), 1–8. https://doi.org/10.1016/j.apacoust.2019.107177
Mohanty, R., Mallik, B. K., & Solanki, S. S. (2020b). Recognition of bird species based on spike model using bird dataset. Data in Brief, 29(2020), 1–5. https://doi.org/10.1016/j.dib.2020.105301
Mohemmed, A., et al. (2012). SPAN: Spike pattern association neuron for learning spatio-temporal spike patterns. International Journal of Neural Systems, 22(4), 1250012.
Potamitis, I. (2014). Automatic classification of a taxon-rich community recorded in the wild. PLoS ONE, 9(5), e96936. https://doi.org/10.1371/journal.pone.0096936
Potamitis, I., Ntalampiras, S., Jahn, O., & Riede, K. (2014a). Automatic bird sound detection in long real-field recordings: Applications and tools. Applied Acoustics, 80, 1–9. https://doi.org/10.1016/j.apacoust.2014.01.001
Qiao, B., Zhou, Z., Yang, H., & Cao, J. (2017). Bird species recognition based on SVM classifier and decision tree. In 2017 First international conference on electronics instrumentation & information systems (EIIS) (pp. 1–4).
Rueckauer, B., Lungu, I. A., & Hu, Y. (2017). Conversion of continuous-valued deep networks to efficient event-driven networks for image classification. Frontiers in Neuroscience, 11(682), 1282–1287.
Song, S., Miller, K. D., & Abbott, L. F. (2003). Competitive Hebbian learning through spike-timing-dependent synaptic plasticity. Nature Neuroscience, 3(9), 919–926.
Stimpert, A. K., et al. (2015). Sound production and associated behavior of tagged fin whales (Balaenoptera physalus) in the Southern California Bight. Animal Biotelemetry, 3(1), 3–23.
Van Pham, N., Pham, L. T., Nguyen, T. D., & Ngo, L. T. (2018). A new cluster tendency assessment method for fuzzy co-clustering in hyperspectral image analysis. Neurocomputing. https://doi.org/10.1016/j.neucom.2018.04.022
Ventura, T. M. (2015). Audio parameterization with robust frame selection for improved bird identification. Expert Systems with Applications, 42, 8463–8471.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mohanty, R., Bhuyan, H.K., Pani, S.K. et al. Bird species recognition using spiking neural network along with distance based fuzzy co-clustering. Int J Speech Technol 26, 681–694 (2023). https://doi.org/10.1007/s10772-023-10040-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-023-10040-1