To read this content please select one of the options below:

Intelligent recognition system for viewpoint variations on gait and speech using CNN-CapsNet

G. Merlin Linda (Department of Computer Science and Engineering, SRM Institute of Science and Technology–Vadapalani Campus, Chennai, India)
N.V.S. Sree Rathna Lakshmi (Department of Electronics and Communication Engineering, Agni College of Technology, Chennai, India)
N. Senthil Murugan (School of Computer Science and Engineering, VIT-AP University, Amaravati, India)
Rajendra Prasad Mahapatra (Department of Computer Science and Engineering, SRM Institute of Science and Technology, Ghaziabad, India)
V. Muthukumaran (Department of Mathematics, School of Applied Sciences, REVA University, Bangalore, India)
M. Sivaram (Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam)

International Journal of Intelligent Computing and Cybernetics

ISSN: 1756-378X

Article publication date: 12 November 2021

Issue publication date: 6 July 2022

138

Abstract

Purpose

The paper aims to introduce an intelligent recognition system for viewpoint variations of gait and speech. It proposes a convolutional neural network-based capsule network (CNN-CapsNet) model and outlining the performance of the system in recognition of gait and speech variations. The proposed intelligent system mainly focuses on relative spatial hierarchies between gait features in the entities of the image due to translational invariances in sub-sampling and speech variations.

Design/methodology/approach

This proposed work CNN-CapsNet is mainly used for automatic learning of feature representations based on CNN and used capsule vectors as neurons to encode all the spatial information of an image by adapting equal variances to change in viewpoint. The proposed study will resolve the discrepancies caused by cofactors and gait recognition between opinions based on a model of CNN-CapsNet.

Findings

This research work provides recognition of signal, biometric-based gait recognition and sound/speech analysis. Empirical evaluations are conducted on three aspects of scenarios, namely fixed-view, cross-view and multi-view conditions. The main parameters for recognition of gait are speed, change in clothes, subjects walking with carrying object and intensity of light.

Research limitations/implications

The proposed CNN-CapsNet has some limitations when considering for detecting the walking targets from surveillance videos considering multimodal fusion approaches using hardware sensor devices. It can also act as a pre-requisite tool to analyze, identify, detect and verify the malware practices.

Practical implications

This research work includes for detecting the walking targets from surveillance videos considering multimodal fusion approaches using hardware sensor devices. It can also act as a pre-requisite tool to analyze, identify, detect and verify the malware practices.

Originality/value

This proposed research work proves to be performing better for the recognition of gait and speech when compared with other techniques.

Keywords

Citation

Merlin Linda, G., Sree Rathna Lakshmi, N.V.S., Murugan, N.S., Mahapatra, R.P., Muthukumaran, V. and Sivaram, M. (2022), "Intelligent recognition system for viewpoint variations on gait and speech using CNN-CapsNet", International Journal of Intelligent Computing and Cybernetics, Vol. 15 No. 3, pp. 363-382. https://doi.org/10.1108/IJICC-08-2021-0178

Publisher

:

Emerald Publishing Limited

Copyright © 2021, Emerald Publishing Limited

Related articles