Time Sequence Features Extraction Algorithm of Lying Speech Based on Sparse CNN and LSTM

Zhou, Yan; Shang, Li

doi:10.1007/978-3-030-60799-9_8

Yan Zhou¹¹ &
Li Shang¹¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12463))

Included in the following conference series:

International Conference on Intelligent Computing

704 Accesses

Abstract

Time sequence feature extraction algorithm of lying speech based on CNN-LSTM deep network was proposed in this paper. The sparse representation of CNN was realized by introducing \( l_{1} \) norm into the objective function of CNN. This sparse optimization algorithm overcame the disadvantage of CNN network that was easy to fall into the local minimum. Firstly, speech preprocessing had been performed, and then, the spectrograms of lying speech were sent into the sparse CNN model. This step was aim to extract the local lying features. Secondly, establishing a time sequence feature extraction model, the local lying features were sent into the LSTM network to extract lying features temporal perspective. Finally, the \( {\text{Softmax}} \) testing unit was used to output the lie detection results. Experimental results show that, compared with traditional methods, the model that extracted the fusion features of local features and time sequence features proposed in this paper had a higher detection rate and good scalability. In a word, the Sparse-CNN-LSTM feature extraction model provided a new idea for the research of lying speech detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Hartwig, M., Bond, C.F.: Why do lie catchers fail? A lens model meta-analysis of human lie judgments. Psychol. Bull. 137(4), 643–659 (2011)
Article Google Scholar
Cheng, F., Heming, Z., Xueqin, C., et al.: Distinguishing deception from non-deception in Chinese speech. In: 6th International Conference on Intelligent Control and Information Proceedings. pp. 268–273. IEEE, Wuhan (2016)
Google Scholar
Ekman, P., Osullivan, M., Friesen, W.V., et al.: Invited article: face, voice, and body in detecting deceit. J. Non-verbal Behav. 15(2), 125–135 (1991)
Article Google Scholar
Bond, C.F., Depaulo, B.M.: Accuracy of deception judgments. Pers. Soc. Psychol. Rev. Official J. Soc. Pers. Soc. Psychol. Inc 10(3), 214 (2006)
Google Scholar
Gopalan, K., Wendt, S.: Speech analysis using modulation-based features for detecting deception. In: 15th International Conference on Digital Signal Processing, pp. 619–622. IEEE, Cardiff (2007)
Google Scholar
Enos, F.: Detecting Deception in Speech. Columbia University (2010)
Google Scholar
Kirchhuebel, C.: The Acoustic and Temporal Characteristics of Deceptive Speech. University of York, York (2015)
Google Scholar
Li, Z., Ruiyu, L., Yue, X., et al.: Progress and outlook of lie detection technique in lie speech. Data Acquisition Proces. 032(002), 246–257 (2017)
Google Scholar
Hua, Yu., Chao, Ye: Deception detection based on SVM and GMM combined classifier. Electron. Devices 42(1), 240–243 (2019)
Google Scholar
Badshah, A.M., Ahmad, J., Rahim, N., et al.: Speech emotion recognition from spectrograms with deep convolutional neural network. In: the International Conference on Platform Technology and Service, pp. 1–5 (2017)
Google Scholar
Cong, X.: Research on the multi-granularity analysis method of time sequence signal using convolutional long short time memory network. Harbin Institute of Technology (2017)
Google Scholar
Palangi, H., Deng, L., Shen, Y., et al.: Deep sentence embedding using the long short-term memory network: analysis and application to information retrieval. Trans. Audio Speech Lang. Process. 24(4), 694 (2016)
Article Google Scholar

Download references

Acknowledgement

The authors acknowledge the QingLan project of colleges and universities in Jiangsu province. Intelligent computing and knowledge learning research platform construction project of Suzhou Vocational University.

Author information

Authors and Affiliations

College of Electronic and Information Engineering, Suzhou Vocational University, Suzhou, Jiangsu, China
Yan Zhou & Li Shang

Authors

Yan Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Li Shang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yan Zhou .

Editor information

Editors and Affiliations

Institute of Machine Learning and Systems Biology, Tongji University, Shanghai, China
De-Shuang Huang
Electrical and Electronics Department, Polytechnic University of Bari, Bari, Italy
Vitoantonio Bevilacqua
School of Computer Science and Mathematics, Liverpool John Moores University, Liverpool, UK
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhou, Y., Shang, L. (2020). Time Sequence Features Extraction Algorithm of Lying Speech Based on Sparse CNN and LSTM. In: Huang, DS., Bevilacqua, V., Hussain, A. (eds) Intelligent Computing Theories and Application. ICIC 2020. Lecture Notes in Computer Science(), vol 12463. Springer, Cham. https://doi.org/10.1007/978-3-030-60799-9_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-60799-9_8
Published: 05 October 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60798-2
Online ISBN: 978-3-030-60799-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics