Abstract
To assist an investigation process, forensic experts compare and analyze audio recordings. Speech utterances are compared by humans and/or machines for use in court for investigation. Scientific research community insists for specific automatic or human-based approach to identify uniquxy2e audio features from identical twins group. Filters can be employed to enhance an audio recording for improving clarity. This may entail removal of unnecessary noise to enrich the intelligibility of speech. Forensic audio experts can examine a variety of characteristics of the audio recording to decide the possibility of alterations in the collected evidences. This includes confirming the integrity and authenticating that the content is what it purports to be. Thiswork named as FIST(Forensic Investigation for Twin Identification from Speech: Perceptual and Gamma-tone Features and Models) proposes an automated system to identify a twin from identical twin pairs by the use of gamma-tone features and perceptual features.The proposed features are excerpted from the set of training speeches and templates are created for each twin based on vector quantisation (VQ), Fuzzy C means clustering (FCM) and multivariate hidden Markov modelling (MHMM) techniques. For testing, features are extracted from the set of test utterances and worked out to the templates for classification. Based on the type of classifier used, classification of twin is carried out with minimum distance and maximum loglikelihood value. The proposed features are examinedfor sub-optimal and true success rates as key performance metrics to assess the system and also a comparative analysis is made across the proposed features. Among the inspected features, Gammatone energy features expose better performance in comparison to perceptual features by attaining the overall sub-optimal success rate and true success rate as97.8375% and 92.75% for Gammatone energy features with VQ based modelling technique. This work FIST has also been analysed by inducing disturbance in the form of speech interference from their own twin pairs and Gamma-tone energy feature with VQ based modelling technique performs better for twin identification. A high claim of 99.625% and 95.0625% accuracy has been achieved by employing decision level fusion classifier.







Similar content being viewed by others
References
Ariyaeeinia A, Morrison C, Malegaonkar A, Black S (2008) A test of the effectiveness of speaker verification for differentiating between identical twins. Science and Justice 48:182–186
Bolt RH, Cooper FS, Green DM, Hamlet SL, McKnight JG, Pickett JM, Tosi O, Underwood BD, Hogan DL (1979) On the theory and practice of voice identification. Washington, D.C, National Research Council, National Academy of Sciences
Debruyne F, Decoster W, Van Gijsel A, Vercammen J (2002) Speaking fundamental frequency in monozygotic and dizygotic twins. Journal of Voice 4:466–471
Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27(8):861–874
Feiser HS (2009) Acoustic similarities and differences in the voices of same-sex siblings. 18th Annual Conference of the International Association for Forensic Phonetics and Acoustics (IAFPA). UK, Cambridge
Künzel HJ (2010) Automatic speaker recognition of identical twins. International Journal of Speech, Language and the Law 17(2):251–277
Leng, L., & Teoh, ABJ, “Alignment-free row-co-occurrence cancelable palmprint fuzzy vault”, Pattern Recognition, 48(7), 2290-2303. 2015. Comment 2
Leng L, Teoh ABJ (2015) Dual-key-binding cancelable palmprint cryptosystem for palmprint protection and information security. Journal of Network and Computer Applications 34(6):1979–1989 Comment 2
Leng L, Zhang J, Khan MK, Chen X, Alghathbar K (2010) Dynamic weighted discrimination power analysis: a novel approach for face and palmprint recognition in DCT domain. International Journal of the Physical Sciences 5(17):2543–2554 Comment 1 and comment 3
Leng, Lu, et al. (2011) Two-directional two-dimensional random projection and its variations for face and palmprint recognition. International conference on computational science and its applications. Springer, Berlin, Heidelberg, page no -458-470. Comment 1
Leng L, Teoh ABJ, Li M, Khan MK (2014) A remote cancelable palmprint authentication protocol based on multi-directional two-dimensional PalmPhasor-fusion. Security and Communication Networks 7(11):1860–1871 Comment 2
Leng, L., Li, M., Kim, C., & Bi, X, “Dual-source discrimination power analysis for multi-instance contactless palmprint recognition”, Multimedia Tools and Applications, 76(1), 333-354. 2017. Comment 2
Nolan F, Oh T (1996) Identical twins, different voices. International Journal of Speech Language and the Law 3(1):39–49
Patil HA, Basu TK (2004) Teager energy melcepstrum for identification of twins in Marathi. Proceedings of the IEEE INDICON 2004. First India Annual Conference, 58-61
Przybyla BD, Horii Y, Crawford MH (1992) Vocal fundamental frequency in a twin sample: Looking for a genetic effect. Journal of Voice 6(3):261–266
Rabiner L, Juang BH (1993) Fundamentals of speech recognition, Prentice Hall
Revathi A, Venkataramani Y (2008) Use of perceptual features in iterative clustering based twins identification system. 2008 International Conference on Computing, Communication and Networking, page no-1-6
Revathi A, Chinnadurai R, Venkataramani Y (2007) Effectiveness of LP derived features and DCTC in twins identification- iterative speaker clustering approach. International Conference on Computational Intelligence and Multimedia Applications, page no-535-539
Revathi A, Sasikaladevi N, Nagakrishnan R, Jeyalakshmi C (September 2018) Robust emotion recognition from speech: Gamma tone features and models. International Journal of speech technology 21(3):723–739
Rose P (2002) Forensic speaker identification. Taylor & Francis, London
Ryalls J, Shaw H, Simon M (2004) Voice onset time production in older and younger female monozygotic twins. Folia PhoniatLogopaed 56:165–169
Sabatier SB, Trester MR, Dawson JM (2019) Measurement of the impact of identical twin voices on automatic speaker recognition. International journal on Measurements 134:385–389
San Segundo, E (2012) Glottal source parameters for forensic voice comparison: An approach to voice quality in twins' voices. 21st Annual Conference of the International Association for Forensic Phonetics and Acoustics (IAFPA)
San Segundo E (2013) Guess who is laughing: a perceptual experiment on twin and non-twin siblings’ identification. 31st International Conference AESLA
SanSegundo E, Gómez-Vilda P (2013) Voice biometrical match of twin and non-twin siblings. 8th international Workshow on models and analysis of vocal emissions for biomedical Applications (MAVEBA 2013), page no – 253-256
San Segundo E (2013) A phonetic corpus of Spanish male twins and siblings: Corpus design and forensic application. Proceedia - social and Behavioral sciences 95:59–67
Segundo ES, Kunzel H (2005) Automatic speaker recognition of spanish siblings: (monozygotic and dizygotic) twins and non-twin brothers. Proceedings of the IEEE INDICON 2004, First India Annual Conference 2:1–10
Sun Z, Paulino AA, Feng J, Chai Z, Tieniu, Anil K. Jain (2010) A study of multi biometric traits of identical twins. Proc. SPIE 7667, Biometric Technology for Human Identification VII
Van Gysel WD, Vercammen J, Debruyne (2001) Voice similarity in identical twins. Acta Oto-Rhino-Laryngologica Belg 55:49–55
Van Lierde KM, Vinck B, De Ley S, Clement G, Van Cauwenberge P (2004) Genetics of vocal quality characteristics in monozygotic twins: a multiparameter approach. Journal of voice 19(4):511–518
Zuo D, Mok PPK (2015) Formant dynamics of bilingual identical twins. Journal of Phonetics 52:1–12
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
This article does not contain any studies with human participants.
Conflict of interest
First author declares that she has no conflict of interest. Second and third authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
A., R., N., S. & K., G. Forensic investigation for twin identification from speech: perceptual and gamma-tone features and models. Multimed Tools Appl 80, 18301–18315 (2021). https://doi.org/10.1007/s11042-021-10639-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-10639-z