The DTW is a method used in text-dependent Speaker Recognition. In this context, the training or testing data are composed by a sequence of acoustic vectors and the temporal order of the vectors is important. In order to compute likelihood or a distance between two of such sequences, two functions are needed, a frame to frame distance function and a frame mapping function, able to align the individual acoustic frames of both sequences. This time alignment function is mandatory as two occurrences of the same linguistic messages, pronounced or not by the same speaker, present different time characteristics, like the global pronunciation speed. If there is a training template T r with \(N_{T_R }\) frames and a test utterance T E consisting in a sequence of \(N_{T_E } \) frames, the DTW is able to find the time mapping function w(n) between T R and T E . In the figure 1, the tying function w(n) is illustrated by the tying of the T R frame at time x with the T E frame at time y. Thus, the...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this entry
Cite this entry
(2009). Dynamic Time Warping (DTW). In: Li, S.Z., Jain, A. (eds) Encyclopedia of Biometrics. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-73003-5_768
Download citation
DOI: https://doi.org/10.1007/978-0-387-73003-5_768
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-73002-8
Online ISBN: 978-0-387-73003-5
eBook Packages: Computer ScienceReference Module Computer Science and Engineering