An Approach to Intelligent Signal Processing

Wolff, Matthias; Hoffmann, Rüdiger

doi:10.1007/978-3-642-34584-5_1

Matthias Wolff²¹ &
Rüdiger Hoffmann²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7403))

2963 Accesses

Abstract

This paper describes an approach to intelligent signal processing. First we propose a general signal model which applies to speech, music, biological, and technical signals. We formulate this model mathematically using a unification of hidden Markov models and finite state machines. Then we name tasks for intelligent signal processing systems and derive a hierarchical architecture which is capable of solving them. We show the close relationship of our approach to cognitive dynamic systems. Finally we give a number of application examples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Kalman and particle filtering

Signal Processing

Fundamentals

References

Bilmes, J.: A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian Mixture and hidden Markov models. Tech. rep., International Computer Science Institute (1998)
Google Scholar
Caseiro, D., Trancoso, I.: A specialized on-the-fly algorithm for lexicon and language model composition. IEEE Transactions on Audio, Speech, and Language Processing 14(4), 1281–1291 (2006)
Article Google Scholar
Duckhorn, F.: Optimierung von Hidden-Markov-Modellen für die Sprach- und Signalerkennung. Diplomarbeit, Technische Universität Dresden, Institut für Akustik und Sprachkommunikation (2007)
Google Scholar
Duckhorn, F., Wolff, M., Strecha, G., Hoffmann, R.: An application example for unified speech synthesis and recognition using Hidden Markov Models. In: One Day Meeting on Unified Models for Speech Recognition and Synthesis, Birmingham, U.K. (March 2009)
Google Scholar
Eichner, M.: Spracherkennung und Sprachsynthese mit gemeinsamen Datenbasen - Akustische Analyse und Modellierung. Dissertationsschrift, Technische Universität Dresden, Institut für Akustik und Sprachkommunikation, Studientexte zur Sprachkommunikation vol. 43, w.e.b. Universitätsverlag, Dresden (2006) ISBN 978-3-940046-10-9
Google Scholar
Eichner, M.: Signalverarbeitung für ein rotationsbezogenes Messsystem. Forschungsbericht, Technische Universität Dresden, Institut für Akustik und Sprachkommunikation (April 2007)
Google Scholar
Eichner, M., Göcks, M., Hoffmann, R., Kühne, M., Wolff, M.: Speech-enabled services in a web-based e-learning environment. Advanced Technology for Learning 1(2), 91–98 (2004)
Article Google Scholar
Eichner, M., Wolff, M., Hoffmann, R.: A unified approach for speech synthesis and speech recognition using Stochastic Markov Graphs. In: Proceedings of the Internation Conference on Spoken Language Processing, ICSLP 2000, Beijing, PR China, vol. 1, pp. 701–704 (October 2000)
Google Scholar
Eichner, M., Wolff, M., Hoffmann, R.: Voice characteristics conversion for TTS using reverse VTLN. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Montreal, Canada, vol. 1, pp. 17–20 (May 2004)
Google Scholar
Eichner, M., Wolff, M., Hoffmann, R.: Instrument classification using Hidden Markov Models. In: International Conference on Music Information Retrieval, ISMIR 2006, Victoria, BC, Canada, pp. 349–350 (October 2006)
Google Scholar
Eichner, M., Wolff, M., Hoffmann, R.: An HMM based investigation of differences between musical instruments of the same type. In: Proceedings of the International Congress on Acoustics, ICA 2007, Madrid, Spain, 5 pages on CD-ROM Proceedings (September 2007)
Google Scholar
Eichner, M., Wolff, M., Hoffmann, R., Kordon, U., Ziegenhals, G.: Verfahren und Vorrichtung zur Klassifikation und Beurteilung von Musikinstrumenten. Deutsches Patent 102006014507 (December 2008)
Google Scholar
Eichner, M., Wolff, M., Ohnewald, S., Hoffmann, R.: Speech synthesis using stochastic Markov graphs. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2001, Salt Lake City, UT, USA, pp. 829–832 (May 2001)
Google Scholar
Fuster, J.M.: Cortex and Mind: Unifying Cognition. Oxford University Press, New York (2005) 978-0-19-530084-0
Book Google Scholar
Haykin, S.: Cognitive dynamic systems. Proceedings of the IEEE 94(11), 1910–1911 (2006)
Article Google Scholar
Haykin, S.: Foundations of cognitive dynamic systems. IEEE Lecture, Queens University (January 29, 2009), http://soma.mcmaster.ca/papers/Slides_Haykin_Queens.pdf
Hübler, S.: Suchraumoptimierung zur Identifizierung ähnlicher Musikstücke. Diplomarbeit, Technische Universität Dresden, Institut für Akustik und Sprachkommunikation (2008)
Google Scholar
Hentschel, D., Tschöpe, C., Hoffmann, R., Eichner, M., Wolff, M.: Verfahren zur Beurteilung einer Güteklasse eines zu prüfenden Objekts. Deutsches Patent 10 2004 023 824 (July 2006)
Google Scholar
Hentschel, D., Tschöpe, C., Hoffmann, R., Eichner, M., Wolff, M.: Verfahren zur Beurteilung einer Güteklasse eines zu prüfenden Objekts. Europäisches Patent EP 1 733 223 (January 2008)
Google Scholar
Hentschel, D., Tschöpe, C., Hoffmann, R., Eichner, M., Wolff, M.: Verfahren zur Beurteilung einer Güteklasse eines zu prüfenden Objekts. Österreichisches Patent AT 384261 (February 2008)
Google Scholar
Erkennungsexperimente mit Barkhausen-Rauschen. In: Hoffmann, R. (ed.) Jahresbericht 1999, p. 34. Technische Universität Dresden, Institut für Akustik und Sprachkommunikation (December 1999)
Google Scholar
Hoffmann, R.: Recognition of non-speech acoustic signals. In: Kacic, Z. (ed.) Proceedings of the International Workshop on Advances in Speech Technology Advances, AST 2006, p. 107. University of Maribor, Maribor (2006)
Google Scholar
Hoffmann, R.: Denken in Systemen. In: Gerlach, G., Hoffmann, R. (eds.) Neue Entwicklungen in der Elektroakustik und elektromechanischen Messtechnik, Dresdner Beiträge zur Sensorik, vol. 40, pp. 13–24. TUD Press, Dresden (2009)
Google Scholar
Hoffmann, R., Eichner, M., Wolff, M.: Analysis of Verbal and Nonverbal Acoustic Signals with the Dresden UASR System. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) Verbal and Nonverbal Commun. Behaviours. LNCS (LNAI), vol. 4775, pp. 200–218. Springer, Heidelberg (2007)
Chapter Google Scholar
Hussein, H., Strecha, G., Hoffmann, R.: Resynthesis of prosodic information using the cepstrum vocoder. In: Proceedings of the 5th International Conference Speech Prosody. Chicago, IL, March 11-14, 4 pages (2010)
Google Scholar
Hutschenreuther, T.: Automatische Anordnung von Gesangstexten zu Musik mit Hilfe von Methoden aus der Spracherkennung. Diplomarbeit, Technische Universität Dresden, Institut für Akustik und Sprachkommunikation (2009)
Google Scholar
Imai, S., Sumita, K., Furuichi, C.: Mel log spectrum approximation (MLSA) filter for speech synthesis. In: Electronics and Communications in Japan (Part I: Communications), vol. 66, pp. 10–18 (1983)
Google Scholar
Juang, H.H., Rabiner, L.R.: The segmental K-means algorithm for estimating parameters of Hidden Markov Models. IEEE Transactions on Acoustics, Speech, Signal Processing 38(9), 1639–1641 (1990)
Article MATH Google Scholar
Kühne, M., Wolff, M., Eichner, M., Hoffmann, R.: Voice activation using prosodic features. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2004, pp. 3001–3004 (October 2004)
Google Scholar
Korotkoff, N.C.: On the subject of methods of determining blood pressure. Bull. Imperial. Mil. Med. Acad. 11, 365–367 (1905)
Google Scholar
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (2001)
Google Scholar
Mohri, M.: Weighted automata algorithms. In: Droste, M., Kuich, W., Vogler, H. (eds.) Handbook of Weighted Automata. Monographs in Theoretical Computer Science. An EATCS Series, pp. 213–254. Springer, Heidelberg (2009) ISBN 978-3-642-01491-8
Chapter Google Scholar
Mohri, M., Pereira, F., Riley, M.: Speech recognition with weighted finite-state transducers. In: Handbook on Speech Processing and Speech Communication, Part E: Speech Recognition. Springer (2008)
Google Scholar
Mohri, M., Riley, M.: Weighted finite-state transducers in speech recognition (tutorial). In: Proceedings of the International Conference on Spoken Language Processing (2002)
Google Scholar
Mohri, M., Riley, M., Hindle, D., Ljolje, A., Pereira, F.: Full expansion of context-dependent networks in large vocabulary speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1998, vol. 2, pp. 665–668 (May 1998)
Google Scholar
Petrick, R., Lohde, K., Wolff, M., Hoffmann, R.: The harming part of room acoustics in automatic speech recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2007, Antwerp, Belgium, pp. 1094–1097 (August 2007)
Google Scholar
Päßler, S., Wolff, M., Fischer, W.J.: Chewing sound classification using a grammar based classification algorithm. In: Proceedings of Forum Acusticum 2011 (2011) ISBN 978-84-694-1520-7
Google Scholar
Pusch, T., Cherif, C., Farooq, A., Wittenberg, S., Hoffmann, R., Tschöpe, C.: Early fault detection at textile machines with the help of structure-borne sound analysis. Melliand English 11-12, E144–E145 (2008)
Google Scholar
Rabiner, L.R.: A tutorial on Hidden Markov Models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Richter, T.: Erkennung von Biosignalen. Diplomarbeit, Technische Universität Dresden, Institut für Akustik und Sprachkommunikation (2001)
Google Scholar
Römer, R.: Beschreibung von Analyse-Synthese-Systemen unter Verwendung von kaskadierten bidirektionalen HMMs. In: Kröger, B.J., Birkholz, P. (eds.) Elektronische Sprachsignalverarbeitung 2011, Tagungsband der 22. Konferenz. Studientexte zur Sprachkommunikation, vol. 61, pp. 67–74. TUD Press (2011) ISBN 978-3-942710-37-4
Google Scholar
Römer, R.: A Cortical Approach Based on Cascaded Bidirectional Hidden Markov Models. In: Esposito, A., Esposito, A.M., Vinciarelli, A., Hoffmann, R., Müller, V.C. (eds.) Cognitive Behavioural Systems. LNCS, vol. 7403, pp. 266–272. Springer, Heidelberg (2012)
Chapter Google Scholar
Strecha, G., Wolff, M.: Speech synthesis using hmm based diphone inventory encoding for low-resource devices. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011), pp. 5380–5383 (2011)
Google Scholar
Strecha, G., Wolff, M., Duckhorn, F., Wittenberg, S., Tschöpe, C.: The HMM synthesis algorithm of an embedded unified speech recognizer and synthesizer. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, Brighton, U.K., pp. 1763–1766 (September 2009)
Google Scholar
Tokuda, K., Masuko, T., Hiroi, J., Kobayashi, T., Kitamura, T.: A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques. In: Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2, pp. 609–612 (1998)
Google Scholar
Tokuda, K., Yoshimura, T., Masuko, T., Kobayashi, T., Kitamura, T.: Speech parameter generation algorithms for hmm-based speech synthesis. In: Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, pp. 1315–1318 (2000)
Google Scholar
Tschöpe, C.: Klassifikation technischer Signale, Studientexte zur Sprachkommunikation, vol. 60. TUD Press (2012)
Google Scholar
Tschöpe, C., Hentschel, D., Wolff, M., Eichner, M., Hoffmann, R.: Classification of non-speech acoustic signals using structure models. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, vol. 5, pp. V653–V656 (May 2004)
Google Scholar
Tschöpe, C., Hirschfeld, D., Hoffmann, R.: Klassifikation technischer Signale für die Geräuschdiagnose von Maschinen und Bauteilen. In: Tschöke, H., Henze, W. (eds.) Motor- und Aggregateakustik II, pp. 45–53. Expert Verlag, Renningen (2005)
Google Scholar
Tschöpe, C., Wolff, M.: Automatic decision making in SHM using Hidden Markov Models. In: Database and Expert Systems Applications, DEXA 2007, pp. 307–311 (September 2007)
Google Scholar
Tschöpe, C., Wolff, M.: Statistical classifiers for structural health monitoring. IEEE Sensors Journal 9(11), 1567–1676 (2009)
Article Google Scholar
Werner, S., Wolff, M., Eichner, M., Hoffmann, R., Estelmann, J.: Language identification using meta-classification of multiple experts. In: Processings of the International Conference on Speech and Computer, SPECOM 2005, Patras, Greece, pp. 519–522 (October 2005)
Google Scholar
Wirsching, G., Huber, M., Kölbl, C.: The confidence-probability semiring. Tech. Rep. 2010-4, Institut für Informatik der Universität Augsburg (2010)
Google Scholar
Wirsching, G., Huber, M., Kölbl, C., Lorenz, R., Römer, R.: Semantic Dialogue Modeling. In: Esposito, A., Esposito, A.M., Vinciarelli, A., Hoffmann, R., Müller, V.C. (eds.) Cognitive Behavioural Systems. LNCS, vol. 7403, pp. 104–113. Springer, Heidelberg (2012)
Chapter Google Scholar
Wittenberg, S., Wolff, M., Hoffmann, R.: Feasibility of statistical classifiers for monitoring rollers. In: Proceedings of the International Conference on Signals and Electronic Systems, ICSES 2008, Krakow, Poland, pp. 463–466 (September 2008)
Google Scholar
Wolff, M.: Akustische Musterkennung, Studientexte zur Sprachkommunikation, vol. 57. TUD Press (2011) ISBN 978-3-942710-14-5
Google Scholar
Wolff, M., Kordon, U., Hussein, H., Eichner, M., Hoffmann, R., Tschöpe, C.: Auscultatory blood pressure measurement using HMMs. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007, Honolulu, HI, USA, vol. 1, pp. 405–408 (April 2007)
Google Scholar
Wolff, M., Schubert, R., Hoffmann, R., Tschöpe, C., Schulze, E., Neunübel, H.: Experiments in acoustic structural health monitoring of airplane parts. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2008, Las Vegas, NV, USA, pp. 2037–2040 (April 2008)
Google Scholar
Wolff, M., Tschópe, C.: Pattern recognition for sensor signals. In: Proceedings of the IEEE Sensors Conference 2009, Christchurch, New Zealand, pp. 665–668 (October 2009)
Google Scholar
Zen, H., Tokuda, K., Black, A.W.: Statistical parametric speech synthesis. Speech Communication 51(11), 1039–1154 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Lehrstuhl Kommunikationstechnik, Brandenburgische Technische Universität Cottbus, 03046, Cottbus, Germany
Matthias Wolff
Systemtheorie und Sprachtechnologie, Technische Universität Dresden, 01062, Dresden, Germany
Rüdiger Hoffmann

Authors

Matthias Wolff
View author publications
You can also search for this author in PubMed Google Scholar
Rüdiger Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Psychology, and IIASS, Seconda Università degli Studi di Napoli, Italy
Anna Esposito
Istituto Nazionale di Geofisica e Vulcanologia, sezione di Napoli Osservatorio Vesuviano, Napoli, Italy
Antonietta M. Esposito
School of Computing Science, University of Glasgow, Glasgow, UK
Alessandro Vinciarelli
Laboratory of Acoustics and Speech Communication, Technische Universität Dresden, 01062, Dresden, Germany
Rüdiger Hoffmann
Dept. of Humanities and Social Sciences, Anatolia College/ACT, P.O. Box 21021, 55510, Pylaia, Greece
Vincent C. Müller

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wolff, M., Hoffmann, R. (2012). An Approach to Intelligent Signal Processing. In: Esposito, A., Esposito, A.M., Vinciarelli, A., Hoffmann, R., Müller, V.C. (eds) Cognitive Behavioural Systems. Lecture Notes in Computer Science, vol 7403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34584-5_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-34584-5_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34583-8
Online ISBN: 978-3-642-34584-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Approach to Intelligent Signal Processing

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Kalman and particle filtering

Signal Processing

Fundamentals

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Approach to Intelligent Signal Processing

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Kalman and particle filtering

Signal Processing

Fundamentals

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation