Overview

Authors:

K. Sreenivasa Rao ⁰,
N. P. Narendra ¹

K. Sreenivasa Rao
1. Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, India
View author publications

You can also search for this author in PubMed Google Scholar
N. P. Narendra
1. Aalto University, Espoo, Finland
View author publications

You can also search for this author in PubMed Google Scholar

Presents the efficient excitation source modeling techniques for generating high quality speech
Includes a combination of both waveform and parametric methods to enhance the quality of synthesis
Features and methods that need less memory and computational requirements than others, allowing them to be integrated to smart phones and smaller devices

Part of the book series: SpringerBriefs in Speech Technology (BRIEFSSPEECHTECH)

2398 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99

Price excludes VAT (USA)

Softcover Book USD 54.99

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This book presents a statistical parametric speech synthesis (SPSS) framework for developing a speech synthesis system where the desired speech is generated from the parameters of vocal tract and excitation source. Throughout the book, the authors discuss novel source modeling techniques to enhance the naturalness and overall intelligibility of the SPSS system. This book provides several important methods and models for generating the excitation source parameters for enhancing the overall quality of synthesized speech. The contents of the book are useful for both researchers and system developers. For researchers, the book is useful for knowing the current state-of-the-art excitation source models for SPSS and further refining the source models to incorporate the realistic semantics present in the text. For system developers, the book is useful to integrate the sophisticated excitation source models mentioned to the latest models of mobile/smart phones.

Spectral and Temporal Envelope Cues for Human and Automatic Speech Recognition in Noise

Article 22 November 2019

Constructing a Deep Neural Network Based Spectral Model for Statistical Speech Synthesis

Speech analysis and synthesis with a refined adaptive sinusoidal representation

Article 15 May 2018

Keywords

Table of contents (7 chapters)

Front Matter

Pages i-xii

Download chapter PDF
Introduction
- K. Sreenivasa Rao, N. P. Narendra
Pages 1-10
Background and Literature Review
- K. Sreenivasa Rao, N. P. Narendra
Pages 11-27
Robust Voicing Detection and F₀ Estimation Method
- K. Sreenivasa Rao, N. P. Narendra
Pages 29-52
Parametric Approach of Modeling the Source Signal
- K. Sreenivasa Rao, N. P. Narendra
Pages 53-74
Hybrid Approach of Modeling the Source Signal
- K. Sreenivasa Rao, N. P. Narendra
Pages 75-103
Generation of Creaky Voice
- K. Sreenivasa Rao, N. P. Narendra
Pages 105-124
Summary and Conclusions
- K. Sreenivasa Rao, N. P. Narendra
Pages 125-129
Back Matter

Pages 131-136

Download chapter PDF

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, India

K. Sreenivasa Rao
Aalto University, Espoo, Finland

N. P. Narendra

About the authors

K. Sreenivasa Rao is currently a Professor at IIT Kharagpur, where he has taught since 2007. He has also worked at IIT Guwahati and IIT Madras. He received his PhD from IIT Madras in 2005. He is the author of 8 books, 68 journal articles, 2 patents, 25 book chapters, and 140 conference proceedings.

Narendra N P is a Postdoctoral Researcher at Aalto University. He received his PhD at IIT Kharagpur in 2016. He has published 7 journal articles, 3 book chapters, and 15 conference proceedings.

Bibliographic Information

Book Title: Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis
Authors: K. Sreenivasa Rao, N. P. Narendra
Series Title: SpringerBriefs in Speech Technology
DOI: https://doi.org/10.1007/978-3-030-02759-9
Publisher: Springer Cham
eBook Packages: Engineering, Engineering (R0)
Copyright Information: The Author(s), under exclusive licence to Springer Nature Switzerland AG 2019
Softcover ISBN: 978-3-030-02758-2Published: 28 January 2019
eBook ISBN: 978-3-030-02759-9Published: 13 December 2018
Series ISSN: 2191-737X
Series E-ISSN: 2191-7388
Edition Number: 1
Number of Pages: XII, 136
Number of Illustrations: 63 b/w illustrations, 11 illustrations in colour
Topics: Signal, Image and Speech Processing, Natural Language Processing (NLP), Computational Linguistics

Publish with us

Policies and ethics

Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Keywords

Table of contents (7 chapters)

Front Matter

Back Matter

Authors and Affiliations

Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, India

Aalto University, Espoo, Finland

About the authors

Bibliographic Information

Publish with us