Skip to main content

An Improved TTS Model and Algorithm for Web Voice Browser

  • Conference paper
Agent Computing and Multi-Agent Systems (PRIMA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4088))

Included in the following conference series:

  • 989 Accesses


The paper describes a Web voice browser based on improved text-to-speech algorithm and architecture, which making Internet content available by voice. A visual and audible web browser was discussed in terms of HTML files to be tuned with TTS and speech recognition processes. The voice evaluation results show that the system has better voice quality and data identifiability than other voice browsers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others


  1. Chou, F.C., Tseng, C.Y., Chen, K.J., Lee, L.S.: A Chinese text-to-speech system based on part-of-speech analysis, prosodic modeling, and nonuniform units. In: Proc. Int. Conf. Acoustics, Speech, Signal Processing, pp. 923–926 (1997)

    Google Scholar 

  2. Dutoit, T.: An Introduction to Text-to-Speech Synthesis. Kluwer, Norwell, MA (1997)

    Google Scholar 

  3. Liang, S.F., So, A.W., Lin, C.: Model-based synthesis of plucked string instruments by using a class of scattering recurrent networks. IEEE Trans. Neural Networks 11(1), 171–185 (2000)

    Article  Google Scholar 

  4. Bao, H., Wang, A., Lu, S.: A Study of Evaluation Method for Synthetic Mandarin Speech. In: Proceedings of ISCSLP 2002, The Third International Symposium on Chinese Spoken Language Processing, pp. 383–386 (2002)

    Google Scholar 

  5. Chen, W., Lin, F., Li, J., Zhang, B.: Generation of Chinese Prosodic Phrasing Rules by an Extension Matrix Algorithm. In: Proceedings of IEEE ICASSP 2002, pp. 489–492 (2002)

    Google Scholar 

  6. Lu, H.M.: An Implementation and Analysis of Mandarin Speech Synthesis Technologies. M. S. Thesis, Institute of Communication Engineering, National Chiao-Tung University (June 2002)

    Google Scholar 

  7. Yu, M.S., Huang, F.L.: Disambiguating the Senses of Non-Text Symbols for Mandarin TTS Systems with a Three-Layer Classifier. Speech Communication 39(3-4), 191–229 (2003)

    Article  MATH  MathSciNet  Google Scholar 

  8. Yan, Q., Vaseghi, S.: Analysis, Modelling and Synthesis of Formants of British, American and Australian Accents. ICASSP 1, 712–715 (2003)

    Google Scholar 

  9. Torajlic, E., Rentzos, D., Vaseghi, S., Ho, C.H.: Evaluation of Methods for Parametric Formant Transformation in Voice Conversion. ICASSP I, 724–727 (2003)

    Google Scholar 

  10. Wouters, J., Macon, M.W.: Spectral Modification for Concatenative Speech Synthesis. ICASSP- pp II. 941–II.944 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations


Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liao, R., Ji, Y., Li, H. (2006). An Improved TTS Model and Algorithm for Web Voice Browser. In: Shi, ZZ., Sadananda, R. (eds) Agent Computing and Multi-Agent Systems. PRIMA 2006. Lecture Notes in Computer Science(), vol 4088. Springer, Berlin, Heidelberg.

Download citation

  • DOI:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-36707-9

  • Online ISBN: 978-3-540-36860-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics