ABSTRACT
An application which provides a service of producing speech for any text entered by the user is highly beneficial for the society. A person with speaking disabilities would benefit the most from it. In this paper we have proposed and developed a software system that performs noise reduction and text-to-speech conversion. For removal of noise, text entered by the user is checked for any misspelling or non-standard abbreviation. Each word in the text is processed as a separate token. Each token is checked using three available algorithms - Longest Common Subsequence, Levenshtein distance and Soundex algorithms. Using the above mentioned algorithms we select the best possible variant of the word from a pre-defined wordlist and we replace the word with the best possible replacement. After noise reduction, text is sent as input to a text-to-speech convertor which produces the corresponding sound file which can be played and is audible to the user. This sound file also could be saved for future use.
- Sharma, Kaushal, "Assisting persons with disability in the Age of Cybernetics and Technology" in the book - 'Education in Cybernetic Age', pp. 119--128, Sarup & Sons, 2006.Google Scholar
- Contractor, Danish, Kothari, Govind, Tanveer A. Faruquie, L. Venkata Subramaniam, Sumit Negi, "Handling noisy queries in cross language FAQ retrieval" EMNLP'10, In Proc. of the 2010 Conference on Empirical Methods in Natural Language Processing. Google ScholarDigital Library
- Schroeder M., Charfuelan M., Pammi S., and Türk O., "The MARY TTS entry in the Blizzard Challenge", 2008, in Proc. of the Blizzard Challenge, 2008.Google Scholar
- Zizhong Fan, Westat, Rockville, MD. "Matching Character Variables by Sound: A closer look at SOUNDEX function and Sounds-Like Operator (=*)" SUGI 2009, in Proc., pp. 072--29.Google Scholar
- "The Soundex Indexing System" National Archives and Records Administration. 2007-05-30. Retrieved 2010-12-24.Google Scholar
- Cormen, Thomas H., Leiserson, Charles E., Rivest, Ronald L., Stein, Clifford (2009) {1990}. Introduction to Algorithms (3rd ed.). MIT Press and McGraw-Hill. ISBN 0-262-03384-4.Google Scholar
- Gilleland Michael, "Levenshtein Distance, in Three Flavors", available at http://www.merriampark.com/ld.htmGoogle Scholar
- Mary Text-to-speech System Documentation, available at http://www.merriampark.com/ld.htmGoogle Scholar
- Schröder Marc, DFKI "Text-to-Speech synthesis using OpenMARY - An introduction and practical tutorial", eNTERFACE Amsterdam, 14 July 2010.Google Scholar
- Langer Akhil, Banga Rohit, Mittal Ankush, and Subramaniam L. V., "Variant Search and Syntactic Tree Similarity Based Approach to Retrieve Matching Questions for SMS queries" in Proc. of the fourth workshop on Analytics for noisy unstructured text data. Google ScholarDigital Library
- Govind Kothari, Sumit Negi, Tanveer A. Faruquie, Venkatesan T. Chakaravarthy and L. Venkata Subramaniam., "SMS based interface for FAQ retrieval", ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2. Google ScholarDigital Library
- "Soundex Algorithm" available at http://www.blackbeltcoder.com/Articles/algorithms/phonetic-string-comparison-with-soundexGoogle Scholar
Index Terms
- A novel software system to facilitate better and easier communication for people with speaking disabilities
Recommendations
Speaking in noise: How does the Lombard effect improve acoustic contrasts between speech and ambient noise?
What makes speech produced in the presence of noise (Lombard speech) more intelligible than conversational speech produced in quiet conditions? This study investigates the hypothesis that speakers modify their speech in the presence of noise in such a ...
Non-native disadvantage in spoken word recognition is due to lexical knowledge and not type/level of noise
Highlights- Two-talker babble compromised Mandarin word recognition in noise more than speech-shaped noise.
AbstractAdverse listening conditions typically trigger the use of linguistic knowledge, which helps the listener compensate for the impoverished acoustic signal. It is not clear, however, whether linguistic knowledge facilitates non-native ...
Rules and Algorithms for Phonetic Transcription of Standard Malay
Phonetic transcription of text is an indispensable component of text-to-speech (TTS) systems and is used in acoustic modeling for speech recognition and other natural language processing applications. One approach to the transcription of written text ...
Comments