IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Online ISSN : 1745-1337
Print ISSN : 0916-8508
Regular Section
An Improved Speech / Nonspeech Classification Based on Feature Combination for Audio Indexing
Ji-Soo KEUMHyon-Soo LEEMasafumi HAGIWARA
Author information
JOURNAL RESTRICTED ACCESS

2010 Volume E93.A Issue 4 Pages 830-832

Details
Abstract

In this letter, we propose an improved speech/nonspeech classification method to effectively classify a multimedia source. To improve performance, we introduce a feature based on spectral duration analysis, and combine recently proposed features such as high zero crossing rate ratio (HZCRR), low short time energy ratio (LSTER), and pitch ratio (PR). According to the results of our experiments on speech, music, and environmental sounds, the proposed method obtained high classification results when compared with conventional approaches.

Content from these authors
© 2010 The Institute of Electronics, Information and Communication Engineers
Previous article Next article
feedback
Top