Loading [a11y]/accessibility-menu.js
Blind speaker counting in highly reverberant environments by clustering coherence features | IEEE Conference Publication | IEEE Xplore

Blind speaker counting in highly reverberant environments by clustering coherence features


Abstract:

This paper proposes the use of the frequency- domain Magnitude Squared Coherence (MSC) between two ad- hoc recordings of speech as a reliable speaker discrimination featu...Show More

Abstract:

This paper proposes the use of the frequency- domain Magnitude Squared Coherence (MSC) between two ad- hoc recordings of speech as a reliable speaker discrimination feature for source counting applications in highly reverberant environments. The proposed source counting method does not require knowledge of the microphone spacing and does not assume any relative distance between the sources and the microphones. Source counting is based on clustering the frequency domain MSC of the speech signals derived from short time segments. Experiments show that the frequency domain MSC is speaker-dependent and the method was successfully used to obtain highly accurate source counting results for up to six active speakers for varying levels of reverberation and microphone spacing.
Date of Conference: 12-15 December 2017
Date Added to IEEE Xplore: 08 February 2018
ISBN Information:
Conference Location: Kuala Lumpur, Malaysia

References

References is not available for this document.