ISCA Archive Interspeech 2018
ISCA Archive Interspeech 2018

Information Bottleneck Based Percussion Instrument Diarization System for Taniavartanam Segments of Carnatic Music Concerts

Nauman Dawalatabad, Jom Kuriakose, Chandra Sekhar Chellu, Hema Murthy

An approach to diarize taniavartanam segments of a Carnatic music concert is proposed in this paper. Information bottleneck (IB) based approach used for speaker diarization is applied for this task. IB system initializes the segments to be clustered uniformly with fixed duration. The issue with diarization of percussion instruments in taniavartanam is that the stroke rate varies highly across the segments. It can double or even quadruple within a short duration, thus leading to variable information rate in different segments. To address this issue, the IB system is modified to use the stroke rate information to divide the audio into segments of varying durations. These varying duration segments are then clustered using the IB approach which is then followed by Kullback-Leibler hidden Markov model (KL-HMM) based realignment of the instrument boundaries. Performance of the conventional IB system and the proposed system is evaluated on standard Carnatic music dataset. The proposed technique shows a best case absolute improvement of 8.2% over the conventional IB based system in terms of diarization error rate.


doi: 10.21437/Interspeech.2018-1203

Cite as: Dawalatabad, N., Kuriakose, J., Chellu, C.S., Murthy, H. (2018) Information Bottleneck Based Percussion Instrument Diarization System for Taniavartanam Segments of Carnatic Music Concerts. Proc. Interspeech 2018, 1215-1219, doi: 10.21437/Interspeech.2018-1203

@inproceedings{dawalatabad18_interspeech,
  author={Nauman Dawalatabad and Jom Kuriakose and Chandra Sekhar Chellu and Hema Murthy},
  title={{Information Bottleneck Based Percussion Instrument Diarization System for Taniavartanam Segments of Carnatic Music Concerts}},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={1215--1219},
  doi={10.21437/Interspeech.2018-1203}
}