ISCA Archive Interspeech 2020
ISCA Archive Interspeech 2020

Mentoring-Reverse Mentoring for Unsupervised Multi-Channel Speech Source Separation

Yu Nakagome, Masahito Togami, Tetsuji Ogawa, Tetsunori Kobayashi

Mentoring-reverse mentoring, which is a novel knowledge transfer framework for unsupervised learning, is introduced in multi-channel speech source separation. This framework aims to improve two different systems, which are referred to as a senior and a junior system, by mentoring each other. The senior system, which is composed of a neural separator and a statistical blind source separation (BSS) model, generates a pseudo-target signal. The junior system, which is composed of a neural separator and a post-filter, was constructed using teacher-student learning with the pseudo-target signal generated from the senior system i.e, imitating the output from the senior system (mentoring step). Then, the senior system can be improved by propagating the shared neural separator of the grown-up junior system to the senior system (reverse mentoring step). Since the improved neural separator can give better initial parameters for the statistical BSS model, the senior system can yield more accurate pseudo-target signals, leading to iterative improvement of the pseudo-target signal generator and the neural separator. Experimental comparisons conducted under the condition where mixture-clean parallel data are not available demonstrated that the proposed mentoring-reverse mentoring framework yielded improvements in speech source separation over the existing unsupervised source separation methods.


doi: 10.21437/Interspeech.2020-2082

Cite as: Nakagome, Y., Togami, M., Ogawa, T., Kobayashi, T. (2020) Mentoring-Reverse Mentoring for Unsupervised Multi-Channel Speech Source Separation. Proc. Interspeech 2020, 86-90, doi: 10.21437/Interspeech.2020-2082

@inproceedings{nakagome20_interspeech,
  author={Yu Nakagome and Masahito Togami and Tetsuji Ogawa and Tetsunori Kobayashi},
  title={{Mentoring-Reverse Mentoring for Unsupervised Multi-Channel Speech Source Separation}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={86--90},
  doi={10.21437/Interspeech.2020-2082}
}