TR2021-020

Deep clustering-based single-channel speech separation and recent advances


Abstract:

The recently-proposed deep clustering algorithm introduced significant advances in single-channel speaker-independent multi-speaker speech separation. In this paper, we review deep clustering and its improved method called chimera net. In addition, we describe our architectures for reducing the latency of deep clustering by combining block processing and teacher-student learning. Unfolding of a phase reconstruction algorithm and a complex mask estimation method for speech separation are also described.

 

  • Related Publication

  •  Aihara, R., Wichern, G., Le Roux, J., "Deep clusteringによる シングルチャネル音声分離とその発展", The Journal of the Acoustical Society of Japan, DOI: 10.20697/​jasj.76.2_101, Vol. 76, No. 2, pp. 101-108, April 2020.
    BibTeX J-STAGE
    • @article{Aihara2020apr,
    • author = {{Aihara, Ryo and Wichern, Gordon and Le Roux, Jonathan}},
    • title = {Deep clusteringによる シングルチャネル音声分離とその発展},
    • journal = {The Journal of the Acoustical Society of Japan},
    • year = 2020,
    • volume = 76,
    • number = 2,
    • pages = {101--108},
    • month = apr,
    • doi = {10.20697/jasj.76.2_101},
    • url = {https://www.jstage.jst.go.jp/article/jasj/76/2/76_101/_article/-char/en}
    • }