Abstract

The sources separated by most single channel audio source separation techniques are usually distorted and each separated source contains residual signals from the other sources. To tackle this problem, we propose to enhance the separated sources to decrease the distortion and interference between the separated sources using deep neural networks (DNNs). Two different DNNs are used in this work. The first DNN is used to separate the sources from the mixed signal. The second DNN is used to enhance the separated signals. To consider the interactions between the separated sources, we propose to use a single DNN to enhance all the separated sources together. To reduce the residual signals of one source from the other separated sources (interference), we train the DNN for enhancement discriminatively to maximize the dissimilarity between the predicted sources. The experimental results show that using discriminative enhancement decreases the distortion and interference between the separated sources.

Bibtex


@inbook{Grais_2017,
  author = {Grais, Emad M. and Roma, G. and Simpson, A. J. R. and Plumbley, M. D.},
  editor = {Tichavsk{\'y}, Petr and Babaie-Zadeh, Massoud and Michel, Olivier J.J. and Thirion-Moreau, Nad{\`e}ge},
  title = {Discriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks},
  booktitle = {Latent Variable Analysis and Signal Separation: 13th International Conference, LVA/ICA 2017, Grenoble, France, February 21-23, 2017, Proceedings},
  year = {2017},
  publisher = {Springer International Publishing},
  address = {Cham},
  pages = {236--246},
  isbn = {978-3-319-53547-0},
  doi = {10.1007/978-3-319-53547-0_23},
  openaccess = {http://epubs.surrey.ac.uk/813112/},
  keywords = {"maruss"}
}