Music remixing is difficult when the original multitrack recording is not available. One solution is to estimate the elements of a mixture using source separation. However, existing techniques suffer from imperfect separation and perceptible artifacts on single separated sources. To investigate their influence on a remix, five state-of-the-art source separation algorithms were used to remix six songs by increasing the level of the vocals. A listening test was conducted to assess the remixes in terms of loudness balance and sound quality. The results show that some source separation algorithms are able to increase the level of the vocals by up to 6 dB at the cost of introducing a small but perceptible degradation in sound quality.


  title = {Perceptual Evaluation of Source Separation for Remixing Music},
  author = {Wierstorf, H. and Ward, D. and Grais, E. M. and Plumbley, M. D. and Mason, R. D. and Hummersone, C.},
  booktitle = {143rd Convention of the Audio Engineering Society},
  address = {New York, NY},
  pages = {Paper 9880},
  month = oct,
  year = {2017},
  openaccess = {http://epubs.surrey.ac.uk/844663/},
  keywords = {"maruss"}

Supplementary material

The results from the listening test presented in this paper together with the procedure are available at 10.5281/zenodo.835191. The statistical analysis and code to reproduce the figures are accessible at 10.5281/zenodo.835196. The actual stimuli of the test are available at 10.5281/zenodo.835182. The presentation with the sound files included can be downloaded from 10.5281/zenodo.1034175.