Please use this identifier to cite or link to this item:
Type: Artigo de periódico
Title: Perceptually controlled doping for audio source separation
Author: Mahe, G
Nadalin, EZ
Suyama, R
Romano, JMT
Abstract: The separation of an underdetermined audio mixture can be performed through sparse component analysis (SCA) that relies however on the strong hypothesis that source signals are sparse in some domain. To overcome this difficulty in the case where the original sources are available before the mixing process, the informed source separation (ISS) embeds in the mixture a watermark, which information can help a further separation. Though powerful, this technique is generally specific to a particular mixing setup and may be compromised by an additional bitrate compression stage. Thus, instead of watermarking, we propose a 'doping' method that makes the time-frequency representation of each source more sparse, while preserving its audio quality. This method is based on an iterative decrease of the distance between the distribution of the signal and a target sparse distribution, under a perceptual constraint. We aim to show that the proposed approach is robust to audio coding and that the use of the sparsified signals improves the source separation, in comparison with the original sources. In this work, the analysis is made only in instantaneous mixtures and focused on voice sources.
Subject: Informed source separation (ISS)
Sparse component analysis (SCA)
Doping watermarking
Country: Suíça
Editor: Springer International Publishing Ag
Rights: aberto
Identifier DOI: 10.1186/1687-6180-2014-27
Date Issue: 2014
Appears in Collections:Unicamp - Artigos e Outros Documentos

Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.