Next: Introduction
SOUND SOURCE SEPARATION OF N SOURCES FROM STEREO SIGNALS VIA FITTING TO N MODELS EACH LACKING ONE SOURCE
Stanford University,
Center for Computer Research in Music and Acoustics,
Stanford,
CA 94305-8180 USA
Abstract:
We present a system to perform sound source separation of an
arbitrary number of speech or music sources from a stereo signal.
We build on the work of other authors' DUET system, which uses a
histogram technique to estimate the mixing parameters of the
time-frequency sparse sources, before using a nearest-neighbor
approach to demix the sources. Herein, we describe a new demixing
method called Delay and Scale Subtraction Scoring (DASSS) that is
less erratic than the nearest-neighbor method, and highlights when
the sparsity assumptions of the DUET system are not valid. We
also utilize a demixing technique to be used in cases where
multiple sources are present, and propose an additional
source-aware demixing technique for such cases. We demonstrate
psychoacoustically convincing results on an example signal.
Aaron S. Master
2003-03-27