next up previous
Next: Introduction

SOUND SOURCE SEPARATION OF N SOURCES FROM STEREO SIGNALS VIA FITTING TO N MODELS EACH LACKING ONE SOURCE


Stanford University,
Center for Computer Research in Music and Acoustics,
Stanford, CA 94305-8180 USA

Abstract:

We present a system to perform sound source separation of an arbitrary number of speech or music sources from a stereo signal. We build on the work of other authors' DUET system, which uses a histogram technique to estimate the mixing parameters of the time-frequency sparse sources, before using a nearest-neighbor approach to demix the sources. Herein, we describe a new demixing method called Delay and Scale Subtraction Scoring (DASSS) that is less erratic than the nearest-neighbor method, and highlights when the sparsity assumptions of the DUET system are not valid. We also utilize a demixing technique to be used in cases where multiple sources are present, and propose an additional source-aware demixing technique for such cases. We demonstrate psychoacoustically convincing results on an example signal.





Aaron S. Master 2003-03-27