AES E-Library

Separating the individual sources in a single channel is particularly difficult because of overlapping harmonics. Western music is often arranged so that sounds are not only occurring simultaneously, but are also harmonically related. While traditional approaches use either spectral or temporal models, the proposed model exploits the combination of the spectral and temporal correlations of harmonic magnitudes to estimate the regions of overlap. A diverse selection of harmonic musical instruments was analyzed, and a generalized-instrument magnitude track prediction model was derived to generate track estimates. This approach, which exploits dependencies among tracks, was shown to be consistently more accurate than existing methods.