AES San Francisco 2010Poster Session P19

Saturday, November 6, 2:30 pm — 4:00 pm (Room 226)

Poster: P19 - Spatial Sound Processing—1

P19-1 Estimation of the Probability Density Function of the Interaural Level Differences for Binaural Speech Separation—David Ayllon, Roberto Gil-Pita, Manuel Rosa-Zurera, University of Alcalá - Alcalá de Henares (Madrid), Spain
Source separation techniques are applied to audio signals to separate several sources from one mixture. One important challenge of speech processing is noise suppression and several methods have been proposed. However, in some applications like hearing aids, we are not interested just in removing noise from speech but amplifying speech and attenuating noise. A novel method based on the estimation of the Power Density Function of the Interaural Level Differences in conjunction with time-frequency decomposition and binary masking is applied to speech-noise mixtures in order to obtain both signals separately. Results show how both signals are clearly separated and the method entails low computational cost so it could be implemented in a real-time environment, such as a hearing aid device.
Convention Paper 8273 (Purchase now)

P19-2 The Learning Effect of HRTF-Based 3-D Sound Perception with a Horizontally Arranged 8-Loudspeaker System—Akira Saji, Keita Tanno, Li Huakang, Tetsuya Watanabe, Jie Huang, The University of Aizu - Aizuwakamatsu City, Fukushima, Japan
This paper argues about the learning effects on the localization of HRTF-based 3-D sound using an 8-channel loudspeaker system, which creates virtual sound images. This system can realize sound with elevation by 8 channel loudspeakers arranged on the horizontal plane and convolving HRTF, not using high or low mounted loudspeakers. The position of the sound image that the system creates is difficult to perceive because such HRTF-based sounds are unfamiliar. However, after repetition of the learning process, almost all listeners can perceive the position of the sound images better. This paper shows this learning effect for an HRTF-based 3-D sound system.
Convention Paper 8274 (Purchase now)