Research Programs

OneVoice - Speech Enhancement for Dictation Systems

Speech recording is a common practice in daily professional activities, such as for lawyers, physicians, journalists and architects, among others. The combination of dictation systems with automatic speech recognition (ASR) is being demanded today as the natural procedure to take over their daily transcription routines. However, in those working environments (e.g. hospital, court of law, street, etc.), it is not always possible to record in silent or noise-free conditions, this fact causing ASR to become unreliable. The researchers in oneVoice have developed several novel signal processing-based techniques for analyzing speech with natural intonation. These methods represent the scientific basis of the project outcome, namely, a new single-channel speech enhancement/coding system that removes the background interferences present in the recording.