Towards degradation decomposition for voice communication system assessment
Köster, Friedemann; Schiffner, Falk; Möller, Sebastian; Malfait, Ludovic
2017-03-30 00:00:00
This article presents the current development of degradation decomposition tools for the assessment of voice communications. Overall quality scores, represented as Mean Opinion Scores (MOS) produced by subjective test methodologies such as ITU-T P.800 Absolute Category Rating (ACR), remains the most popular quality metric in the industry. While MOS is a great indicator to evaluate quality issues, it does not provide information on the cause of issues. To address this gap, work items are currently active within ITU-T to provide the industry with means to understand the cause of lower scores by perceptual or technical degradation decompositions. The goal is to produce objective models that enable automated degradation decomposition. The first step in such a development is the construction of databases for model training and validation. For this, in sum four experiments using a potential diagnostic test method discussed within ITU-T are conducted. In addition, two optional improvements for the test method are presented and discussed. The results of the experiments show that for standardization the analyzed test method still leaves room for validation and further improvements.
http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.pngQuality and User ExperienceSpringer Journalshttp://www.deepdyve.com/lp/springer-journals/towards-degradation-decomposition-for-voice-communication-system-bu2nJgMvwZ

Abstract

This article presents the current development of degradation decomposition tools for the assessment of voice communications. Overall quality scores, represented as Mean Opinion Scores (MOS) produced by subjective test methodologies such as ITU-T P.800 Absolute Category Rating (ACR), remains the most popular quality metric in the industry. While MOS is a great indicator to evaluate quality issues, it does not provide information on the cause of issues. To address this gap, work items are currently active within ITU-T to provide the industry with means to understand the cause of lower scores by perceptual or technical degradation decompositions. The goal is to produce objective models that enable automated degradation decomposition. The first step in such a development is the construction of databases for model training and validation. For this, in sum four experiments using a potential diagnostic test method discussed within ITU-T are conducted. In addition, two optional improvements for the test method are presented and discussed. The results of the experiments show that for standardization the analyzed test method still leaves room for validation and further improvements.

Journal

Quality and User Experience
– Springer Journals

Published: Mar 30, 2017

Recommended Articles

Loading...

References

Statistik

Bortz, J

Integral and diagnostic intrusive prediction of speech quality

Côté, N

Methods for assessing the quality of transmitted speech and of speech communication services

Köster, F; Möller, S; Antons, J-N; Arndt, S; Guse, D; Weiss, B

The measurement of meaning

Osgood, C

Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation