Towards speech quality assessment using a crowdsourcing approach: evaluation of standardized methods

Naderi, Babak; Zequeira Jiménez, Rafael; Hirth, Matthias; Möller, Sebastian; Metzger, Florian; Hoßfeld, Tobias

doi:10.1007/s41233-020-00042-1

Artikel / Aufsatz So., 22. Nov.. 2020 CC BY 4.0

Veröffentlicht

Towards speech quality assessment using a crowdsourcing approach : evaluation of standardized methods

Naderi, Babak ; Zequeira Jiménez, Rafael; Hirth, Matthias ; Möller, Sebastian; Metzger, Florian; Hoßfeld, Tobias

Subjective speech quality assessment has traditionally been carried out in laboratory environments under controlled conditions. With the advent of crowdsourcing platforms tasks, which need human intelligence, can be resolved by crowd workers over the Internet. Crowdsourcing also offers a new paradigm for speech quality assessment, promising higher ecological validity of the quality judgments at the expense of potentially lower reliability. This paper compares laboratory-based and crowdsourcing-based speech quality assessments in terms of comparability of results and efficiency. For this purpose, three pairs of listening-only tests have been carried out using three different crowdsourcing platforms and following the ITU-T Recommendation P.808. In each test, listeners judge the overall quality of the speech sample following the Absolute Category Rating procedure. We compare the results of the crowdsourcing approach with the results of standard laboratory tests performed according to the ITU-T Recommendation P.800. Results show that in most cases, both paradigms lead to comparable results. Notable differences are discussed with respect to their sources, and conclusions are drawn that establish practical guidelines for crowdsourcing-based speech quality assessment.

Vorschau

Einordnung

Erschienen in:: Quality and user experience
Bd. 6, H. 1 (22.11.2020)Art.-Nr.:2
Band:: 6
Heft:: 1
Datum der Erstellung:: 20.05.2022
Datum der Veröffentlichung:: 22.11.2020
DOI:: 10.1007/s41233-020-00042-1
PPN:: 1743532474
Sprache:: Englisch
Ressourcentyp:: Text
Umfang:: 21 Seiten
Schlagwörter:: Speech quality assessment; Crowdsourcing; Validity; Reliability; P.808
DDC-Sachgruppe der DNB:: 150 Psychologie
Einrichtung:: Technische Universität Ilmenau, Fakultät für Elektrotechnik und Informationstechnik

auf die Merkliste

Zitieren

Zitierform:

10.1007/s41233-020-00042-1
Zitier-Link kopieren

Rechte

Nutzung und Vervielfältigung:

Export

BibTeX, Endnote, MODS, MARCXML, RIS, ISI, PICA, DC, CSV