TY - JOUR
T1 - How to Design a Relevant Corpus for Sleepiness Detection Through Voice?
AU - Martin, Vincent P.
AU - Rouas, Jean Luc
AU - Micoulaud-Franchi, Jean Arthur
AU - Philip, Pierre
AU - Krajewski, Jarek
N1 - Publisher Copyright:
© Copyright © 2021 Martin, Rouas, Micoulaud-Franchi, Philip and Krajewski.
PY - 2021/9/22
Y1 - 2021/9/22
N2 - This article presents research on the detection of pathologies affecting speech through automatic analysis. Voice processing has indeed been used for evaluating several diseases such as Parkinson, Alzheimer, or depression. If some studies present results that seem sufficient for clinical applications, this is not the case for the detection of sleepiness. Even two international challenges and the recent advent of deep learning techniques have still not managed to change this situation. This article explores the hypothesis that the observed average performances of automatic processing find their cause in the design of the corpora. To this aim, we first discuss and refine the concept of sleepiness related to the ground-truth labels. Second, we present an in-depth study of four corpora, bringing to light the methodological choices that have been made and the underlying biases they may have induced. Finally, in light of this information, we propose guidelines for the design of new corpora.
AB - This article presents research on the detection of pathologies affecting speech through automatic analysis. Voice processing has indeed been used for evaluating several diseases such as Parkinson, Alzheimer, or depression. If some studies present results that seem sufficient for clinical applications, this is not the case for the detection of sleepiness. Even two international challenges and the recent advent of deep learning techniques have still not managed to change this situation. This article explores the hypothesis that the observed average performances of automatic processing find their cause in the design of the corpora. To this aim, we first discuss and refine the concept of sleepiness related to the ground-truth labels. Second, we present an in-depth study of four corpora, bringing to light the methodological choices that have been made and the underlying biases they may have induced. Finally, in light of this information, we propose guidelines for the design of new corpora.
KW - corpus design
KW - guidelines
KW - methodological issue
KW - sleepiness
KW - speech processing
UR - http://www.scopus.com/inward/record.url?scp=85131267022&partnerID=8YFLogxK
U2 - 10.3389/fdgth.2021.686068
DO - 10.3389/fdgth.2021.686068
M3 - Article
AN - SCOPUS:85131267022
SN - 2673-253X
VL - 3
JO - Frontiers in Digital Health
JF - Frontiers in Digital Health
M1 - 686068
ER -