TY - GEN
T1 - Automatic Speech Recognition systems errors for accident-prone sleepiness detection through voice
AU - Martin, Vincent P.
AU - Rouas, Jean Luc
AU - Boyer, Florian
AU - Philip, Pierre
N1 - Publisher Copyright:
© 2021 European Signal Processing Conference. All rights reserved.
PY - 2021
Y1 - 2021
N2 - Excessive Daytime Sleepiness (EDS), a symptom linked to chronic sleepiness, impacts everyday life and increases risks of work or road accidents of subjects affected by it. The detection of accident-prone EDS through voice benefits from its ease to be implemented in ecological conditions and to be sober in terms of data processing and costs. Contrary to previous works, this study focuses on long-term sleepiness detection through voice. Using the Multiple Sleep Latency Test corpus, we propose a feature selection pipeline inspired by clinical validation practices to classify accident-prone EDS - as measured by a threshold of 15 on the Epworth Sleepiness Scale - based on vocal clues. We propose three different approaches based on the acoustic quality of voice, reading mistakes, and a whole new approach, relying on Automatic Speech Recognition systems errors. The classification system achieves performances on the same scale as the state-of-the-art systems on short-term sleepiness detection through voice (74.2% of Unweighted Average Recall). Moreover, we give insights into the decision process implied during classification and the system's specificity regarding the threshold delimiting the two classes Higher-risk driver and Lower-risk driver.
AB - Excessive Daytime Sleepiness (EDS), a symptom linked to chronic sleepiness, impacts everyday life and increases risks of work or road accidents of subjects affected by it. The detection of accident-prone EDS through voice benefits from its ease to be implemented in ecological conditions and to be sober in terms of data processing and costs. Contrary to previous works, this study focuses on long-term sleepiness detection through voice. Using the Multiple Sleep Latency Test corpus, we propose a feature selection pipeline inspired by clinical validation practices to classify accident-prone EDS - as measured by a threshold of 15 on the Epworth Sleepiness Scale - based on vocal clues. We propose three different approaches based on the acoustic quality of voice, reading mistakes, and a whole new approach, relying on Automatic Speech Recognition systems errors. The classification system achieves performances on the same scale as the state-of-the-art systems on short-term sleepiness detection through voice (74.2% of Unweighted Average Recall). Moreover, we give insights into the decision process implied during classification and the system's specificity regarding the threshold delimiting the two classes Higher-risk driver and Lower-risk driver.
KW - Accidental risk
KW - Automatic speech recognition
KW - Excessive daytime sleepiness
KW - Sleepiness
KW - Voice
UR - http://www.scopus.com/inward/record.url?scp=85123223579&partnerID=8YFLogxK
U2 - 10.23919/EUSIPCO54536.2021.9616299
DO - 10.23919/EUSIPCO54536.2021.9616299
M3 - Conference contribution
AN - SCOPUS:85123223579
T3 - European Signal Processing Conference
SP - 541
EP - 545
BT - 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings
PB - European Signal Processing Conference, EUSIPCO
T2 - 29th European Signal Processing Conference, EUSIPCO 2021
Y2 - 23 August 2021 through 27 August 2021
ER -