TY - JOUR
T1 - CardioPHON
T2 - Quality assessment and self-supervised pretraining for screening of cardiac function based on phonocardiogram recordings
AU - Despotovic, Vladimir
AU - Pocta, Peter
AU - Zgank, Andrej
N1 - Funding:
This publication is based upon work from COST Action CA19121
– GoodBrother, Network on Privacy-Aware Audio- and Video-Based
Applications for Active and Assisted Living (https://goodbrother.eu/),
supported by COST (European Cooperation in Science and Technology)
(https://www.cost.eu/). Andrej Zgank’s research work was partially
supported by the Slovenian Research and Innovation Agency (research
core funding No. P2-0069 Advanced Methods of Interaction in Telecom-
munications).
Publisher Copyright:
© 2025
PY - 2025/11/4
Y1 - 2025/11/4
N2 - Remote monitoring of cardiovascular diseases plays an essential role in early detection of abnormal cardiac function, enabling timely intervention, improved preventive care, and personalized patient treatment. Abnormalities in the heart sounds can be detected automatically via computer-assisted decision support systems, and used as the first-line screening tool for detection of cardiovascular problems, or for monitoring the effects of treatments and interventions. We propose in this paper CardioPHON, an integrated heart sound quality assessment and classification tool that can be used for screening of abnormal cardiac function from phonocardiogram recordings. The model is pretrained in a self-supervised fashion on a collection of six small- and mid-sized heart sound datasets, enables automatic removal of low quality recordings to ensure that subtle sounds of heart abnormalities are not misdiagnosed, and provides a state-of-the-art performance for the heart sound classification task. The multimodal model that combines audio and socio-demographic features demonstrated superior performance, achieving the best ranking on the official leaderboard of the 2022 George B. Moody PhysioNet heart sound challenge, whereas the unimodal model, that is based only on phonocardiogram recordings, holds the first position among the unimodal approaches (a total rank 4), surpassing the models utilizing multiple modalities. CardioPHON is the first publicly released pretrained model in the domain of heart sound recordings, facilitating the development of data-efficient artificial intelligence models that can generalize to various downstream tasks in cardiovascular diagnostics.
AB - Remote monitoring of cardiovascular diseases plays an essential role in early detection of abnormal cardiac function, enabling timely intervention, improved preventive care, and personalized patient treatment. Abnormalities in the heart sounds can be detected automatically via computer-assisted decision support systems, and used as the first-line screening tool for detection of cardiovascular problems, or for monitoring the effects of treatments and interventions. We propose in this paper CardioPHON, an integrated heart sound quality assessment and classification tool that can be used for screening of abnormal cardiac function from phonocardiogram recordings. The model is pretrained in a self-supervised fashion on a collection of six small- and mid-sized heart sound datasets, enables automatic removal of low quality recordings to ensure that subtle sounds of heart abnormalities are not misdiagnosed, and provides a state-of-the-art performance for the heart sound classification task. The multimodal model that combines audio and socio-demographic features demonstrated superior performance, achieving the best ranking on the official leaderboard of the 2022 George B. Moody PhysioNet heart sound challenge, whereas the unimodal model, that is based only on phonocardiogram recordings, holds the first position among the unimodal approaches (a total rank 4), surpassing the models utilizing multiple modalities. CardioPHON is the first publicly released pretrained model in the domain of heart sound recordings, facilitating the development of data-efficient artificial intelligence models that can generalize to various downstream tasks in cardiovascular diagnostics.
KW - Cardiac function
KW - Heart sound
KW - Phonocardiogram
KW - Quality assessment
KW - Self-supervised learning
UR - https://www.scopus.com/pages/publications/105020827279
U2 - 10.1016/j.bspc.2025.109047
DO - 10.1016/j.bspc.2025.109047
M3 - Article
AN - SCOPUS:105020827279
SN - 1746-8094
VL - 113
JO - Biomedical Signal Processing and Control
JF - Biomedical Signal Processing and Control
M1 - 109047
ER -