TY - JOUR
T1 - Analysis
T2 - Flawed Datasets of Monkeypox Skin Images
AU - Vega, Carlos
AU - Schneider, Reinhard
AU - Satagopam, Venkata
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2023/3/18
Y1 - 2023/3/18
N2 - The self-proclaimed first publicly available dataset of Monkeypox skin images consists of medically irrelevant images extracted from Google and photography repositories through a process denominated web-scrapping. Yet, this did not stop other researchers from employing it to build Machine Learning (ML) solutions aimed at computer-aided diagnosis of Monkeypox and other viral infections presenting skin lesions. Neither did it stop the reviewers or editors from publishing these subsequent works in peer-reviewed journals. Several of these works claimed extraordinary performance in the classification of Monkeypox, Chickenpox and Measles, employing ML and the aforementioned dataset. In this work, we analyse the initiator work that has catalysed the development of several ML solutions, and whose popularity is continuing to grow. Further, we provide a rebuttal experiment that showcases the risks of such methodologies, proving that the ML solutions do not necessarily obtain their performance from the features relevant to the diseases at issue.
AB - The self-proclaimed first publicly available dataset of Monkeypox skin images consists of medically irrelevant images extracted from Google and photography repositories through a process denominated web-scrapping. Yet, this did not stop other researchers from employing it to build Machine Learning (ML) solutions aimed at computer-aided diagnosis of Monkeypox and other viral infections presenting skin lesions. Neither did it stop the reviewers or editors from publishing these subsequent works in peer-reviewed journals. Several of these works claimed extraordinary performance in the classification of Monkeypox, Chickenpox and Measles, employing ML and the aforementioned dataset. In this work, we analyse the initiator work that has catalysed the development of several ML solutions, and whose popularity is continuing to grow. Further, we provide a rebuttal experiment that showcases the risks of such methodologies, proving that the ML solutions do not necessarily obtain their performance from the features relevant to the diseases at issue.
KW - Machine learning
KW - Monkeypox
KW - Translational medicine
UR - http://www.scopus.com/inward/record.url?scp=85150666705&partnerID=8YFLogxK
UR - https://pubmed.ncbi.nlm.nih.gov/36933065
U2 - 10.1007/s10916-023-01928-1
DO - 10.1007/s10916-023-01928-1
M3 - Article
C2 - 36933065
AN - SCOPUS:85150666705
SN - 0148-5598
VL - 47
SP - 37
JO - Journal of Medical Systems
JF - Journal of Medical Systems
IS - 1
M1 - 37
ER -