TY - JOUR
T1 - Bottlenecks in advancing and applying multiomic data integration-common data resources as rate-limiting drivers-the high-impact use case of atherosclerotic cardiovascular disease
AU - Bezzina Wettinger, Stephanie
AU - Karaduzovic-Hadziabdic, Kanita
AU - Attard, Ritienne
AU - Farrugia, Rosienne
AU - Wolford, Brooke N
AU - Chierici, Marco
AU - Jurman, Giuseppe
AU - Alexiou, Panagiotis
AU - Peñalvo, José L
AU - Costa, Rafael S
AU - Basílio, José
AU - Sabovčik, František
AU - Vitorino, Rui
AU - Schmid, Johannes A
AU - Shigdel, Rajesh
AU - Vilne, Baiba
AU - Hatzigeorgiou, Artemis G
AU - Sopic, Miron
AU - Devaux, Yvan
AU - Magni, Paolo
AU - Tellez-Plaza, Maria
AU - Kreil, David P
AU - Gruca, Aleksandra
N1 - Funding:
This article is based upon work from COST Action AtheroNET,
CA21153, supported by COST (European Cooperation in Science
and Technology). S.B.W. and R.F. were supported by HORIZON-
EIC-2022-Pathfinderchallenges-01-03 TargetMI (ID:101114924)
and HORIZON-WIDERA-2022 BioGeMT (ID:101086768). R.A.
was supported by HORIZON-EIC-2022-Pathfinderchallenges-
01-03 TargetMI (ID:101114924). B.N.W. has received funding
from the European Union’s Horizon Europe Research and
Innovation Programme under the Marie Skłodowska-Curie
grant agreement No. 101110878. M.Ch. and G.J. were par-
tially supported by HORIZON-MSCA-2021-SE-01-01-MSCA Staff
Exchanges 2021 CardioSCOPE 101086397. P.A.’s contribution was
supported by HORIZON-WIDERA-2022 BioGeMT (ID: 101086768).
J.B. was supported by FCT PhD Scholarship 2020.09166.BD, European Union’s Horizon 2020 Research and Innovation
Programme under grant agreement No. 951970 (OLISSIPO
project), and INESC-ID Plurianual project UIDB/50021/2020 until
31.07.2023. From 01.08.2023 JB is a full-time employee of the
Medical University of Vienna. F.S. has received grant from
Research Foundation Flanders (1S07421N). R.V. was supported
by FCT—Portuguese Foundation for Science and Technology,
under iBiMED (UID 4501- Instituto de Biomedicina - Aveiro)
and the Cardiovascular R&D Center—UnIC (UIDB/00051/2020
and UIDP/00051/2020), CardioNIR project – CARDIOvascu-
lar Near-InfraRed spectroscopy probing – 2021 (PTDC/EMD-
EMD/3822/2021), https://doi.org/10.54499/PTDC/EMD-EMD/3822/
2021. M.S. is funded by the European Union (HORIZON-MSCA-
2021-PF- MAACS 101064175; HORIZON-MSCA-2021-SE-01-01 -
MSCA Staff Exchanges 2021 CardioSCOPE 101086397) and the
Ministry of Science, Technological Development, and Inno-
vation, Republic of Serbia through Grant Agreement with
University of Belgrade-Faculty of Pharmacy No: 451-03-47/2024-
01/200161P.M. was supported in part by European Union
(HORIZON-MSCA-2021-SE-01-01-MSCA Staff Exchanges 2021,
CardioSCOPE 101086397) and Italian Space Agency (N. 2023-7-
HH.0 CUP F13C23000050005 MicroFunExpo). M.T.P. was supported
by the Spanish Funds for Research in Health Sciences, Instituto de
Salud Carlos III, cofounded by European Regional Development
Funds (PI22CIII/00029) and the Spanish Agency for Research
(PID2019-108973RB-C21 and PID2023-147163OB-C22).
© The Author(s) 2025. Published by Oxford University Press.
PY - 2025/10/10
Y1 - 2025/10/10
N2 - Despite striking successes in identifying novel biomarkers for improved patient stratification and predicting disease progression, numerous challenges remain in the effective integration and exploitation of multiomic data in biomedical applications beyond cancer, for which most bioinformatics strategies are developed and validated. That focus on cancer severely limits the effective development and advancement of algorithms in machine learning and artificial intelligence that do not suffer degraded out-of-domain performance. Generalizability and interpretability of models, however, are also required for robust insights that may translate into clinical practice. Work across different independent datasets is critical for establishing models robust towards unwanted variation in assays, protocols, and cohort populations. Disease-specific context like ethnicity, socioeconomic background, sex, lifestyle, disease phase, and tissue type also strongly affect molecular profiles. We here discuss atherosclerotic cardiovascular disease (ASCVD) as a high-impact non-cancer use case for the challenges remaining in the development and application of the latest bioinformatics approaches to multiomics data integration. ASCVD remains the leading cause of death globally. Disease aetiology, progression, and therapy outcome depend on a complex interplay of genetic, environmental, and lifestyle factors. Integrating these diverse data types effectively remains a challenge but holds transformative potential for personalized medicine. Discovery and access to data of sufficient diversity and extent form key bottlenecks. We here compile a first comprehensive overview of key data sets in ASCVD to complement the established cancer-focused resources as a foundation for future effective development and application of state-of-the-art bioinformatics tools for multiomic data integration.
AB - Despite striking successes in identifying novel biomarkers for improved patient stratification and predicting disease progression, numerous challenges remain in the effective integration and exploitation of multiomic data in biomedical applications beyond cancer, for which most bioinformatics strategies are developed and validated. That focus on cancer severely limits the effective development and advancement of algorithms in machine learning and artificial intelligence that do not suffer degraded out-of-domain performance. Generalizability and interpretability of models, however, are also required for robust insights that may translate into clinical practice. Work across different independent datasets is critical for establishing models robust towards unwanted variation in assays, protocols, and cohort populations. Disease-specific context like ethnicity, socioeconomic background, sex, lifestyle, disease phase, and tissue type also strongly affect molecular profiles. We here discuss atherosclerotic cardiovascular disease (ASCVD) as a high-impact non-cancer use case for the challenges remaining in the development and application of the latest bioinformatics approaches to multiomics data integration. ASCVD remains the leading cause of death globally. Disease aetiology, progression, and therapy outcome depend on a complex interplay of genetic, environmental, and lifestyle factors. Integrating these diverse data types effectively remains a challenge but holds transformative potential for personalized medicine. Discovery and access to data of sufficient diversity and extent form key bottlenecks. We here compile a first comprehensive overview of key data sets in ASCVD to complement the established cancer-focused resources as a foundation for future effective development and application of state-of-the-art bioinformatics tools for multiomic data integration.
KW - Humans
KW - Atherosclerosis/genetics
KW - Computational Biology/methods
KW - Cardiovascular Diseases/genetics
KW - Machine Learning
KW - Biomarkers
KW - Artificial Intelligence
UR - https://www.scopus.com/pages/publications/105018397938
UR - https://pubmed.ncbi.nlm.nih.gov/41071609/
U2 - 10.1093/bib/bbaf526
DO - 10.1093/bib/bbaf526
M3 - Review article
C2 - 41071609
SN - 1467-5463
VL - 26
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 5
M1 - bbaf526
ER -