Graph databases in systems biology: A systematic review

Ilya Mazein*, Adrien Rougny, Alexander Mazein, Ron Henkel, Lea Gütebier, Lea Michaelis, Marek Ostaszewski, Reinhard Schneider, Venkata Satagopam, Lars Juhl Jensen, Dagmar Waltemath, Judith A.H. Wodke, Irina Balaur*

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

2 Citations (Scopus)

Abstract

Graph databases are becoming increasingly popular across scientific disciplines, being highly suitable for storing and connecting complex heterogeneous data. In systems biology, they are used as a backend solution for biological data repositories, ontologies, networks, pathways, and knowledge graph databases. In this review, we analyse all publications using or mentioning graph databases retrieved from PubMed and PubMed Central full-text search, focusing on the top 16 available graph databases, Publications are categorized according to their domain and application, focusing on pathway and network biology and relevant ontologies and tools. We detail different approaches and highlight the advantages of outstanding resources, such as UniProtKB, Disease Ontology, and Reactome, which provide graph-based solutions. We discuss ongoing efforts of the systems biology community to standardize and harmonize knowledge graph creation and the maintenance of integrated resources. Outlining prospects, including the use of graph databases as a way of communication between biological data repositories, we conclude that efficient design, querying, and maintenance of graph databases will be key for knowledge generation in systems biology and other research fields with heterogeneous data.

Original languageEnglish
Article numberbbae561
JournalBriefings in Bioinformatics
Volume25
Issue number6
DOIs
Publication statusPublished - 1 Nov 2024
Externally publishedYes

Keywords

  • graph databases
  • network biology
  • NoSQL databases
  • ontology
  • RDF
  • systems biology

Fingerprint

Dive into the research topics of 'Graph databases in systems biology: A systematic review'. Together they form a unique fingerprint.

Cite this