Representing and querying disease networks using graph databases

Artem Lysenko, Irina A. RoznovǍţ*, Mansoor Saqi, Alexander Mazein, Christopher J. Rawlings, Charles Auffray

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

84 Citations (Scopus)

Abstract

Background: Systems biology experiments generate large volumes of data of multiple modalities and this information presents a challenge for integration due to a mix of complexity together with rich semantics. Here, we describe how graph databases provide a powerful framework for storage, querying and envisioning of biological data. Results: We show how graph databases are well suited for the representation of biological information, which is typically highly connected, semi-structured and unpredictable. We outline an application case that uses the Neo4j graph database for building and querying a prototype network to provide biological context to asthma related genes. Conclusions: Our study suggests that graph databases provide a flexible solution for the integration of multiple types of biological data and facilitate exploratory data mining to support hypothesis generation.

Original languageEnglish
Article number23
JournalBioData Mining
Volume9
Issue number1
DOIs
Publication statusPublished - 25 Jul 2016
Externally publishedYes

Keywords

  • Computational approach
  • Disease management platform
  • Graph database
  • Neo4j graph
  • Protein-centric framework
  • Systems medicine

Fingerprint

Dive into the research topics of 'Representing and querying disease networks using graph databases'. Together they form a unique fingerprint.

Cite this