Neo4j Research Center

Welcome to Neo4j Research, where we make science into technology.

Projects

Publications

Funding

Collaborations

At Neo4j, we build graph data products that delight our users and strive to always surpass customer expectations. Our products are built on a rich, collaborative history of computer science research and solid engineering.

Neo4j Research is where we explore future possibilities, both to enhance current products and to discover new opportunities. We work with internal technology teams at Neo4j as well as leading research institutions around the world with the common goal to accelerate graph technology.

Projects

At Neo4j, we perform systems research on all parts of the graph data stack. We are currently working on a diverse set of projects that target temporal graph use-cases, leaderless transaction processing methods, and novel query runtimes based on dynamic programming languages.

Our aim is to understand how to build graph processing systems for modern cloud environments that are more capable than the current state-of-the-art and a departure from the classic (relational) approach.

Current graph database runtimes are built using the same techniques and principles as relational databases which can inhibit their performance and functionality. The fundamental issue is that graph runtimes have to handle a lot of irregularity, stemming from both schema-optionality and irregularity of workload and topology from a machine point of view.

To solve these problems we are building a next-generation query runtime that is inspired by dynamic programming languages technology. It allows us to optimize schema-less graphs through dynamic code optimization and to scale processing by adopting new compute paradigms such as disaggregated compute or accelerated computing with specialized hardware.

Modern graph database management systems (DBMSs) allow users to model real-world interactions as a set of nodes and relationships at a billions-to-trillion scale. However, existing systems ignore the temporal dimension of data: how a graph evolved over time. Lacking native temporal support, ad-hoc strategies are implemented that only achieve good performance depending on the size of the effective graph workload, such as local pattern matching or global graph algorithms.

To tackle this problem, we designed Aion, a transactional temporal graph DBMS that generalizes previous approaches for labeled property graphs (LPGs). Aion is built directly atop Neo4j and adopts a hybrid temporal storage approach. For point lookups and small subgraph queries, it uses LineageStore that indexes graph updates by entity identifiers. For queries that require full graph reconstruction at arbitrary time points, it uses TimeStore that indexes updates by time.

To enable incremental graph computations for improved latency, Aion introduces a compute-efficient in-memory LPG representation. Our experiments so far show that Aion achieves up to 7x higher throughput against existing non-transactional temporal systems and provides up to an order of magnitude speedup over Neo4j with minimal storage overhead.

Transaction protocols have historically been decoupled from the data models they support. Consequently graph databases either support one of two suboptimal choices: either protocols that are too strict which sacrifice performance while maintaining correctness, or too loose which offers better performance but corrupts data in normal operation.

In the long term we need better options. We are investigating one such approach called “Conjunction of Majorities” where transaction messages carry metadata about their predecessors. Participants use this metadata to compare against their local state to determine compatibility. For single-shard transactions, if a majority of participants discover that a transaction is compatible then it can proceed through a conventional two-phase consensus protocol. For multi-shard transactions each shard must have a majority and hence “conjunction of majorities” in the general case.
We have undertaken a theoretical investigation to establish the limits of the approach with respect to correctness (specifically reciprocal consistency for graphs) and global constraints. We are also building a prototype system to evaluate the performance of the approach in real-world conditions.

Publications

Neo4j has a strong publication history, and often collaborates with universities and other industrial researchers.

2024

Seraph: Continuous Queries on Property Graph Streams

Stefan Plantikow and Hannes Voigt

GRAPH TOOLS

DEVELOPERS

DATA SCIENTISTS

LEARN

CONNECT

FEATURED EVENTS

QUICK LINKS

Neo4j Research Center

Projects

Publications

2024

Seraph: Continuous Queries on Property Graph Streams

Aion: Efficient Temporal Graph Data Management

2023

Analysis of an epoch commit protocol for distributed processing systems

2022

A Performance Study of Epoch-based Commit Protocols in Distributed OLTP Databases

Pick & Mix Isolation Levels: Mixed Serialization Graph Testing

2021

A GraphBLAS implementation in pure Java.

PG-Keys: Keys for Property Graphs

The Future is Big Graphs! A Community View on Graph Processing Systems

2020

Modeling the Gradual Degradation of Eventually-Consistent Distributed Graph Databases

The Future is Big Graphs! A Community View on Graph Processing Systems

2019

Big Graph Processing Systems

Efficient Query Processing for Dynamically Changing Datasets

Schema Validation and Evolution for Graph Databases

Period Index: A Learned 2D Hash Index for Range and Duration Queries

Understanding Trolls with Efficient Analytics of Large Graphs in Neo4j

Graph Query Languages

Updating graph databases with Cypher

Approximate querying for the property graph language Cypher

2018

Cypher: An Evolving Query Language for Property Graphs

Declarative and distributed graph analytics with GRADOOP

openCypher: New Directions in Property Graph Querying

2017

ACTiCLOUD: Enabling the Next Generation of Cloud Applications

2016

Investigations on path indexing for graph databases

2012

A programmatic introduction to Neo4j

Funding

M.Sc. Dissertations

Ph.D. scholarships

Post-Doctoral funding

Collaborations

Newcastle University (UK)

LIRIS (France)

ACTiCLOUD

LDBC

Contact Us