229+ Distributed Databases Ranked & Compared

Name: 229+ Distributed Databases Ranked & Compared
Creator: 1bench
License: https://creativecommons.org/licenses/by/4.0/

Compare distributed databases ranked by GitHub stars, fault tolerance, and global scalability.

Last updated: July 11, 2026

207 databases

All Relational Document Key-Value Vector Graph Time-Series Wide-Column Search Engine Analytics Distributed NoSQL

ElasticsearchSA

Distributed search and analytics engine built on Apache Lucene for full-text search, observability, and security

Search·Elastic-2.0·2010·Java

77.6k

+197

+624

today

Elasticsearch

77.6k+624 30d

Distributed search and analytics engine built on Apache Lucene for full-text search, observability, and security

Search·2010·Elastic-2.0·Java

etcd

Distributed reliable key-value store for the most critical data of a distributed system

Key-Value·Apache-2.0·2013·Go

52.0k

+29

+165

etcd

52.0k+165 30d

Distributed reliable key-value store for the most critical data of a distributed system

Key-Value·2013·Apache-2.0·Go

ClickHouse

Blazing-fast open-source column-oriented database for real-time analytics and OLAP

Analytics·Apache-2.0·2016·C++

48.6k

+131

+641

today

ClickHouse

48.6k+641 30d

Blazing-fast open-source column-oriented database for real-time analytics and OLAP

Analytics·2016·Apache-2.0·C++

Milvus

High-performance cloud-native vector database built for scalable similarity search and AI applications

Vector·Apache-2.0·2019·Go, C++

45.2k

+115

+454

today

Milvus

45.2k+454 30d

High-performance cloud-native vector database built for scalable similarity search and AI applications

Vector·2019·Apache-2.0·Go, C++

Apache Spark SQL

Distributed SQL query engine within Apache Spark for structured data processing at scale

Analytics·Apache-2.0·2014·Scala, Java, Python, R

43.6k

+52

+164

today

Apache Spark SQL

43.6k+164 30d

Distributed SQL query engine within Apache Spark for structured data processing at scale

Analytics·2014·Apache-2.0·Scala, Java, Python, R

TiDB

MySQL-compatible distributed SQL database for hybrid transactional and analytical workloads

Relational·Apache-2.0·2015·Go

40.3k

+23

+125

TiDB

40.3k+125 30d

MySQL-compatible distributed SQL database for hybrid transactional and analytical workloads

Relational·2015·Apache-2.0·Go

Qdrant

High-performance open-source vector database for next-generation AI applications

Vector·Apache-2.0·2021·Rust

33.1k

+221

+1.1k

Qdrant

33.1k+1.1k 30d

High-performance open-source vector database for next-generation AI applications

Vector·2021·Apache-2.0·Rust

SurrealDBSA

Multi-model database combining documents, graphs, vectors, and time-series with built-in API layer and real-time capabilities

Multi-Model·BSL-1.1·2022·Rust

32.7k

+50

+301

SurrealDB

32.7k+301 30d

Multi-model database combining documents, graphs, vectors, and time-series with built-in API layer and real-time capabilities

Multi-Model·2022·BSL-1.1·Rust

CockroachDBSA

Distributed SQL database built for cloud-native global applications with serializable isolation

Relational·BSL-1.1·2015·Go, C++

32.3k

+19

+65

CockroachDB

32.3k+65 30d

Distributed SQL database built for cloud-native global applications with serializable isolation

Relational·2015·BSL-1.1·Go, C++

InfluxDB

Scalable time-series database built in Rust for metrics, events, and real-time analytics

Time-Series·Apache-2.0·2013·Rust

31.6k

+12

+75

InfluxDB

31.6k+75 30d

Scalable time-series database built in Rust for metrics, events, and real-time analytics

Time-Series·2013·Apache-2.0·Rust

MongoDBSA

The most popular document database for modern applications

Document·SSPL·2009·C++, JavaScript, Python

28.4k

+15

+90

today

MongoDB

28.4k+90 30d

The most popular document database for modern applications

Document·2009·SSPL·C++, JavaScript, Python

RethinkDB

Open-source document database designed for real-time push updates to applications

Document·Apache-2.0·2012·C++

27.0k

−7

−2

3mo

RethinkDB

27.0k−2 30d

Open-source document database designed for real-time push updates to applications

Document·2012·Apache-2.0·C++

Valkey

Open-source high-performance key-value database forked from Redis, backed by the Linux Foundation

Key-Value·BSD-3-Clause·2024·C

26.5k

+62

+408

today

Valkey

26.5k+408 30d

Open-source high-performance key-value database forked from Redis, backed by the Linux Foundation

Key-Value·2024·BSD-3-Clause·C

Apache Flink

Stateful stream processing framework for real-time and batch data at any scale

Streaming·Apache-2.0·2011·Java, Scala

26.2k

+19

+98

Apache Flink

26.2k+98 30d

Stateful stream processing framework for real-time and batch data at any scale

Streaming·2011·Apache-2.0·Java, Scala

TDengine

High-performance open-source time-series database designed for Industrial IoT and real-time analytics

Time-Series·AGPL-3.0·2019·C

25.0k

+56

TDengine

25.0k+56 30d

High-performance open-source time-series database designed for Industrial IoT and real-time analytics

Time-Series·2019·AGPL-3.0·C

Neon

Serverless PostgreSQL with separated storage and compute, branching, and scale-to-zero

Relational·Apache-2.0·2022·Rust, C

22.5k

+75

+326

1mo

Neon

22.5k+326 30d

Serverless PostgreSQL with separated storage and compute, branching, and scale-to-zero

Relational·2022·Apache-2.0·Rust, C

Dgraph

Distributed graph database with native GraphQL support built for horizontal scale

Graph·Apache-2.0·2017·Go

21.7k

+11

+45

Dgraph

21.7k+45 30d

Distributed graph database with native GraphQL support built for horizontal scale

Graph·2017·Apache-2.0·Go

Vitess

Cloud-native database clustering system for horizontal scaling of MySQL through transparent sharding

Relational·Apache-2.0·2012·Go

21.1k

+32

+109

Vitess

21.1k+109 30d

Cloud-native database clustering system for horizontal scaling of MySQL through transparent sharding

Relational·2012·Apache-2.0·Go

Apache ShardingSphere

Distributed SQL middleware providing sharding, encryption, and read-write splitting across any database

Relational·Apache-2.0·2016·Java

20.8k

+16

Apache ShardingSphere

20.8k+16 30d

Distributed SQL middleware providing sharding, encryption, and read-write splitting across any database

Relational·2016·Apache-2.0·Java

rqlite

Lightweight, fault-tolerant, distributed relational database built on SQLite and Raft consensus

Relational·MIT·2014·Go

17.6k

+38

rqlite

17.6k+38 30d

Lightweight, fault-tolerant, distributed relational database built on SQLite and Raft consensus

Relational·2014·MIT·Go

VictoriaMetrics

Fast, cost-effective time-series database and monitoring solution compatible with Prometheus

Time-Series·Apache-2.0·2018·Go

17.3k

+31

+170

VictoriaMetrics

17.3k+170 30d

Fast, cost-effective time-series database and monitoring solution compatible with Prometheus

Time-Series·2018·Apache-2.0·Go

Neo4j

Native graph database with Cypher query language for connected data at scale

Graph·GPL-3.0·2007·Java, Scala

16.9k

+37

+168

Neo4j

16.9k+168 30d

Native graph database with Cypher query language for connected data at scale

Graph·2007·GPL-3.0·Java, Scala

TiKV

Distributed transactional key-value database providing ACID guarantees at scale

Key-Value·Apache-2.0·2016·Rust

16.8k

+46

TiKV

16.8k+46 30d

Distributed transactional key-value database providing ACID guarantees at scale

Key-Value·2016·Apache-2.0·Rust

Presto

Distributed SQL query engine for running interactive analytic queries against data sources of all sizes

Analytics·Apache-2.0·2012·Java, C++

16.7k

−1

today

Presto

16.7k+3 30d

Distributed SQL query engine for running interactive analytic queries against data sources of all sizes

Analytics·2012·Apache-2.0·Java, C++

Weaviate

AI-native vector database with hybrid search and built-in model integration

Vector·BSD-3-Clause·2019·Go

16.6k

+70

+252

Weaviate

16.6k+252 30d

AI-native vector database with hybrid search and built-in model integration

Vector·2019·BSD-3-Clause·Go

FoundationDB

Distributed, transactional key-value store with multi-model layers and strict serializability

Key-Value·Apache-2.0·2013·C++, Flow

16.5k

+24

+97

today

FoundationDB

16.5k+97 30d

Distributed, transactional key-value store with multi-model layers and strict serializability

Key-Value·2013·Apache-2.0·C++, Flow

ScyllaDBSA

High-performance NoSQL wide-column database compatible with Apache Cassandra and Amazon DynamoDB

Wide-Column·ScyllaDB Source Available License·2015·C++

15.6k

+47

ScyllaDB

15.6k+47 30d

High-performance NoSQL wide-column database compatible with Apache Cassandra and Amazon DynamoDB

Wide-Column·2015·ScyllaDB Source Available License·C++

Apache Doris

High-performance real-time analytical database for sub-second queries on large-scale data

Analytics·Apache-2.0·2017·Java, C++

15.6k

+19

+133

Apache Doris

15.6k+133 30d

High-performance real-time analytical database for sub-second queries on large-scale data

Analytics·2017·Apache-2.0·Java, C++

ArangoDBSA

Multi-model database unifying document, graph, and key-value in a single engine with AQL

Multi-Model·BUSL-1.1·2012·C++, JavaScript

14.2k

+11

+33

ArangoDB

14.2k+33 30d

Multi-model database unifying document, graph, and key-value in a single engine with AQL

Multi-Model·2012·BUSL-1.1·C++, JavaScript

Memcached

High-performance distributed memory caching system for speeding up dynamic web applications

Key-Value·BSD-3-Clause·2003·C

14.2k

+34

Memcached

14.2k+34 30d

High-performance distributed memory caching system for speeding up dynamic web applications

Key-Value·2003·BSD-3-Clause·C

Thanos

Highly available Prometheus setup with unlimited long-term storage on object storage

Time-Series·Apache-2.0·2017·Go

14.1k

+47

Thanos

14.1k+47 30d

Highly available Prometheus setup with unlimited long-term storage on object storage

Time-Series·2017·Apache-2.0·Go

Apache Druid

High-performance real-time analytics database for sub-second OLAP queries at scale

Analytics·Apache-2.0·2012·Java

14.0k

+12

today

Apache Druid

14.0k+12 30d

High-performance real-time analytics database for sub-second OLAP queries at scale

Analytics·2012·Apache-2.0·Java

OpenSearch

Community-driven open-source search and analytics engine forked from Elasticsearch

Search·Apache-2.0·2021·Java

13.5k

+155

+312

OpenSearch

13.5k+312 30d

Community-driven open-source search and analytics engine forked from Elasticsearch

Search·2021·Apache-2.0·Java

Trino

Fast distributed SQL query engine for big data analytics across heterogeneous data sources

Analytics·Apache-2.0·2019·Java

13.0k

+15

+92

Trino

13.0k+92 30d

Fast distributed SQL query engine for big data analytics across heterogeneous data sources

Analytics·2019·Apache-2.0·Java

Citus

Distributed PostgreSQL as an extension for multi-tenant SaaS and real-time analytics at scale

Relational·AGPL-3.0·2016·C

12.6k

+16

+50

Citus

12.6k+50 30d

Distributed PostgreSQL as an extension for multi-tenant SaaS and real-time analytics at scale

Relational·2016·AGPL-3.0·C

KeyDB

Multithreaded Redis fork with higher throughput and active replication

Key-Value·BSD-3-Clause·2019·C++, C

12.5k

KeyDB

12.5k+4 30d

Multithreaded Redis fork with higher throughput and active replication

Key-Value·2019·BSD-3-Clause·C++, C

Mnesia

Distributed real-time database management system built into Erlang/OTP for telecom-grade fault tolerance

Key-Value·Apache-2.0·1999·Erlang

12.4k

+107

+146

Mnesia

12.4k+146 30d

Distributed real-time database management system built into Erlang/OTP for telecom-grade fault tolerance

Key-Value·1999·Apache-2.0·Erlang

NebulaGraph

Distributed graph database built for billion-scale graphs with millisecond latency

Graph·Apache-2.0·2019·C++

12.3k

+15

+61

1mo

NebulaGraph

12.3k+61 30d

Distributed graph database built for billion-scale graphs with millisecond latency

Graph·2019·Apache-2.0·C++

ConvexSA

Reactive backend database with real-time sync, TypeScript-native queries, and built-in serverless functions

Document·BUSL-1.1·2022·Rust, TypeScript

12.1k

+55

+223

Convex

12.1k+223 30d

Reactive backend database with real-time sync, TypeScript-native queries, and built-in serverless functions

Document·2022·BUSL-1.1·Rust, TypeScript

Manticore Search

Fast open-source search database with SQL and JSON interfaces

Search·GPL-3.0·2017·C++

11.9k

+12

+71

today

Manticore Search

11.9k+71 30d

Fast open-source search database with SQL and JSON interfaces

Search·2017·GPL-3.0·C++

StarRocks

High-performance MPP analytics engine for real-time and batch data warehousing

Analytics·Apache-2.0·2021·Java, C++

11.9k

+25

+99

StarRocks

11.9k+99 30d

High-performance MPP analytics engine for real-time and batch data warehousing

Analytics·2021·Apache-2.0·Java, C++

Quickwit

Cloud-native search engine for observability, built on object storage with sub-second latency

Search·Apache-2.0·2021·Rust

11.4k

+16

+89

Quickwit

11.4k+89 30d

Cloud-native search engine for observability, built on object storage with sub-second latency

Search·2021·Apache-2.0·Rust

YugabyteDB

PostgreSQL-compatible distributed SQL database with high resilience and geo-distribution

Relational·Apache-2.0·2017·C++, Java

10.4k

+65

today

YugabyteDB

10.4k+65 30d

PostgreSQL-compatible distributed SQL database with high resilience and geo-distribution

Relational·2017·Apache-2.0·C++, Java

OceanBase

Distributed relational database for high-performance transactional, analytical, and AI workloads at scale

Relational·Mulan PubL v2·2010·C++

10.2k

+13

+36

OceanBase

10.2k+36 30d

Distributed relational database for high-performance transactional, analytical, and AI workloads at scale

Relational·2010·Mulan PubL v2·C++

Cassandra

Distributed wide-column database designed for high availability and linear scalability across data centers

Wide-Column·Apache-2.0·2008·Java

9.9k

+105

+140

Cassandra

9.9k+140 30d

Distributed wide-column database designed for high availability and linear scalability across data centers

Wide-Column·2008·Apache-2.0·Java

Databend

Cloud-native data warehouse built in Rust for analytics, search, and AI on object storage

Analytics·Apache-2.0·2021·Rust

9.4k

+55

Databend

9.4k+55 30d

Cloud-native data warehouse built in Rust for analytics, search, and AI on object storage

Analytics·2021·Apache-2.0·Rust

Deep Lake

GPU-native vector and multimodal data lake for AI agents with deep learning integrations

Vector·Apache-2.0·2020·Python, C++

9.2k

+36

1mo

Deep Lake

9.2k+36 30d

GPU-native vector and multimodal data lake for AI agents with deep learning integrations

Vector·2020·Apache-2.0·Python, C++

RisingWave

Postgres-compatible streaming database for real-time event processing and analytics

Streaming·Apache-2.0·2022·Rust

9.2k

+10

+80

today

RisingWave

9.2k+80 30d

Postgres-compatible streaming database for real-time event processing and analytics

Streaming·2022·Apache-2.0·Rust

Vespa

Open-source big data serving engine combining search, recommendation, and real-time AI at scale

Search·Apache-2.0·2017·Java, C++

7.0k

+29

+57

today

Vespa

7.0k+57 30d

Open-source big data serving engine combining search, recommendation, and real-time AI at scale

Search·2017·Apache-2.0·Java, C++

Apache CouchDB

Seamless multi-master sync with an intuitive HTTP/JSON API

Document·Apache-2.0·2005·Erlang, JavaScript, C

6.9k

+19

Apache CouchDB

6.9k+19 30d

Seamless multi-master sync with an intuitive HTTP/JSON API

Document·2005·Apache-2.0·Erlang, JavaScript, C

Hazelcast

Unified real-time data platform combining in-memory data grid with stream processing

Key-Value·Apache-2.0·2008·Java

6.6k

+13

Hazelcast

6.6k+13 30d

Unified real-time data platform combining in-memory data grid with stream processing

Key-Value·2008·Apache-2.0·Java

GreptimeDB

Open-source unified observability database for metrics, logs, and traces built in Rust

Time-Series·Apache-2.0·2022·Rust

6.5k

+19

+115

GreptimeDB

6.5k+115 30d

Open-source unified observability database for metrics, logs, and traces built in Rust

Time-Series·2022·Apache-2.0·Rust

Apache IoTDB

High-performance time-series database for IoT data with lightweight architecture and high compression

Time-Series·Apache-2.0·2019·Java

6.4k

+16

today

Apache IoTDB

6.4k+16 30d

High-performance time-series database for IoT data with lightweight architecture and high compression

Time-Series·2019·Apache-2.0·Java

Apache Pinot

Real-time distributed OLAP datastore for ultra low-latency analytics at high throughput

Analytics·Apache-2.0·2015·Java

6.1k

today

Apache Pinot

6.1k+7 30d

Real-time distributed OLAP datastore for ultra low-latency analytics at high throughput

Analytics·2015·Apache-2.0·Java

Apache Hive

Data warehouse software for reading, writing, and managing large datasets in distributed storage using SQL

Analytics·Apache-2.0·2010·Java

6.0k

+14

Apache Hive

6.0k+14 30d

Data warehouse software for reading, writing, and managing large datasets in distributed storage using SQL

Analytics·2010·Apache-2.0·Java

AliSQL

Alibaba's battle-tested MySQL branch with built-in DuckDB analytics and vector search

Relational·GPL-2.0·2016·C++, C

5.8k

−4

+18

2mo

AliSQL

5.8k+18 30d

Alibaba's battle-tested MySQL branch with built-in DuckDB analytics and vector search

Relational·2016·GPL-2.0·C++, C

Cortex

Horizontally scalable, multi-tenant long-term storage for Prometheus metrics

Time-Series·Apache-2.0·2016·Go

5.8k

+15

Cortex

5.8k+15 30d

Horizontally scalable, multi-tenant long-term storage for Prometheus metrics

Time-Series·2016·Apache-2.0·Go

KurrentDBSA

Event-native database for event sourcing and event-driven architectures with built-in streaming, formerly EventStoreDB

Streaming·Kurrent License·2012·C#

5.8k

KurrentDB

5.8k+7 30d

Event-native database for event sourcing and event-driven architectures with built-in streaming, formerly EventStoreDB

Streaming·2012·Kurrent License·C#

JanusGraph

Scalable open-source distributed graph database optimized for storing and querying billions of vertices and edges

Graph·Apache-2.0·2017·Java

5.8k

JanusGraph

5.8k+9 30d

Scalable open-source distributed graph database optimized for storing and querying billions of vertices and edges

Graph·2017·Apache-2.0·Java

Apache HBase

Distributed wide-column store for random real-time read/write access to big data

Wide-Column·Apache-2.0·2008·Java

5.5k

+10

Apache HBase

5.5k+10 30d

Distributed wide-column store for random real-time read/write access to big data

Wide-Column·2008·Apache-2.0·Java

Apache Ignite

Distributed in-memory database with ACID transactions, SQL, and compute capabilities

Multi-Model·Apache-2.0·2015·Java, C++, C#

5.1k

−2

Apache Ignite

5.1k+6 30d

Distributed in-memory database with ACID transactions, SQL, and compute capabilities

Multi-Model·2015·Apache-2.0·Java, C++, C#

OpenTSDB

Distributed, scalable time-series database built on top of HBase for monitoring at massive scale

Time-Series·LGPL-2.1·2010·Java

5.1k

OpenTSDB

5.1k+1 30d

Distributed, scalable time-series database built on top of HBase for monitoring at massive scale

Time-Series·2010·LGPL-2.1·Java

MarqoSA

AI-native tensor search engine with built-in embedding generation for multimodal vector search

Vector·Apache-2.0·2022·Python

5.0k

−1

−4

Marqo

5.0k−4 30d

AI-native tensor search engine with built-in embedding generation for multimodal vector search

Vector·2022·Apache-2.0·Python

OrientDB

Multi-model database combining graph, document, key-value, and object models with SQL support and ACID transactions

Multi-Model·Apache-2.0·2010·Java

5.0k

OrientDB

5.0k+6 30d

Multi-model database combining graph, document, key-value, and object models with SQL support and ACID transactions

Multi-Model·2010·Apache-2.0·Java

M3DB

Distributed time-series database built by Uber for large-scale metrics with Prometheus and Graphite compatibility

Time-Series·Apache-2.0·2018·Go

4.9k

−4

−2

24d

M3DB

4.9k−2 30d

Distributed time-series database built by Uber for large-scale metrics with Prometheus and Graphite compatibility

Time-Series·2018·Apache-2.0·Go

YDB

Open-source distributed SQL database combining high availability, scalability, strong consistency, and ACID transactions

Relational·Apache-2.0·2022·C++

4.7k

+15

today

YDB

4.7k+15 30d

Open-source distributed SQL database combining high availability, scalability, strong consistency, and ACID transactions

Relational·2022·Apache-2.0·C++

CrateDB

Distributed SQL database for real-time analytics on massive datasets with PostgreSQL compatibility

Multi-Model·Apache-2.0·2014·Java

4.4k

CrateDB

4.4k+7 30d

Distributed SQL database for real-time analytics on massive datasets with PostgreSQL compatibility

Multi-Model·2014·Apache-2.0·Java

TypeDB

Polymorphic database with a conceptual data model, strong type system, and symbolic reasoning engine

Graph·MPL-2.0·2016·Rust

4.4k

+10

+32

TypeDB

4.4k+32 30d

Polymorphic database with a conceptual data model, strong type system, and symbolic reasoning engine

Graph·2016·MPL-2.0·Rust

Kvrocks

Distributed Redis-compatible key-value NoSQL database built on RocksDB for cost-effective persistent storage

Key-Value·Apache-2.0·2019·C++

4.4k

+30

Kvrocks

4.4k+30 30d

Distributed Redis-compatible key-value NoSQL database built on RocksDB for cost-effective persistent storage

Key-Value·2019·Apache-2.0·C++

dqlite

Lightweight distributed SQLite with Raft consensus for fault-tolerant edge and IoT deployments

Relational·LGPL-3.0·2017·C

4.4k

+19

dqlite

4.4k+19 30d

Lightweight distributed SQLite with Raft consensus for fault-tolerant edge and IoT deployments

Relational·2017·LGPL-3.0·C

RavenDBSA

ACID document database with integrated full-text search, time series, and distributed counters

Document·AGPL-3.0 / Commercial·2010·C#

4.0k

+14

RavenDB

4.0k+14 30d

ACID document database with integrated full-text search, time series, and distributed counters

Document·2010·AGPL-3.0 / Commercial·C#

Apache Kylin

Distributed OLAP engine with sub-second query performance via pre-calculated cubes on Hadoop

Analytics·Apache-2.0·2015·Java

3.8k

Apache Kylin

3.8k+2 30d

Distributed OLAP engine with sub-second query performance via pre-calculated cubes on Hadoop

Analytics·2015·Apache-2.0·Java

Netflix Atlas

In-memory dimensional time-series database built for operational metrics at Netflix scale

Time-Series·Apache-2.0·2014·Scala, Java

3.6k

Netflix Atlas

3.6k+3 30d

In-memory dimensional time-series database built for operational metrics at Netflix scale

Time-Series·2014·Apache-2.0·Scala, Java

Olric

Distributed, in-memory key/value store and cache with Redis-compatible protocol support

Key-Value·Apache-2.0·2018·Go

3.5k

+12

15d

Olric

3.5k+12 30d

Distributed, in-memory key/value store and cache with Redis-compatible protocol support

Key-Value·2018·Apache-2.0·Go

Roshi

Large-scale CRDT set implementation for timestamped events backed by Redis

Time-Series·BSD-2-Clause·2014·Go

3.2k

23d

Roshi

3.2k+1 30d

Large-scale CRDT set implementation for timestamped events backed by Redis

Time-Series·2014·BSD-2-Clause·Go

PolarDB for PostgreSQL

Alibaba's cloud-native PostgreSQL with shared-storage architecture and elastic scaling

Relational·Apache-2.0·2021·C

3.2k

PolarDB for PostgreSQL

3.2k+8 30d

Alibaba's cloud-native PostgreSQL with shared-storage architecture and elastic scaling

Relational·2021·Apache-2.0·C

Apache HugeGraph

High-performance graph database supporting hundreds of billions of vertices and edges

Graph·Apache-2.0·2017·Java

3.1k

+14

Apache HugeGraph

3.1k+14 30d

High-performance graph database supporting hundreds of billions of vertices and edges

Graph·2017·Apache-2.0·Java

LinDB

Scalable, high-performance distributed time-series database with multi-IDC replication

Time-Series·Apache-2.0·2019·Go

3.1k

18d

LinDB

3.1k+7 30d

Scalable, high-performance distributed time-series database with multi-IDC replication

Time-Series·2019·Apache-2.0·Go

Apache HoraeDB

High-performance distributed cloud-native time-series database for analytics and time-series workloads

Time-Series·Apache-2.0·2022·Rust

2.8k

−1

5mo

Apache HoraeDB

2.8k0 30d

High-performance distributed cloud-native time-series database for analytics and time-series workloads

Time-Series·2022·Apache-2.0·Rust

GridDB

IoT-optimized time-series database with hybrid in-memory and disk storage from Toshiba

Time-Series·AGPL-3.0·2013·C++, Java

2.5k

−1

3mo

GridDB

2.5k+2 30d

IoT-optimized time-series database with hybrid in-memory and disk storage from Toshiba

Time-Series·2013·AGPL-3.0·C++, Java

Apache Geode

In-memory data grid providing real-time, consistent access to data-intensive applications at massive scale

Key-Value·Apache-2.0·2015·Java

2.4k

1mo

Apache Geode

2.4k+1 30d

In-memory data grid providing real-time, consistent access to data-intensive applications at massive scale

Key-Value·2015·Apache-2.0·Java

Apache Sedona

Cluster computing framework for large-scale geospatial data processing on Spark, Flink, and Snowflake

Analytics·Apache-2.0·2017·Scala, Java, Python, Rust

2.3k

+12

Apache Sedona

2.3k+12 30d

Cluster computing framework for large-scale geospatial data processing on Spark, Flink, and Snowflake

Analytics·2017·Apache-2.0·Scala, Java, Python, Rust

Graph Engine

Distributed in-memory graph processing engine with strongly-typed key-value store, formerly Trinity

Graph·MIT·2015·C#, C++

2.3k

Graph Engine

2.3k+2 30d

Distributed in-memory graph processing engine with strongly-typed key-value store, formerly Trinity

Graph·2015·MIT·C#, C++

YTsaurus

Exabyte-scale distributed storage and processing platform for big data from Yandex

Multi-Model·Apache-2.0·2023·C++

2.2k

today

YTsaurus

2.2k+3 30d

Exabyte-scale distributed storage and processing platform for big data from Yandex

Multi-Model·2023·Apache-2.0·C++

VictoriaLogs

Fast and easy-to-use open-source log management database by VictoriaMetrics

Search·Apache-2.0·2023·Go

2.0k

+23

+83

VictoriaLogs

2.0k+83 30d

Fast and easy-to-use open-source log management database by VictoriaMetrics

Search·2023·Apache-2.0·Go

Apache Drill

Schema-free SQL query engine for Hadoop, NoSQL, and cloud storage with dynamic schema discovery

Analytics·Apache-2.0·2015·Java

2.0k

−1

15d

Apache Drill

2.0k+2 30d

Schema-free SQL query engine for Hadoop, NoSQL, and cloud storage with dynamic schema discovery

Analytics·2015·Apache-2.0·Java

ActorDB

Distributed SQL database using the actor model with Raft consensus on SQLite

Relational·MPL-2.0·2014·Erlang, C

1.9k

−1

ActorDB

1.9k−1 30d

Distributed SQL database using the actor model with Raft consensus on SQLite

Relational·2014·MPL-2.0·Erlang, C

MatrixOne

Cloud-native HTAP database with MySQL compatibility, Git-style data versioning, and AI-native capabilities

Relational·Apache-2.0·2021·Go

1.9k

+14

today

MatrixOne

1.9k+14 30d

Cloud-native HTAP database with MySQL compatibility, Git-style data versioning, and AI-native capabilities

Relational·2021·Apache-2.0·Go

KairosDB

Fast distributed scalable time-series database built on top of Apache Cassandra

Time-Series·Apache-2.0·2013·Java

1.8k

4mo

KairosDB

1.8k0 30d

Fast distributed scalable time-series database built on top of Apache Cassandra

Time-Series·2013·Apache-2.0·Java

CnosDB

Cloud-native open-source distributed time-series database with high performance and compression

Time-Series·AGPL-3.0·2022·Rust

1.8k

−1

−3

9mo

CnosDB

1.8k−3 30d

Cloud-native open-source distributed time-series database with high performance and compression

Time-Series·2022·AGPL-3.0·Rust

Elassandra

Apache Cassandra distribution with tightly integrated Elasticsearch for combined NoSQL storage and search

Wide-Column·Apache-2.0·2015·Java

1.7k

−1

1mo

Elassandra

1.7k−1 30d

Apache Cassandra distribution with tightly integrated Elasticsearch for combined NoSQL storage and search

Wide-Column·2015·Apache-2.0·Java

Vald

Highly scalable distributed vector search engine built on Cloud-Native architecture with NGT

Vector·Apache-2.0·2019·Go

1.7k

today

Vald

1.7k+3 30d

Highly scalable distributed vector search engine built on Cloud-Native architecture with NGT

Vector·2019·Apache-2.0·Go

OpenMLDB

Open-source machine learning database providing consistent feature engineering for training and inference

Time-Series·Apache-2.0·2021·C++, Java, Python

1.7k

OpenMLDB

1.7k+6 30d

Open-source machine learning database providing consistent feature engineering for training and inference

Time-Series·2021·Apache-2.0·C++, Java, Python

PolarDB-X

Cloud-native distributed SQL database for high concurrency and massive storage with MySQL compatibility

Relational·Apache-2.0·2020·Java

1.7k

−2

7mo

PolarDB-X

1.7k−2 30d

Cloud-native distributed SQL database for high concurrency and massive storage with MySQL compatibility

Relational·2020·Apache-2.0·Java

Apache Solr

Blazing-fast, open-source multi-modal search platform built on Apache Lucene

Search·Apache-2.0·2004·Java

1.6k

+15

today

Apache Solr

1.6k+15 30d

Blazing-fast, open-source multi-modal search platform built on Apache Lucene

Search·2004·Apache-2.0·Java

CovenantSQL

Decentralized SQL database built on blockchain with immutable query history and GDPR compliance

Relational·Apache-2.0·2018·Go

1.5k

−1

CovenantSQL

1.5k−1 30d

Decentralized SQL database built on blockchain with immutable query history and GDPR compliance

Relational·2018·Apache-2.0·Go

Comdb2

Bloomberg's clustered RDBMS built on optimistic concurrency with high availability SQL

Relational·Apache-2.0·2004·C

1.5k

Comdb2

1.5k+4 30d

Bloomberg's clustered RDBMS built on optimistic concurrency with high availability SQL

Relational·2004·Apache-2.0·C

EloqKVSA

High-performance distributed Redis-compatible database with ACID transactions, tiered storage and predictable tail latency.

Key-Value·source-available·2025

1.5k

today

EloqKV

1.5k+1 30d

High-performance distributed Redis-compatible database with ACID transactions, tiered storage and predictable tail latency.

Key-Value·2025·source-available

GeoMesa

Distributed spatio-temporal indexing on top of Accumulo, HBase, Cassandra, and Kafka

Multi-Model·Apache-2.0·2014·Scala, Java

1.5k

GeoMesa

1.5k0 30d

Distributed spatio-temporal indexing on top of Accumulo, HBase, Cassandra, and Kafka

Multi-Model·2014·Apache-2.0·Scala, Java

100

AerospikeSA

Flash-optimized distributed NoSQL database for real-time applications at massive scale

Key-Value·AGPL-3.0·2012·C

1.4k

+11

100

Aerospike

1.4k+11 30d

Flash-optimized distributed NoSQL database for real-time applications at massive scale

Key-Value·2012·AGPL-3.0·C

101

Infinispan

Open-source distributed in-memory data grid with multi-protocol access and cross-site replication

Key-Value·Apache-2.0·2009·Java

1.3k

101

Infinispan

1.3k+7 30d

Open-source distributed in-memory data grid with multi-protocol access and cross-site replication

Key-Value·2009·Apache-2.0·Java

102

Apache Impala

Native analytic SQL engine for Apache Hadoop and open data formats with low-latency queries

Analytics·Apache-2.0·2012·C++, Java

1.3k

−2

102

Apache Impala

1.3k+1 30d

Native analytic SQL engine for Apache Hadoop and open data formats with low-latency queries

Analytics·2012·Apache-2.0·C++, Java

103

Apache Cloudberry

Advanced open-source MPP analytics database forked from Greenplum with a modern PostgreSQL kernel

Analytics·Apache-2.0·2023·C, C++

1.2k

+14

103

Apache Cloudberry

1.2k+14 30d

Advanced open-source MPP analytics database forked from Greenplum with a modern PostgreSQL kernel

Analytics·2023·Apache-2.0·C, C++

104

openGemini

Cloud-native distributed time-series database by Huawei for IoT and observability at massive scale

Time-Series·Apache-2.0·2022·Go

1.2k

−1

today

104

openGemini

1.2k+4 30d

Cloud-native distributed time-series database by Huawei for IoT and observability at massive scale

Time-Series·2022·Apache-2.0·Go

105

Apache Accumulo

Sorted, distributed key-value store built on Hadoop with cell-level security

Wide-Column·Apache-2.0·2011·Java

1.2k

105

Apache Accumulo

1.2k+4 30d

Sorted, distributed key-value store built on Hadoop with cell-level security

Wide-Column·2011·Apache-2.0·Java

106

Apache Phoenix

Massively parallel SQL engine on top of Apache HBase for low-latency OLTP queries

Relational·Apache-2.0·2014·Java

1.1k

−1

106

Apache Phoenix

1.1k+1 30d

Massively parallel SQL engine on top of Apache HBase for low-latency OLTP queries

Relational·2014·Apache-2.0·Java

107

MyScale

SQL vector database built on ClickHouse for high-performance AI applications with filtered search

Vector·Apache-2.0·2023·C++

1.0k

107

MyScale

1.0k+2 30d

SQL vector database built on ClickHouse for high-performance AI applications with filtered search

Vector·2023·Apache-2.0·C++

108

ArcadeDB

Multi-model database supporting graphs, documents, key-value, vectors, time-series, and search in one engine

Multi-Model·Apache-2.0·2021·Java

1.0k

+17

+77

today

108

ArcadeDB

1.0k+77 30d

Multi-model database supporting graphs, documents, key-value, vectors, time-series, and search in one engine

Multi-Model·2021·Apache-2.0·Java

109

openGauss

Huawei's open-source PostgreSQL-derived enterprise RDBMS optimized for ARM and high concurrency

Relational·Mulan PSL v2·2020·C, C++

782

18d

109

openGauss

782+4 30d

Huawei's open-source PostgreSQL-derived enterprise RDBMS optimized for ARM and high concurrency

Relational·2020·Mulan PSL v2·C, C++

110

NCache

Open-source distributed in-memory cache for .NET and Java with pub/sub messaging

Key-Value·Apache-2.0·2005·C#, .NET

664

110

NCache

664+3 30d

Open-source distributed in-memory cache for .NET and Java with pub/sub messaging

Key-Value·2005·Apache-2.0·C#, .NET

111

SiriDB

Highly scalable and super fast open-source time-series database with dynamic grouping

Time-Series·MIT·2017·C

513

2mo

111

SiriDB

5130 30d

Highly scalable and super fast open-source time-series database with dynamic grouping

Time-Series·2017·MIT·C

112

OpenTenBase

Enterprise-level distributed HTAP database based on PostgreSQL for hybrid transactional and analytical workloads

Relational·BSD-3-Clause·2019·C

508

+60

8mo

112

OpenTenBase

508+60 30d

Enterprise-level distributed HTAP database based on PostgreSQL for hybrid transactional and analytical workloads

Relational·2019·BSD-3-Clause·C

113

Oracle Coherence

In-memory data grid with fault-tolerant caching, transactions, and event processing for enterprise Java applications

Key-Value·UPL-1.0·2001·Java

470

1mo

113

Oracle Coherence

470+1 30d

In-memory data grid with fault-tolerant caching, transactions, and event processing for enterprise Java applications

Key-Value·2001·UPL-1.0·Java

114

Fluree

Immutable, ledger-backed semantic graph database with native RDF and JSON-LD support

Graph·EPL-2.0·2016·Clojure

423

+32

114

Fluree

423+32 30d

Immutable, ledger-backed semantic graph database with native RDF and JSON-LD support

Graph·2016·EPL-2.0·Clojure

115

Warp 10

Advanced open-source time-series platform with native geo-temporal support and WarpScript analytics

Time-Series·Apache-2.0·2015·Java

415

4mo

115

Warp 10

4150 30d

Advanced open-source time-series platform with native geo-temporal support and WarpScript analytics

Time-Series·2015·Apache-2.0·Java

116

Gnocchi

Scalable time-series database with pre-computed aggregations for cloud metrics and resource indexing

Time-Series·Apache-2.0·2017·Python

322

2mo

116

Gnocchi

3220 30d

Scalable time-series database with pre-computed aggregations for cloud metrics and resource indexing

Time-Series·2017·Apache-2.0·Python

117

Percona Server for MongoDB

Enhanced open-source MongoDB drop-in replacement with enterprise-grade security and backup features

Document·SSPL·2016·C++, JavaScript

247

117

Percona Server for MongoDB

247+2 30d

Enhanced open-source MongoDB drop-in replacement with enterprise-grade security and backup features

Document·2016·SSPL·C++, JavaScript

118

Riak KV

Distributed NoSQL key-value database with masterless architecture for high availability and fault tolerance

Key-Value·Apache-2.0·2009·Erlang

118

Riak KV

50+6 30d

Distributed NoSQL key-value database with masterless architecture for high availability and fault tolerance

Key-Value·2009·Apache-2.0·Erlang

119

1010dataP

Cloud-based columnar analytics platform for massive-scale data discovery and ad hoc analysis

Analytics·proprietary·2000·K

—

119

1010data

Cloud-based columnar analytics platform for massive-scale data discovery and ad hoc analysis

Analytics·2000·proprietary·K

120

Actian NoSQL DatabaseP

Object-oriented database for complex data models with native language integration

Document·proprietary·1988·C, C++, Java

—

120

Actian NoSQL Database

Object-oriented database for complex data models with native language integration

Document·1988·proprietary·C, C++, Java

121

AlgoliaP

AI-powered search and discovery API delivering sub-millisecond results with typo tolerance and real-time indexing

Search·proprietary·2012·C++

—

121

Algolia

AI-powered search and discovery API delivering sub-millisecond results with typo tolerance and real-time indexing

Search·2012·proprietary·C++

122

Alibaba Cloud AnalyticDB for MySQLP

Cloud-native real-time data warehouse with MySQL compatibility for petabyte-scale analytics

Analytics·proprietary·2017·C++

—

122

Alibaba Cloud AnalyticDB for MySQL

Cloud-native real-time data warehouse with MySQL compatibility for petabyte-scale analytics

Analytics·2017·proprietary·C++

123

Alibaba Cloud AnalyticDB for PostgreSQLP

MPP cloud data warehouse with PostgreSQL compatibility and vector search capabilities

Analytics·proprietary·2016·C, C++

—

123

Alibaba Cloud AnalyticDB for PostgreSQL

MPP cloud data warehouse with PostgreSQL compatibility and vector search capabilities

Analytics·2016·proprietary·C, C++

124

Alibaba Cloud Log ServiceP

Cloud-native observability platform for PB-scale log collection, analysis, and visualization

Analytics·proprietary·2016

—

124

Alibaba Cloud Log Service

Cloud-native observability platform for PB-scale log collection, analysis, and visualization

Analytics·2016·proprietary

125

Alibaba Cloud MaxComputeP

Fully managed petabyte-scale data warehouse with serverless SQL, MapReduce, and graph computation

Analytics·proprietary·2010

—

125

Alibaba Cloud MaxCompute

Fully managed petabyte-scale data warehouse with serverless SQL, MapReduce, and graph computation

Analytics·2010·proprietary

126

Alibaba Cloud PolarDBP

Cloud-native relational database with MySQL, PostgreSQL, and Oracle compatibility and HTAP capabilities

Relational·proprietary·2018·C, C++

—

126

Alibaba Cloud PolarDB

Cloud-native relational database with MySQL, PostgreSQL, and Oracle compatibility and HTAP capabilities

Relational·2018·proprietary·C, C++

127

Alibaba Cloud Table StoreP

Serverless NoSQL wide-column and time-series storage with auto-scaling to 10 PB

Wide-Column·proprietary·2016

—

127

Alibaba Cloud Table Store

Serverless NoSQL wide-column and time-series storage with auto-scaling to 10 PB

Wide-Column·2016·proprietary

128

AllegroGraphP

Neuro-symbolic AI platform combining RDF knowledge graphs, vector store, and SPARQL in a transactional graph database

Graph·Commercial (free edition available)·2004·Common Lisp, C

—

128

AllegroGraph

Neuro-symbolic AI platform combining RDF knowledge graphs, vector store, and SPARQL in a transactional graph database

Graph·2004·Commercial (free edition available)·Common Lisp, C

129

Amazon AuroraP

MySQL and PostgreSQL-compatible relational database with up to 5x throughput and 99.999% availability

Relational·proprietary·2014

—

129

Amazon Aurora

MySQL and PostgreSQL-compatible relational database with up to 5x throughput and 99.999% availability

Relational·2014·proprietary

130

Amazon CloudSearchP

Managed search service with auto-scaling, faceted search, and support for 34 languages

Search·proprietary·2012

—

130

Amazon CloudSearch

Managed search service with auto-scaling, faceted search, and support for 34 languages

Search·2012·proprietary

131

Amazon DocumentDBP

Fully managed MongoDB-compatible document database with fast performance and up to 10 global regions

Document·proprietary·2019

—

131

Amazon DocumentDB

Fully managed MongoDB-compatible document database with fast performance and up to 10 global regions

Document·2019·proprietary

132

Amazon DynamoDBP

Serverless, fully managed NoSQL key-value and document database with single-digit millisecond performance at any scale

Key-Value·proprietary·2012

—

132

Amazon DynamoDB

Serverless, fully managed NoSQL key-value and document database with single-digit millisecond performance at any scale

Key-Value·2012·proprietary

133

Amazon KeyspacesP

Serverless, fully managed Apache Cassandra-compatible database service on AWS

Wide-Column·proprietary·2020

—

133

Amazon Keyspaces

Serverless, fully managed Apache Cassandra-compatible database service on AWS

Wide-Column·2020·proprietary

134

Amazon NeptuneP

Fully managed graph database service supporting Gremlin, openCypher, and SPARQL

Graph·proprietary·2018

—

134

Amazon Neptune

Fully managed graph database service supporting Gremlin, openCypher, and SPARQL

Graph·2018·proprietary

135

Amazon RedshiftP

Petabyte-scale cloud data warehouse with columnar storage and massively parallel processing

Analytics·proprietary·2013

—

135

Amazon Redshift

Petabyte-scale cloud data warehouse with columnar storage and massively parallel processing

Analytics·2013·proprietary

136

Amazon TimestreamP

Serverless time-series database for IoT and operational applications with built-in analytics

Time-Series·proprietary·2020

—

136

Amazon Timestream

Serverless time-series database for IoT and operational applications with built-in analytics

Time-Series·2020·proprietary

137

AnzoGraph DBP

Massively parallel graph OLAP database for W3C standards-based analytics at scale

Graph·proprietary·2018·C++

—

137

AnzoGraph DB

Massively parallel graph OLAP database for W3C standards-based analytics at scale

Graph·2018·proprietary·C++

138

Axibase Time Series DatabaseP

Special-purpose time-series database for IT infrastructure, industrial equipment, and financial market data

Time-Series·proprietary·2004·Java

—

138

Axibase Time Series Database

Special-purpose time-series database for IT infrastructure, industrial equipment, and financial market data

Time-Series·2004·proprietary·Java

139

Azure Cosmos DBP

Globally distributed, multi-model database service with turnkey multi-region replication and single-digit millisecond latency

Multi-Model·proprietary·2017·C++, C#

—

139

Azure Cosmos DB

Globally distributed, multi-model database service with turnkey multi-region replication and single-digit millisecond latency

Multi-Model·2017·proprietary·C++, C#

140

Cloudflare Workers KVP

Global edge key-value store with low-latency reads across 330+ locations

Key-Value·proprietary·2018

—

140

Cloudflare Workers KV

Global edge key-value store with low-latency reads across 330+ locations

Key-Value·2018·proprietary

141

CloudKitP

Apple's cloud database service for seamless data sync across all Apple platforms

Document·proprietary·2014

—

141

CloudKit

Apple's cloud database service for seamless data sync across all Apple platforms

Document·2014·proprietary

142

CouchbaseSA

Multi-model NoSQL database for enterprise applications with SQL++ support

Multi-Model·BSL 1.1 / Apache-2.0 (Community)·2011·C++, Go, Erlang, C

—

142

Couchbase

Multi-model NoSQL database for enterprise applications with SQL++ support

Multi-Model·2011·BSL 1.1 / Apache-2.0 (Community)·C++, Go, Erlang, C

143

CoveoP

AI-powered enterprise search and relevance platform with machine learning recommendations

Search·proprietary·2005

—

143

Coveo

AI-powered enterprise search and relevance platform with machine learning recommendations

Search·2005·proprietary

144

DatabricksP

Unified data lakehouse platform combining the best of data warehouses and data lakes with Delta Lake

Analytics·proprietary·2013

—

144

Databricks

Unified data lakehouse platform combining the best of data warehouses and data lakes with Delta Lake

Analytics·2013·proprietary

145

DataStax EnterpriseP

Enterprise-grade distributed database built on Apache Cassandra with integrated analytics, search, and graph

Wide-Column·Commercial·2010·Java

—

145

DataStax Enterprise

Enterprise-grade distributed database built on Apache Cassandra with integrated analytics, search, and graph

Wide-Column·2010·Commercial·Java

146

DolphinDBP

High-performance time-series database with built-in analytics for finance and IoT

Time-Series·Proprietary·2018·C++

—

146

DolphinDB

High-performance time-series database with built-in analytics for finance and IoT

Time-Series·2018·Proprietary·C++

147

ExasolP

High-performance in-memory MPP analytics database delivering up to 1000x faster analytical queries

Analytics·Commercial·2000·C++

—

147

Exasol

High-performance in-memory MPP analytics database delivering up to 1000x faster analytical queries

Analytics·2000·Commercial·C++

148

FireboltP

Sub-second analytics cloud data warehouse built for high-concurrency, data-intensive applications

Analytics·proprietary·2021·C++

—

148

Firebolt

Sub-second analytics cloud data warehouse built for high-concurrency, data-intensive applications

Analytics·2021·proprietary·C++

149

FirestoreP

Serverless, fully managed NoSQL document database with real-time sync and offline support for mobile and web apps

Document·proprietary·2017

—

149

Firestore

Serverless, fully managed NoSQL document database with real-time sync and offline support for mobile and web apps

Document·2017·proprietary

150

GalaxybaseP

High-performance native distributed graph database for HTAP workloads at trillion-edge scale

Graph·proprietary·2018·Java

—

150

Galaxybase

High-performance native distributed graph database for HTAP workloads at trillion-edge scale

Graph·2018·proprietary·Java

151

GBaseP

Chinese enterprise database platform with leading analytical and transactional database products

Analytics·proprietary·2004·C, C++

—

151

GBase

Chinese enterprise database platform with leading analytical and transactional database products

Analytics·2004·proprietary·C, C++

152

GemStone/SP

Smalltalk-based object database for scalable, transactional multi-tier business applications

Multi-Model·Proprietary·1986·Smalltalk, C

—

152

GemStone/S

Smalltalk-based object database for scalable, transactional multi-tier business applications

Multi-Model·1986·Proprietary·Smalltalk, C

153

GigaSpaces XAPP

In-memory computing platform for real-time analytics and extreme transaction processing

Key-Value·Proprietary·2000·Java

—

153

GigaSpaces XAP

In-memory computing platform for real-time analytics and extreme transaction processing

Key-Value·2000·Proprietary·Java

154

Google BigQueryP

Serverless, highly scalable multi-cloud data warehouse with built-in ML and real-time analytics

Analytics·proprietary·2010

—

154

Google BigQuery

Serverless, highly scalable multi-cloud data warehouse with built-in ML and real-time analytics

Analytics·2010·proprietary

155

Google Cloud BigtableP

Fully managed, low-latency wide-column NoSQL database for massive analytical and operational workloads

Wide-Column·Proprietary·2015·C++

—

155

Google Cloud Bigtable

Fully managed, low-latency wide-column NoSQL database for massive analytical and operational workloads

Wide-Column·2015·Proprietary·C++

156

Google Cloud DatastoreP

Highly scalable NoSQL document database with automatic sharding and ACID transactions

Document·Proprietary·2013

—

156

Google Cloud Datastore

Highly scalable NoSQL document database with automatic sharding and ACID transactions

Document·2013·Proprietary

157

Google Cloud SpannerP

Globally distributed, strongly consistent relational database with unlimited scale and 99.999% availability

Relational·Proprietary·2017·C++

—

157

Google Cloud Spanner

Globally distributed, strongly consistent relational database with unlimited scale and 99.999% availability

Relational·2017·Proprietary·C++

158

GraphBaseP

Enterprise graph database for knowledge graphs and Graph RAG with distributed cloud-native architecture

Graph·proprietary

—

158

GraphBase

Enterprise graph database for knowledge graphs and Graph RAG with distributed cloud-native architecture

Graph·proprietary

159

GreenplumP

Massively parallel processing analytics database built on PostgreSQL for large-scale data warehousing

Analytics·proprietary·2005·C, C++, Python

—

159

Greenplum

Massively parallel processing analytics database built on PostgreSQL for large-scale data warehousing

Analytics·2005·proprietary·C, C++, Python

160

GridGainP

In-memory computing platform built on Apache Ignite for real-time transactions and analytics

Multi-Model·proprietary·2007·Java, C++, C#

—

160

GridGain

In-memory computing platform built on Apache Ignite for real-time transactions and analytics

Multi-Model·2007·proprietary·Java, C++, C#

161

HPE Ezmeral Data FabricP

Converged data platform with integrated NoSQL database, file system, and event streams for hybrid cloud

Multi-Model·proprietary·2009·C++, Java

—

161

HPE Ezmeral Data Fabric

Converged data platform with integrated NoSQL database, file system, and event streams for hybrid cloud

Multi-Model·2009·proprietary·C++, Java

162

IBM CloudantP

Fully managed CouchDB-compatible JSON document database with global distribution and serverless scaling

Document·proprietary·2010·Erlang, JavaScript, C

—

162

IBM Cloudant

Fully managed CouchDB-compatible JSON document database with global distribution and serverless scaling

Document·2010·proprietary·Erlang, JavaScript, C

163

IBM Db2P

Enterprise-grade relational database with AI-powered optimization and hybrid cloud deployment

Relational·proprietary·1983·C, C++, Assembly

—

163

IBM Db2

Enterprise-grade relational database with AI-powered optimization and hybrid cloud deployment

Relational·1983·proprietary·C, C++, Assembly

164

InfiniteGraphP

Distributed graph database for large-scale relationship analytics and deep link analysis

Graph·Commercial·2010·Java, C++

—

164

InfiniteGraph

Distributed graph database for large-scale relationship analytics and deep link analysis

Graph·2010·Commercial·Java, C++

165

KineticaP

GPU-accelerated real-time analytics database for spatial, temporal, graph, and AI workloads at scale

Analytics·proprietary·2016·C++

—

165

Kinetica

GPU-accelerated real-time analytics database for spatial, temporal, graph, and AI workloads at scale

Analytics·2016·proprietary·C++

166

Kyligence EnterpriseP

AI-augmented OLAP analytics platform delivering sub-second queries on petabyte-scale data, built on Apache Kylin

Analytics·proprietary·2016·Java

—

166

Kyligence Enterprise

AI-augmented OLAP analytics platform delivering sub-second queries on petabyte-scale data, built on Apache Kylin

Analytics·2016·proprietary·Java

167

LeanXcaleP

Ultra-scalable distributed SQL database with full ACID compliance and NoSQL-speed ingestion

Relational·proprietary·2015·Java

—

167

LeanXcale

Ultra-scalable distributed SQL database with full ACID compliance and NoSQL-speed ingestion

Relational·2015·proprietary·Java

168

MarkLogicP

Enterprise multi-model database combining documents, graph, and search with government-grade security

Multi-Model·proprietary·2001·C++

—

168

MarkLogic

Enterprise multi-model database combining documents, graph, and search with government-grade security

Multi-Model·2001·proprietary·C++

169

Microsoft Azure AI SearchP

Enterprise cloud search service with vector search, semantic ranking, and AI-powered agentic retrieval

Search·proprietary·2014

—

169

Microsoft Azure AI Search

Enterprise cloud search service with vector search, semantic ranking, and AI-powered agentic retrieval

Search·2014·proprietary

170

Microsoft Azure Data ExplorerP

Fast and scalable data analytics service for real-time analysis of streaming and time-series data using Kusto Query Language

Analytics·proprietary·2019

—

170

Microsoft Azure Data Explorer

Fast and scalable data analytics service for real-time analysis of streaming and time-series data using Kusto Query Language

Analytics·2019·proprietary

171

Microsoft Azure SQL DatabaseP

Fully managed cloud relational database built on the Microsoft SQL Server engine with intelligent performance

Relational·proprietary·2010

—

171

Microsoft Azure SQL Database

Fully managed cloud relational database built on the Microsoft SQL Server engine with intelligent performance

Relational·2010·proprietary

172

Microsoft Azure Synapse AnalyticsP

Integrated analytics platform combining data warehousing, big data, and data integration with serverless and provisioned options

Analytics·proprietary·2019

—

172

Microsoft Azure Synapse Analytics

Integrated analytics platform combining data warehousing, big data, and data integration with serverless and provisioned options

Analytics·2019·proprietary

173

Microsoft Azure Table StorageP

Schemaless NoSQL key-value store for massive volumes of semi-structured data with strong consistency

Key-Value·proprietary·2009

—

173

Microsoft Azure Table Storage

Schemaless NoSQL key-value store for massive volumes of semi-structured data with strong consistency

Key-Value·2009·proprietary

174

Microsoft FabricP

Unified analytics platform converging data warehousing, engineering, science, and real-time intelligence on a single SaaS foundation

Analytics·proprietary·2023

—

174

Microsoft Fabric

Unified analytics platform converging data warehousing, engineering, science, and real-time intelligence on a single SaaS foundation

Analytics·2023·proprietary

175

NetezzaP

Purpose-built analytics appliance for high-performance data warehousing and advanced analytics

Analytics·Proprietary·2002·C, C++

—

175

Netezza

Purpose-built analytics appliance for high-performance data warehousing and advanced analytics

Analytics·2002·Proprietary·C, C++

176

NonStop SQLP

Fault-tolerant relational database for mission-critical OLTP on HPE NonStop systems

Relational·proprietary·1986·C, C++

—

176

NonStop SQL

Fault-tolerant relational database for mission-critical OLTP on HPE NonStop systems

Relational·1986·proprietary·C, C++

177

NuoDBP

Cloud-native distributed SQL database with ACID compliance and elastic scale-out

Relational·proprietary·2010·C++

—

177

NuoDB

Cloud-native distributed SQL database with ACID compliance and elastic scale-out

Relational·2010·proprietary·C++

178

Ontotext GraphDBP

Enterprise RDF triplestore with real-time semantic inferencing at billion-statement scale

Graph·proprietary·2000·Java

—

178

Ontotext GraphDB

Enterprise RDF triplestore with real-time semantic inferencing at billion-statement scale

Graph·2000·proprietary·Java

179

OracleP

Enterprise-grade multi-model database with AI-native capabilities

Relational·Oracle Commercial License·1979·C, C++

—

179

Oracle

Enterprise-grade multi-model database with AI-native capabilities

Relational·1979·Oracle Commercial License·C, C++

180

Oracle NoSQL Database

Distributed NoSQL database providing key-value, table, and document data models with ACID transactions

Key-Value·Apache-2.0 (Community Edition)·2011·Java

—

180

Oracle NoSQL Database

Distributed NoSQL database providing key-value, table, and document data models with ACID transactions

Key-Value·2011·Apache-2.0 (Community Edition)·Java

181

OushuDBP

Cloud-native MPP data warehouse built on Apache HAWQ for petabyte-scale interactive analytics

Analytics·proprietary·2016·C, C++

—

181

OushuDB

Cloud-native MPP data warehouse built on Apache HAWQ for petabyte-scale interactive analytics

Analytics·2016·proprietary·C, C++

182

PieCloudDBP

Cloud-native virtual data warehouse with elastic massive parallel processing (eMPP) architecture

Analytics·proprietary·2022·C++

—

182

PieCloudDB

Cloud-native virtual data warehouse with elastic massive parallel processing (eMPP) architecture

Analytics·2022·proprietary·C++

183

PineconeP

Fully managed vector database built for high-performance AI applications at scale

Vector·proprietary·2021

—

183

Pinecone

Fully managed vector database built for high-performance AI applications at scale

Vector·2021·proprietary

184

PlanetScaleP

Serverless MySQL-compatible cloud database platform powered by Vitess with branching and deploy requests

Relational·proprietary·2021·Go

—

184

PlanetScale

Serverless MySQL-compatible cloud database platform powered by Vitess with branching and deploy requests

Relational·2021·proprietary·Go

185

QuasarDBP

High-performance distributed column-oriented time-series database with native transactional support

Time-Series·proprietary·2009·C++

—

185

QuasarDB

High-performance distributed column-oriented time-series database with native transactional support

Time-Series·2009·proprietary·C++

186

RizhiyiP

Enterprise log analytics platform with proprietary Beaver search engine for PB-level log management

Search·proprietary·2014·C++

—

186

Rizhiyi

Enterprise log analytics platform with proprietary Beaver search engine for PB-level log management

Search·2014·proprietary·C++

187

SAP HANAP

In-memory relational database for real-time analytics and transactional processing in enterprise environments

Relational·proprietary·2010·C++

—

187

SAP HANA

In-memory relational database for real-time analytics and transactional processing in enterprise environments

Relational·2010·proprietary·C++

188

ScaleOut StateServerP

In-memory data grid with distributed caching, parallel query, and high availability for .NET and Java

Key-Value·Commercial·2005·C++, C#

—

188

ScaleOut StateServer

In-memory data grid with distributed caching, parallel query, and high availability for .NET and Java

Key-Value·2005·Commercial·C++, C#

189

SciDBSA

Array database for multidimensional data management and complex analytics in scientific computing

Analytics·AGPL-3.0·2010·C++

—

189

SciDB

Array database for multidimensional data management and complex analytics in scientific computing

Analytics·2010·AGPL-3.0·C++

190

SingleStoreP

Distributed SQL database for data-intensive applications combining transactions, analytics, and AI workloads

Relational·SingleStore Commercial License·2013·C++

—

190

SingleStore

Distributed SQL database for data-intensive applications combining transactions, analytics, and AI workloads

Relational·2013·SingleStore Commercial License·C++

191

SnowflakeP

Cloud-native data warehouse with automatic scaling, separation of storage and compute, and near-zero administration

Analytics·proprietary·2014

—

191

Snowflake

Cloud-native data warehouse with automatic scaling, separation of storage and compute, and near-zero administration

Analytics·2014·proprietary

192

SpaceTimeP

Spatiotemporal relational database optimized for analytical workloads on moving objects with JIT compilation

Analytics·proprietary·2020·C++

—

192

SpaceTime

Spatiotemporal relational database optimized for analytical workloads on moving objects with JIT compilation

Analytics·2020·proprietary·C++

193

SplunkP

Platform for searching, monitoring, and analyzing machine-generated data via a web-style interface

Search·proprietary·2003·C++, Python

—

193

Splunk

Platform for searching, monitoring, and analyzing machine-generated data via a web-style interface

Search·2003·proprietary·C++, Python

194

StardogP

Enterprise knowledge graph platform built on RDF standards for data unification and AI-powered insights

Graph·proprietary·2012·Java

—

194

Stardog

Enterprise knowledge graph platform built on RDF standards for data unification and AI-powered insights

Graph·2012·proprietary·Java

195

TDSQL for MySQLP

Tencent's distributed MySQL-compatible database with strong consistency and horizontal scaling

Relational·proprietary·2012·C, C++

—

195

TDSQL for MySQL

Tencent's distributed MySQL-compatible database with strong consistency and horizontal scaling

Relational·2012·proprietary·C, C++

196

TeradataP

Enterprise-grade parallel data warehouse for large-scale analytics and business intelligence

Analytics·proprietary·1984·C, C++

—

196

Teradata

Enterprise-grade parallel data warehouse for large-scale analytics and business intelligence

Analytics·1984·proprietary·C, C++

197

TigerGraphP

High-performance native graph analytics platform for AI, fraud detection, and real-time insights on connected data

Graph·proprietary·2017·C++

—

197

TigerGraph

High-performance native graph analytics platform for AI, fraud detection, and real-time insights on connected data

Graph·2017·proprietary·C++

198

Transwarp ArgoDBP

Distributed analytical database replacing Hadoop+MPP with unified SQL analytics and real-time data processing

Analytics·proprietary

—

198

Transwarp ArgoDB

Distributed analytical database replacing Hadoop+MPP with unified SQL analytics and real-time data processing

Analytics·proprietary

199

Transwarp HippoP

Enterprise cloud-native distributed vector database with GPU acceleration and multi-model support

Vector·proprietary

—

199

Transwarp Hippo

Enterprise cloud-native distributed vector database with GPU acceleration and multi-model support

Vector·proprietary

200

Transwarp KunDBP

Financial-grade distributed relational database with strong consistency and Oracle/MySQL compatibility

Relational·proprietary

—

200

Transwarp KunDB

Financial-grade distributed relational database with strong consistency and Oracle/MySQL compatibility

Relational·proprietary

201

Transwarp StellarDBP

Enterprise distributed graph database with native graph storage and deep link analysis at PB scale

Graph·proprietary

—

201

Transwarp StellarDB

Enterprise distributed graph database with native graph storage and deep link analysis at PB scale

Graph·proprietary

202

turbopufferP

Serverless vector and full-text search database built on object storage for low-cost high-scale workloads

Vector·Commercial·2024·Rust

—

202

turbopuffer

Serverless vector and full-text search database built on object storage for low-cost high-scale workloads

Vector·2024·Commercial·Rust

203

UltipaP

Ultra-high-performance 4th-generation graph database with deep traversal and GQL compliance

Graph·proprietary·2019·C++

—

203

Ultipa

Ultra-high-performance 4th-generation graph database with deep traversal and GQL compliance

Graph·2019·proprietary·C++

204

VerticaP

High-performance columnar analytics database for petabyte-scale real-time analytics and machine learning

Analytics·Proprietary·2005·C++

—

204

Vertica

High-performance columnar analytics database for petabyte-scale real-time analytics and machine learning

Analytics·2005·Proprietary·C++

205

VMware Tanzu GemFireP

Enterprise-grade distributed in-memory data grid for sub-millisecond, low-latency applications

Key-Value·proprietary·2002·Java

—

205

VMware Tanzu GemFire

Enterprise-grade distributed in-memory data grid for sub-millisecond, low-latency applications

Key-Value·2002·proprietary·Java

206

XtremeDataP

Cloud-scale parallel SQL analytics engine with vectorized execution for complex analytical workloads

Analytics·proprietary·2005·C++

—

206

XtremeData

Cloud-scale parallel SQL analytics engine with vectorized execution for complex analytical workloads

Analytics·2005·proprietary·C++

207

YellowbrickP

Modern enterprise cloud data warehouse built on Kubernetes for extreme speed and concurrency

Analytics·proprietary·2014·C++

—

207

Yellowbrick

Modern enterprise cloud data warehouse built on Kubernetes for extreme speed and concurrency

Analytics·2014·proprietary·C++

What is a Distributed Database?

A distributed database spreads data across multiple machines (nodes) that work together as a single logical system. Data is partitioned (sharded) and replicated across nodes to provide horizontal scalability, fault tolerance, and geographic distribution. When one node fails, others continue serving requests. Distributed databases solve the fundamental limitation of single-server databases: they scale beyond what one machine can handle. They span multiple data models — distributed SQL (CockroachDB, TiDB, YugabyteDB), distributed NoSQL (Cassandra, ScyllaDB), distributed key-value (etcd, FoundationDB), and distributed document stores (MongoDB with sharding).

When to Use a Distributed Database

Use a distributed database when a single server can't meet your requirements for storage capacity, write throughput, read scalability, or availability. Specific scenarios include: applications serving users across multiple regions with low-latency requirements, workloads needing 99.999% uptime, datasets too large for a single machine, and write-heavy systems that exceed single-node throughput. Distributed SQL databases (CockroachDB, TiDB) offer familiar SQL with automatic sharding. Distributed NoSQL (Cassandra) offers extreme write scalability. Consider single-node databases when your data fits on one machine — they're simpler to operate, debug, and reason about.

Frequently Asked Questions

What is the difference between a distributed database and a regular database?

A regular (centralized) database runs on a single server — all data lives on one machine. A distributed database runs across multiple servers, with data partitioned and replicated across them. Distributed databases offer horizontal scalability (add more nodes to handle more load), fault tolerance (survive node failures), and geographic distribution (serve users from nearby nodes). The tradeoff is operational complexity — distributed systems are harder to deploy, monitor, debug, and reason about than single-node databases.

What is the CAP theorem?

The CAP theorem states that a distributed system can provide at most two of three guarantees simultaneously: Consistency (every read returns the latest write), Availability (every request gets a response), and Partition tolerance (the system works despite network failures). Since network partitions are inevitable, the real choice is between consistency and availability during failures. CockroachDB and YugabyteDB choose consistency (CP). Cassandra and DynamoDB choose availability (AP). In practice, modern databases offer tunable consistency — you can adjust the tradeoff per query.

What is the difference between CockroachDB and TiDB?

Both are distributed SQL databases but with different compatibility targets. CockroachDB is PostgreSQL wire-compatible — existing Postgres drivers and many Postgres queries work unchanged. TiDB is MySQL wire-compatible — existing MySQL applications can migrate with minimal changes. CockroachDB uses a Raft-based consensus protocol and is written in Go. TiDB separates compute (TiDB) from storage (TiKV, built on RocksDB). Both support horizontal scaling and ACID transactions across distributed nodes.

Is MongoDB a distributed database?

MongoDB supports distribution through built-in sharding — you can partition data across multiple servers based on a shard key. However, MongoDB is not distributed by default. A single-node MongoDB deployment is a standard centralized database. You enable distribution by configuring a sharded cluster with config servers, shard nodes, and mongos routers. Purpose-built distributed databases like CockroachDB, TiDB, and Cassandra are distributed from the ground up and handle sharding automatically without manual configuration.

Do I need a distributed database?

Most applications don't. A well-tuned single-node PostgreSQL or MySQL instance handles millions of rows and thousands of concurrent connections. You need a distributed database when: your dataset exceeds single-server storage, your write throughput exceeds single-server capacity, you need multi-region low-latency access, or you require 99.999%+ uptime. If your data fits on one machine and you're not serving globally, a single-node database with read replicas is simpler and often faster than a distributed system.

Browse by Category

Explore databases organized by type, data model, and architecture.

Relational Document Key-Value Vector Time-Series NoSQL Graph Search Engine Analytics Wide-Column In-Memory Embedded Streaming

Browse by Compatibility

Drop-in replacements and alternatives that speak an existing database's protocol.

PostgreSQL-Compatible MySQL-Compatible Redis-Compatible MongoDB-Compatible Elasticsearch-Compatible

Manage Distributed Databases Visually

1bench is a modern GUI client that supports all major distributed databases and many more.

Get Started