246+ Distributed Databases Ranked & Compared

Compare distributed databases ranked by GitHub stars, fault tolerance, and global scalability.

Last updated: April 14, 2026
213 databases
1Elasticsearch
Elasticsearch
76.5k+403 30d

Distributed search and analytics engine built on Apache Lucene for full-text search, observability, and security

Search·2010·Elastic-2.0·Java
2etcd
etcd
51.6k+135 30d

Distributed reliable key-value store for the most critical data of a distributed system

Key-Value·2013·Apache-2.0·Go
3ClickHouse
ClickHouse
46.8k+541 30d

Blazing-fast open-source column-oriented database for real-time analytics and OLAP

Analytics·2016·Apache-2.0·C++
4Milvus
Milvus
43.8k+898 30d

High-performance cloud-native vector database built for scalable similarity search and AI applications

Vector·2019·Apache-2.0·Go, C++
5Apache Spark SQL
Apache Spark SQL
43.1k+185 30d

Distributed SQL query engine within Apache Spark for structured data processing at scale

Analytics·2014·Apache-2.0·Scala, Java, Python, R
6TiDB
TiDB
40.0k+187 30d

MySQL-compatible distributed SQL database for hybrid transactional and analytical workloads

Relational·2015·Apache-2.0·Go
7CockroachDB
CockroachDB
32.0k+148 30d

Distributed SQL database built for cloud-native global applications with serializable isolation

Relational·2015·BSL-1.1·Go, C++
8SurrealDB
SurrealDB
31.8k+275 30d

Multi-model database combining documents, graphs, vectors, and time-series with built-in API layer and real-time capabilities

Multi-Model·2022·BSL-1.1·Rust
9InfluxDB
InfluxDB
31.4k+182 30d

Scalable time-series database built in Rust for metrics, events, and real-time analytics

Time-Series·2013·Apache-2.0·Rust
10Qdrant
Qdrant
30.3k+743 30d

High-performance open-source vector database for next-generation AI applications

Vector·2021·Apache-2.0·Rust
11MongoDB
MongoDB
28.2k+124 30d

The most popular document database for modern applications

Document·2009·SSPL·C++, JavaScript, Python
12RethinkDB
RethinkDB
27.0k+19 30d

Open-source document database designed for real-time push updates to applications

Document·2012·Apache-2.0·C++
13Apache Flink
Apache Flink
25.9k+109 30d

Stateful stream processing framework for real-time and batch data at any scale

Streaming·2011·Apache-2.0·Java, Scala
14Valkey
Valkey
25.4k+359 30d

Open-source high-performance key-value database forked from Redis, backed by the Linux Foundation

Key-Value·2024·BSD-3-Clause·C
15TDengine
TDengine
24.8k+84 30d

High-performance open-source time-series database designed for Industrial IoT and real-time analytics

Time-Series·2019·AGPL-3.0·C
16Dgraph
Dgraph
21.7k+97 30d

Distributed graph database with native GraphQL support built for horizontal scale

Graph·2017·Apache-2.0·Go
17Neon
Neon
21.5k+278 30d

Serverless PostgreSQL with separated storage and compute, branching, and scale-to-zero

Relational·2022·Apache-2.0·Rust, C
18Vitess
Vitess
20.9k+141 30d

Cloud-native database clustering system for horizontal scaling of MySQL through transparent sharding

Relational·2012·Apache-2.0·Go
19rqlite
rqlite
17.4k+75 30d

Lightweight, fault-tolerant, distributed relational database built on SQLite and Raft consensus

Relational·2014·MIT·Go
20VictoriaMetrics
VictoriaMetrics
16.7k+263 30d

Fast, cost-effective time-series database and monitoring solution compatible with Prometheus

Time-Series·2018·Apache-2.0·Go
21Presto
Presto
16.7k+28 30d

Distributed SQL query engine for running interactive analytic queries against data sources of all sizes

Analytics·2012·Apache-2.0·Java, C++
22TiKV
TiKV
16.6k+78 30d

Distributed transactional key-value database providing ACID guarantees at scale

Key-Value·2016·Apache-2.0·Rust
23Neo4j
Neo4j
16.3k+193 30d

Native graph database with Cypher query language for connected data at scale

Graph·2007·GPL-3.0·Java, Scala
24FoundationDB
FoundationDB
16.2k+65 30d

Distributed, transactional key-value store with multi-model layers and strict serializability

Key-Value·2013·Apache-2.0·C++, Flow
25Weaviate
Weaviate
16.0k+191 30d

AI-native vector database with hybrid search and built-in model integration

Vector·2019·BSD-3-Clause·Go
26ScyllaDB
ScyllaDB
15.5k+78 30d

High-performance NoSQL wide-column database compatible with Apache Cassandra and Amazon DynamoDB

Wide-Column·2015·ScyllaDB Source Available License·C++
27Apache Doris
Apache Doris
15.2k+125 30d

High-performance real-time analytical database for sub-second queries on large-scale data

Analytics·2017·Apache-2.0·Java, C++
28Memcached
Memcached
14.2k+32 30d

High-performance distributed memory caching system for speeding up dynamic web applications

Key-Value·2003·BSD-3-Clause·C
29ArangoDB
ArangoDB
14.1k+44 30d

Multi-model database unifying document, graph, and key-value in a single engine with AQL

Multi-Model·2012·BUSL-1.1·C++, JavaScript
30Thanos
Thanos
14.0k+47 30d

Highly available Prometheus setup with unlimited long-term storage on object storage

Time-Series·2017·Apache-2.0·Go
31Apache Druid
Apache Druid
14.0k+23 30d

High-performance real-time analytics database for sub-second OLAP queries at scale

Analytics·2012·Apache-2.0·Java
32OpenSearch
OpenSearch
12.7k+170 30d

Community-driven open-source search and analytics engine forked from Elasticsearch

Search·2021·Apache-2.0·Java
33Trino
Trino
12.7k+90 30d

Fast distributed SQL query engine for big data analytics across heterogeneous data sources

Analytics·2019·Apache-2.0·Java
34KeyDB
KeyDB
12.5k+21 30d

Multithreaded Redis fork with higher throughput and active replication

Key-Value·2019·BSD-3-Clause·C++, C
35Citus
Citus
12.4k+74 30d

Distributed PostgreSQL as an extension for multi-tenant SaaS and real-time analytics at scale

Relational·2016·AGPL-3.0·C
36NebulaGraph
NebulaGraph
12.1k+64 30d

Distributed graph database built for billion-scale graphs with millisecond latency

Graph·2019·Apache-2.0·C++
37Mnesia
Mnesia
12.1k+44 30d

Distributed real-time database management system built into Erlang/OTP for telecom-grade fault tolerance

Key-Value·1999·Apache-2.0·Erlang
38Manticore Search
Manticore Search
11.7k+58 30d

Fast open-source search database with SQL and JSON interfaces

Search·2017·GPL-3.0·C++
39StarRocks
StarRocks
11.6k+110 30d

High-performance MPP analytics engine for real-time and batch data warehousing

Analytics·2021·Apache-2.0·Java, C++
40Convex
Convex
11.2k+496 30d

Reactive backend database with real-time sync, TypeScript-native queries, and built-in serverless functions

Document·2022·BUSL-1.1·Rust, TypeScript
41Quickwit
Quickwit
11.1k+103 30d

Cloud-native search engine for observability, built on object storage with sub-second latency

Search·2021·Apache-2.0·Rust
42YugabyteDB
YugabyteDB
10.2k+64 30d

PostgreSQL-compatible distributed SQL database with high resilience and geo-distribution

Relational·2017·Apache-2.0·C++, Java
43OceanBase
OceanBase
10.1k+51 30d

Distributed relational database for high-performance transactional, analytical, and AI workloads at scale

Relational·2010·Mulan PubL v2·C++
44Apache Cassandra
Apache Cassandra
9.7k+49 30d

Distributed wide-column database designed for high availability and linear scalability across data centers

Wide-Column·2008·Apache-2.0·Java
45Databend
Databend
9.2k+69 30d

Cloud-native data warehouse built in Rust for analytics, search, and AI on object storage

Analytics·2021·Apache-2.0·Rust
46Deep Lake
Deep Lake
9.1k+48 30d

GPU-native vector and multimodal data lake for AI agents with deep learning integrations

Vector·2020·Apache-2.0·Python, C++
47RisingWave
RisingWave
8.9k+88 30d

Postgres-compatible streaming database for real-time event processing and analytics

Streaming·2022·Apache-2.0·Rust
48Vespa
Vespa
6.9k+49 30d

Open-source big data serving engine combining search, recommendation, and real-time AI at scale

Search·2017·Apache-2.0·Java, C++
49Apache CouchDB
Apache CouchDB
6.9k+24 30d

Seamless multi-master sync with an intuitive HTTP/JSON API

Document·2005·Apache-2.0·Erlang, JavaScript, C
50Hazelcast
Hazelcast
6.6k−11 30d

Unified real-time data platform combining in-memory data grid with stream processing

Key-Value·2008·Apache-2.0·Java
51Apache IoTDB
Apache IoTDB
6.3k+39 30d

High-performance time-series database for IoT data with lightweight architecture and high compression

Time-Series·2019·Apache-2.0·Java
52GreptimeDB
GreptimeDB
6.1k+97 30d

Open-source unified observability database for metrics, logs, and traces built in Rust

Time-Series·2022·Apache-2.0·Rust
53Apache Pinot
Apache Pinot
6.1k+14 30d

Real-time distributed OLAP datastore for ultra low-latency analytics at high throughput

Analytics·2015·Apache-2.0·Java
54Apache Hive
Apache Hive
6.0k−11 30d

Data warehouse software for reading, writing, and managing large datasets in distributed storage using SQL

Analytics·2010·Apache-2.0·Java
55AliSQL
AliSQL
5.8k+37 30d

Alibaba's battle-tested MySQL branch with built-in DuckDB analytics and vector search

Relational·2016·GPL-2.0·C++, C
56Cortex
Cortex
5.8k+20 30d

Horizontally scalable, multi-tenant long-term storage for Prometheus metrics

Time-Series·2016·Apache-2.0·Go
57KurrentDB
KurrentDB
5.8k+26 30d

Event-native database for event sourcing and event-driven architectures with built-in streaming, formerly EventStoreDB

Streaming·2012·Kurrent License·C#
58JanusGraph
JanusGraph
5.8k+36 30d

Scalable open-source distributed graph database optimized for storing and querying billions of vertices and edges

Graph·2017·Apache-2.0·Java
59Apache HBase
Apache HBase
5.6k−13 30d

Distributed wide-column store for random real-time read/write access to big data

Wide-Column·2008·Apache-2.0·Java
60Apache Ignite
Apache Ignite
5.1k+12 30d

Distributed in-memory database with ACID transactions, SQL, and compute capabilities

Multi-Model·2015·Apache-2.0·Java, C++, C#
61OpenTSDB
OpenTSDB
5.1k+2 30d

Distributed, scalable time-series database built on top of HBase for monitoring at massive scale

Time-Series·2010·LGPL-2.1·Java
62Marqo
Marqo
5.0k+7 30d

AI-native tensor search engine with built-in embedding generation for multimodal vector search

Vector·2022·Apache-2.0·Python
63OrientDB
OrientDB
5.0k+11 30d

Multi-model database combining graph, document, key-value, and object models with SQL support and ACID transactions

Multi-Model·2010·Apache-2.0·Java
64M3DB
M3DB
4.9k+7 30d

Distributed time-series database built by Uber for large-scale metrics with Prometheus and Graphite compatibility

Time-Series·2018·Apache-2.0·Go
65YDB
YDB
4.7k+22 30d

Open-source distributed SQL database combining high availability, scalability, strong consistency, and ACID transactions

Relational·2022·Apache-2.0·C++
66CrateDB
CrateDB
4.4k+16 30d

Distributed SQL database for real-time analytics on massive datasets with PostgreSQL compatibility

Multi-Model·2014·Apache-2.0·Java
67dqlite
dqlite
4.3k+16 30d

Lightweight distributed SQLite with Raft consensus for fault-tolerant edge and IoT deployments

Relational·2017·LGPL-3.0·C
68Apache Kvrocks
Apache Kvrocks
4.3k+34 30d

Distributed Redis-compatible key-value NoSQL database built on RocksDB for cost-effective persistent storage

Key-Value·2019·Apache-2.0·C++
69TypeDB
TypeDB
4.3k+46 30d

Polymorphic database with a conceptual data model, strong type system, and symbolic reasoning engine

Graph·2016·MPL-2.0·Rust
70RavenDB
RavenDB
3.9k+12 30d

ACID document database with integrated full-text search, time series, and distributed counters

Document·2010·AGPL-3.0 / Commercial·C#
71Apache Kylin
Apache Kylin
3.8k+2 30d

Distributed OLAP engine with sub-second query performance via pre-calculated cubes on Hadoop

Analytics·2015·Apache-2.0·Java
72Netflix Atlas
Netflix Atlas
3.5k+8 30d

In-memory dimensional time-series database built for operational metrics at Netflix scale

Time-Series·2014·Apache-2.0·Scala, Java
73Olric
Olric
3.4k+9 30d

Distributed, in-memory key/value store and cache with Redis-compatible protocol support

Key-Value·2018·Apache-2.0·Go
74Roshi
Roshi
3.2k+2 30d

Large-scale CRDT set implementation for timestamped events backed by Redis

Time-Series·2014·BSD-2-Clause·Go
75Alibaba Cloud PolarDB

Cloud-native relational database with MySQL, PostgreSQL, and Oracle compatibility and HTAP capabilities

Relational·2018·proprietary·C, C++
76LinDB
LinDB
3.1k+3 30d

Scalable, high-performance distributed time-series database with multi-IDC replication

Time-Series·2019·Apache-2.0·Go
77Apache HugeGraph
Apache HugeGraph
3.0k+74 30d

High-performance graph database supporting hundreds of billions of vertices and edges

Graph·2017·Apache-2.0·Java
78Apache HoraeDB
Apache HoraeDB
2.8k+3 30d

High-performance distributed cloud-native time-series database for analytics and time-series workloads

Time-Series·2022·Apache-2.0·Rust
79GridDB
GridDB
2.5k+7 30d

IoT-optimized time-series database with hybrid in-memory and disk storage from Toshiba

Time-Series·2013·AGPL-3.0·C++, Java
80Apache Geode
Apache Geode
2.4k+3 30d

In-memory data grid providing real-time, consistent access to data-intensive applications at massive scale

Key-Value·2015·Apache-2.0·Java
81Apache Sedona
Apache Sedona
2.3k+11 30d

Cluster computing framework for large-scale geospatial data processing on Spark, Flink, and Snowflake

Analytics·2017·Apache-2.0·Scala, Java, Python, Rust
82YTsaurus
YTsaurus
2.2k+20 30d

Exabyte-scale distributed storage and processing platform for big data from Yandex

Multi-Model·2023·Apache-2.0·C++
83Apache Drill
Apache Drill
2.0k+4 30d

Schema-free SQL query engine for Hadoop, NoSQL, and cloud storage with dynamic schema discovery

Analytics·2015·Apache-2.0·Java
84ActorDB
ActorDB
1.9k+4 30d

Distributed SQL database using the actor model with Raft consensus on SQLite

Relational·2014·MPL-2.0·Erlang, C
85MatrixOne
MatrixOne
1.8k−19 30d

Cloud-native HTAP database with MySQL compatibility, Git-style data versioning, and AI-native capabilities

Relational·2021·Apache-2.0·Go
86KairosDB
KairosDB
1.8k+2 30d

Fast distributed scalable time-series database built on top of Apache Cassandra

Time-Series·2013·Apache-2.0·Java
87VictoriaLogs
VictoriaLogs
1.8k+133 30d

Fast and easy-to-use open-source log management database by VictoriaMetrics

Search·2023·Apache-2.0·Go
88CnosDB
CnosDB
1.7k+5 30d

Cloud-native open-source distributed time-series database with high performance and compression

Time-Series·2022·AGPL-3.0·Rust
89Elassandra
Elassandra
1.7k0 30d

Apache Cassandra distribution with tightly integrated Elasticsearch for combined NoSQL storage and search

Wide-Column·2015·Apache-2.0·Java
90Vald
Vald
1.7k+9 30d

Highly scalable distributed vector search engine built on Cloud-Native architecture with NGT

Vector·2019·Apache-2.0·Go
91OpenMLDB
OpenMLDB
1.7k−1 30d

Open-source machine learning database providing consistent feature engineering for training and inference

Time-Series·2021·Apache-2.0·C++, Java, Python
92PolarDB-X
PolarDB-X
1.7k+2 30d

Cloud-native distributed SQL database for high concurrency and massive storage with MySQL compatibility

Relational·2020·Apache-2.0·Java
93Apache Solr
Apache Solr
1.6k+17 30d

Blazing-fast, open-source multi-modal search platform built on Apache Lucene

Search·2004·Apache-2.0·Java
94CovenantSQL
CovenantSQL
1.5k+1 30d

Decentralized SQL database built on blockchain with immutable query history and GDPR compliance

Relational·2018·Apache-2.0·Go
95Comdb2
Comdb2
1.5k+4 30d

Bloomberg's clustered RDBMS built on optimistic concurrency with high availability SQL

Relational·2004·Apache-2.0·C
96GeoMesa
GeoMesa
1.5k+3 30d

Distributed spatio-temporal indexing on top of Accumulo, HBase, Cassandra, and Kafka

Multi-Model·2014·Apache-2.0·Scala, Java
97Aerospike
Aerospike
1.3k+10 30d

Flash-optimized distributed NoSQL database for real-time applications at massive scale

Key-Value·2012·AGPL-3.0·C
98Infinispan
Infinispan
1.3k+2 30d

Open-source distributed in-memory data grid with multi-protocol access and cross-site replication

Key-Value·2009·Apache-2.0·Java
99Apache Impala
Apache Impala
1.3k+4 30d

Native analytic SQL engine for Apache Hadoop and open data formats with low-latency queries

Analytics·2012·Apache-2.0·C++, Java
100Apache Cloudberry

Advanced open-source MPP analytics database forked from Greenplum with a modern PostgreSQL kernel

Analytics·2023·Apache-2.0·C, C++
101openGemini

Cloud-native distributed time-series database by Huawei for IoT and observability at massive scale

Time-Series·2022·Apache-2.0·Go
102Apache Accumulo
Apache Accumulo
1.1k+5 30d

Sorted, distributed key-value store built on Hadoop with cell-level security

Wide-Column·2011·Apache-2.0·Java
103Apache Phoenix
Apache Phoenix
1.1k+4 30d

Massively parallel SQL engine on top of Apache HBase for low-latency OLTP queries

Relational·2014·Apache-2.0·Java
104MyScale
MyScale
1.0k+5 30d

SQL vector database built on ClickHouse for high-performance AI applications with filtered search

Vector·2023·Apache-2.0·C++
105Tigris
Tigris
971+2 30d

Open-source serverless NoSQL database and search platform built on FoundationDB

Document·2022·Apache-2.0·Go
106ArcadeDB
ArcadeDB
798+62 30d

Multi-model database supporting graphs, documents, key-value, vectors, time-series, and search in one engine

Multi-Model·2021·Apache-2.0·Java
107openGauss
openGauss
771+5 30d

Huawei's open-source PostgreSQL-derived enterprise RDBMS optimized for ARM and high concurrency

Relational·2020·Mulan PSL v2·C, C++
108NCache
NCache
6580 30d

Open-source distributed in-memory cache for .NET and Java with pub/sub messaging

Key-Value·2005·Apache-2.0·C#, .NET
109SiriDB
SiriDB
511+1 30d

Highly scalable and super fast open-source time-series database with dynamic grouping

Time-Series·2017·MIT·C
110Oracle Coherence

In-memory data grid with fault-tolerant caching, transactions, and event processing for enterprise Java applications

Key-Value·2001·UPL-1.0·Java
111Warp 10
Warp 10
412+1 30d

Advanced open-source time-series platform with native geo-temporal support and WarpScript analytics

Time-Series·2015·Apache-2.0·Java
112Fluree
Fluree
3750 30d

Immutable, ledger-backed semantic graph database with native RDF and JSON-LD support

Graph·2016·EPL-2.0·Clojure
113Gnocchi
Gnocchi
321+1 30d

Scalable time-series database with pre-computed aggregations for cloud metrics and resource indexing

Time-Series·2017·Apache-2.0·Python
114Percona Server for MongoDB

Enhanced open-source MongoDB drop-in replacement with enterprise-grade security and backup features

Document·2016·SSPL·C++, JavaScript
115OpenTenBase
OpenTenBase
208+11 30d

Enterprise-level distributed HTAP database based on PostgreSQL for hybrid transactional and analytical workloads

Relational·2019·BSD-3-Clause·C
116Firebolt
Firebolt
197+3 30d

Sub-second analytics cloud data warehouse built for high-concurrency, data-intensive applications

Analytics·2021·proprietary·C++
117Newts
Newts
1950 30d

Scalable time-series data store built on Apache Cassandra with late aggregation for network monitoring

Time-Series·2014·Apache-2.0·Java
118Splice Machine

Dual-engine HTAP database combining HBase transactions with Spark analytics and ANSI SQL

Relational·2014·AGPL-3.0·Java, Scala
119Greenplum
Greenplum
112+2 30d

Massively parallel processing analytics database built on PostgreSQL for large-scale data warehousing

Analytics·2005·proprietary·C, C++, Python
120Harper
Harper
69+2 30d

Distributed Node.js platform unifying database, cache, application, and messaging in one process

Document·2018·Apache-2.0·JavaScript, C++
121DolphinDB
DolphinDB
570 30d

High-performance time-series database with built-in analytics for finance and IoT

Time-Series·2018·Proprietary·C++
122NSDb
NSDb
540 30d

Distributed open-source time-series database with streaming orientation built on Scala and Akka

Time-Series·2017·Apache-2.0·Scala
123Riak KV
Riak KV
40+1 30d

Distributed NoSQL key-value database with masterless architecture for high availability and fault tolerance

Key-Value·2009·Apache-2.0·Erlang
124Oracle NoSQL Database

Distributed NoSQL database providing key-value, table, and document data models with ACID transactions

Key-Value·2011·Apache-2.0 (Community Edition)·Java
125VoltDB
VoltDB
18+1 30d

In-memory NewSQL database for sub-millisecond ACID transactions at massive scale

Relational·2010·AGPL-3.0·Java, C++
126SWC-DB
SWC-DB
160 30d

Super Wide Column Database designed for high-performance scalable storage at yottabyte scale

Wide-Column·2019·GPLv3·C++
127Exasol
Exasol
60 30d

High-performance in-memory MPP analytics database delivering up to 1000x faster analytical queries

Analytics·2000·Commercial·C++
128SequoiaDB
SequoiaDB
60 30d

Distributed multi-model NewSQL database for financial and enterprise applications in China

Multi-Model·2012·AGPL-3.0·C++
129JaguarDB
JaguarDB
30 30d

Distributed vector database with ZeroMove scaling for AI and similarity search workloads

Vector·2017·MIT·C++
1301010data

Cloud-based columnar analytics platform for massive-scale data discovery and ad hoc analysis

Analytics·2000·proprietary·K
131Actian NoSQL Database

Object-oriented database for complex data models with native language integration

Document·1988·proprietary·C, C++, Java
132Algolia

AI-powered search and discovery API delivering sub-millisecond results with typo tolerance and real-time indexing

Search·2012·proprietary·C++
133Alibaba Cloud AnalyticDB for MySQL

Cloud-native real-time data warehouse with MySQL compatibility for petabyte-scale analytics

Analytics·2017·proprietary·C++
134Alibaba Cloud AnalyticDB for PostgreSQL

MPP cloud data warehouse with PostgreSQL compatibility and vector search capabilities

Analytics·2016·proprietary·C, C++
135Alibaba Cloud Log Service

Cloud-native observability platform for PB-scale log collection, analysis, and visualization

Analytics·2016·proprietary
136Alibaba Cloud MaxCompute

Fully managed petabyte-scale data warehouse with serverless SQL, MapReduce, and graph computation

Analytics·2010·proprietary
137Alibaba Cloud Table Store

Serverless NoSQL wide-column and time-series storage with auto-scaling to 10 PB

Wide-Column·2016·proprietary
138AllegroGraph

Neuro-symbolic AI platform combining RDF knowledge graphs, vector store, and SPARQL in a transactional graph database

Graph·2004·Commercial (free edition available)·Common Lisp, C
139Amazon Aurora

MySQL and PostgreSQL-compatible relational database with up to 5x throughput and 99.999% availability

Relational·2014·proprietary
140Amazon CloudSearch

Managed search service with auto-scaling, faceted search, and support for 34 languages

Search·2012·proprietary
141Amazon DocumentDB

Fully managed MongoDB-compatible document database with fast performance and up to 10 global regions

Document·2019·proprietary
142Amazon DynamoDB

Serverless, fully managed NoSQL key-value and document database with single-digit millisecond performance at any scale

Key-Value·2012·proprietary
143Amazon Keyspaces

Serverless, fully managed Apache Cassandra-compatible database service on AWS

Wide-Column·2020·proprietary
144Amazon Neptune

Fully managed graph database service supporting Gremlin, openCypher, and SPARQL

Graph·2018·proprietary
145Amazon Redshift

Petabyte-scale cloud data warehouse with columnar storage and massively parallel processing

Analytics·2013·proprietary
146Amazon Timestream

Serverless time-series database for IoT and operational applications with built-in analytics

Time-Series·2020·proprietary
147AntDB

Chinese distributed HTAP database serving telecom-scale workloads with hyper-converged multi-engine architecture

Multi-Model·2014·Apache-2.0·C
148AnzoGraph DB

Massively parallel graph OLAP database for W3C standards-based analytics at scale

Graph·2018·proprietary·C++
149Axibase Time Series Database

Special-purpose time-series database for IT infrastructure, industrial equipment, and financial market data

Time-Series·2004·proprietary·Java
150Azure Cosmos DB

Globally distributed, multi-model database service with turnkey multi-region replication and single-digit millisecond latency

Multi-Model·2017·proprietary·C++, C#
151Microsoft Azure Data Explorer

Fast and scalable data analytics service for real-time analysis of streaming and time-series data using Kusto Query Language

Analytics·2019·proprietary
152Microsoft Azure Table Storage

Schemaless NoSQL key-value store for massive volumes of semi-structured data with strong consistency

Key-Value·2009·proprietary
153Cloud Firestore

Serverless, fully managed NoSQL document database with real-time sync and offline support for mobile and web apps

Document·2017·proprietary
154Cloudflare Workers KV

Global edge key-value store with low-latency reads across 330+ locations

Key-Value·2018·proprietary
155CloudKit

Apple's cloud database service for seamless data sync across all Apple platforms

Document·2014·proprietary
156Couchbase

Multi-model NoSQL database for enterprise applications with SQL++ support

Multi-Model·2011·BSL 1.1 / Apache-2.0 (Community)·C++, Go, Erlang, C
157DataStax Enterprise

Enterprise-grade distributed database built on Apache Cassandra with integrated analytics, search, and graph

Wide-Column·2010·Commercial·Java
158Dydra

Cloud-native RDF graph database with versioned SPARQL queries and streaming graph support

Graph·2010·proprietary·Ruby
159Galaxybase

High-performance native distributed graph database for HTAP workloads at trillion-edge scale

Graph·2018·proprietary·Java
160GBase

Chinese enterprise database platform with leading analytical and transactional database products

Analytics·2004·proprietary·C, C++
161VMware Tanzu GemFire

Enterprise-grade distributed in-memory data grid for sub-millisecond, low-latency applications

Key-Value·2002·proprietary·Java
162GemStone/S

Smalltalk-based object database for scalable, transactional multi-tier business applications

Multi-Model·1986·Proprietary·Smalltalk, C
163GigaSpaces XAP

In-memory computing platform for real-time analytics and extreme transaction processing

Key-Value·2000·Proprietary·Java
164Google BigQuery

Serverless, highly scalable multi-cloud data warehouse with built-in ML and real-time analytics

Analytics·2010·proprietary
165Google Cloud Bigtable

Fully managed, low-latency wide-column NoSQL database for massive analytical and operational workloads

Wide-Column·2015·Proprietary·C++
166Google Cloud Datastore

Highly scalable NoSQL document database with automatic sharding and ACID transactions

Document·2013·Proprietary
167Google Cloud Firestore

Serverless document database with real-time sync, offline support, and global scalability

Document·2017·Proprietary
168Google Cloud Spanner

Globally distributed, strongly consistent relational database with unlimited scale and 99.999% availability

Relational·2017·Proprietary·C++
169GraphBase

Enterprise graph database for knowledge graphs and Graph RAG with distributed cloud-native architecture

Graph·proprietary
170GridGain

In-memory computing platform built on Apache Ignite for real-time transactions and analytics

Multi-Model·2007·proprietary·Java, C++, C#
171HPE Ezmeral Data Fabric

Converged data platform with integrated NoSQL database, file system, and event streams for hybrid cloud

Multi-Model·2009·proprietary·C++, Java
172IBM Cloudant

Fully managed CouchDB-compatible JSON document database with global distribution and serverless scaling

Document·2010·proprietary·Erlang, JavaScript, C
173IBM Db2

Enterprise-grade relational database with AI-powered optimization and hybrid cloud deployment

Relational·1983·proprietary·C, C++, Assembly
174InfiniteGraph

Distributed graph database for large-scale relationship analytics and deep link analysis

Graph·2010·Commercial·Java, C++
175Kinetica

GPU-accelerated real-time analytics database for spatial, temporal, graph, and AI workloads at scale

Analytics·2016·proprietary·C++
176Kyligence Enterprise

AI-augmented OLAP analytics platform delivering sub-second queries on petabyte-scale data, built on Apache Kylin

Analytics·2016·proprietary·Java
177LeanXcale

Ultra-scalable distributed SQL database with full ACID compliance and NoSQL-speed ingestion

Relational·2015·proprietary·Java
178MarkLogic

Enterprise multi-model database combining documents, graph, and search with government-grade security

Multi-Model·2001·proprietary·C++
179Microsoft Azure AI Search

Enterprise cloud search service with vector search, semantic ranking, and AI-powered agentic retrieval

Search·2014·proprietary
180Microsoft Azure Cosmos DB

Globally distributed, multi-model NoSQL database with guaranteed single-digit millisecond latency

Multi-Model·2017·proprietary
181Microsoft Azure SQL Database

Fully managed cloud relational database built on the Microsoft SQL Server engine with intelligent performance

Relational·2010·proprietary
182Microsoft Azure Synapse Analytics

Integrated analytics platform combining data warehousing, big data, and data integration with serverless and provisioned options

Analytics·2019·proprietary
183Netezza

Purpose-built analytics appliance for high-performance data warehousing and advanced analytics

Analytics·2002·Proprietary·C, C++
184NonStop SQL

Fault-tolerant relational database for mission-critical OLTP on HPE NonStop systems

Relational·1986·proprietary·C, C++
185NuoDB

Cloud-native distributed SQL database with ACID compliance and elastic scale-out

Relational·2010·proprietary·C++
186Ontotext GraphDB

Enterprise RDF triplestore with real-time semantic inferencing at billion-statement scale

Graph·2000·proprietary·Java
187Oracle Database

Enterprise-grade multi-model database with AI-native capabilities

Relational·1979·Oracle Commercial License·C, C++
188OushuDB

Cloud-native MPP data warehouse built on Apache HAWQ for petabyte-scale interactive analytics

Analytics·2016·proprietary·C, C++
189PieCloudDB

Cloud-native virtual data warehouse with elastic massive parallel processing (eMPP) architecture

Analytics·2022·proprietary·C++
190Pinecone

Fully managed vector database built for high-performance AI applications at scale

Vector·2021·proprietary
191PlanetScale

Serverless MySQL-compatible cloud database platform powered by Vitess with branching and deploy requests

Relational·2021·proprietary·Go
192PolarDB for PostgreSQL

Alibaba's cloud-native PostgreSQL with shared-storage architecture and elastic scaling

Relational·2021·Apache-2.0·C
193QuasarDB

High-performance distributed column-oriented time-series database with native transactional support

Time-Series·2009·proprietary·C++
194Rasdaman

Array database management system for massive multidimensional raster data and datacubes

Multi-Model·1996·GPL-3.0·C++, Java
195SAP HANA

In-memory relational database for real-time analytics and transactional processing in enterprise environments

Relational·2010·proprietary·C++
196ScaleOut StateServer

In-memory data grid with distributed caching, parallel query, and high availability for .NET and Java

Key-Value·2005·Commercial·C++, C#
197SciDB

Array database for multidimensional data management and complex analytics in scientific computing

Analytics·2010·AGPL-3.0·C++
198SingleStore

Distributed SQL database for data-intensive applications combining transactions, analytics, and AI workloads

Relational·2013·SingleStore Commercial License·C++
199Snowflake

Cloud-native data warehouse with automatic scaling, separation of storage and compute, and near-zero administration

Analytics·2014·proprietary
200SpaceTime

Spatiotemporal relational database optimized for analytical workloads on moving objects with JIT compilation

Analytics·2020·proprietary·C++
201Stardog

Enterprise knowledge graph platform built on RDF standards for data unification and AI-powered insights

Graph·2012·proprietary·Java
202TDSQL for MySQL

Tencent's distributed MySQL-compatible database with strong consistency and horizontal scaling

Relational·2012·proprietary·C, C++
203Teradata

Enterprise-grade parallel data warehouse for large-scale analytics and business intelligence

Analytics·1984·proprietary·C, C++
204TigerGraph

High-performance native graph analytics platform for AI, fraud detection, and real-time insights on connected data

Graph·2017·proprietary·C++
205Transwarp ArgoDB

Distributed analytical database replacing Hadoop+MPP with unified SQL analytics and real-time data processing

Analytics·proprietary
206Transwarp Hippo

Enterprise cloud-native distributed vector database with GPU acceleration and multi-model support

Vector·proprietary
207Transwarp KunDB

Financial-grade distributed relational database with strong consistency and Oracle/MySQL compatibility

Relational·proprietary
208Transwarp StellarDB

Enterprise distributed graph database with native graph storage and deep link analysis at PB scale

Graph·proprietary
209turbopuffer

Serverless vector and full-text search database built on object storage for low-cost high-scale workloads

Vector·2024·Commercial·Rust
210Ultipa

Ultra-high-performance 4th-generation graph database with deep traversal and GQL compliance

Graph·2019·proprietary·C++
211Vertica

High-performance columnar analytics database for petabyte-scale real-time analytics and machine learning

Analytics·2005·Proprietary·C++
212XtremeData

Cloud-scale parallel SQL analytics engine with vectorized execution for complex analytical workloads

Analytics·2005·proprietary·C++
213Yellowbrick

Modern enterprise cloud data warehouse built on Kubernetes for extreme speed and concurrency

Analytics·2014·proprietary·C++

What is a Distributed Database?

A distributed database spreads data across multiple machines (nodes) that work together as a single logical system. Data is partitioned (sharded) and replicated across nodes to provide horizontal scalability, fault tolerance, and geographic distribution. When one node fails, others continue serving requests. Distributed databases solve the fundamental limitation of single-server databases: they scale beyond what one machine can handle. They span multiple data models — distributed SQL (CockroachDB, TiDB, YugabyteDB), distributed NoSQL (Cassandra, ScyllaDB), distributed key-value (etcd, FoundationDB), and distributed document stores (MongoDB with sharding).

When to Use a Distributed Database

Use a distributed database when a single server can't meet your requirements for storage capacity, write throughput, read scalability, or availability. Specific scenarios include: applications serving users across multiple regions with low-latency requirements, workloads needing 99.999% uptime, datasets too large for a single machine, and write-heavy systems that exceed single-node throughput. Distributed SQL databases (CockroachDB, TiDB) offer familiar SQL with automatic sharding. Distributed NoSQL (Cassandra) offers extreme write scalability. Consider single-node databases when your data fits on one machine — they're simpler to operate, debug, and reason about.

Frequently Asked Questions

What is the difference between a distributed database and a regular database?
A regular (centralized) database runs on a single server — all data lives on one machine. A distributed database runs across multiple servers, with data partitioned and replicated across them. Distributed databases offer horizontal scalability (add more nodes to handle more load), fault tolerance (survive node failures), and geographic distribution (serve users from nearby nodes). The tradeoff is operational complexity — distributed systems are harder to deploy, monitor, debug, and reason about than single-node databases.
What is the CAP theorem?
The CAP theorem states that a distributed system can provide at most two of three guarantees simultaneously: Consistency (every read returns the latest write), Availability (every request gets a response), and Partition tolerance (the system works despite network failures). Since network partitions are inevitable, the real choice is between consistency and availability during failures. CockroachDB and YugabyteDB choose consistency (CP). Cassandra and DynamoDB choose availability (AP). In practice, modern databases offer tunable consistency — you can adjust the tradeoff per query.
What is the difference between CockroachDB and TiDB?
Both are distributed SQL databases but with different compatibility targets. CockroachDB is PostgreSQL wire-compatible — existing Postgres drivers and many Postgres queries work unchanged. TiDB is MySQL wire-compatible — existing MySQL applications can migrate with minimal changes. CockroachDB uses a Raft-based consensus protocol and is written in Go. TiDB separates compute (TiDB) from storage (TiKV, built on RocksDB). Both support horizontal scaling and ACID transactions across distributed nodes.
Is MongoDB a distributed database?
MongoDB supports distribution through built-in sharding — you can partition data across multiple servers based on a shard key. However, MongoDB is not distributed by default. A single-node MongoDB deployment is a standard centralized database. You enable distribution by configuring a sharded cluster with config servers, shard nodes, and mongos routers. Purpose-built distributed databases like CockroachDB, TiDB, and Cassandra are distributed from the ground up and handle sharding automatically without manual configuration.
Do I need a distributed database?
Most applications don't. A well-tuned single-node PostgreSQL or MySQL instance handles millions of rows and thousands of concurrent connections. You need a distributed database when: your dataset exceeds single-server storage, your write throughput exceeds single-server capacity, you need multi-region low-latency access, or you require 99.999%+ uptime. If your data fits on one machine and you're not serving globally, a single-node database with read replicas is simpler and often faster than a distributed system.

Manage Distributed Databases Visually

1bench is a modern GUI client that supports all major distributed databases and many more.

Get Started