Apache Druid

Apache Druid

High-performance real-time analytics database for OLAP workloads

13.9kstars
3.8kforks
Last commit: 1d ago
Repo age: 14y old
Apache Druid screenshot

Apache Druid is a high-performance real-time analytics database designed for interactive OLAP queries on large, high-cardinality datasets. It supports both streaming and batch ingestion and is optimized for low-latency queries under high concurrency.

Key Features

  • Sub-second interactive query engine optimized for high-dimensional, high-cardinality data
  • Native streaming ingestion designed for query-on-arrival use cases
  • Columnar storage with time indexing, dictionary encoding, bitmap indexes, and compression
  • SQL API plus native query APIs over HTTP, including JDBC connectivity
  • Built-in web console for ingestion setup, query exploration, and cluster visibility
  • Elastic, loosely coupled architecture separating ingestion, query, and coordination services
  • Tiering and quality-of-service controls to prioritize mixed workloads

Use Cases

  • Powering real-time analytics dashboards and embedded analytics in user-facing applications
  • Ad-hoc operational analytics on event, clickstream, and observability-style data
  • High-concurrency OLAP analytics on time-series and event data from streaming platforms

Limitations and Considerations

  • Operates as a distributed system with multiple service types, which can increase operational complexity compared to single-node databases
  • Designed primarily for analytics workloads; it is not a general-purpose OLTP database

Apache Druid is well-suited for organizations that need fast, consistent analytical queries on continuously arriving data. Its storage format and distributed architecture make it effective for high-scale, high-concurrency real-time analytics applications.

Categories:

Tags:

Tech Stack:

Share:

Similar Services

Netdata

Netdata

Real-time, per‑second full‑stack monitoring and observability

77.4k
6.3k
Last commit: 16h ago

Open-source, agent-based monitoring platform delivering per-second metrics, edge ML anomaly detection, tiered time-series storage and centralized cloud UI.

Alternative to:
Datadog Cloud Monitoring
Datadog Cloud Monitoring
+16
Prometheus

Prometheus

Monitoring system and time-series database for metrics and alerting

62.2k
10.1k
Last commit: 1d ago

Prometheus is an open-source monitoring and time-series database for collecting metrics, querying with PromQL, and alerting on system and application health.

Alternative to:
Datadog Cloud Monitoring
Datadog Cloud Monitoring
+17
ClickHouse

ClickHouse

Fast open-source columnar database for real-time analytics.

45.2k
8k
Last commit: 18h ago

Open-source OLAP database designed for real-time analytics at scale.

Alternative to:
ClickHouse Cloud
ClickHouse Cloud
+11
InfluxDB

InfluxDB

Time-series database for metrics, events, and real-time analytics

31.1k
3.7k
Last commit: 2d ago

InfluxDB is an open source time-series database for high-ingest metrics and event data, enabling fast queries for monitoring, dashboards, and real-time analytics.

Alternative to:
InfluxDB Cloud
InfluxDB Cloud
+5
VictoriaMetrics

VictoriaMetrics

High-performance time series database for monitoring and observability

16k
1.5k
Last commit: 21h ago

Fast, resource-efficient time series database compatible with Prometheus and Grafana, for scalable monitoring and long-term metrics storage.

Alternative to:
Grafana Cloud
Grafana Cloud
+9
Speedtest Tracker

Speedtest Tracker

Self-hosted internet performance and uptime monitoring tool

5.1k
199
Last commit: 4d ago

Laravel-based app that schedules Ookla Speedtest checks, records download/upload/ping/packet-loss metrics, visualizes historical performance, and sends alerts.

Alternative to:
Speedtest by Ookla
Speedtest by Ookla
+3