Best Self-hosted Data Integration & ETL/ELT tools in 2026
18 self-hosted open source alternatives in this category
See also:
Business Intelligence & DashboardsNoSQL DatabasesRelational DatabasesSearch & Indexing EnginesTime-Series DatabasesWeb & Product Analytics18 services found

Huginn
Open-source platform for self-hosted automation agents
Huginn is an open-source automation platform that runs agents to monitor web data, process events, and trigger actions — self-hosted and extensible.


Medusa
Open-source, headless ecommerce backend built for customization.
Open-source, API-first commerce backend with modular architecture for custom storefronts and marketplaces.


Hasura GraphQL Engine
Open-source GraphQL engine providing instant, realtime APIs on your data
Hasura is an open-source GraphQL engine that instantly exposes realtime, secure GraphQL APIs over databases and other data sources with fine-grained access control.


Kestra
Open-source, event-driven workflow orchestration and scheduling platform
Declarative, API-first orchestration platform for scheduled and event-driven workflows with a plugin ecosystem, UI editor, CI/CD and Terraform integration.

Node-RED
Flow-based low-code tool for building event-driven automations
Open-source, browser-based low-code platform and Node.js runtime for wiring devices, APIs and services into event-driven flows for automation, IoT and integrations.


Vector
High-performance observability data pipeline written in Rust
Open-source observability pipeline to collect, transform, and route logs and metrics with a single, high-performance binary and programmable transforms.

Apprise
Unified notifications library for 120+ services via a single API.
A Python-based notification library and CLI that routes messages to 120+ services via URL-based configurations, enabling self-hosted cross-platform alerts.


Quickwit
Cloud-native, sub-second search on cloud storage for logs and traces.
Open-source cloud-native search engine for observability data on object storage with an Elasticsearch/OpenSearch-compatible API.
Countly
Privacy-first product analytics and customer engagement platform
Open-source product analytics platform with SDKs for mobile, web and desktop; provides dashboards, events, crash reporting, messaging, A/B testing and APIs.

Livebook
Interactive Elixir notebooks for code, data and automation
Web-based interactive and collaborative notebooks for Elixir with data visualizations, integrations, reproducible workflows, and automation.
Nominatim
Open-source geocoding and reverse-geocoding using OpenStreetMap data
Nominatim provides geocoding (name/address → coordinates) and reverse geocoding (coordinates → address) powered by OpenStreetMap, with import tooling and a public API.

OpenTripPlanner
Multimodal trip planning and transit routing server
OpenTripPlanner (OTP) is an open source multimodal routing engine that builds networks from GTFS and OpenStreetMap to produce itineraries and real-time transit trip plans...

Open Archiver
Open-source platform for legally compliant email archiving
Self-hosted email archiving platform for ingesting, storing, indexing and searching emails from Gmail, Microsoft 365, IMAP, PST and more.


Panora
AI-powered back-office automation for purchase order entry
Automates purchase order ingestion, validation, and ERP posting for distributors, manufacturers and wholesalers using AI-driven item matching and configurable workflows.

CrossWatch
Synchronize media metadata between servers and trackers
CrossWatch synchronizes watchlists, history, ratings and live scrobbles between Plex, Jellyfin, Emby and trackers like Trakt, SIMKL, AniList and MDBlist.

Wavelog
Web-based amateur radio logging and QSO management system
Self-hosted PHP web application for logging amateur radio contacts with mapping, analytics, awards tracking and integration with common QSO services.

Mere Medical
Self-hosted, offline-first personal health record aggregator
Self-hosted personal health record (PHR) that aggregates and syncs medical records from multiple patient portals into a local, privacy-first web app.

Apache Flink
Distributed stream and batch data processing engine
Apache Flink is a distributed engine for stateful stream processing and batch analytics with event-time semantics, fault tolerance, and scalable deployment on clusters.