Self-hosted projects tagged “Data Ingestion

24 open source projects with this tag

24 services found

Huginn

Huginn

Open-source platform for self-hosted automation agents

48.8k
4.2k
Last commit: 5d ago

Huginn is an open-source automation platform that runs agents to monitor web data, process events, and trigger actions — self-hosted and extensible.

Alternative to:
IFTTT
IFTTT
+17
Kestra

Kestra

Open-source, event-driven workflow orchestration and scheduling platform

26.4k
2.5k
Last commit: 9h ago

Declarative, API-first orchestration platform for scheduled and event-driven workflows with a plugin ecosystem, UI editor, CI/CD and Terraform integration.

Alternative to:
Dagster Cloud
Dagster Cloud
+16
Vector

Vector

High-performance observability data pipeline written in Rust

21.4k
2k
Last commit: 8h ago

Open-source observability pipeline to collect, transform, and route logs and metrics with a single, high-performance binary and programmable transforms.

Alternative to:
Elastic Logstash
Elastic Logstash
+13
ZincSearch

ZincSearch

A lightweight open-source search engine for full-text indexing.

17.7k
770
Last commit: 1mo ago

ZincSearch is a Go-based, lightweight search engine for full-text indexing with Elasticsearch API-compatible ingestion, a Vue UI, and a schema-less document model.

Alternative to:
Elastic Cloud (Elasticsearch Service)
Elastic Cloud (Elasticsearch Service)
+7
Graylog

Graylog

Centralized log management and analysis platform

8k
1.1k
Last commit: 8h ago

Graylog is an open source platform for collecting, indexing, searching, and alerting on logs and machine data from many sources in one place.

Alternative to:
Graylog Cloud
Graylog Cloud
+11
TeslaMate

TeslaMate

Self-hosted data logger and analytics for Tesla vehicles.

7.7k
904
Last commit: 4d ago

Open-source Tesla telemetry logger that records driving, charging and location data to PostgreSQL and provides Grafana dashboards plus MQTT integration.

Alternative to:
TeslaFi
TeslaFi
+2
Open Source Routing Machine (OSRM)

Open Source Routing Machine (OSRM)

High-performance routing engine for OpenStreetMap data

7.5k
3.9k
Last commit: 4d ago

OSRM is a high-performance routing engine for OpenStreetMap data, providing an HTTP API for routing, map matching, distance tables, and more.

Alternative to:
Mapbox
Mapbox
+1
Scrutiny

Scrutiny

Web UI for SMART monitoring of hard drives

7.4k
255
Last commit: 3d ago

Self-hosted S.M.A.R.T monitoring dashboard that collects SMART data, visualizes historical trends, and alerts on drive health.

Alternative to:
Hard Disk Sentinel
Hard Disk Sentinel
+11
Nominatim

Nominatim

Open-source geocoding and reverse-geocoding using OpenStreetMap data

4.1k
808
Last commit: 3d ago

Nominatim provides geocoding (name/address → coordinates) and reverse geocoding (coordinates → address) powered by OpenStreetMap, with import tooling and a public API.

Alternative to:
Mapbox
Mapbox
+1
Chartbrew

Chartbrew

Open-source self-hosted data visualization dashboards

3.7k
412
Last commit: 3d ago

Chartbrew is an open-source platform to build live dashboards and reports by connecting SQL/NoSQL databases and REST APIs.

Alternative to:
Looker
Looker
+14
OpenTripPlanner

OpenTripPlanner

Multimodal trip planning and transit routing server

2.6k
1.1k
Last commit: 17h ago

OpenTripPlanner (OTP) is an open source multimodal routing engine that builds networks from GTFS and OpenStreetMap to produce itineraries and real-time transit trip plans...

Alternative to:
ArcGIS Online
ArcGIS Online
+2
Aleph

Aleph

Document and data indexing, entity search, and investigative analysis

2.3k
332
Last commit: 2mo ago

Aleph indexes documents and structured datasets to enable fast search, entity extraction, and cross-referencing for investigative research and OSINT workflows.

Alternative to:
Glean
Glean
+6
Parseable

Parseable

An observability platform for predictive insights across MELT telemetry.

2.3k
159
Last commit: 9h ago

Parseable ingests, analyzes, and extracts insights from MELT telemetry data with predictive analytics and a unified SQL/NL querying interface.

Alternative to:
Datadog
Datadog
+15
Open Archiver

Open Archiver

Open-source platform for legally compliant email archiving

1.7k
79
Last commit: 1d ago

Self-hosted email archiving platform for ingesting, storing, indexing and searching emails from Gmail, Microsoft 365, IMAP, PST and more.

Alternative to:
MailStore Server
MailStore Server
+4
Emoncms

Emoncms

Energy and environmental time-series logging and visualization

1.3k
534
Last commit: 1mo ago

Open-source web app to collect, process, store, and visualize energy, temperature, and other environmental time-series data with dashboards, graphs, and an API.

Alternative to:
InfluxDB Cloud
InfluxDB Cloud
+14
Panora

Panora

AI-powered back-office automation for purchase order entry

1k
201
Last commit: 4mo ago

Automates purchase order ingestion, validation, and ERP posting for distributors, manufacturers and wholesalers using AI-driven item matching and configurable workflows.

Alternative to:
Microsoft Power Automate
Microsoft Power Automate
+8
Fitbit Fetch Script and InfluxDB Grafana Integration

Fitbit Fetch Script and InfluxDB Grafana Integration

Fetch Fitbit API data into InfluxDB and visualize it with Grafana

828
66
Last commit: 3d ago

Python service that pulls Fitbit health metrics via the Fitbit Web API, stores them in InfluxDB, and provides Grafana dashboards for long-term trend visualization.

Alternative to:
Grafana Cloud
Grafana Cloud
+9
Riven

Riven

VFS-based automated media management and streaming platform

744
97
Last commit: 1mo ago

Open-source media management system that exposes a FUSE-based virtual filesystem, automates discovery/scraping/downloading, and integrates with Plex/Jellyfin/Emby.

Alternative to:
Plex
Plex
+3
Wishlist

Wishlist

Sharable, self-hosted wishlist application for friends and family

485
36
Last commit: 1d ago

Self-hosted SvelteKit wishlist app that scrapes product metadata, supports groups, registry mode, PWA, OpenID Connect, and Docker deployment.

Alternative to:
Wishlistr
Wishlistr
Minne

Minne

Minne: a graph-powered read-it-later and personal knowledge base.

213
8
Last commit: 10d ago

Self-hosted graph-powered personal knowledge base with AI search, chat, and multi-format ingestion.

Alternative to:
Readwise Reader
Readwise Reader
+12
Hyrax

Hyrax

Ruby on Rails engine for building digital repository applications

194
133
Last commit: 2d ago

Open-source repository engine from the Samvera community for building institutional digital repositories with flexible metadata, workflows, and search integration.

Alternative to:
DSPACE Direct
DSPACE Direct
+19
Mantium

Mantium

Self-hosted manga tracker aggregating metadata from multiple sources.

127
6
Last commit: 22h ago

Mantium is a self-hosted manga tracker that collects manga metadata (not images) from multiple sources and provides a dashboard and iFrame for embedding.

Alternative to:
AniList
AniList
+7
Mistborn

Mistborn

Multi-source threat intelligence and IOC aggregation platform

Mistborn aggregates threat intelligence from multiple sources to enrich, normalize, and distribute IOCs for security analysis and incident response workflows.

Alternative to:
VirusTotal
VirusTotal
+2
Spotizerr

Spotizerr

Self-hosted Spotify/Deezer music download manager

Self-hosted music download manager that fetches Spotify content and falls back to Deezer for lossless sources; FastAPI backend, Celery tasks and Redis caching.