Databricks

Best Self Hosted Alternatives to Databricks

A curated collection of the 2 best self hosted alternatives to Databricks.

Databricks provides a unified data and AI Lakehouse platform (built on Apache Spark) for data engineering, analytics, collaborative notebooks, and end-to-end machine learning — enabling scalable ETL, BI, model training, and deployment across cloud providers.

Alternatives List

#1
Livebook

Livebook

Livebook is a collaborative, web-based notebook for running Elixir code, building data workflows, and deploying notebooks as interactive apps, with real-time collaboration.

Livebook screenshot

Livebook is a web application for authoring and running interactive notebooks backed by the Elixir runtime. It combines executable code cells with rich text, visuals, and integrations to support data exploration, automation, and reproducible computational documents. Notebooks can be shared, collaborated on in real time, and turned into interactive “apps” for non-developers.

Key Features

  • Interactive notebooks with Elixir code cells, Markdown, and rich outputs (tables, charts, etc.)
  • Real-time multi-user collaboration with presence and synchronized editing
  • Runtime management: connect local, remote, and clustered runtimes; attach to running nodes
  • Package/dependency management inside notebooks (Mix-based) and reusable “Smart cells”
  • Integrations for data work (e.g., Kino-based visual components; common I/O and visualization patterns)
  • Secrets management and environment-based configuration for notebooks/apps
  • Export/share options (notebook files, links/sharing, and runnable artifacts depending on setup)
  • Deploy notebooks as Livebook Apps with interactive inputs and controlled execution

Use Cases

  • Data exploration, ETL prototyping, and reproducible analysis in Elixir
  • Team runbooks and operational notebooks for debugging/observability workflows
  • Internal tools: ship a notebook as an interactive app for stakeholders

Limitations and Considerations

  • Primarily optimized for the Elixir ecosystem; other languages require external bridging
  • Some features (collaboration/deployments) may depend on specific runtime/network configuration

Livebook is well-suited for teams that want notebooks with strong concurrency and distribution primitives from BEAM/Elixir. It bridges documentation and execution, making it practical both for exploratory work and for operationalizing notebooks as interactive applications.

5.7kstars
483forks
#2
CloudBeaver

CloudBeaver

Self-hosted web UI for SQL database administration and querying with role-based access, connection management, and a browser SQL editor based on DBeaver.

CloudBeaver screenshot

CloudBeaver is a web-based database manager from the DBeaver project designed for teams to access, browse, and query databases from a browser. It provides a centralized way to manage database connections and permissions while using a familiar DBeaver-style SQL editor and object navigator.

Key Features

  • Browser-based SQL editor with results grid, filtering, and basic data editing
  • Database object navigator to browse schemas, tables, views, columns, indexes, etc.
  • Centralized connection management (create/manage shared connections)
  • Role-based access control for users/teams and per-connection permissions
  • Authentication options for team deployments (community vs. enterprise capabilities vary)
  • Multi-database support via DBeaver drivers/extensions (JDBC-based)
  • Admin console for server configuration and workspace management
  • Deployment via Docker for quick setup and upgrades

Use Cases

  • Provide a shared, browser-accessible SQL client for operations and analysts
  • Centralize database connection credentials and control who can access what
  • Lightweight alternative to installing desktop SQL clients across a team

Limitations and Considerations

  • Some advanced DBeaver desktop features (deep ER modeling, extensive plugins) may not be available in the web UI
  • Exact auth/SSO and enterprise management features depend on edition and configuration

CloudBeaver fits teams that want a web database client with centralized governance and broad JDBC connectivity. It is especially useful when you need controlled access to databases without requiring local tooling on every workstation.

4.5kstars
504forks

Why choose an open source alternative?

  • Data ownership: Keep your data on your own servers
  • No vendor lock-in: Freedom to switch or modify at any time
  • Cost savings: Reduce or eliminate subscription fees
  • Transparency: Audit the code and know exactly what's running