Self-hosted projects tagged “RAG”
16 open source projects with this tag
16 services found

Ollama
Run and manage large language models locally with an API
Ollama is a local LLM runtime that lets you pull, run, and customize models, offering a CLI and REST API for chat, generation, and embeddings.

AnythingLLM
All-in-one AI chat app with RAG, agents, and multi-model support
AnythingLLM is an all-in-one desktop and Docker app for chatting with documents using RAG, running AI agents, and connecting to local or hosted LLMs and vector databases.
LocalAI
OpenAI-compatible local AI inference server and API
Run LLMs, image, and audio models locally with an OpenAI-compatible API, optional GPU acceleration, and a built-in web UI for managing and testing models.

LibreChat
Self-hosted multi-provider AI chat UI with agents and tools
LibreChat is a self-hosted AI chat platform that supports multiple LLM providers, custom endpoints, agents/tools, file and image chat, conversation search, and presets.

Perplexica
Privacy-focused AI answering engine with web search and citations
Self-hosted AI answering engine that combines web search with local or hosted LLMs to generate cited answers, with search history and file uploads.

Langfuse
Open-source platform for LLM observability, evals, and prompt management
Langfuse is an open-source LLM engineering platform for tracing, metrics, evaluations, datasets, and prompt management to debug and improve AI applications.

Opik
LLM observability and evaluation platform for traces, tests, and dashboards
Opik is an open-source platform to trace, evaluate, and monitor LLM apps, RAG pipelines, and agent workflows with automated evaluations and production dashboards.
Onyx Community Edition
Self-hosted AI chat and enterprise search for any LLM
Open-source platform for AI chat, RAG, agents, and enterprise search across your team’s connected knowledge sources, compatible with hosted and local LLMs.

Blinko
Privacy-focused AI note-taking app with RAG-powered search
Open-source, self-hosted AI note-taking app for fast capture and organization, with Markdown notes and RAG-based natural language search.
Paperless-AI
AI extension for Paperless‑ngx providing automated analysis and RAG
Extension for Paperless‑ngx that uses OpenAI-compatible backends and Ollama to auto-classify, tag, index, and enable RAG-powered document chat and semantic search.


Agenta
Open-source LLMOps platform for prompts, evals, and observability
Agenta is an open-source LLMOps platform with a prompt playground, prompt/version management, LLM evaluation, and production observability for LLM apps.

Basic Memory
Local-first persistent memory for AI chats using Markdown notes
Basic Memory gives AI assistants durable, local-first memory by reading and writing structured Markdown notes, enabling reusable context across conversations and tools.

Scriberr
Offline AI audio and video transcription with transcript chat
Scriberr is a self-hosted, privacy-focused AI transcription app for audio and video, with speaker diarization, word-level timestamps, summaries, and transcript chat.
SecureAI Tools
Self-hosted private AI chat and document Q&A with local inference
Self-hosted private AI tools for chat and document Q&A, supporting local Ollama inference or OpenAI-compatible APIs, with built-in authentication and user management.

StreamyStats
Analytics and AI recommendations for Jellyfin libraries
StreamyStats is a Jellyfin statistics and analytics service with dashboards, watch history insights, and optional AI chat and recommendations powered by embeddings.

Tiledesk
Open-source platform for AI chatbots and multichannel live chat
Open-source conversational platform to build AI chatbots, multichannel live chat, and human-in-the-loop customer support with knowledge base and RAG capabilities.