
Khoj
Self-hostable personal AI 'second brain' for chat, semantic search, custom agents, automations and integration with local or cloud LLMs.

Khoj is an open-source personal AI platform that combines chat, semantic document search, custom agents and scheduled automations. It can run locally or as a cloud-hosted service and integrates with local or remote LLMs to answer questions, generate content and automate research.
Key Features
- Multi-client access: web, desktop, Obsidian, Emacs, mobile (PWA) and chat integrations (e.g., WhatsApp).
- Model-agnostic LLM support: connect local GGUF models or remote OpenAI-compatible, Anthropic and Google-compatible endpoints; supports on-device and cloud models.
- Semantic search and embeddings: document ingestion (PDF, Markdown, Word, org-mode, Notion, images) with vector storage and retrieval for fast, contextual search.
- Custom agents and automations: build agents with distinct personas, tools and knowledge bases; schedule research tasks and email newsletters.
- Document processing and code tools: built-in extractors, simple code execution sandbox support (local Terrarium or remote sandboxes) and image generation features.
- Enterprise & self-hosting options: deploy via Docker or pip, use Postgres with pgvector for embeddings, and configure authentication and domains.
Use Cases
- Personal knowledge management: query a private document corpus and get grounded answers across notes, PDFs and files.
- Research automation: schedule recurring research queries and receive summarized results by email.
- Team/private deployments: host a private assistant for a team with custom agents, model selection and on-premise data control.
Limitations and Considerations
- Some optional integrations require extra setup or external services (e.g., code sandboxes, email providers); self-hosting needs correct environment configuration.
- A few plugins/integrations may be unmaintained or platform-specific; users should check the chosen connectors and follow the docs for compatibility and maintenance status.
Khoj is designed to be extensible and model-agnostic, emphasizing private data control and flexible deployment. It is suited for individuals and teams who need a searchable, automatable assistant that can run with local or cloud language models.
