Willow

Website

Open-source, privacy-focused voice assistant platform

Repository Website

3kstars

115forks

Last commit: 13d ago

Repo age: 3y old

Willow is an open-source, privacy-focused voice assistant platform designed for low-cost ESP32-S3 hardware. It provides fast on-device wake-word and command recognition and can optionally integrate with a self-hosted inference server for high-quality speech-to-text, TTS, and LLM tasks.

Key Features

On-device wake-word engine and voice-activity detection with configurable wake words and up to hundreds of on-device commands.
Integration with Home Assistant, openHAB and generic REST endpoints for home automation and custom workflows.
Willow Inference Server (WIS) option: a performance-optimized server that supports ASR/STT (Whisper models), TTS, and optional LLM inference with REST, WebRTC and WebSocket transports. WIS targets CUDA GPUs for low-latency workloads and includes deployment scripts and Docker compose support.
Device management and OTA flashing via the Willow Application Server (WAS) with a provided Docker image to simplify onboarding.

Use Cases

Privacy-first smart-home voice control: local wake-word and command recognition that triggers Home Assistant automations without cloud transcription.
On-premises speech processing: self-hosted WIS for low-latency ASR/STT and TTS for accessibility, transcription, or edge assistant applications.
Developer integrations: embed Willow devices into custom REST/WebRTC workflows or use WIS to add LLM-powered assistants to local networks.

Limitations and Considerations

Advanced WIS features (LLM, high-quality TTS) expect CUDA-capable GPUs and NVIDIA drivers; CPU-only setups are supported but significantly slower and may disable some features.
Primary device target is the ESP32-S3-BOX family; other hardware may require additional porting or tuning.

Willow combines a small-footprint device runtime with an optional, high-performance inference server to enable private, low-latency voice assistants and on-premises speech workflows. It is actively developed with documentation, Docker deployment options, and community discussion channels for support.

Open WebUI

Extensible, offline-capable web interface for LLM interactions

124.9k

17.7k

Last commit: 2d ago

Feature-rich, self-hosted AI interface that integrates Ollama and OpenAI-compatible APIs, offers RAG, vector DB support, image tools, RBAC and observability.

Alternative to:

Open WebUI Cloud+19

AnythingLLM

All-in-one AI chat app with RAG, agents, and multi-model support

55k

5.9k

Last commit: 1d ago

AnythingLLM is an all-in-one desktop and Docker app for chatting with documents using RAG, running AI agents, and connecting to local or hosted LLMs and vector databases.

Alternative to:

AnythingLLM Cloud+19

LibreChat

Self-hosted multi-provider AI chat UI with agents and tools

34.1k

6.9k

Last commit: 21h ago

LibreChat is a self-hosted AI chat platform that supports multiple LLM providers, custom endpoints, agents/tools, file and image chat, conversation search, and presets.

Alternative to: