Self-hosted projects tagged “AI Model Serving”
9 open source projects with this tag
9 services found

Ollama
Run and manage large language models locally with an API
Ollama is a local LLM runtime that lets you pull, run, and customize models, offering a CLI and REST API for chat, generation, and embeddings.
LocalAI
OpenAI-compatible local AI inference server and API
Run LLMs, image, and audio models locally with an OpenAI-compatible API, optional GPU acceleration, and a built-in web UI for managing and testing models.

Netron
Visualizer for neural network and machine learning models
Netron is a model graph viewer for inspecting neural network and ML formats such as ONNX, TensorFlow Lite, PyTorch, Keras, Core ML, and more.

Jina
Cloud-native Python framework for serving multimodal AI services
Open-source Python framework to build, scale, and deploy multimodal AI services and pipelines with gRPC/HTTP/WebSocket support and Kubernetes/Docker integration.
Willow
Open-source, privacy-focused voice assistant platform
Self-hosted voice assistant platform for ESP32 devices with on-device wake-word and command recognition, Home Assistant integration, and an optional inference server for...


Speaches
OpenAI API-compatible server for speech-to-text and text-to-speech
Self-hosted, OpenAI API-compatible server for streaming transcription, translation, and speech generation using faster-whisper and TTS engines like Piper and Kokoro.

Unblink
AI camera monitoring with federated vision workers
Open-source AI camera monitoring that routes camera streams through a relay/node proxy and broadcasts frames to federated AI workers for detections, summaries, and alerts...
withoutBG
Open-source image background removal with local models and hosted API
Open-source background-removal toolkit offering Focus/Snap local models, a Docker web app and Python SDK, plus a Pro API (Inferentia‑accelerated) for production use.

WhoRang
AI-powered doorbell intelligence with face recognition and analytics
WhoRang turns doorbell camera events into AI-analyzed visitor insights with face recognition, multi-provider AI integration, real-time notifications, and mobile-first das...