Best Self-hosted Document Processing & PDF Tools tools in 2026
43 self-hosted open source alternatives in this category
See also:
Automation & Workflow Builders (Low-code)Bookmarking, RSS & Read-laterFile Transfer & SharingNotes & Personal Knowledge ManagementPersonal Dashboards & StartpagesRemote Access & Remote DesktopTasks, Habits & Time TrackingURL Shorteners & Link ToolsWhiteboards, Diagrams & Mind Maps43 services found

Signature PDF
Open-source web app to sign, organize, edit metadata, and compress PDFs
Self-hosted web app for signing PDFs (single or multi-signer), organizing pages (merge/rotate/extract), editing PDF metadata, and compressing files.

Fast Music Remover
Web-based background music and noise removal for media
Self-hosted web app that removes background music and reduces noise from videos or audio (including URLs), using FFmpeg and DeepFilterNet-based enhancement.

I, Librarian
Web-based PDF and reference manager for collaborative research
Web application to manage, annotate, and share academic PDFs with full-text search, OCR, citation import, and team collaboration.

EveryDocs
Self-hosted Ruby on Rails document management server for PDFs
Ruby on Rails document management server for uploading, organizing, encrypting and full-text searching PDF documents. Provides a REST API and mobile-friendly web UI.

OpenReader WebUI
Web-based text-to-speech document reader for EPUB, PDF, DOCX, MD and TXT
Next.js web app that reads EPUB, PDF, DOCX, MD and TXT using pluggable TTS providers, offering real-time read-along highlighting, word timestamps, and audiobook export.

Mere Medical
Self-hosted, offline-first personal health record aggregator
Self-hosted personal health record (PHR) that aggregates and syncs medical records from multiple patient portals into a local, privacy-first web app.
URL to PNG
HTTP service that generates PNG screenshots from URLs using Playwright
Self-hosted HTTP API to render web pages to PNG with configurable viewport, caching, Playwright-based parallel rendering, and S3/CouchDB/filesystem storage.
Autocaliweb
Web interface for browsing, reading and managing Calibre libraries
Fork of Calibre-Web providing a Bootstrap-based web UI to browse, read, convert and serve eBooks, comics and PDFs from a Calibre database with Docker support.


Receipt Wrangler
Receipt capture, extraction, indexing, and organization platform
Multi-platform receipt management suite with an API, desktop and mobile apps for capture, OCR extraction, indexing, and searchable archives.

Ackify
Cryptographic proof-of-read and document acknowledgment system
Ackify provides Ed25519-based proof-of-read for documents with immutable audit trails, OAuth2/MagicLink authentication, embeddable widgets and an admin dashboard.

Webarchive
Simple web archive: save pages as PDF, headers, or single-file HTML
Go-based self-hosted web archiver for personal/home use. Saves pages as PDF, captures HTTP headers, and stores single-file HTML. Provides a REST API and optional web UI.


Mayan EDMS
Open source electronic document management system (EDMS)
Mayan EDMS is an open source document management system for ingesting, indexing, organizing, and securing documents with workflows, OCR, and audit trails.
SANE (Scanner Access Now Easy)
Standard API and driver collection for raster image scanners
SANE provides a portable API, a collection of scanner backends and frontends, and network scanning support (saned/scanimage) for Unix-like systems.