Best Self-hosted Document Processing & PDF Tools tools in 2026
43 self-hosted open source alternatives in this category
See also:
Automation & Workflow Builders (Low-code)Bookmarking, RSS & Read-laterFile Transfer & SharingNotes & Personal Knowledge ManagementPersonal Dashboards & StartpagesRemote Access & Remote DesktopTasks, Habits & Time TrackingURL Shorteners & Link ToolsWhiteboards, Diagrams & Mind Maps43 services found

Stirling PDF
Self-hosted PDF editing, conversion, OCR, and automation platform
Open-source PDF platform to edit, convert, OCR, sign, redact, and automate PDF workflows via a web UI and REST API.

Paperless-ngx
Document management system with OCR, search, and automated filing
Paperless-ngx is an open-source document management system that ingests scans and files, runs OCR, and turns them into a searchable, taggable document archive.

Reactive Resume
Privacy-focused, open-source resume builder
Open-source resume builder for creating, customizing, exporting and publishing resumes with templates, PDF export, public sharing and optional OpenAI assistance.

CyberChef
Browser-based toolkit for data decoding, encoding and analysis
CyberChef is a web-based “cyber” toolkit for encoding/decoding, encryption/decryption, compression, hashing, parsing, and data transformation using drag-and-drop recipes.

ArchiveBox
Open-source self-hosted web archiving and snapshotting tool
Self-hosted tool to collect and preserve webpages, media, and bookmarks in durable formats (HTML, PDF, WARC, MP4) with a CLI, web UI, and search.
ebook2audiobook
Convert eBooks into audiobooks with TTS and optional voice cloning
Self-hostable tool to convert non-DRM eBooks into audiobooks with chapter support, metadata, multilingual TTS engines, and optional voice cloning via a web UI or CLI.


ConvertX
Self-hosted web-based file converter for 1000+ formats
ConvertX is a self-hosted web file converter supporting 1000+ formats across documents, images, audio/video, ebooks, and 3D assets, with multi-file processing and account...

LanguageTool
Multilingual grammar, style, and spell checker
Open-source proofreading and writing assistant that checks spelling, grammar, punctuation, and style across 30+ languages, with an HTTP API for integrations.
google-webfonts-helper
Download and self-host Google Fonts with CSS snippets
Self-host Google Fonts easily: select variants and subsets, download font files (woff2, woff, ttf, eot, svg) and generate matching CSS snippets via web UI or API.

Documenso
Open-source document signing platform and DocuSign alternative
Open-source e-signature platform for creating, sending, embedding, and automating legally compliant digital signatures with API and developer tooling.


BentoPDF
Privacy-first, browser-based PDF toolkit for editing and conversion
Self-hostable, privacy-first PDF toolkit that runs fully in the browser for editing, merging, converting, and processing PDFs without server-side uploads.

DocuSeal
Open-source platform for filling and signing documents online
Self-hostable DocuSign alternative to create fillable forms and collect legally binding eSignatures with API and webhooks.

Gotenberg
Containerized API for document conversion and PDF generation
Gotenberg is a containerized HTTP API that converts HTML, Markdown, and Office documents to PDF using engines like Chromium and LibreOffice, with options to merge and aut...


imgproxy
On-the-fly image resizing, processing, and format conversion server
Fast, security-focused image processing server to resize, transform, optimize, and convert images on demand via URL-based HTTP requests.
Thumbor
On-demand image resizing, cropping, filters, and smart focal-point detection
Thumbor is an open-source image processing server for on-demand resizing, smart cropping, format conversion, and filter pipelines via URL-based HTTP requests.

OmniTools
Self-hosted web-based utilities for file, text, and data tasks
OmniTools is a self-hosted web app providing browser-based utilities for images, video, PDF, text, date/time, math, and data formats, with client-side processing.

Paperless-AI
AI extension for Paperless‑ngx providing automated analysis and RAG
Extension for Paperless‑ngx that uses OpenAI-compatible backends and Ollama to auto-classify, tag, index, and enable RAG-powered document chat and semantic search.


IronCalc
Open-source spreadsheet engine with XLSX import/export
IronCalc is a modern open-source spreadsheet engine for building and embedding spreadsheets, with XLSX reading/writing and language bindings including WebAssembly.

Papermerge
Open-source document management system for scanned documents
Open-source DMS that OCRs, indexes, and manages scanned PDFs, TIFFs and images with tagging, versioning, metadata and full-text search support.
Speakr
Self-hosted AI transcription and intelligent note-taking app
Speakr is a self-hosted web app for recording or uploading audio, transcribing with AI (including diarization), and turning conversations into searchable, shareable notes...

MAZANOKE
Privacy-focused in-browser image optimizer for self-hosting
Self-hosted image optimizer that compresses and converts images locally in your browser, works offline, and keeps files private with on-device processing.

Matchering
Reference-based audio matching and mastering toolkit
Open-source Python library and Dockerized web app for reference-based audio mastering, matching loudness, EQ and stereo width to a reference track.

Docspell
Personal document management system with OCR and metadata suggestions
Docspell is a self-hosted document management system that imports scanned files and email attachments, runs OCR, and helps organize documents with tags, metadata, and sea...
Scriberr
Offline AI audio and video transcription with transcript chat
Scriberr is a self-hosted, privacy-focused AI transcription app for audio and video, with speaker diarization, word-level timestamps, summaries, and transcript chat.

PdfDing
Self-hosted PDF manager, viewer, and editor
Self-hosted PDF manager to organize, view, annotate, sign, and share PDFs with multi-device reading progress, tagging, and optional access-controlled links.

Warracker
Open-source warranty tracker to monitor expirations and store receipts
Self-hosted web app to organize product warranties, track expirations, manage claims, store receipts, and send customizable notifications.


YAMLResume
Resume-as-code tool that renders YAML resumes to PDF and more
Write resumes as YAML, validate with a schema, and generate professional outputs like pixel-perfect PDFs plus Markdown and HTML via a developer-friendly CLI.

Flyimg
On-the-fly image resizing, cropping, and format optimization API
Dockerized image processing service that fetches, resizes, crops, compresses, caches, and serves optimized images (AVIF, WebP, MozJPEG, PNG, GIF, optional JXL).

docassemble
Expert system for guided interviews and document assembly
Open-source platform for guided web interviews that collect user input, run logic, and generate documents (PDF, DOCX, RTF) or integrate with external services via APIs.
File Wizard
Browser-based file conversion, OCR, and audio transcription UI
Self-hosted web UI for file conversion, OCR for PDFs/images, and local Whisper-based audio transcription, wrapping common CLI tools with background jobs and history.