Best Self-hosted Document Processing & PDF Tools tools in 2026

43 self-hosted open source alternatives in this category

43 services found

Stirling PDF

Stirling PDF

Self-hosted PDF editing, conversion, OCR, and automation platform

74.6k
6.3k
Last commit: 7h ago

Open-source PDF platform to edit, convert, OCR, sign, redact, and automate PDF workflows via a web UI and REST API.

Alternative to:
Adobe Acrobat
Adobe Acrobat
+19
Paperless-ngx

Paperless-ngx

Document management system with OCR, search, and automated filing

36.9k
2.3k
Last commit: 1d ago

Paperless-ngx is an open-source document management system that ingests scans and files, runs OCR, and turns them into a searchable, taggable document archive.

Alternative to:
DocuWare
DocuWare
+6
Reactive Resume

Reactive Resume

Privacy-focused, open-source resume builder

35.4k
3.9k
Last commit: 1d ago

Open-source resume builder for creating, customizing, exporting and publishing resumes with templates, PDF export, public sharing and optional OpenAI assistance.

Alternative to:
Resume.io
Resume.io
+5
CyberChef

CyberChef

Browser-based toolkit for data decoding, encoding and analysis

34.1k
3.9k
Last commit: 1d ago

CyberChef is a web-based “cyber” toolkit for encoding/decoding, encryption/decryption, compression, hashing, parsing, and data transformation using drag-and-drop recipes.

ArchiveBox

ArchiveBox

Open-source self-hosted web archiving and snapshotting tool

26.9k
1.5k
Last commit: 1d ago

Self-hosted tool to collect and preserve webpages, media, and bookmarks in durable formats (HTML, PDF, WARC, MP4) with a CLI, web UI, and search.

Alternative to:
Internet Archive Wayback Machine
Internet Archive Wayback Machine
+3
ebook2audiobook

ebook2audiobook

Convert eBooks into audiobooks with TTS and optional voice cloning

18.3k
1.5k
Last commit: 5d ago

Self-hostable tool to convert non-DRM eBooks into audiobooks with chapter support, metadata, multilingual TTS engines, and optional voice cloning via a web UI or CLI.

Alternative to:
Speechify
Speechify
+7
ConvertX

ConvertX

Self-hosted web-based file converter for 1000+ formats

16k
874
Last commit: 2d ago

ConvertX is a self-hosted web file converter supporting 1000+ formats across documents, images, audio/video, ebooks, and 3D assets, with multi-file processing and account...

Alternative to:
CloudConvert
CloudConvert
+7
LanguageTool

LanguageTool

Multilingual grammar, style, and spell checker

14.1k
1.5k
Last commit: 15h ago

Open-source proofreading and writing assistant that checks spelling, grammar, punctuation, and style across 30+ languages, with an HTTP API for integrations.

Alternative to:
Grammarly
Grammarly
+4
google-webfonts-helper

google-webfonts-helper

Download and self-host Google Fonts with CSS snippets

12.9k
444
Last commit: 4mo ago

Self-host Google Fonts easily: select variants and subsets, download font files (woff2, woff, ttf, eot, svg) and generate matching CSS snippets via web UI or API.

Documenso

Documenso

Open-source document signing platform and DocuSign alternative

12.4k
2.4k
Last commit: 15h ago

Open-source e-signature platform for creating, sending, embedding, and automating legally compliant digital signatures with API and developer tooling.

Alternative to:
Docusign
Docusign
+15
BentoPDF

BentoPDF

Privacy-first, browser-based PDF toolkit for editing and conversion

11.7k
908
Last commit: 15h ago

Self-hostable, privacy-first PDF toolkit that runs fully in the browser for editing, merging, converting, and processing PDFs without server-side uploads.

Alternative to:
Adobe Acrobat
Adobe Acrobat
+19
DocuSeal

DocuSeal

Open-source platform for filling and signing documents online

11.5k
964
Last commit: 2d ago

Self-hostable DocuSign alternative to create fillable forms and collect legally binding eSignatures with API and webhooks.

Alternative to:
Docusign
Docusign
+13
Gotenberg

Gotenberg

Containerized API for document conversion and PDF generation

11.4k
738
Last commit: 1d ago

Gotenberg is a containerized HTTP API that converts HTML, Markdown, and Office documents to PDF using engines like Chromium and LibreOffice, with options to merge and aut...

Alternative to:
PDFShift
PDFShift
+9
imgproxy

imgproxy

On-the-fly image resizing, processing, and format conversion server

10.5k
727
Last commit: 1d ago

Fast, security-focused image processing server to resize, transform, optimize, and convert images on demand via URL-based HTTP requests.

Alternative to:
CloudConvert
CloudConvert
+4
Thumbor

Thumbor

On-demand image resizing, cropping, filters, and smart focal-point detection

10.5k
863
Last commit: 2mo ago

Thumbor is an open-source image processing server for on-demand resizing, smart cropping, format conversion, and filter pipelines via URL-based HTTP requests.

Alternative to:
CloudConvert
CloudConvert
+4
OmniTools

OmniTools

Self-hosted web-based utilities for file, text, and data tasks

8.7k
543
Last commit: 2d ago

OmniTools is a self-hosted web app providing browser-based utilities for images, video, PDF, text, date/time, math, and data formats, with client-side processing.

Alternative to:
iLovePDF
iLovePDF
+7
Paperless-AI

Paperless-AI

AI extension for Paperless‑ngx providing automated analysis and RAG

5.3k
259
Last commit: 3mo ago

Extension for Paperless‑ngx that uses OpenAI-compatible backends and Ollama to auto-classify, tag, index, and enable RAG-powered document chat and semantic search.

Alternative to:
AskYourPDF
AskYourPDF
+19
IronCalc

IronCalc

Open-source spreadsheet engine with XLSX import/export

3.8k
127
Last commit: 3d ago

IronCalc is a modern open-source spreadsheet engine for building and embedding spreadsheets, with XLSX reading/writing and language bindings including WebAssembly.

Alternative to:
Google Docs
Google Docs
+3
Papermerge

Papermerge

Open-source document management system for scanned documents

2.9k
303
Last commit: 3mo ago

Open-source DMS that OCRs, indexes, and manages scanned PDFs, TIFFs and images with tagging, versioning, metadata and full-text search support.

Alternative to:
DocuWare
DocuWare
+7
Speakr

Speakr

Self-hosted AI transcription and intelligent note-taking app

2.8k
220
Last commit: 17h ago

Speakr is a self-hosted web app for recording or uploading audio, transcribing with AI (including diarization), and turning conversations into searchable, shareable notes...

Alternative to:
Otter.ai
Otter.ai
+14
MAZANOKE

MAZANOKE

Privacy-focused in-browser image optimizer for self-hosting

2.5k
127
Last commit: 8mo ago

Self-hosted image optimizer that compresses and converts images locally in your browser, works offline, and keeps files private with on-device processing.

Alternative to:
Convertio
Convertio
+6
Matchering

Matchering

Reference-based audio matching and mastering toolkit

2.4k
255
Last commit: 3mo ago

Open-source Python library and Dockerized web app for reference-based audio mastering, matching loudness, EQ and stereo width to a reference track.

Docspell

Docspell

Personal document management system with OCR and metadata suggestions

2.2k
170
Last commit: 14d ago

Docspell is a self-hosted document management system that imports scanned files and email attachments, runs OCR, and helps organize documents with tags, metadata, and sea...

Alternative to:
DocuWare
DocuWare
+5
Scriberr

Scriberr

Offline AI audio and video transcription with transcript chat

2.1k
152
Last commit: 1mo ago

Scriberr is a self-hosted, privacy-focused AI transcription app for audio and video, with speaker diarization, word-level timestamps, summaries, and transcript chat.

Alternative to:
Otter.ai
Otter.ai
+6
PdfDing

PdfDing

Self-hosted PDF manager, viewer, and editor

1.6k
89
Last commit: 16h ago

Self-hosted PDF manager to organize, view, annotate, sign, and share PDFs with multi-device reading progress, tagging, and optional access-controlled links.

Alternative to:
Adobe Acrobat
Adobe Acrobat
+14
Warracker

Warracker

Open-source warranty tracker to monitor expirations and store receipts

1.3k
39
Last commit: 3mo ago

Self-hosted web app to organize product warranties, track expirations, manage claims, store receipts, and send customizable notifications.

Alternative to:
Encircle Home Inventory
Encircle Home Inventory
YAMLResume

YAMLResume

Resume-as-code tool that renders YAML resumes to PDF and more

1.2k
56
Last commit: 9d ago

Write resumes as YAML, validate with a schema, and generate professional outputs like pixel-perfect PDFs plus Markdown and HTML via a developer-friendly CLI.

Alternative to:
Resume.io
Resume.io
+5
Flyimg

Flyimg

On-the-fly image resizing, cropping, and format optimization API

1.2k
121
Last commit: 15d ago

Dockerized image processing service that fetches, resizes, crops, compresses, caches, and serves optimized images (AVIF, WebP, MozJPEG, PNG, GIF, optional JXL).

Alternative to:
CloudConvert
CloudConvert
+6
docassemble

docassemble

Expert system for guided interviews and document assembly

926
300
Last commit: 28d ago

Open-source platform for guided web interviews that collect user input, run logic, and generate documents (PDF, DOCX, RTF) or integrate with external services via APIs.

Alternative to:
Formstack
Formstack
+19
File Wizard

File Wizard

Browser-based file conversion, OCR, and audio transcription UI

818
50
Last commit: 3mo ago

Self-hosted web UI for file conversion, OCR for PDFs/images, and local Whisper-based audio transcription, wrapping common CLI tools with background jobs and history.

Alternative to:
CloudConvert
CloudConvert
+15