Self-hosted projects tagged “Document OCR

11 open source projects with this tag

11 services found

Stirling PDF

Stirling PDF

Self-hosted PDF editing, conversion, OCR, and automation platform

74.6k
6.3k
Last commit: 7h ago

Open-source PDF platform to edit, convert, OCR, sign, redact, and automate PDF workflows via a web UI and REST API.

Alternative to:
Adobe Acrobat
Adobe Acrobat
+19
Paperless-ngx

Paperless-ngx

Document management system with OCR, search, and automated filing

36.9k
2.3k
Last commit: 1d ago

Paperless-ngx is an open-source document management system that ingests scans and files, runs OCR, and turns them into a searchable, taggable document archive.

Alternative to:
DocuWare
DocuWare
+6
Papermerge

Papermerge

Open-source document management system for scanned documents

2.9k
303
Last commit: 3mo ago

Open-source DMS that OCRs, indexes, and manages scanned PDFs, TIFFs and images with tagging, versioning, metadata and full-text search support.

Alternative to:
DocuWare
DocuWare
+7
File Wizard

File Wizard

Browser-based file conversion, OCR, and audio transcription UI

818
50
Last commit: 3mo ago

Self-hosted web UI for file conversion, OCR for PDFs/images, and local Whisper-based audio transcription, wrapping common CLI tools with background jobs and history.

Alternative to:
CloudConvert
CloudConvert
+15
Recipya

Recipya

Clean, family-focused recipe manager web application

389
26
Last commit: 3mo ago

Recipya is a simple recipe manager for collecting, importing and organizing recipes into cookbooks, digitizing paper recipes, converting units, and calculating nutrition.

Alternative to:
Paprika Recipe Manager
Paprika Recipe Manager
+6
EveryDocs

EveryDocs

Self-hosted Ruby on Rails document management server for PDFs

326
20
Last commit: 7d ago

Ruby on Rails document management server for uploading, organizing, encrypting and full-text searching PDF documents. Provides a REST API and mobile-friendly web UI.

Alternative to:
DocuWare
DocuWare
+7
I, Librarian

I, Librarian

Web-based PDF and reference manager for collaborative research

325
31
Last commit: 2mo ago

Web application to manage, annotate, and share academic PDFs with full-text search, OCR, citation import, and team collaboration.

Alternative to:
Mendeley
Mendeley
+12
Receipt Wrangler

Receipt Wrangler

Receipt capture, extraction, indexing, and organization platform

202
12
Last commit: 1mo ago

Multi-platform receipt management suite with an API, desktop and mobile apps for capture, OCR extraction, indexing, and searchable archives.

Alternative to:
Expensify
Expensify
+7
Flare

Flare

Modern file sharing platform with screenshot tool integrations

103
5
Last commit: 9d ago

Modern, self-hosted file sharing platform built with Next.js. Integrates with ShareX, Flameshot, and KDE Spectacle; supports S3/local storage, OCR, previews, and admin to...

Alternative to:
WeTransfer
WeTransfer
+13
PlikShare

PlikShare

Self-hosted secure file sharing and collaboration platform

88
4
Last commit: 5mo ago

Self-hosted file sharing platform with box-based access controls, S3 or local storage, file previews, ZIP browsing, OCR and optional AI integrations.

Alternative to:
Dropbox
Dropbox
+19
Mayan EDMS

Mayan EDMS

Open source electronic document management system (EDMS)

Mayan EDMS is an open source document management system for ingesting, indexing, organizing, and securing documents with workflows, OCR, and audit trails.

Alternative to:
DocuWare
DocuWare
+6