Self-hosted projects tagged “Document OCR”
11 open source projects with this tag
11 open source projects with this tag
11 services found

Self-hosted PDF editing, conversion, OCR, and automation platform
Open-source PDF platform to edit, convert, OCR, sign, redact, and automate PDF workflows via a web UI and REST API.

Document management system with OCR, search, and automated filing
Paperless-ngx is an open-source document management system that ingests scans and files, runs OCR, and turns them into a searchable, taggable document archive.

Open-source document management system for scanned documents
Open-source DMS that OCRs, indexes, and manages scanned PDFs, TIFFs and images with tagging, versioning, metadata and full-text search support.
Browser-based file conversion, OCR, and audio transcription UI
Self-hosted web UI for file conversion, OCR for PDFs/images, and local Whisper-based audio transcription, wrapping common CLI tools with background jobs and history.

Clean, family-focused recipe manager web application
Recipya is a simple recipe manager for collecting, importing and organizing recipes into cookbooks, digitizing paper recipes, converting units, and calculating nutrition.

Web-based PDF and reference manager for collaborative research
Web application to manage, annotate, and share academic PDFs with full-text search, OCR, citation import, and team collaboration.

Self-hosted Ruby on Rails document management server for PDFs
Ruby on Rails document management server for uploading, organizing, encrypting and full-text searching PDF documents. Provides a REST API and mobile-friendly web UI.

Receipt capture, extraction, indexing, and organization platform
Multi-platform receipt management suite with an API, desktop and mobile apps for capture, OCR extraction, indexing, and searchable archives.
Modern file sharing platform with screenshot tool integrations
Modern, self-hosted file sharing platform built with Next.js. Integrates with ShareX, Flameshot, and KDE Spectacle; supports S3/local storage, OCR, previews, and admin to...
Self-hosted secure file sharing and collaboration platform
Self-hosted file sharing platform with box-based access controls, S3 or local storage, file previews, ZIP browsing, OCR and optional AI integrations.

Open source electronic document management system (EDMS)
Mayan EDMS is an open source document management system for ingesting, indexing, organizing, and securing documents with workflows, OCR, and audit trails.