File Wizard is a browser-based utility for converting files, running OCR on PDFs/images, and transcribing audio. It provides a simple web UI that orchestrates common command-line tools and local ML models, with job tracking and a persistent history.

Key Features

Convert between many document, image, audio, and video formats by wrapping external tools (configurable via a YAML settings file)
OCR for PDFs and images using Tesseract and OCRmyPDF, including generating searchable PDFs
Audio transcription using local Whisper models (faster-whisper), with subtitle-style outputs supported by Whisper tooling
Drag-and-drop web interface with responsive dark UI
Background job processing with real-time status updates and stored job history
Optional OAuth/OIDC-based access control configuration (can run without auth in local-only mode)
Optional CUDA-enabled container image for GPU-accelerated transcription

Use Cases

Convert office documents and ebooks into consistent archival formats (PDF, EPUB, DOCX)
Turn scanned PDFs into searchable documents with OCR
Create transcripts/subtitles from meeting recordings and other audio files

Limitations and Considerations

Not safe to expose publicly without strong authentication and isolation; wrapping converters can introduce arbitrary command execution risk if misconfigured
Conversion fidelity and supported formats depend on the installed external tools and their build options
Transcription performance varies significantly by model size and whether GPU acceleration is available

File Wizard fits well for homelabs and internal teams that want a single, lightweight web interface to run conversions, OCR workflows, and local speech-to-text processing. Its tool-based architecture makes it extensible, but it should be deployed with careful security controls when used beyond local environments.

File Wizard

Key Features

Use Cases

Limitations and Considerations

Categories:

Tags:

Tech Stack:

Similar Services

Stirling PDF

Paperless-ngx

Reactive Resume

CyberChef

ArchiveBox

ebook2audiobook