ebook2audiobook
Convert eBooks into audiobooks with TTS and optional voice cloning
ebook2audiobook is a tool for generating audiobooks from non-DRM, legally acquired eBooks using multiple text-to-speech (TTS) engines. It can run with a Gradio web interface or in headless/CLI mode, and supports multilingual narration with optional voice cloning.
Key Features
- Converts many input formats including EPUB, MOBI/AZW3, FB2, PDF, DOC/DOCX, HTML, RTF, TXT, and image-based documents
- OCR support for scanned pages and image-based eBooks
- Multiple TTS engine options (including XTTSv2 and others) with broad language coverage
- Optional voice cloning using a provided reference voice file
- Supports custom XTTSv2 model uploads (e.g., zipped model artifacts)
- Outputs common audiobook/audio formats including MP3, M4B, M4A, AAC, FLAC, OGG, WAV, and WebM
- Runs on CPU or accelerators (CUDA and other backends depending on environment)
Use Cases
- Converting personal eBook libraries into listenable audiobooks with chapters and metadata
- Producing multilingual narration for accessibility, language learning, or travel
- Creating custom-voice narration for personal use using voice cloning
Limitations and Considerations
- Intended for non-DRM, legally acquired eBooks; DRM-protected sources require separate lawful handling
- OCR quality and document structure (especially EPUB chapter boundaries) can affect chapter splitting and narration results
It is well-suited for users who want a local web UI and batch-capable CLI for audiobook generation, while keeping flexibility in TTS engines, languages, and output formats. With GPU acceleration and suitable TTS models, it can significantly improve throughput and audio quality for larger books.
Categories:
Tags:
Tech Stack:
Similar Services

Stirling PDF
Self-hosted PDF editing, conversion, OCR, and automation platform
Open-source PDF platform to edit, convert, OCR, sign, redact, and automate PDF workflows via a web UI and REST API.

Paperless-ngx
Document management system with OCR, search, and automated filing
Paperless-ngx is an open-source document management system that ingests scans and files, runs OCR, and turns them into a searchable, taggable document archive.

Reactive Resume
Privacy-focused, open-source resume builder
Open-source resume builder for creating, customizing, exporting and publishing resumes with templates, PDF export, public sharing and optional OpenAI assistance.

CyberChef
Browser-based toolkit for data decoding, encoding and analysis
CyberChef is a web-based “cyber” toolkit for encoding/decoding, encryption/decryption, compression, hashing, parsing, and data transformation using drag-and-drop recipes.

ArchiveBox
Open-source self-hosted web archiving and snapshotting tool
Self-hosted tool to collect and preserve webpages, media, and bookmarks in durable formats (HTML, PDF, WARC, MP4) with a CLI, web UI, and search.

ConvertX
Self-hosted web-based file converter for 1000+ formats
ConvertX is a self-hosted web file converter supporting 1000+ formats across documents, images, audio/video, ebooks, and 3D assets, with multi-file processing and account...
Docker
Python
PyTorch