OpenReader v3.0.0, an open-source, self-hosted text-to-speech document reader, launched on May 14, 2026. It converts PDF, EPUB, DOCX, TXT, and Markdown files into synchronized read-along sessions or exported audiobooks, with support for multiple TTS providers including local options via Kokoro-FastAPI and commercial APIs. It deploys via Docker, Vercel, or local development and requires no third-party subscription to run.

What Happened

Developer Richard Remmington published OpenReader v3.0.0 on GitHub on May 14, 2026, after 467 commits across the project's lifetime. The release introduces a rewritten TTS engine with multi-provider support, sentence-aware narration for PDFs and EPUBs, and cross-device library sync. Audiobook export lands in M4B and MP3 formats with resumable processing so large documents do not require a single uninterrupted session.

Why It Matters

Creators who consume large volumes of research papers, technical documentation, or long-form articles gain a practical passive learning tool that does not require a paid subscription like Speechify or NaturalReader. Because OpenReader is self-hosted, your documents stay on your own server. The multi-provider design means you can use a free local TTS engine during development and switch to a higher-quality commercial voice for final export, without changing the rest of your workflow.

For teams with accessibility requirements, synchronized word highlighting via whisper.cpp timestamps makes OpenReader viable as a reading-assistance tool, not just an audiobook exporter.

Key Details

  • Supported formats: EPUB, PDF, DOCX, TXT, Markdown
  • TTS providers: Kokoro-FastAPI, KittenTTS-FastAPI, and OpenAI-compatible endpoints (covers most commercial voice APIs)
  • Export: M4B and MP3 audiobook export with chapter markers and resumable processing
  • Sync: Cross-device library and reading position sync
  • Storage: Embedded SeaweedFS or S3-compatible backends (self-hosted or cloud)
  • Auth: Optional authentication, suitable for personal or team deployment
  • Deploy: Docker, Vercel, or local with automatic database migrations
  • License: Open source on GitHub

Creator Outcome: How to Integrate

The most immediate use case for AI creators is turning research papers and technical documentation into listening material. Pull the Docker image, point OpenReader at a local Kokoro-FastAPI instance for zero-cost TTS, and load PDFs from arXiv or tool documentation directly into the reader. Use the M4B export feature to create offline audiobook files you can sync to a phone and listen to away from your desk. For creators building workflows around voice synthesis, OpenReader's OpenAI-compatible endpoint support means any TTS API that follows the OpenAI speech format drops in without code changes, including ElevenLabs and similar providers.

What to Do Next

  • Clone or star the OpenReader repository and run the Docker quickstart
  • Review the releases page for Docker Compose examples and configuration reference
  • For local TTS, pair with a Kokoro-FastAPI instance to keep processing fully offline
  • Test the M4B export on a long PDF before committing to a workflow; large documents with many images can take several minutes to process