PDFs and e-books are cousins. Both deliver formatted text, but PDFs are fixed-layout and e-books are reflowable. Converting a PDF manuscript into a publishable EPUB or MOBI is one of the most common authorial chores in 2026. The good news is that the tools have matured: clean PDFs convert in minutes; messy PDFs take patience but are manageable. This guide walks through the practical workflow.
Why convert PDF to e-book
A finished PDF (from Word, InDesign, LaTeX) is ready for print and screen reading on a desktop. For Kindle, Kobo, Apple Books, and most e-readers, you need EPUB or KPF (Kindle Publishing Format). The benefits:
- Reflowable text for different screen sizes.
- Adjustable font size, choice of font.
- Better search and highlight experience on small devices.
- Lower file sizes (often).
- Wider distribution: Amazon KDP, Apple Books, Kobo Writing Life, Smashwords, draft2digital.
A PDF is fine for some distribution (academic monographs, design-heavy works), but for trade publishing EPUB is the standard.
What "EPUB" really is
An EPUB is a zip file containing:
- HTML files (one per chapter, typically).
- CSS for styling.
- An OPF (Open Packaging Format) manifest.
- A spine (reading order).
- Metadata (title, author, ISBN, language).
- Embedded fonts (optional).
- Images.
It is essentially a small website packaged as a single file. Reflowable means the reader app decides how to lay out the text.
EPUB 3 (the current version) supports rich features: SVG, MathML, audio/video, scripting. Most readers support a core subset.
The conversion path
Three broad approaches.
Approach 1: don't start with PDF. If you have access to the source (Word, Markdown, LaTeX, InDesign), convert from there. PDF is the last resort of converters because it has already lost structure.
- Word: Save As, EPUB. Or use Calibre.
- Markdown: Pandoc to EPUB. Excellent quality.
- LaTeX: Pandoc or tex4ht.
- InDesign: built-in EPUB export.
- Scrivener: built-in compile to EPUB.
If you must start with PDF, proceed.
Approach 2: convert PDF with Calibre. Calibre is free, cross-platform, and the de facto standard:
- Open Calibre, add the PDF.
- Click "Convert books" and pick EPUB as the output format.
- Tune settings (heuristic processing, font, structure detection).
- Review the output thoroughly.
Calibre's PDF-to-EPUB is "reasonable" for clean text PDFs. For complex layouts, it's a starting point that needs cleanup.
For more on the conversion, see how to convert PDF to EPUB.
Approach 3: convert PDF to Word, fix, then convert to EPUB. A two-step path that often works better than direct conversion:
- Convert PDF to Word (with how to convert PDF to Word).
- Clean up the Word (fix headings, remove headers/footers, normalize styles).
- Save Word as EPUB or pipe through Calibre.
The Word intermediate step is editable in a familiar tool, so cleanup is easier.
Common conversion problems
PDFs convert imperfectly. Common issues:
- Headings lost. The PDF showed a big bold font, but the converter treated it as just bold text. Fix: in the source or intermediate, mark headings explicitly.
- Page headers and footers leak in. Every page has "Chapter Three" as a header, and the converter included those as inline text. Strip systematically.
- Hyphenated words at line ends. "exam-\nple" becomes "exam- ple" in the output. Run a dehyphenation pass.
- Footnotes scattered. Sometimes converted into the body inline. Manual fix: move footnotes to end-of-chapter notes.
- Image placement. Centered images may flow left; captions may attach to wrong image.
- Tables. PDF tables convert to mangled HTML tables. May need rebuilding by hand.
- Equations. Math in PDFs is hard. Best to re-author equations as MathML or images.
- Fonts. PDF font substitutions don't carry over. Re-style in CSS for EPUB.
For an academic monograph, expect days of cleanup. For a 200-page novel, an afternoon.
Cleaning EPUB output
After conversion, open the EPUB in:
- Sigil: free EPUB editor; raw HTML/CSS access.
- Calibre's editor: built into Calibre; less powerful than Sigil but adequate.
- Vellum (Mac, paid): aimed at trade authors; polishes output.
Inside the editor:
- Check chapter structure (table of contents matches actual chapters).
- Verify heading hierarchy (h1 for chapter, h2 for section).
- Strip stray text from headers/footers.
- Fix paragraph breaks; collapse hard line breaks within paragraphs.
- Validate with Sigil's or Pagina's EPUB validator.
Cover image
E-books need a cover. Standard:
- JPEG or PNG, 1600 x 2560 px (Amazon's recommended size).
- Cover separately from interior (most retailers want a separate file).
- Embed in EPUB metadata for in-reader cover display.
If you start with a PDF that has a cover page, extract that page and use it as the cover image. Tools: any PDF converter, or how to convert PDF to image.
Validation
Before publishing:
- EPUBCheck: official validator; free; available standalone or via Calibre.
- Run validation on every EPUB before upload.
- Fix every error; warnings are usually safe to leave.
Retailers (Amazon, Apple, Kobo) all run EPUBCheck internally; failing validation delays publication.
Distribution
Once you have a clean EPUB:
- Amazon KDP: upload EPUB; Amazon converts to KPF/KFX for Kindle. Direct EPUB upload supported since 2022.
- Apple Books: EPUB upload via Apple Books for Authors or via an aggregator.
- Kobo Writing Life: EPUB upload; covers Kobo and many international stores.
- Google Play Books: EPUB upload.
- Smashwords / Draft2Digital: aggregators that distribute to multiple stores from one upload.
Each platform has its quirks. Verify the rendered book in their preview tools before going live.
Metadata
EPUB metadata that matters for sales:
- Title and subtitle.
- Author (with multiple-author support).
- Description: 200-2,500 characters (varies).
- Categories / BISAC codes.
- Keywords.
- Language.
- ISBN (recommended; required by some retailers).
Edit in your EPUB editor or Calibre's metadata pane.
Special cases
Heavily illustrated books. EPUB can include images, but reflowable layout limits placement. For art books, cookbooks, graphic novels: consider fixed-layout EPUB (FLE) or stick with PDF for direct distribution.
Math-heavy books. EPUB supports MathML, but reader support varies. Consider rendering equations as SVG with text alternatives.
Comic books and manga. CBZ or CBR, or fixed-layout EPUB.
Academic with footnotes and references. EPUB supports linked footnotes and endnotes; cite-management plugins help during authoring. See citation management with PDF papers.
Multi-language editions. One EPUB per language; metadata flags the locale.
From manuscript to print PDF too
Many authors publish both EPUB and print. Print versions are PDFs sent to a print-on-demand service (KDP Print, IngramSpark). The PDF requirements:
- Specific trim size (5x8, 6x9, etc.).
- Embedded fonts.
- Bleeds where artwork goes to the edge.
- PDF/X-1a or similar print standard.
See PDF/X print format explained.
For going both directions, the cleanest pattern: maintain a single source (Word, Markdown, or InDesign) and export to both EPUB and print PDF separately.
Tools summary
For PDF-to-EPUB workflows in 2026:
- Calibre: free conversion and library.
- Pandoc: command-line; excellent for Markdown-based sources.
- Sigil: free EPUB editor.
- Vellum: Mac, paid; trade-author polish.
- Adobe InDesign: paid; professional layout plus EPUB export.
- Reedsy Book Editor: web-based; free to use.
- Scrivener: writing tool with EPUB compile.
For PDF-side prep (cropping, splitting, redacting before conversion), Docento.app handles operations locally.
Common gotchas
Track changes leaking in. Word documents with track changes accepted/rejected create messy EPUBs. Clean track changes before conversion.
Smart quotes vs straight quotes. Inconsistency is jarring. Normalize.
Em-dashes and en-dashes. EPUB displays them fine, but if you stripped them from the PDF stage, they may be missing.
Drop caps. Look great in PDF, often fail in EPUB. Test in multiple readers.
Embedded fonts. Some readers ignore them. Use widely supported font families.
Lost ToC. EPUB's logical table of contents requires explicit chapter markers. Generate from headings.
File size from images. Heavy images bloat EPUB. Compress images before embedding.
Practical recipe
For a clean PDF-to-EPUB conversion:
- Start from source if possible (Word, Markdown, LaTeX). PDF is the last resort.
- If from PDF, convert to Word first, clean, then convert to EPUB.
- Use Calibre or Sigil to inspect and clean.
- Validate with EPUBCheck.
- Preview in multiple readers (Calibre's preview, Kindle Previewer, Apple Books).
- Add a cover image.
- Set metadata.
- Upload to retailers.
Takeaway
PDF-to-EPUB conversion is solvable in 2026 with mature tools, but starting from PDF is always harder than starting from a structured source. If you have the choice, author in Word, Markdown, or LaTeX; export to both PDF and EPUB separately. For PDF prep steps before conversion, Docento.app keeps the file local. See also how to convert PDF to EPUB, PDF vs EPUB, and EPUB vs MOBI ebook formats.