How to Fix EPUB Formatting Issues After Conversion
You converted a file to EPUB, opened it on your reader, and the formatting is a disaster. Images are missing, bold text is gone, paragraphs have merged into walls of text, and hyperlinks are broken. Sound familiar? Here’s how to fix every common EPUB formatting issue.
Why EPUB Formatting Breaks During Conversion
EPUB formatting issues almost always come from the conversion step, not the EPUB format itself. EPUB is a well-designed format that supports rich formatting, images, hyperlinks, and structured content. The problem is that many converters do a poor job of extracting and mapping the source file’s formatting into proper EPUB structure.
The most common source of problems is PDF to EPUB conversion. PDFs use fixed positioning for every element, while EPUB is reflowable. Bridging this gap requires sophisticated analysis of fonts, spacing, images, and document structure — which many basic converters skip entirely.
The 8 Most Common EPUB Formatting Issues (and How to Fix Them)
Issue 1: Bold and Italic Text Disappeared
What happened: Your source file had bold headings, italic emphasis, and monospace code, but the EPUB shows everything in the same plain font.
Why it happens: Many converters extract raw text from the source file without reading font metadata. In a PDF, bold text isn’t flagged with a “bold” attribute — it’s rendered in a different font (e.g., “TimesNewRoman-Bold” vs “TimesNewRoman”). Converters that don’t analyze font names will lose all styling information.
How to fix it:
- Use a better converter: CheersPDF’s PDF to EPUB converter reads font name metadata for every text span and applies
<strong>,<em>, and<code>tags automatically. - Manual fix in Calibre: Open the EPUB in Calibre’s editor (Edit Book → select the EPUB), then manually add bold/italic tags to the XHTML files.
- Manual fix with Sigil: Sigil is a free EPUB editor. Open your EPUB, find the chapter files, and add
<strong>or<em>tags around text that should be styled.
Issue 2: Missing or Broken Images
What happened: The source file had images, but the EPUB shows broken image placeholders or no images at all.
Why it happens: Image extraction from PDFs requires rendering each image object to a canvas and exporting it. Some converters skip images entirely, extract them at tiny resolutions, or fail to properly reference the image files in the EPUB’s XHTML content.
How to fix it:
- Use a converter with proper image extraction: CheersPDF extracts images at their original resolution, renders them to a canvas, and exports at 92% JPEG quality for large images or lossless PNG for smaller graphics.
- Check the EPUB structure: Open the EPUB as a ZIP file (rename .epub to .zip). Look in the OEBPS/images/ folder. If images are there but not showing, the XHTML files may have broken
<img>references. - Re-add images manually: In Sigil, you can add images to the EPUB and insert
<img>tags in the appropriate chapter files.
Issue 3: Paragraphs Merged Into Walls of Text
What happened: In the source file, paragraphs were clearly separated. In the EPUB, everything runs together with no spacing between paragraphs.
Why it happens: PDF text extraction returns individual text items with X/Y coordinates, but no explicit “paragraph break” marker. Basic converters treat every line as continuous text, only breaking at page boundaries. Without analyzing the vertical gap between lines, paragraph breaks are lost.
How to fix it:
- Use a converter with gap analysis: CheersPDF measures the Y-position distance between consecutive text lines and compares it to the statistical median line gap. A gap exceeding 1.8x the typical line spacing triggers a new paragraph.
- Manual fix: Open the EPUB in Sigil and add
<p>tags to separate paragraphs. This is tedious for long documents but effective. - Calibre heuristic processing: In Calibre, enable “Heuristic processing” and the “Unwrap lines” option during conversion. This can help detect paragraph boundaries.
Issue 4: Broken or Missing Hyperlinks
What happened: The source PDF had clickable URLs, but in the EPUB they’re just plain text.
Why it happens: Hyperlinks in PDFs are stored as “annotations” — invisible rectangles overlaid on the text, each with a target URL. These annotations are separate from the text content. Converters that only extract text will miss the link annotations entirely.
How to fix it:
- Use CheersPDF: CheersPDF reads PDF link annotations, matches them to text spans by position, and wraps the linked text in
<a>tags with the original URLs. Links are sanitized to only allow http, https, and mailto protocols. - Manual fix: In Sigil, find URL text in the XHTML and wrap it in
<a href="URL">...</a>tags. - Auto-link URLs: Some EPUB editors can detect URL patterns and auto-convert them to clickable links.
Issue 5: Wrong Image Sizes
What happened: Images appear in the EPUB but are either tiny or stretched to fill the entire page, ruining the reading experience.
Why it happens: If the converter doesn’t preserve the original image dimensions, the EPUB reader has to guess the display size. Without explicit width and height attributes, images may be scaled unpredictably.
How to fix it:
- Use a converter that preserves dimensions: CheersPDF stores the original width and height of every extracted image and includes these as attributes in the EPUB’s
<img>tags, ensuring correct display. - Edit the CSS: In the EPUB’s stylesheet, add
img { max-width: 100%; height: auto; }to prevent images from overflowing while maintaining their aspect ratio. - Fix individual images: In Sigil, edit the
<img>tags to add explicit width/height attributes or inline styles.
Issue 6: No Chapter Structure or Table of Contents
What happened: The EPUB has no chapters in the table of contents — it’s just one long, continuous document.
Why it happens: The converter didn’t detect chapter boundaries in the source file, or didn’t create separate XHTML files for each chapter. Without a proper table of contents (NCX and navigation documents), ebook readers can’t show a chapter list.
How to fix it:
- Look for page-break converters: CheersPDF creates chapter breaks based on the PDF’s page count, generating a navigable chapter structure with both EPUB 3 navigation and EPUB 2 NCX compatibility.
- Add chapters in Sigil: Split the content into separate XHTML files at chapter boundaries, then update the navigation document.
- Add chapters in Calibre editor: Use Calibre’s “Edit Book” feature to split content and regenerate the table of contents.
Issue 7: Garbled Characters or Encoding Issues
What happened: Some characters appear as question marks, boxes, or completely wrong symbols in the EPUB.
Why it happens: Character encoding mismatches. The source file uses one encoding (e.g., Windows-1252) but the converter outputs UTF-8 without proper mapping. This is most common with non-English text and special characters.
How to fix it:
- Verify source encoding: If converting with Calibre, set the correct input encoding in the conversion settings.
- Check XHTML declaration: Ensure the EPUB’s XHTML files have
<?xml version="1.0" encoding="UTF-8"?>at the top. - Re-convert with UTF-8: CheersPDF outputs UTF-8 EPUB files by default, which handles all Unicode characters correctly.
Issue 8: Lists Lost Their Formatting
What happened: Bulleted or numbered lists in the source file appear as plain paragraphs in the EPUB, with bullet characters or numbers mixed into the text.
Why it happens: PDF files don’t have a “list” element. Bullets and numbers are just rendered text characters. Converters must detect list patterns (lines starting with •, –, numbers) and convert them to proper <ul>/<ol>/<li> HTML tags.
How to fix it:
- Use a smart converter: CheersPDF detects common list prefixes (bullets, dashes, numbers, letters) and converts them to proper HTML lists with the prefix character removed.
- Manual fix: In Sigil, replace paragraph tags containing list items with proper
<ul><li>...</li></ul>or<ol><li>...</li></ol>markup.
Prevention Is Better Than Cure
The best way to avoid EPUB formatting issues is to use a high-quality converter from the start. CheersPDF’s PDF to EPUB converter is specifically designed to preserve formatting by:
- Analyzing font metadata for every text span (bold, italic, monospace detection)
- Reading PDF link annotations and creating clickable hyperlinks
- Extracting images at original resolution with preserved dimensions
- Detecting paragraph breaks through Y-position gap analysis
- Recognizing list patterns and converting to proper HTML lists
- Detecting bold short lines as subheadings for better document structure
All of this happens in your browser with zero file upload, making it both the highest-quality and most private option available.
Quick tip: If you’ve already converted a file and the EPUB has issues, try re-converting the original source file with CheersPDF before attempting manual fixes. A better converter often solves the problem without any editing.
Tools for Editing EPUB Files
If you still need to manually edit your EPUB after conversion, here are the best tools:
- Sigil: Free, open-source EPUB editor for Windows, macOS, and Linux. The best option for hands-on EPUB editing with full access to XHTML, CSS, and images.
- Calibre’s Edit Book: Built into Calibre, it provides a code editor for EPUB content with preview. Good for quick fixes.
- Any text editor: Since EPUB files are ZIP archives containing XHTML files, you can rename .epub to .zip, edit the files with any text editor, and re-zip.
Get Clean EPUB Conversions from the Start
Free, private, and instant. No sign-up required.