You exported a report, received a contract, or scanned a form — and the resulting PDF is enormous. A file that should be a few megabytes is somehow 80 MB. Before you can fix the problem, you need to understand what is actually inside that file.
PDFs are not single-purpose containers. They can hold text, embedded fonts, high-resolution images, colour profiles, revision history, audio, video, attachments, and dozens of other data types. Each element adds to the final file size — and some add far more than you would expect.
This guide covers the six most common causes of unexpectedly large PDF files, how to identify which one is responsible for your document, and the fastest way to fix each one.
The Six Main Causes of Large PDF Files
1. High-Resolution Embedded Images
Images are almost always the primary reason a PDF is large. When a document is created in Word, InDesign, or any design tool, images are embedded at their original resolution — sometimes 300 DPI or higher. A single full-page photograph at print resolution can exceed 10 MB on its own.
The problem is compounded when images are exported without downsampling. A 5000 × 4000 pixel photo embedded in a document that will only ever be read on screen carries resolution data that serves no purpose and adds significant weight.
How to spot it: Open the PDF and check the file size against the number of pages. If a 10-page document exceeds 5 MB, embedded images are almost certainly the cause.
2. Embedded Fonts
PDFs embed fonts to ensure the document looks identical on every device, regardless of what fonts are installed. This is a feature, not a bug — but it comes with a cost.
Each unique font family used in a document can add 50–300 KB to the file. A professionally designed brochure using six custom fonts may add 1–2 MB of font data alone. Some older PDF generators embed the complete font file instead of just the characters used in the document (called font subsetting), which inflates the file further.
3. Revision History and Undo Data
Every time you save a PDF using certain tools — particularly Adobe Acrobat — the application appends a new revision to the end of the file rather than overwriting the previous version. This incremental saving model preserves undo history but causes file size to grow with each save.
A document that has been edited and saved 20 times may contain 19 complete revision snapshots inside the file, even though only the final version is visible. The solution is straightforward: use Save As (not Save) to flatten the history and write a clean, single-version file.
4. Uncompressed or Poorly Compressed Content Streams
PDF content streams — the instructions that describe how text and graphics are drawn on each page — can be stored either compressed (using Deflate/ZIP compression) or uncompressed. Documents created by older software, or exported from certain CAD and engineering applications, often use uncompressed streams.
Reprocessing the file through a modern PDF optimiser applies compression to these streams and can reduce file size by 20–40% even when images are not involved.
5. Embedded Metadata, Thumbnails, and Attachments
PDFs can carry:
- Embedded thumbnails for each page (generated by some applications for fast preview)
- XMP metadata describing author, software, creation date, and copyright information
- Embedded file attachments — spreadsheets, images, or other documents attached inside the PDF
- Colour profile data used for print production
Print-production PDFs frequently include ICC colour profiles that add 500 KB to 2 MB per profile. If the document will only be read on screen, these profiles serve no purpose.
6. Scanned Documents Without Compression
A scanned PDF is essentially a series of photographs, one per page. If the scanning software saves each page as an uncompressed or lightly compressed TIFF, even a simple 10-page text document can exceed 50 MB.
Modern scanners and scanning apps usually apply some compression, but the default settings are often conservative to preserve quality for archiving. For documents that only need to be readable — not archival-quality — aggressive recompression can reduce scanned PDF sizes by 70–85%.
How to Check What Is Making Your PDF Large
Before applying any compression, it helps to know which factor is dominant:
- Open the PDF in a viewer and zoom in on images. If they appear at high resolution on screen (sharp and detailed at 200% zoom), large embedded images are likely the cause.
- Right-click the file and check properties. Compare file size to page count. More than 500 KB per page with no images usually indicates fonts, colour profiles, or revision history.
- Check if the file was edited multiple times. If yes, Save As in Acrobat or re-export from the source application before compressing.
The Fastest Way to Fix a Large PDF
For most users, the quickest path to a smaller PDF is an online compressor. ShellPDF offers two compression modes to address different types of large files:
Free Compression (browser-based): Reorganises the PDF structure, removes redundant metadata, and applies stream compression without touching image quality. No upload required — processing happens entirely on your device. Best for documents where file structure is the primary issue.
Stronger Compression (Ghostscript): Resamples embedded images at a lower DPI using Ghostscript, one of the most reliable PDF rendering engines available. This is the right choice for scanned documents, image-heavy reports, and presentations where visual content is the dominant cause of file size.
When Compression Is Not Enough
If you need a smaller file for a specific purpose — such as an email attachment limit or a form upload cap — consider the following additional steps:
- Remove pages you do not need. If the document includes appendices or blank pages, removing them before compressing reduces the base file size.
- Flatten annotations and form fields. Interactive form fields and sticky note annotations add data. Flattening converts them to static content.
- Convert to greyscale. Colour PDFs are larger than black-and-white equivalents. For text-heavy documents, converting to greyscale can reduce size by 20–40% with no loss of readability.
- Split the document. If the compressed file still exceeds a limit, splitting it into two smaller files is a reliable workaround.
Summary
Large PDF files most commonly result from high-resolution embedded images, full font files, accumulated revision history, or uncompressed content streams. Identifying the specific cause helps you choose the right fix — but for most documents, running the file through a quality compressor resolves the issue in under 30 seconds.
If your PDF is still too large after compression, the problem is likely scanned image content at very high DPI, and Ghostscript-based compression (Stronger mode) is the appropriate tool for the job.


