Why Is My PDF File So Large? (And How to Fix It)

You exported a report, received a contract, or scanned a form — and the resulting PDF is enormous. A file that should be a few megabytes is somehow 80 MB. Before you can fix the problem, you need to understand what is actually inside that file.

PDFs are not single-purpose containers. They can hold text, embedded fonts, high-resolution images, colour profiles, revision history, audio, video, attachments, and dozens of other data types. Each element adds to the final file size — and some add far more than you would expect.

This guide covers the six most common causes of unexpectedly large PDF files, how to identify which one is responsible for your document, and the fastest way to fix each one.

The Six Main Causes of Large PDF Files

1. High-Resolution Embedded Images

Images are almost always the primary reason a PDF is large. When a document is created in Word, InDesign, or any design tool, images are embedded at their original resolution — sometimes 300 DPI or higher. A single full-page photograph at print resolution can exceed 10 MB on its own.

The problem is compounded when images are exported without downsampling. A 5000 × 4000 pixel photo embedded in a document that will only ever be read on screen carries resolution data that serves no purpose and adds significant weight.

How to spot it: Open the PDF and check the file size against the number of pages. If a 10-page document exceeds 5 MB, embedded images are almost certainly the cause.

2. Embedded Fonts

PDFs embed fonts to ensure the document looks identical on every device, regardless of what fonts are installed. This is a feature, not a bug — but it comes with a cost.

Each unique font family used in a document can add 50–300 KB to the file. A professionally designed brochure using six custom fonts may add 1–2 MB of font data alone. Some older PDF generators embed the complete font file instead of just the characters used in the document (called font subsetting), which inflates the file further.

Tip: If you control how the PDF is created, enable font subsetting in your export settings. This embeds only the specific characters used in your document rather than the full font file.

3. Revision History and Undo Data

Every time you save a PDF using certain tools — particularly Adobe Acrobat — the application appends a new revision to the end of the file rather than overwriting the previous version. This incremental saving model preserves undo history but causes file size to grow with each save.

A document that has been edited and saved 20 times may contain 19 complete revision snapshots inside the file, even though only the final version is visible. The solution is straightforward: use Save As (not Save) to flatten the history and write a clean, single-version file.

4. Uncompressed or Poorly Compressed Content Streams

PDF content streams — the instructions that describe how text and graphics are drawn on each page — can be stored either compressed (using Deflate/ZIP compression) or uncompressed. Documents created by older software, or exported from certain CAD and engineering applications, often use uncompressed streams.

Reprocessing the file through a modern PDF optimiser applies compression to these streams and can reduce file size by 20–40% even when images are not involved.

5. Embedded Metadata, Thumbnails, and Attachments

PDFs can carry:

Embedded thumbnails for each page (generated by some applications for fast preview)
XMP metadata describing author, software, creation date, and copyright information
Embedded file attachments — spreadsheets, images, or other documents attached inside the PDF
Colour profile data used for print production

Print-production PDFs frequently include ICC colour profiles that add 500 KB to 2 MB per profile. If the document will only be read on screen, these profiles serve no purpose.

6. Scanned Documents Without Compression

A scanned PDF is essentially a series of photographs, one per page. If the scanning software saves each page as an uncompressed or lightly compressed TIFF, even a simple 10-page text document can exceed 50 MB.

Modern scanners and scanning apps usually apply some compression, but the default settings are often conservative to preserve quality for archiving. For documents that only need to be readable — not archival-quality — aggressive recompression can reduce scanned PDF sizes by 70–85%.

How to Check What Is Making Your PDF Large

Before applying any compression, it helps to know which factor is dominant:

Open the PDF in a viewer and zoom in on images. If they appear at high resolution on screen (sharp and detailed at 200% zoom), large embedded images are likely the cause.
Right-click the file and check properties. Compare file size to page count. More than 500 KB per page with no images usually indicates fonts, colour profiles, or revision history.
Check if the file was edited multiple times. If yes, Save As in Acrobat or re-export from the source application before compressing.

Note: A PDF with only text and no images should be under 100 KB for most documents under 20 pages. If yours is significantly larger, embedded fonts, colour profiles, or revision data are the likely culprits.

The Fastest Way to Fix a Large PDF

For most users, the quickest path to a smaller PDF is an online compressor. ShellPDF offers two compression modes to address different types of large files:

Free Compression (browser-based): Reorganises the PDF structure, removes redundant metadata, and applies stream compression without touching image quality. No upload required — processing happens entirely on your device. Best for documents where file structure is the primary issue.

Stronger Compression (Ghostscript): Resamples embedded images at a lower DPI using Ghostscript, one of the most reliable PDF rendering engines available. This is the right choice for scanned documents, image-heavy reports, and presentations where visual content is the dominant cause of file size.

Try ShellPDF Compress PDF — free, no signup required →

When Compression Is Not Enough

If you need a smaller file for a specific purpose — such as an email attachment limit or a form upload cap — consider the following additional steps:

Remove pages you do not need. If the document includes appendices or blank pages, removing them before compressing reduces the base file size.
Flatten annotations and form fields. Interactive form fields and sticky note annotations add data. Flattening converts them to static content.
Convert to greyscale. Colour PDFs are larger than black-and-white equivalents. For text-heavy documents, converting to greyscale can reduce size by 20–40% with no loss of readability.
Split the document. If the compressed file still exceeds a limit, splitting it into two smaller files is a reliable workaround.

Summary

Large PDF files most commonly result from high-resolution embedded images, full font files, accumulated revision history, or uncompressed content streams. Identifying the specific cause helps you choose the right fix — but for most documents, running the file through a quality compressor resolves the issue in under 30 seconds.

If your PDF is still too large after compression, the problem is likely scanned image content at very high DPI, and Ghostscript-based compression (Stronger mode) is the appropriate tool for the job.

Frequently Asked Questions

When Word exports to PDF, it embeds full font files, high-resolution images, and sometimes metadata that the source .docx did not contain. Vector graphics from Office shapes can also expand significantly in PDF format. Using a PDF compressor after export typically reduces the file by 30–60%.

It depends on the method. Browser-based compression (like ShellPDF's Free Compression) reorganises the file structure without touching image pixels, so quality is identical. Server-based Ghostscript compression resamples images at a lower DPI, which reduces visual quality slightly but produces much smaller files. For documents that will only be read on screen, the difference is rarely noticeable.

ShellPDF supports files up to 200 MB per batch for server compression. Browser compression handles files up to 10 MB locally on your device.

Yes. Scanned PDFs are essentially large JPEG or TIFF images inside a PDF container. Ghostscript compression (ShellPDF's Stronger Compression mode) re-encodes those images at a lower resolution, which typically cuts scanned PDF sizes by 50–80%.

Acrobat adds revision history each time you save. Over multiple saves, this history accumulates hidden data. Using File > Save As instead of Save removes this history and often produces a noticeably smaller file.

Free Tool

Compress PDF

Smaller PDFs, same important content.

Try Compress PDF

pdf sizelarge pdfreduce pdfpdf troubleshooting

Back to all articles

Why Is My PDF File So Large? (And How to Fix It)

The Six Main Causes of Large PDF Files

1. High-Resolution Embedded Images

2. Embedded Fonts

3. Revision History and Undo Data

4. Uncompressed or Poorly Compressed Content Streams

5. Embedded Metadata, Thumbnails, and Attachments

6. Scanned Documents Without Compression

How to Check What Is Making Your PDF Large

The Fastest Way to Fix a Large PDF

When Compression Is Not Enough

Summary

Frequently Asked Questions

Compress PDF

Related Articles

How to Compress a PDF Without Losing Quality