Why are computer users so inefficient
Why are PDFs generated from MS Word so large?
I made a simple MS Word document containing just this sentence:
This is a small document.
Nothing else. Then I saved this document as a DOCX and as a PDF. Here are the file sizes:
DOCX: 12 kB
PDF: 89 kB
That difference is technical huge and bothers me a lot when mainly using text documents Dozens of KB start in DOCX to create PDFs with a Size by hundreds of KB too create. What is so inefficient about PDF format? Or is just Word using a terrible output algorithm?
By the way, the PDF output settings have been set so that the smallest possible file is created:
If you open the PDF in Notepad ++ you will find:
and this object is referenced here at the end of the / FontFile2 instruction:
The fonts used by the Word document are embedded in the PDF file so that the PDF file is self-contained.
I used this slide deck to decipher the PDF instructions.
If you want to prevent the fonts from being embedded in the PDF file, make sure that one of the 14 standard fonts available in PDF viewers is used in your Word document (source Wikipedia).
- Times New Roman> Times (v3) (regular, italic, bold and bold italic)
- Courier New> Courier (regular, oblique, bold and bold oblique)
- Arial> Helvetica (v3) (regular, oblique, bold and bold oblique)
- Symbol> symbol
- Wingdings> Zapf Dingbats
This has happened to me many times in Microsoft Word when trying to export a simple manuscript as a PDF. A 5 to 8-page Word document that is approximately 50 KB in size will become a PDF file over 10 MB that is far too large to email anyone.
Rene's answer is on the right track - the problem is fonts are embedded in the document - but using any of the standard fonts won't necessarily solve the problem.
All of my documents were in Times New Roman and were in bold and italics. At least I thought. It turns out that in my default template I use the automatic kerning activated (for obvious reasons). When exporting to PDF, Word actually embed each of these ligatures as a separate font object in the document, making it incredibly bloated.
The update is easy, you just have to remember to do it every time:
- Select all of the text in the document.
- Format → Font → Advanced
- Disable "Kerning for fonts"
Interestingly, you can leave ligatures, contextual alternatives, and other advanced typography features enabled. They have no noticeable impact on the size of the resulting PDF file.
Re-export the document as PDF and it will be approximately 100KB in size. Unfortunately the kerning is below average so I wouldn't recommend printing, but it works fine for emailing a document.
A less technical answer might be for PDF files to use vectors (i.e. math equations) to describe everything you see. All curves and lines are defined by mathematical equations. As a result, there is a lot of information to have on hand, especially if your documents contain images.
This has the advantage that you can theoretically zoom infinitely close without losing resolution or detail, since the lines and curves have no width and can therefore be scaled with your zoom.
Just like the recent change to the Google font reduced the logo size from ~ 14KB to ~ 300KB, simpler fonts are likely to help reduce the file size.
- Which laptops have an i7MQ CPU
- Why is negativity more attractive than positivity
- Which planet moon is called Titan
- How did you learn to trade stocks
- Does the weather affect satellite TV reception
- What are the challenges of conflict reporting
- What is a rating system for ghee
- What are some classic greek love songs
- Wealthy people become philanthropists to get attention
- How are the prices for carpooling regulated
- What makes a drug addictive or not
- What is Facebook's design mantra
- Is JUET Guna better for CSE
- What is the Regression Suite Automation Tool
- What is the name of Canada in French
- What are fashion categories
- How much sun does rosemary need
- How would one explain insecticides?
- Will Daenerys and Jon get together
- Smoking versus vaping
- Which herbs grow best indoors
- Why do petunias turn yellow
- There is a maximum wage
- How do you get 1,000 dollars
- U Haul has a locator
- How can we resolve social injustice
- There are capsules for weight gain
- Are Sony headphones good
- Which dataset topics interact well
- What is chess a good metaphor for?
- What is the Proteus good for?
- Is green tea as healthy as advertised
- Can you solve math