Converted Document Contains Strange Characters

Some PDF documents use an embedded font set. When trying to convert embedded characters, the preview and the resulting target will look like gibberish. Sometimes, only some of the characters are replaced. 

There are two ways to determine whether or not a font is embedded: 

Method 1

  1. Open the PDF with Acrobat reader. 
  2. Go to File > Properties and select the “Fonts” tab. 
  3. Look for the term “(Emdedded Subset)”. This implies that there are embedded fonts in the document. 

Method 2

  1. Open the PDF with Acrobat reader. 
  2. Try to copy the text and paste it into another application such as notepad. 
  3. If the text was not copied correctly from Acrobat, it is an embedded font problem. 
  4. If the text was copied correctly with Acrobat, this is a PDF2XL bug. 

Embedded fonts are more likely to happen in a foreign language such as Hebrew, Arabic, Japanese, and so on. 

In order to convert a document with embedded fonts, you will ned the OCR capability available in PDF2XL OCR or Enterprise. There is no guarantee that the OCR will be able to convert your embedded fonts 100% of the time.

Still need help? Contact Us Contact Us