Tag Archives: PDF Transformer 3.0

Tag removed

When converting a PDF file to a DOC/DOCX file for translation, you may end up with a bunch of “Optional Hyphens”—or as SDL Trados 2009 refers to them, “softbreakhyphens”.  Academic publications are particularly affected.  These symbols in your source text can trigger some annoying tag verification errors in Trados.

image

What’s so irritating?  Beyond the useless tag verification errors, the hyphens can also confuse tools like Lingvo X3 and prevent matching within translation memories. There can literally be hundreds of these irritations in an article.  There are no less than five (highlighted in yellow) in the abstract below. 

The annoying hyphens in this PDF are highlighted in yellow.

They don’t go away, even after converting a PDF to DOC/DOCX.

After converting to DOC/DOCX, the hyphens remain.

Here’s a trick to quickly get rid of them before parsing the converted DOC/DOCX file with translation software such as SDL Trados.

  1. Bring up the Find and Replace dialog. (CTRL+H)
  2. For the “Find what” field, select “Optional Hyphen” from the Special drop down menu.
  3. Leave the “Replace with” field empty.
  4. Click “Replace All”.

Say goodbye to the unhelpful hyphens.

image

Friends don’t let friends use ABBYY PDF Transformer 3.0 on academic papers

I’m translating an academic paper on image registration.  The paper is published online as a PDF by E. V. Mjasnikov at Samara State Aerospace University, but I what I wanted was a Word document.  I thought I’d give ABBYY PDF Transformer 3.0 a whirl.  Fail.  It destroys mathematical symbols and formulae.  Not the right tool for the job.  For example, this equation for the mutual correlation between two images

image

becomes this:

II /i (ti, t2 ) / 2 (ti -Ti, t2-T2 ).

Of course, that was before I realized that I needed to add Greek to the list of PDF Document Languages:

image

While that improved the results, it did not satisfy me.  The improved-yet-unsatisfactory results follow:

ΣΣ7ι (‘ι,’2)12 (‘ι-τι,h-Т2)

C (τι,

ΣΣ722 (‘ι -τι,’2 -τ2)

If ABBYY’s PDF Transformer 3.0 is not the right tool for the job, what is?  Any suggestions?