When converting a PDF file to a DOC/DOCX file for translation, you may end up with a bunch of “Optional Hyphens”—or as SDL Trados 2009 refers to them, “softbreakhyphens”. Academic publications are particularly affected. These symbols in your source text can trigger some annoying tag verification errors in Trados.
What’s so irritating? Beyond the useless tag verification errors, the hyphens can also confuse tools like Lingvo X3 and prevent matching within translation memories. There can literally be hundreds of these irritations in an article. There are no less than five (highlighted in yellow) in the abstract below.
They don’t go away, even after converting a PDF to DOC/DOCX.
Here’s a trick to quickly get rid of them before parsing the converted DOC/DOCX file with translation software such as SDL Trados.
- Bring up the Find and Replace dialog. (CTRL+H)
- For the “Find what” field, select “Optional Hyphen” from the Special drop down menu.
- Leave the “Replace with” field empty.
- Click “Replace All”.
Say goodbye to the unhelpful hyphens.