Removing Superfluous Tags Using CodeZapper

Removing Superfluous Tags Using CodeZapper

Reading Time: 1 minute

When you open a document in a CAT tool, you might notice too many superfluous tags; such files are usually converted from an PDF sources.

CodeZapper is a macro that prepares Word documents for translation. It is designed to prevent the proliferation of useless formatting tags in translation tools. Really useful for documents converted from PDF.

CodeZapper” is a powerful, easy to use set of Word VBA macros designed to “clean up” Word files before being imported into a standalone translation environment (DVX, memoQ, SDL Studio, TagEditor, Swordfish, OmegaT, Wordfast Pro, etc.).

Word documents are often strewn with “rogue codes” or junk tags (so-called “smart tags”, language tags, track changes tags, spellchecker tags, soft hyphenations, scaling and spacing changes, redundant bookmarks, etc.).

This tagged information shows up in the translation grid as spurious codes{1}around{2}, or even in the mid{3}dle of, words, making sentences difficult to read and translate and generally negating many of the productivity benefits of the program.

OCR’d files or PDF converted files are even worse and often end up as a “code soup”.

CodeZapper attempts to safely remove as many of these unnecessary tags as possible while retaining formatting and layout. It can be run at the press of a button on a single file or all the files in a folder in batch mode. It also contains a number of other useful macros (tidying up PDF converted files, temporarily moving bulky pictures out and back into files to speed up import, etc.). No installation is required as such. The template .dot file can be either copied to Word’s startup folder or loaded from the templates and add-ins folder.

CodeZapper website:


Print Friendly, PDF & Email
Spread Knowledge
  • 2
Comments are closed.