You are here: Home » Localization Engineers » Invalid XML characters in a TMX

Invalid XML characters in a TMX

You can use Okabi Olifant for removing invalid XML characters from a TMX translation memory. Click here to download Okapi Olifant.

Some TMX files may come with control characters that are invalid in XML document (often found in RTF-generated style sheet section). When one or more characters is in the TMX file, an error “hexadecimal value 0xHH is an invalid character” occurs when opening or importing the TMX file.

To open such TMX file in Olifant:

  1. Select the Import command from the File menu.
  2. Select the path of the TMX file to import.
  3. Click Open.
  4. An TMX Import Options dialog box opens.
  5. Make sure the option Check for invalid characters is set.
  6. Click OK.

A temporary copy of the file is created, where each invalid XML character will be replaced by _#xHHHH_ where HHHH is the Unicode hexadecimal value of the character. Then the temporary file is open. Note that these characters are left like this after: Olifant does not convert them back automatically when saving the file. You have to decide yourselves what you want to do with them.

You should use this option only when needed, as it increases the time it takes to open the file.

 

Source: http://okapi.sourceforge.net/Release/Olifant/Help/howtos.htm

 

Print Friendly

Comments are closed.

Scroll To Top