Search code examples
internationalizationlocale

Special Characters in non english languages


How can I generate a text file containing all character codes in a specific locale for example 1029 Czech. I basically want to generate a list of every character that exists in their alphabet?


Solution

  • If this is a Windows locale, just generate a file with all bytes from 32 to 255: The byte codes for the locales are the same; they just interpret each byte differently. See this page for links.

    Example: The byte 0xa5 stands for "Ą" in the Czech codepage (1250) while it's "¥" in the German codepage (1252 a.k.a ISO-Latin-1).

    [EDIT] Note that this only works for pre-Unicode locales where one byte maps to exactly one character. It doesn't work for any Asian locale which need two or more bytes per character.