Common Character Sets

Top  Previous  Next

The following are common character sets that can be used in message scanning to determine messages created in different languages.

 

Western (ISO-8859-1)

Western (Windows-1252)

Central European (ISO-8859-2)

Central European (Windows-1250)

Japanese (Shift_JIS)

Japanese (EUC-JP)

Japanese (ISO-2022-JP)

Traditional Chinese (Big5)

Traditional Chinese (EUC-TW)

Simplified Chinese (GB2312)

Simplified Chinese (HZ)

Korean (EUC-KR)

Korean (ISO-2022-KR)

Cyrillic (KOI8-R)

Cyrillic (ISO-8859-5)

Cyrillic (Windows-1251)

Greek (ISO-8859-7)

Greek (Windows-1253)

Turkish (ISO-8859-9)

Unicode (UTF-8)

Unicode (UCS-2)

Unicode (UTF-7)