You can test below various charsets in use today by many computers & browsers around the world. Most of them will require you to have certain Unicode-compatible fonts like Arial Unicode MS (included with Microsoft Office 2000 & higher) & Tahoma. I also recommend to view most of them under a recent browser like Microsoft Internet Explorer 5.0 or Netscape Navigator 4.7 or higher, which have a wider charset support as shown on the table below; I also recommend that you run a recent operating system like Microsoft Windows 2000 or Apple Macintosh OS 9 or higher, which especially have better support for Unicode, Bidirectionality, & CJK charsets. Some of them will display correctly only on Internet Explorer, while some will only display on Netscape. A few of them will also display on other third-party browsers listed below at the end of this page.
I don’t include here any samples of the 8-bit IBM EBCDIC charsets used by IBM mainframes (and supported by some other operating systems, like Windows 2000), because I don’t have access to such machines or don’t have access to any HTML editor that can save to EBCDIC format instead of ASCII (7-bit single-byte charsets), ANSI (Windows-125x, ISO-8859 series, X-Mac-series, & other 8-bit charsets), or Unicode (7-bit, 8-bit, & 16-bit). (EBCDIC files are much more difficult to show in a browser because they are not derived from ASCII like all ones listed in the table below, so their HTML tags must be written according to EBCDIC, not ASCII as we are accustomed.)
All of these charsets begin with the basic 7-bit US-ASCII area (C0 control bytes 00-1F & character bytes 20-7F), which is not shown because of character repetition. ISO-8859, ISO-IR, & ISCII charsets follow the 7-bit US-ASCII area with the C1 Controls area (bytes 80-9F) & then with their own proprietary characters defined from Unicode. Other charsets (like Windows-125x, X-Mac-series, & IBMxxx) follow US-ASCII immediately with their proprietary characters, with no provision for C1 Controls.
Chinese, Japanese, & Korean charsets (collectively called CJK) are a special case of charsets: because each of them involves thousands of characters, they use special rules to make up characters outside US-ASCII that don’t fit on a standard 8-bit system like ISO-8859. All of them (except ISO-2022 & HZ) follow US-ASCII with a series of double-byte 16-bit characters made up by combining 8-bit Header Bytes with 8-bit Trailer Bytes. EUC (which stands for Extended Unix Code) and MacKorean & MacChineseSimp use bytes A1-FE as both header & trailer bytes (Japanese also adds header 8E for half-width Kana); Big5 & MacChineseTrad use bytes A1-FE as headers and 40-7E & A1-FE as trailers; Shift-JIS & MacJapanese use 81-9F & E0-FC as headers, 40-7E & 80-FC as trailers, and also provide for 8-bit half-width Kana through single bytes A0-DF (with no preceding headers); UHC uses 81-FD (skipping C9) as headers and 41-5A, 61-7A, & 81-FE as trailers (no provision for half-width Hangeul letters); Johab, being more difficult to describe, uses 84-F9 (with several gaps) as headers and 31-7E & 81-FE as trailers; and the GB family, being more character-complete, uses 81-FE as headers and 40-7E & 80-FE as trailers, thus covering all known CJK Ideographs (also known as Hanzi in Chinese, Kanji in Japanese, & Hanja in Korean).
ISO-2022 charsets, intended for use on electronic mail messages (which are mostly handled by 7-bit systems), use 7-bit byte strings & escape/shift sequences to generate 14-bit characters that follow after US-ASCII. Japanese JIS also uses 7-bit strings & escape sequences to generate 7-bit half-width Kana through single bytes 21-5F. Chinese HZ, also intended for e-mail messages, uses 7-bit strings & tilde/brace sequences (~{ & ~}) instead of escape/shift sequences to generate 14-bit characters.
Finally, Unicode involves four distinct transformations: a 7-bit mail-safe one, a standard 8-bit Web-safe one, and two 16-bit versions that differ only in byte-ordering (Little-Endian & Big-Endian). The 7-bit UTF-7 uses Base64 strings (beginning with byte 2B [+] and ending with byte 2D [-]) to generate all characters & controls defined in Unicode; the 8-bit UTF-8 follows US-ASCII characters with a series of double-byte (16-bit) & triple-byte (24-bit) characters using bytes C0-DF as 16-bit headers, E0-EF as 24-bit headers, & 80-BF as 16-bit trailers & 24-bit middles & trailers; and the 16-bit UTF-16 (in its Little- & Big-Endian versions) is purely 16-bit (unlike ASCII, which is 7-bit, and unlike ISO-8859, which is 8-bit), ranging from 0000-FFFF (yes, two bytes per each single character among 65,536 possible ones).
Users of Macintosh OS X: you may also want to test these charsets with the new Apple Safari browser (included with MacOS X Jaguar & higher; also available for download at the link) & then tell me about your results.
Windows & Internet Explorer users: many of the charsets below will require you to have installed the necessary files to view them correctly. These files are mainly NLS files that are installed on your WINDOWS/SYSTEM or WINNT/SYSTEM32 folder, depending on your version. Windows 95, 98 & Me include NLS files for some of them; the rest are available from Internet Explorer’s Language Packs, from Charset Decoding, & from the National Language Support (NLS) Files link on this site. Windows 2000 includes NLS files for all of them except ISO-8859-13, for which there is an NLS file also available at the previous link. Windows XP & Server 2003 include NLS files for all of them.
Character Set Test Pages | MIME Name | Windows Codepage | Supported Browsers | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Internet Explorer | Netscape Navigator | Opera Browser | Apple Safari | Mozilla | Other Browsers | ||||||
Win | Mac | Win | Mac | Win | Mac | ||||||
For each Supported Browser cell below: Version # indicates the minimum version I’m aware of that supports the charset (the + means that later versions still support it); No: indicates that it is not supported by the browser; and ?: indicates that I don’t know whether or not it is supported by the browser. For the case of Other Browsers, the cells may contain notes about certain browsers that support certain charset(s), or simply a ?. If you know of any older versions of each browser that support certain charset(s), or for version or support corrections, you are welcome to tell me. | |||||||||||
American/Western European Charsets | |||||||||||
American/Western European (Latin 1-IBM) | ibm850 | 850 | 5.0+ | 5.1+ | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
American/Western European (Latin 1-ISO) | iso-8859-1, latin1 | 28591 | 3.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
American/Western European (Latin 1-Windows) | windows-1252 | 1252 | 3.0+ | 5.1+ | 3.0+ | 4.7+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
American/Western European (Roman-Macintosh) | macintosh, x-mac-roman‡ | 10000 | 5.0+ | 5.1+ | 6.0+ | 3.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Celtic (Latin 8) | iso-8859-14, latin8 | 28604 | No | No | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
French Canadian (IBM) | ibm863 | 863 | 6.0+ | ? | No | No | No | No | ? | No | ? |
Icelandic (IBM) | ibm861 | 861 | 5.0+ | ? | No | No | No | No | ? | No | ? |
Icelandic (Macintosh) | x-mac-icelandic | 10079 | 5.0+ | 5.1+ | 6.0+ | 6.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
New Western European (Latin 9) | iso-8859-15, latin9 | 28605 | 5.0+ | ? | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
New Western European (OEM Latin I) | ibm00858, PC-Multilingual-850+Euro | 858 | 5.0+ (2000+ only) | ? | No | No | No | No | ? | No | ? |
Portuguese (IBM) | ibm860 | 860 | 6.0+ | ? | No | No | No | No | ? | No | ? |
Arabic/Farsi/Urdu Charsets | |||||||||||
Arabic (ASMO) | asmo-708 | 708 | 4.0+ | ? | No | No | 7.0+* | 7.0+* | ? | No | Alis Tango 3.0 (Win3.1/95/98 only) |
Arabic (ASMO-Transparent) | asmo-720, dos-720 | 720 | 4.0+ | ? | No | No | No | No | ? | No | ? |
Arabic (IBM) | ibm864 | 864 | 6.0+ | 5.1+ | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Arabic (ISO) | iso-8859-6, arabic | 28596 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Arabic/Farsi/Urdu (Windows) | windows-1256 | 1256 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Arabic/Urdu (Macintosh) | x-mac-arabic | 10004 | 5.0+ | 5.1+ | 6.0+* 7.0+ | 6.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Farsi (ISIRI) | isiri-3342 | None yet | No | No | No | No | No | No | ? | No | Alis Tango 3.0 (Win3.1/95/98 only) |
Farsi (Macintosh) | x-mac-farsi | 10014? | No | No | 6.0+* | ? | No | No | ? | 0.9+* | ? |
Armenian Charsets | |||||||||||
Armenian (ArmSCII) | armscii-8 | None yet | No | No | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Baltic Charsets | |||||||||||
Baltic (Latin 7-IBM) | ibm775 | 775 | 5.0+ | 5.1+ | No | No | No | No | ? | No | ? |
Baltic (Latin 7-ISO) | iso-8859-13, latin7 | 28603 | 6.0+ | ? | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Baltic (Latin 7-Windows) | windows-1257 | 1257 | 4.0+ | 5.1+ | 4.7+ | 4.7+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux) |
North European/Baltic (Latin 4) | iso-8859-4, latin4 | 28594 | 4.0+ | 5.1+ | 4.7+ | 4.7+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Central/Eastern European Charsets | |||||||||||
Baltic/Central European (Macintosh) | x-mac-ce | 10029 | 5.0+ | 5.1+ | 6.0+ | 3.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Central/Eastern European (Latin 2-IBM) | ibm852 | 852 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Central/Eastern European (Latin 2-ISO) | iso-8859-2, latin2 | 28592 | 3.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Central/Eastern European (Latin 2-Windows) | windows-1250 | 1250 | 3.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Croatian (Macintosh) | x-mac-croatian | 10082 | 6.0+ | 5.1+ | 6.0+ | 6.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Romanian (Latin 10) | iso-8859-16, latin10 | 28606 | No | No | 7.0+ | 7.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Romanian (Macintosh) | x-mac-romanian | 10010 | 6.0+ | ? | 6.0+ | 6.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Cyrillic Charsets | |||||||||||
Cyrillic (ECMA) | iso-ir-111 | None yet | No | No | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Cyrillic (ISO) | iso-8859-5, cyrillic | 28595 | 3.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Cyrillic (Windows) | windows-1251 | 1251 | 3.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Cyrillic OEM (IBM) | ibm855 | 855 | 6.0+ | ? | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Russian Cyrillic (IBM) | ibm866 | 866 | 4.0+ | 5.1+ | 4.7+* 6.0+ | 4.7*+ 6.0+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux) |
Russian Cyrillic (KOI) | koi8-r | 20866 | 4.0+ | 5.1+ | 3.0+* 6.0+ | 3.0+* 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Russian Cyrillic (Macintosh) | x-mac-cyrillic | 10007 | 5.0+ | 5.1+ | 6.0+ | 3.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Ukrainian Cyrillic (KOI) | koi8-u, koi8-ru | 21866 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Ukrainian Cyrillic (Macintosh) | x-mac-ukrainian | 10017 | 6.0+ | 5.1+ | 6.0+ | 6.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Georgian Charsets | |||||||||||
Georgian (GeoSTD) | geostd8 | None yet | No | No | 7.0+* | ? | ? | ? | ? | 0.9+* | ? |
Greek Charsets | |||||||||||
Greek (IBM) | ibm737 | 737 | 5.0+ | 5.1+ | No | No | No | No | ? | No | ? |
Greek (ISO) | iso-8859-7, greek | 28597 | 3.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Greek (Macintosh) | x-mac-greek | 10006 | 5.0+ | 5.1+ | 6.0+ | 3.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Greek (Windows) | windows-1253 | 1253 | 3.0+ | 5.1+ | 4.0+ | 4.0+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux) |
Greek Modern (IBM) | ibm869 | 869 | 6.0+ | ? | No | No | No | No | ? | No | ? |
Hebrew Charsets | |||||||||||
Hebrew (IBM) | ibm862, dos-862 | 862 | 4.0+ | ? | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Hebrew (ISO-Logical) | iso-8859-8-i, logical | 38598 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Hebrew (ISO-Visual) | iso-8859-8, visual | 28598 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Hebrew (Macintosh) | x-mac-hebrew | 10005 | 5.0+ | 5.1+ | 6.0+* 7.0+ | 6.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Hebrew (Windows) | windows-1255 | 1255 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
Indic Charsets | |||||||||||
Assamese (ISCII) | x-iscii-as | 57006 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Bengali (ISCII) | x-iscii-be | 57003 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Devanagari (ISCII) | x-iscii-de | 57002 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Devanagari (Macintosh) | x-mac-devanagari | 100?? | No | No | 6.0+* | ? | No | No | ? | 0.9+* | Ximian Galeon (Linux)* |
Gujarati (ISCII) | x-iscii-gu | 57010 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Gujarati (Macintosh) | x-mac-gujarati | 100?? | No | No | 6.0+* | ? | No | No | ? | 0.9+* | Ximian Galeon (Linux)* |
Gurmukhi (ISCII) | x-iscii-pa | 57011 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Gurmukhi (Macintosh) | x-mac-gurmukhi | 100?? | No | No | 6.0+* | ? | No | No | ? | 0.9+* | Ximian Galeon (Linux)* |
Kannada (ISCII) | x-iscii-ka | 57008 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Malayalam (ISCII) | x-iscii-ma | 57009 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Oriya (ISCII) | x-iscii-or | 57007 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Tamil (ISCII) | x-iscii-ta | 57004 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Telugu (ISCII) | x-iscii-te | 57005 | 5.0+ (2000+ only) | No | No | No | No | No | ? | No | ? |
Japanese Charsets | |||||||||||
Japanese (EUC) | euc-jp | 51932 | 4.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Japanese (ISO/JIS) | iso-2022-jp, JIS_X0208-1983 | 50220 | 4.0+† | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Japanese (ISO/JIS-2) | iso-2022-jp-2 | None yet | No | No | 6.0+^ | 6.0+^ | No | No | ? | 0.9+^ | ? |
Japanese (JIS-Allow 1-byte Kana) | _ISO-2022-JP$ESC, csISO2022JP | 50221 | 4.0+† | 5.1+ | 6.0+ | 6.0+ | 7.0+^ | 7.0+^ | 1.0+ | 0.9+ | NJStar Asian Explorer |
Japanese (JIS-Allow 1-byte Kana, SO/SI) | _ISO-2022-JP$SIO | 50222 | 4.0+† | 5.1+ | No | No | 7.0+^ | 7.0+^ | 1.0+ | No | NJStar Asian Explorer |
Japanese (JIS-Extended) | JIS_X0212-1990 | 20932 | ** | ** | No | No | No | No | ? | No | ? |
Japanese (Macintosh) | x-mac-japanese | 10001 | 5.0+ | 5.1+ | No | ? | No | ? | 1.0+ | No | ? |
Japanese (ShiftJIS) | shift_jis | 932 | 4.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Korean Charsets | |||||||||||
Korean (EUC) | euc-kr | 51949 | 4.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux), NJStar Asian Explorer |
Korean (ISO) | iso-2022-kr | 50225 | 4.0+† | 5.1+ | 4.0+ (not 6.x) | 4.0+ (not 6.x) | No | No | ? | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Korean (Johab) | johab, x-johab‡ | 1361 | 5.0+ | ? | 6.0+* 7.0+ | 6.0+* 7.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Korean (Macintosh) | x-mac-korean | 10003 | 5.0+ | 5.1+ | No | ? | No | ? | 1.0+ | No | ? |
Korean (UHC) | ks_c_5601-1987, ksc5601 | 949 | 4.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
North European/Nordic Charsets | |||||||||||
Lappish (Sami) | windows-sami-2 | 1259? | No | No | No | No | 7.0+ | ? | ? | No | ? |
Nordic (IBM) | ibm865 | 865 | 6.0+ | ? | No | No | No | No | ? | No | ? |
North European/Nordic (Latin 6) | iso-8859-10, latin6 | 28600 | No | No | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Simplified Chinese Charsets | |||||||||||
Simplified Chinese (EUC) | euc-cn | 51936 | 5.0+ | 5.1+ | No | No | 7.0+ | 7.0+ | ? | No | NJStar Asian Explorer |
Simplified Chinese (GB18030) | gb18030 | 54936 | 6.0+ | ? | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Simplified Chinese (GB2312) | gb2312 | 20936 | 5.0+ | ? | 3.0+ | 3.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Simplified Chinese (GBK) | gbk | 936 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Simplified Chinese (HZ) | hz-gb-2312 | 52936 | 4.0+¶ | 5.1+ | 4.7+¶ | 4.7+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Simplified Chinese (ISO) | iso-2022-cn | 50227 | **† | ** | 7.0+ | 7.0+ | 7.0+ | 7.0+ | ? | 1.0+ | NJStar Asian Explorer |
Simplified Chinese (ISO-Extended) | iso-2022-cn-ext | None yet | No | No | 7.0+^ | 7.0+^ | 7.0+^ | 7.0+^ | ? | 1.0+^ | ? |
Simplified Chinese (Macintosh) | x-mac-chinesesimp | 10008 | 5.0+ | 5.1+ | No | ? | No | ? | 1.0+ | No | ? |
South European/Turkish Charsets | |||||||||||
South European (Latin 3) | iso-8859-3, latin3 | 28593 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Turkish (Latin 5-IBM) | ibm857 | 857 | 5.0+ | 5.1+ | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Turkish (Latin 5-ISO) | iso-8859-9, latin5 | 28599 | 4.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Turkish (Latin 5-Macintosh) | x-mac-turkish | 10081 | 5.0+ | 5.1+ | 6.0+ | 3.0+ | No | ? | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Turkish (Latin 5-Windows) | windows-1254 | 1254 | 4.0+ | 5.1+ | 3.0+ | 4.7+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Thai Charsets | |||||||||||
Thai (ISO) | iso-8859-11 | 28601 | 4.0+ | 5.1+ | No | No | 7.0+ | 7.0+ | ? | No | ? |
Thai (Macintosh) | x-mac-thai | 10021 | 6.0+ | 5.1+ | No | ? | No | ? | 1.0+ | No | ? |
Thai (TIS/Windows) | tis-620, windows-874 | 874 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux) |
Traditional Chinese Charsets | |||||||||||
Traditional Chinese (Big5) | big5 | 950 | 4.0+ | 5.1+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only), NJStar Asian Explorer |
Traditional Chinese (EUC) | euc-tw, x-euc-tw‡ | 51950 | ** | ** | 3.0+* 6.0+ | 3.0+* 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux), NJStar Asian Explorer |
Traditional Chinese (HKSCS) | big5-hkscs | 54950 | ** | ** | 6.0+ | 6.0+ | 7.0+* | 7.0+* | ? | 0.9+ | Ximian Galeon (Linux) |
Traditional Chinese (ISO) | iso-2022-tw | 50229 | **† | ** | No | No | No | No | ? | No | ? |
Traditional Chinese (Macintosh) | x-mac-chinesetrad | 10002 | 5.0+ | 5.1+ | No | ? | No | ? | 1.0+ | No | ? |
Unicode Charsets (Further information about Unicode is available at its official site.) | |||||||||||
United States Charsets | |||||||||||
United States (ASCII) | iso-646-us, us-ascii | 20127 | 3.0+• | 3.0+ | 3.0+ | 3.0+ | 7.0+ | 7.0+ | 1.0+ | 0.9+ | Ximian Galeon (Linux), Alis Tango 3.0 (Win3.1/95/98 only) |
United States (OEM) | ibm437 | 437 | 5.0+ | ? | No | No | No | No | ? | No | ? |
Vietnamese Charsets | |||||||||||
Vietnamese (TCVN) | x-viet-tcvn5712 | None yet | No | No | 6.0+ | 6.0+ | No | No | ? | 0.9+ | Ximian Galeon (Linux) |
Vietnamese (VISCII) | viscii | None yet | No | No | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Vietnamese (VPS) | x-viet-vps, x-vps‡ | None yet | No | No | 6.0+ | 6.0+ | 7.0+ | 7.0+ | ? | 0.9+ | Ximian Galeon (Linux) |
Vietnamese (Windows) | windows-1258 | 1258 | 4.0+ | 5.1+ | 6.0+ | 6.0+ | 7.0+ | ? | ? | 0.9+ | Ximian Galeon (Linux) |
Run by: Leroy Vargas. For feedback related to charsets or this website, Leroy can be contacted through his Lycos address.