Convert Chinese characters to Unicode
This tool will convert Chinese characters (and just about everything else, including Japanese hiragana, katakana, and kanji; tonal Hanyu Pinyin; and Cyrillic script) into the decimal (not hex) form of Unicode numerical character references (NCRs). These NCRs can then be used in Web pages, with greater ease than the Chinese characters themselves for those whose operating systems or other software don't handle double-byte text well.
For example, 台北, the characters for "Taipei"
(Táiběi), can also be written as
台北. For this to work, the "charset" of
the Web page should be set to Unicode:
<meta http-equiv="content-type" content="text/html; charset=utf-8"
/>
It's generally better, however, to use the characters themselves rather than their Unicode NCRs in cases where a Web page has a lot of Chinese text, because Chinese characters take up less file space than their NCRs.
Sample text: 台北 Táiběi.
Input your text:
Adapted from a script by Steve Minutillo.