Convert Chinese characters to Unicode

This tool will convert Chinese characters (and just about everything else, including Japanese hiragana, katakana, and kanji; tonal Hanyu Pinyin; and Cyrillic script) into the decimal (not hex) form of Unicode numerical character references (NCRs). These NCRs can then be used in Web pages, with greater ease than the Chinese characters themselves for those whose operating systems or other software don't handle double-byte text well.

For example, 台北, the characters for "Taipei" (Táiběi), can also be written as 台北. For this to work, the "charset" of the Web page should be set to Unicode:
<meta http-equiv="content-type" content="text/html; charset=utf-8" />

It's generally better, however, to use the characters themselves rather than their Unicode NCRs in cases where a Web page has a lot of Chinese text, because Chinese characters take up less file space than their NCRs.

Sample text: 台北 Táiběi.

Input your text:


Sorry, this converter won't work in your browser. Please try using a more modern, standards-compliant browser such as Opera (my favorite) Firefox.


Results:

Adapted from a script by Steve Minutillo.