Cantonese input method for Chinese characters

There’s a new Unicode-based phonetic input method for inputting Chinese characters … using Cantonese: Canto Input.

Here’s the author’s description:

What is it?
CantoInput is a freely available, Unicode-based Chinese input method (IME) which allows you to type both traditional and simplified characters using Cantonese romanization. Both the Yale and Jyutping methods are supported. A Mandarin Pinyin mode is also available.

Why does the world need another Chinese input method?
While there already exist excellent phonetic input methods based on Mandarin Pinyin pronunciation, there is a general lack of support for Cantonese. As a Cantonese learner, I was frustrated by the difficulty of typing Chinese, especially Cantonese-specific colloquial characters. Most existing Cantonese input methods require a Chinese version of Windows and operate using non-Unicode encodings such as BIG5 or GB, while non-phonetic methods such as Cangjie have a very steep learning curve. I originally wrote this program for my own personal use but decided to make it freely available since I felt that other Cantonese speakers and learners might also find it useful. It’s still really basic at this time, but hopefully I’ll have time to impove the interface and add more features in the future.

Those interested in trying this out might find the comments on Chinese Forums useful.

May Fourth remembered

Today is the 87th anniversary of the demonstrations in Beijing that marked the beginning of what is now called the May Fourth Movement. What concerns me here is not the surge in Chinese nationalism (something the present-day PRC — and some would say Taiwan, too — could use rather less of) but the literary revolution that largely overthrew the use of Literary Sinitic (Classical Chinese).

This revolution, though, swift and remarkable as it was, unfortunately remains incomplete today. As Yin Binyong put it:

Ever since the beginnings of the May Fourth movement, many scholars — especially those who support the use of alphabetized writing for Chinese — have all advocated as the main goal of the modern Chinese language standardization movement that spoken and written Chinese should be the same. Unfortunately, this goal has remained primarily a subjective aspiration; as long as Chinese characters continue to be the sole writing system in China, this goal can never be realized. Despite the fact that literary Chinese is no longer used, nevertheless it has been replaced by a half-literary, half-vernacular style of writing, rather than a style based solely on the spoken language.

Even so, the literary movement should not be underestimated. The changes brought — for well or ill — by the introduction several decades later of “simplified” Chinese characters are practically nothing compared with the impact of the overall change from Literary Sinitic to vernacular Mandarin.

A good source of information on the literary aspect of the May Fourth Movement is The Chinese Renaissance, by Hu Shih (Hú Shì, 胡適), one of the main figures in this movement.

Finally, I’d like to direct people to Languagehat’s post yesterday on the somewhat analagous situation with classical Arabic and Arabic vernaculars, a subject I’d love to learn more about.

new MRT signage

David has posted on the inconsistent use of Tongyong Pinyin in the Taipei-area MRT system. I’ve already put a comment there, so I’ll not duplicate everything here.

I spend a lot of time complaining about signage, and my experiences in trying to get some errors in the MRT system corrected have, predictably, been frustrasting. But there is something I do really like: the font for the MRT signage. (See the photos with David’s post.) Does anyone recognize it?

For those of you not in Taiwan, the MRT is the Metropolitan Rapid Transit system for the Taipei area. Most of the system takes the form of a subway. One line, however, is elevated, as is a section of a different line (which also runs on ground level for several miles).

surname-spelling scrap

Danwei has picked up on a story of someone in China with the surname of Xiè being issued an air ticket under the name Jiě. The reason behind the mixup is that the character used for this woman’s name, 解, is most often pronounced “jie,” as in jiěfàng (liberate; emancipate), jiějué (solve; resolve; settle), liǎojiě (understand; comprehend; find out; acquaint oneself with), and jiěshì (expound; interpret; analyze). Thus, it is but one of the many Chinese characters that has more than one pronunciation.

When she and some of her relatives went to the travel agency to get the matter cleared up, however, an argument broke out. Before long, people from the travel agency were using poles to beat the family.

(Maybe not my strongest entry, but there was no way I was going to pass up a chance to post on a story titled “Is personal safety another argument for Chinese romanization?”)

sources:

early ‘universal’ romanization system

No-Sword brings up Karl Richard Lepsius’s early, IPA-like system, with Matt linking to Google Print’s online edition of Standard Alphabet for Reducing Unwritten Languages and Foreign Graphic Systems to a Uniform Orthography in European Letters.

The book groups Taiwanese, Hakka, and Mandarin — or Hok-lo, Hak-ka, and Mandarinic (my favorite), as it refers to them — under “monosyllabic languages” (grr). OTOH, Tibetan is given as an “isolated language.” Interestingly, Mandarin pronunciation is given following the practice of Nanjing, not Beijing; a similar choice made a couple of hundred years ealier is also part of what’s behind the “Peking” spelling for what is now referred to as Beijing (1 MB PDF).

Taiwanese and alphabetical abbreviations

I’d been working on a post about the cards and miniature magnets given away at Family Mart (Quánjiā / 全家) convenience stores with purchases of at least NT$75 (about US$2). But Jason at Wandering to Tamshui beat me to it yesterday with a post showing all of the cards, so I’ll keep this short.

These are particularly interesting because of the use of Taiwanese as well as several other languages, though everything here is labeled “Yingwen” (English). As Jason wrote, “That faint sound you hear is a thousand foreign English teachers slapping their foreheads in despair.”

The series, labeled Quánmín pīn Yīngwén (全民拼英文), is probably meant to counter rival 7-Eleven’s popular Hello Kitty button series. Although few take on Hello Kitty and live to tell the tale, I think the alphabet cards are doing fairly well.

Below is an example. On the left is the wrapper (pun not intended). Top right shows the front and back of the magnet that comes with this particular card. And at bottom right is the card itself. (I say card; but it’s really just glossy paper.)

Here MG is meant to stand for mai3 ke2 sian1 (as always, help with my spelling would be appreciated), which, despite the use of Chinese characters (嘜假仙), is Taiwanese, not Mandarin. Reading 嘜假仙 as Mandarin yields only nonsense. (So much for the “universality” and “ideographic” myths of Chinese characters.)

photo of promotional item from a convenience store; it uses the Roman alphabet to indicate abbreviations of phrases in Taiwanese

resources:

Ma Ying-jeou speaks of Taipei County adopting Hanyu Pinyin

In another sign that Tongyong Pinyin’s days in Taipei County are numbered (not that Tongyong was ever used here much in the first place), Taipei Mayor (and KMT Chairman) Ma Ying-jeou said this on Tuesday:

mùqián Táiběi Shì yǔ Táiběi Xiàn lù míngzi de pīnfǎ bùyī, Táiběi Shì cǎi Hànyǔ Pīnyīn, Táiběi Xiàn cǎi Tōngyòng Pīnyīn, wèilái kěyǐ tǒngyī yòng Hànyǔ Pīnyīn (目前台北市與台北縣路名的拼法不一,北市採「漢語拼音」,北縣採「通用拼音」,未來可以統一用「漢語拼音」)

Although he didn’t state specifically that Taipei County most definitely will use Hanyu Pinyin (which wouldn’t be his announcement to make), he certainly seems to back that happening. Of course, that’s no big surprise; but I like to chronicle such things anyway.

source: Mǎ Yīngjiǔ: Táiběi Xiàn-Shì hézuò; Yīngwén lù míngzi kěyǐ cǎi Hànyǔ Pīnyīn (馬英九:台北縣市合作 英文路名可採漢語拼音), CNA, April 18, 2006