Taiwan to adopt Hanyu Pinyin!

I don’t have time to write a proper post on this now. But the big news is that Taiwan will adopt Hanyu Pinyin, with Tongyong Pinyin left to wither away.

Here’s the story: Zh?ngwén yìy?n c?i Hàny? P?ny?n — bù b?zhù T?ngyòng P?ny?n (??????? ?????), United Daily News, September 17, 2008.

The Forumosa thread on this topic supplies some information for those of you who want to read something on this in English: MOE Approves Hanyu Pinyin as Taiwan Transliteration Policy?

(Thanks, Jidanni, for the heads up.)

software for Shanghainese

Professor Qián N?iróng (Qian Nairong / ???) of Shanghai University has just issued free software to help with the writing of Shanghainese (???). People may now download the 1.3 MB zip file of the program.

Some examples:

shanghe ??
shanghehhehho ???/???????
whangpugang ???
suzouhhu???
shyti ??????
makshy ??????
bhakxiang ?????
dangbhang ???????
ghakbhangyhou ????????
cakyhangxiang ???????????
linfhakqin ???(????)
dhaojiangwhu ??????
aoshaoxhin ?????????????
ghe ????
kang ????
yin ??????
dia ?
whakji ??

The program offers two flavors of romanization. Here are some examples of the differences between the two styles:

New Folk Old Timers
makshy ??????
bhakxiang ?????
dangbhang ???????
ghakbhangyhou ????????
cakyhangxiang ???????????
linfhakqin ???(????)
mekshy ??????
bhekxian ?????
danbhan ???????
ghakbhanyhou ????????
cekyhanxian ???????????
linfhekqin ???(????)

Here’s a brief story on this:

Xiànzài, w?men zài w?ngluò zh?ng liáoti?n de shíhou yuèláiyuè du? de péngyou d?u k?ish? x?huan yòng Shàngh?ihuà. Dànshì y?ushíhou shìbushì juéde xi?ng bi?odá dehuà bùzh?dào z?nme d?, nòng de y?udi?n bùlúnbùlèi ne? Xiànzài, y? ge k?y? q?ngs?ng d?ch? Shàngh?ihuà de chéngxù ch?lai le.

J?ngguò li?ng nián n?lì, Shàngh?i dàxué Zh?ngwénxì Qián N?iróng jiàoshòu jí t? de yánji?sh?ng hé d?dàng zh?ngyú yú b?nyuè wánchéng le Shàngh?ihuà sh?rùf? de zhìzuò. Zhíde gu?nzhù de shì, zhè tào sh?rùf? hái b?okuò x?n-l?o li?ng ge b?nb?n, 45 suì y?shàng de l?o Shàngh?i rénhé niánq?ng y? dài de Shàngh?irén d?u k?y? zh?odào zìj? de “d?f?.”

Háishi tóngyàng 26 ge zìm? de jiànpán, 8 yuè 1 rì q? xiàzài le Shàngh?ihuà sh?rùf? zh?hòu, nín jiù k?y? t?ngguò sh?rù “linfhakqin” d?ch? “l?n wù q?ng,” sh?rù “dhaojiangwhu” d?ch? “táo jiànghu” d?ng yuánzh? yuán wèi de Shàngh?ihuà le. Zuóti?n, jìzh? tíqián xiàzài dào g?i ru?njiàn. Ànzhào sh?yòng shu?míng, yòng quánp?n de f?ngshì chángshì sh?rù “laoselaosy” zhèxi? zìm?, píngmù shàng, lìjí ch?xiàn le “l?o s?nl?o sì” (Shàngh?ihuà, yìsi shì “màil?o, ch?ng l?ochéng de yàngzi”).

Jùx?, yóuyú Shàngh?ihuà y? P?t?nghuà de dúf? y?usu?bùtóng, su?y? zài p?ny?n p?nxi? f?ngshì shàng háishi x?yào sh?yòng shu?míng de b?ngzhù. B?rú jìzh? f?xiàn, fánshì y? P?t?nghuà sh?ngm?, yùnm? xi?ngtóng de zì, zài Shàngh?ihuà sh?rùf? zh?ng zuìzh?ng yòng de háishi P?t?nghuà p?ny?n, bùtóng de zé c?iyòng Shàngh?ihuà sh?rùf? de p?nxi? f?ngshì. Rú “chéngu?ng” de “chén,” “hu?tou” de “tóu” d?u f?chéng zhuóy?n, Shàngh?ihuà p?ny?n sh?rùf? zh?ng yàozài sh?ngm? zh?ng ji? y? ge zìm? h, p?nchéng “shen,” “dhou;” fánshì rùsh?ng zì, zé zài p?ny?n hòu ji? zìm?k, rú “báixi?ng” de “bái” jiù p?nchéng bhek.

Bùguò, dàji? bùyào juéde tài nán. Jìzh? f?xiàn, Shàngh?ihuà sh?rùf? y? P?t?nghuà de sh?rùf? zuìdà xi?ngtóng zh? ch?zài yú, zh?yào liánxù sh?rù sh?ngm? hé yùnm? jiù k?y?, bùx? sh?rù sh?ngdiào. C?wài, Shàngh?ihuà p?ny?n sh?rù xìt?ng háiy?u lèisì “zhìnéng” y?udi?n, k?yòng su?lüè f?ngshì b? cíy? p?nxi? ch?lai.

Zh?chí Shàngh?ihuà sh?rùf? k?if? de Shàngh?i dàxué Zh?ngwénxì Qián N?iróng jiàoshòu gàosu jìzh?, zhè tào sh?rùf? bùj?n néng d?ch? Shàngh?ihuà dà cídi?n zh?ng 15,000 du? ge cítiáo, érqi? hái néng yòng Shàngh?ihuà p?ny?n d?ch? Shàngh?ihuà zh?ng sh?yòng zhe de, y? P?t?nghuà cíyì xi?ngtóng dàn y?y?n bùtóng de chángyòng cíy?. Rú “Huángp? Ji?ng” sh?rù “whangpugang” , “l?xi?ng” zéshì lixiang d?ng, gòngjì 10,000 du? ge cítiáo.

sources:

separating Pinyin syllables: PHP code

A few weeks ago I had someone write to ask if I had a script that can divide Pinyin texts into their individual syllables. It so happens that I do have something that does just that. Since I sent out that bit of code, I might as well make it available to everyone (GNU GPL, and links back to Pinyin.Info are always appreciated).

It has lots of regular expressions, to make the code nice and compact. I’ve added comments for clarity.

##############################
### SEPARATE THE SYLLABLES
##############################
// In the lines below, \s means space
// This program assumes that ü is written as v
// The i at the end of a line means case insensitive
// \W is a single, non-word character (e.g., punctuation)

$search = array ("'([aeiouv])([^aeiounr\W\s])'i", // This line does most of the work
"'(\w)([csz]h)'i", // double-consonant initials
"'(n)([^aeiouvg\W\s])'i", // cleans up most n compounds
"'([aeiuov])([^aeiou\W\s])([aeiuov])'i", // assumes correct Pinyin (i.e., no missing apostrophes)
"'([aeiouv])(n)(g)([aeiouv])'i", // assumes correct Pinyin, i.e. changan = chan + gan
"'([gr])([^aeiou\W\s])'i", // fixes -ng and -r finals not followed by vowels
"'([^e\W\s])(r)'i", // r an initial, except in er
);

$replace = array ("\\1 \\2",
"\\1 \\2",
"\\1 \\2",
"\\1 \\2\\3",
"\\1\\2 \\3\\4",
"\\1 \\2",
"\\1 \\2",
);

$usertext = preg_replace($search, $replace, $document);

##############################

Since I’m always going on about the need for word parsing and not separating Pinyin into single syllables, some of you are probably wondering just why I of all people would have ever written such code. The answer is that it’s part of my Pinyin spell-checker, which is only a very basic utility in that it functions by checking for theoretically correct groups of syllables rather than real words (i.e., anything composed of correctly spelled groups of syllables, minus tone marks, will pass even if that word isn’t found in a dictionary).

Suggestions for improvements are always welcome.

Ovid Tzeng reiterates backing for Hanyu Pinyin

Earlier this week Ovid Tzeng, a former minister of education and current minister without portfolio, reaffirmed his support for Taiwan adoping Hanyu Pinyin and said that this is an important issue the government will need to deal with sooner or later.

Z?ng Zhìl?ng Jiàoyùbù zh?ng rènnèi, ji?nchí c?iyòng Hàny? P?ny?n, shì t? bèi huàn xiàlái de zh?y?n zh?y?. T? zuóti?n réng bù g?i qí zhì, qiángdiào guówài bùgu?n Zh?ngwén jiàoxué huò xuéshù q?k?n, h?n du? y?jing g?iyòng Hàny? P?ny?n, Táiw?n bùnéng shìruòwúd?, zhè su? f?i x?n zhèngf? zuì y?uxi?n sh?zhèng xiàngmù, dànshì y? lièwéi wèilái zhòngdà ji?nt?o shìxiàng.

Most of the source article for this discusses poet and academic Zheng Chouyu’s backing for Hanyu Pinyin. He stresses his view that this is a practical matter, not a political one.

source: Zhèng Chóuy? jiànyì: Zh?ngwén yìy?n k?y? c?i Hàny? P?ny?n (?????????? ??????), United Daily News, June 9, 2009

further reading: Hanyu Pinyin backer to return to Taiwan’s Cabinet, Pinyin News, April 29, 2008

Burger King, romanization, and a Taiwanese morpheme

Last week I was at my neighborhood shopping mall and saw an interesting ad in the Burger King there. Toward the bottom we find the following:
ad text reading ???, A??

買套餐, A大獎
(Mǎi tàocān, A dà jiǎng)
(Buy a set meal, score a big prize.)

Here, the Roman letter “A” is used to represent a Taiwanese verb that means something like “get in an easy manner” or “make off with” — though the fine print says that customers just have a chance to get a prize, not that they necessarily will win one.

A is often used in A-qián (”A錢”: to A money), a mixed Taiwanese and Mandarin term that means embezzle/embezzlement.

Perhaps the Ministry of Education has issued an official Chinese character for this morpheme. But even if they had most people would have no idea how to read it, and it probably would be of spurious origin to boot — just like most of the other characters the ministry has issued. Where a Taiwanese morpheme sounds like the English name of a Roman letter, the romanized form is likely to prevail over the Chinese character.

There are other interesting things about this ad. But I’ll get to those in another post.

Mandarin-language papers in Indonesia

The Jakarta Post recently ran an article titled “Chinese papers attracting younger readers.” That would indeed be encouraging news, given the considerable damage in Indonesia to Sinitic languages and Chinese culture during their suppression there for most the second half of the twentieth century.

But the specifics of the article are a little less encouraging.

Since controls against newspapers in Mandarin were eased in 2000, Jakarta has had up to six so-called Chinese newspapers. The number is now down to three, at least one of which is operating in the red.

Bambang Suryono (a.k.a. Lie Zuo Hui), editor in chief of the Yìnní Guójì Rìbào (Indonesian International Daily News, ??????) said that Chinese newspapers in general did not get many young readers or contributors who could write well.

After the reformation era in 2001, Sino-Mandarin courses started to sprout in the city. However, Bambang said new Chinese language students were not yet apt to read Chinese newspapers.

“They can write simple articles about their daily activities, but not on complex subjects.”

For that matter, not many people on his own staff can write in Hanzi.

Out of his editorial staff of 20, there are only four journalists that can write in Chinese, and the youngest is 58 years old. “It’s so hard to find young people that can write in Chinese,” Bambang said.

Sunardi Mulia, the editor-in-chief of a business daily Indonesia Shang Bao, said most of his paper’s readers were above 55 years old.

“They are those who once went to local Chinese schools,” he said. “More and more of those aging readers are passing away.”

Sunardi’s paper, has few reporters. “I have to really roll up my sleeves to report large events,” he said. And about 90 percent of the paper’s editorial team are over 50….

“It’s true more young people have begun learning the language. However, it will take time for them to improve because their cultural surrounding is completely different than previous generations.”

“A lot of their parents cannot speak Mandarin or read Chinese anymore,” said Sunardi.

“So their opportunities to practice and communicate with the language are very limited. They probably need 15 years or so before they can read fluently,” said Sunardi, 59.

Sunardi said his paper’s present marketing target was mostly foreigners, specifically businesspeople from China and Taiwan. But he was optimistic about the future for his paper — again, though, not because of people born and bred in Indonesia but because of new readers arriving as immigrants from China.

No. of copies printed per day, according to the editors:

  • Guoji Ribao: 60,000 copies per day, about one-third of those distributed in Jakarta
  • Shang Bao: 10,000 copies per day, most distribued in Jakarta
  • Harian Indonesia Yinni Sin Chew Ribao (formerly government controlled): 55,000 per day

source: Chinese papers attracting younger readers, Jakarta Post, May 15, 2008 (alternate source)

online texts in Hanyu Pinyin

Chris recently wrote and asked for a list of texts in Pinyin. This site, of course, has at least a few things in Pinyin. Unfortunately, however, they can be a bit difficult to find. So having a list is indeed a good idea.

Here are some readings in Hanyu Pinyin:

Some song lyrics

I should probably figure out a way to incorporate this into the recommended readings section. One of the problems with this site is that it has grown much, much larger than I ever expected, which has resulted in some pages not fitting well within the structure I initially established for Pinyin Info. Over the years various additional readings have been added to the site, a few of which are even in Hanyu Pinyin. But since Pinyin Info’s recommended readings section is set up for books rather than essays, songs, etc., this will involve a rethinking of that page.

I very much hope people can help expand the list by providing links to readings elsewhere in Pinyin. But before listing something in the comments, please make sure it is in real Hanyu Pinyin (e.g., with word parsing instead of bro ken syl la bles, with tone marks instead of tone numbers, and with proper capitalization and punctuation). Alas, most texts that are supposedly in Pinyin do not follow those rules.