Spreading the good news

Behold, I bring you good tidings.

As I keep having to note, most of the things that are supposedly in Pinyin are terrible. This is not because Pinyin itself is inherently poor or difficult. It’s because most people who produce such things have a fundamental lack of understanding of Pinyin as a system. (And, yes, that includes most users in China.) So it is with amazement that I report today on a journal that not only offers dozens of pages in Hanyu Pinyin — good Hanyu Pinyin — but does so twice every month. It’s also well worth noting that the journal is aimed primarily at adult native speakers of Mandarin, not foreigners trying to pick up the language, though certainly it could also be read by people in the latter group.

From what I’ve seen so far, this journal gets right the things most commonly written incorrectly elsewhere, including:

And it doesn’t use the atrocious ɑ that some people mistakenly believe is required either.

Unfortunately, punctuation and alphanumerics are not included in the Pinyin. But other than that there’s very little that doesn’t follow standard Pinyin orthography, the main exception being the indication of the tone sandhi related to the special cases of and , (e.g., the journal gives “bú shì” and “búdà” instead of the standard “bù shì” and “bùdà,” and “yìhuíshì” and “yí wèi” instead of the standard “yīhuíshì” and “yī wèi“). That said, though, tone changes related to yi and bu can be something of a pain. So although this isn’t standard, I can see why it was done and am not entirely unsympathetic to this approach.

Here are a few sample lines (click to enlarge):
screenshot of some text in the journal, showing text in simplified Chinese characters with word-parsed Hanyu Pinyin above the Hanzi. Note: Yifusuoshu/以弗所書 = Ephesians

It would be nice if this were in Unicode, to help aid searches and cutting and pasting. The text, however, appears to have been made in a system devised years ago by the people at the journal. Regardless, I’m happy to see the Pinyin.

Overall, despite the lamentable absence of punctuation and Arabic numerals in the Pinyin, this is quality work, which is perhaps all the more remarkable in that the Pinyin and simplified Hanzi edition of this journal is not truly free to circulate in the land of its target audience. That’s because its publishers are Jehovah’s Witnesses, a group suppressed by the PRC (though it appears that at least at the moment their sites are not blocked by the great firewall). The journal, Shǒuwàngtái, may be more familiar to you by its English name: Watchtower. Whatever you might think of Jehovah’s Witnesses, I hope you’ll recognize the considerable accomplishment of those who put together this publication.

Getting to the Jehovah’s Witnesses Web pages that link to Shǒuwàngtái can be tricky. (Go to the magazines page, select “Chinese (Simplified)” for the language; then choose the month and file with Pinyin.) So I’m providing direct links to some documents below:

I haven’t found any Pinyin editions other than those. Perhaps old ones are taken offline.

Rénrén Dōu Xūyào Zhīdao De Hǎo Xiāoxi (I'd prefer 'de' instead of 'De' -- but that's no big deal) 人人都需要知道的好消息

With thanks to Victor Mair.

Bing Maps for Taiwan

The maps of Taiwan put out by GooGle are plagued with errors in their use of Pinyin. But what about that other big company with deep pockets? You know: Microsoft. How good a job does Microsoft’s Bing do with its maps of Taiwan?
map of Taiwan from Bing, showing Wade-Giles place names

I won’t keep y’all waiting: After examining Bing’s maps of Taiwan the two words that came first to mind were incompetent and atrocious.

The country-level map is odd, offering Wade-Giles. And although the use of the hyphen is irregular, I will give Bing points for getting at least Wade-Giles’ apostrophes right. So, although some place names on the map are decades out of date (e.g., Hsin-chuang, Chungli, Chunan, Kuang-fu), at least they’re not horribly misspelled within that system.

It’s at the street level that Bing’s weirdness becomes most apparent. For example, below is part of Bing’s map of Banqiao.

I added the highlighting.

click for larger map

This tiny but representative fragment of the map has not one but four romanization systems:

  • MPS2: Gung Guang, Min Chiuan, Shin Fu (Even within MPS2, none of those should have spaces or extra capital letters.)
  • Hanyu Pinyin: Banqiao (This is the only properly written place name on this map fragment.)
  • Tongyong Pinyin: Jhancian, Sianmin, Sin Jhan
  • Gwoyeu Romatzyh(!): Shinjann (This is the same road as the one marked “Sin Jhan”. In Hanyu Pinyin, which is what officially should be used here, this is written “Xinzhan”.)

A few more points about this small fragment of the map:

  • Wen Hua could be either MPS2 or Hanyu Pinyin, but not Tongyong Pinyin. And it should be Wenhua.
  • Minan is missing an apostrophe. (It should be Min’an.)
  • Banchiao is just wrong, regardless of the system. They were probably going for MPS2 but erroneously used an o instead of a u: Banchiau.
  • Sec 1 Rd should be Rd Sec. 1.
  • Mrt should be MRT.

So that’s four systems, plus additional errors.

There’s much, much more that’s wrong with this than is right. That’s even more evident on a larger map — and that’s without me bothering to mark orthographic problems in the Pinyin (e.g., Wen Hua instead of the correct Wenhua).
click for larger view

Here bastardized Wade-Giles (e.g., “Mrt-Hsinpu” at top, center — and, FWIW, in the wrong location) has been added to the mix, making a total of five different romanization systems, as well as some weird spellings, e.g., U Nung, Win De, Bah De, Ying Sh — and that’s without including my favorite, JRLE, because that one is correct in MPS2 (“Zhile” in Hanyu Pinyin).

The main point is that vast majority of names are spelled wrong. And among the few that are spelled correctly, those that are written with correct orthography can be counted on one hand. So, to the words above (incompetent and atrocious) let me add FUBAR.

The copyright statement lists not only Microsoft but also Navteq. The Taiwan maps on the latter company’s site, however, are different from those on Bing. Navteq’s are generally in Hanyu Pinyin, though almost invariably improperly written (e.g., Tai bei Shi, Ban Qiao Shi). And despite the prevalence of Hanyu Pinyin, they still contain other romanization systems (e.g., Jhong Shan) and outright errors (e.g., Shin Jahn).

So an update from Navteq wouldn’t be nearly enough to fix Bing’s problems, which are fundamental.

Wenlin releases major upgrade (4.0)

Wenlin logoOne of my favorite programs, Wenlin (which bills itself as “software for learning Chinese”), has just released a major upgrade for both Mac and Windows versions. This doesn’t happen often; it has been three-and-a-half years since the most recent big change was issued (Wenlin 3.4) and heaven only knows how long since 3.0 came out. So, yes, this release has many substantial improvements.

One of the features nearest and dearest to my heart is that Wenlin 4.0 features greatly improved handling of Pinyin. I was among the field testers for the new version, so I’ve already spent a lot of time examining this feature. Here are a few important aspects of this:

  • Conversions from Chinese characters follow Hanyu Pinyin orthography much more closely than before. This is a major change for the better. (There’s still some room for improvement. But I don’t think we’ll have to wait years for this.)
  • In the past, using Wenlin to convert long texts in Chinese characters into Pinyin could be a real chore, with users having to examine example after example of Chinese characters with multiple pronunciations in order to select the proper pronunciation for that particular context. But now users may, if they so desire, tell Wenlin not to ask users for disambiguation input. Of course, that doesn’t mean that Wenlin will always guess right; but many users will be happy that this trade-off allows them to skip the frustration of, for example, having to tell the program over and over and over that, yes, in this case 說 is pronounced shuō rather than shuì.
  • Relative newcomers to Mandarin may appreciate that for common words tone sandhi is indicated in Wenlin with additional marks (a dot or line below the vowel). This feature can also be turned off, for those who want standard Pinyin.

There are, of course, many improvements beyond the area of Pinyin. Here are a few:

  • One limitation of Wenlin 3.x was that its English dictionary wasn’t very large. But Wenlin 4.0 includes not only the ABC Chinese-English Comprehensive Dictionary but also the excellent new ABC English-Chinese, Chinese-English Dictionary (now finally in stock in the printed version).
  • The flashcards are now set up to handle not just individual characters but polysyllabic words.
  • There’s full Unicode Unihan 6.0 support for more than 75,000 Chinese characters.
  • And for those who think 75,000 just isn’t enough, users can now access Wenlin’s CDL technology. Through this, users can create new, variant, and rare characters; moreover, these can be published and shared with other Wenlin users or CDL-friendly devices.
  • Seal script versions of more than 11,000 characters are provided.
  • Wenlin contains an e-edition of the Shuowen Jiezi (Shuōwén Jiězì / 說文解字 / 说文解字).
  • Coders will be interested to know that Wenlin appears to be headed toward becoming open-source.
  • Both Mandarin and English entries are marked with grade levels, which aids learners by indicating relative frequency of use. The levels for Mandarin words are based on the Hanyu Shuiping Kaoshi (Hànyǔ Shǔipíng Kǎoshì / 汉语水平考试 / 漢語水平考試 / HSK).

The full version (i.e., the CD with the program comes in a box and is likely packaged with a hard copy of the manual) is US$199, or US$179 if you download it from the Wenlin Web store. Upgrades from 3.x cost US$49.

For more information, see the summary of features and outline of what’s new in Wenlin 4.0.

screenshot from Wenlin 4.0 -- click for larger version

Xin Tang no. 1: articles in Gwoyeu Romatzyh

click to view the PDFI’ve just put up another issue of Xin Tang.

As you may have noticed already, the name on the cover is given not as Xin Tang but as Shin Tarng. That’s because the journal started out being published in the Gwoyeu Romatzyh romanization system. But using the Hanyu Pinyin spelling here helps me keep track of these better.

Almost all of this issue is in Mandarin written in Gwoyeu Romatzyh. One article also has an en face translation into English. And as is the case with the other issues of Xin Tang, a variety of topics are covered.

Shin Tarng no. 1 (September/Jiǔyuè 1982)

Hanyu Pinyin Cihui

image of the cover of this book, which gives 'HANYU PINYIN CIHUI', followed on the next line in larger characters by '汉语拼音词汇', followed on the next line, in smaller letters, by '增訂稿' -- the text is white against a blue backgroundToday, for all you orthography junkies (Hello? Hello? Anybody there?), I have added a selection from the 1963 edition of Hanyu Pinyin Cihui (汉语拼音词汇 / Hànyǔ Pīnyīn Cíhuì).

The book, which is fully alphabetized by Hanyu Pinyin (i.e., like the ABC dictionary series, not like the Hanzi-by-Hanzi Pinyin ordering seen in most dictionaries published in the PRC), is a long list of Mandarin words as written in Hanyu Pinyin and Chinese characters. It’s meant as a reference for word division and other such orthographic concerns. It’s the sort of thing that just cried out to have been made into a full dictionary (especially since that’s what it looks like, minus definitions); but, unfortunately, it never was. But it was an important influence on the ABC series.

One can see some interesting instances of differences between Pinyin orthography then and now. For example, in this old edition of Hanyu Pinyin Cihui de tends to be appended to words and written as d, e.g. ái’áid, rather than the current ái’ái de (皚皚的). Similarly, zi is written z at the end of a word, e.g. ǎigèz, rather than the current ǎigèzi (矮个子).

Also interesting is the mixed use of simplified and traditional Chinese characters. (It will be easier to see what I’m referring to if you open the PDF file of the introduction and A’s of Hanyu Pinyin Cihui.) The title on the cover is given as 汉语拼音词汇 in Chinese characters — perfectly standard. But below this is 增訂稿 (zēngdìng gǎo / revised edition); note how dìng is written as 訂 rather than as 订.

More striking, though, for the modern reader is the script in the foreword. Here, what was written 汉语拼音词汇 on the cover is written 汉拼音汇, mixing traditional and simplified forms. The full traditional version of this would be written 漢語拼音詞彙. The text of the introduction is similarly mixed. This is because this was published before many simplified forms that are now standard were fully accepted officially.

The selection from this book here on Pinyin.info comprises the introduction and all of the entries beginning with the letter a.

image of a few entries

ABC English-Chinese, Chinese-English Dictionary out soon

front cover of the ABC English-Chinese, Chinese-English DictionaryThe ABC Chinese-English Dictionary was published ten years ago. It was revolutionary in that, for the first time, a Mandarin-English dictionary was ordered entirely by the headwords’ pronunciation as written in pinyin. (Stroke and radical indexes are also there to aid finding a character when its shape is known but not its pronunciation.) Other dictionaries in the DeFrancis ABC series have followed. But up to now there been no ABC dictionary with an English to Mandarin section as well as a Mandarin to English one.

At the end of this month the University of Hawai`i Press is releasing the ABC English-Chinese, Chinese-English Dictionary. The new dictionary, which is 1,252 pages long, has 29,670 entries in its English-Mandarin section and 37,963 entries for Mandarin-English (total 67,633 entries). (The much larger ABC Chinese-English Comprehensive Dictionary has some 196,000 entries — all Mandarin-English).

This is a big year for Mandarin-English dictionaries, with the forthcoming release of the ABC ECCE and the release three months ago of the massive Oxford Chinese Dictionary. From the standpoint of Pinyin, however, the Oxford dictionary is a disappointment. For example, the Oxford dictionary has no Pinyin in the English-Mandarin section, just Chinese characters; in some other places tone marks are missing from some of the Pinyin, where it appears at all. Perhaps this will be rectified in the online edition, which has yet to appear. At the moment, though, the Oxford looks like a fairly traditional dictionary — albeit a huge one — aimed mainly at English learners in China, which isn’t necessarily a bad thing if you happen to be among that very large group of people. For more on the Oxford, see the video at Danwei and the entries at Chinese Forums (with some images) and Language Log.

Unlike the Oxford dictionary, the ABC ECCE offers both Pinyin and Chinese characters for all entries and sample sentences. (See samples below. Click on those for more extensive examples in PDF files.)

From what I’ve seen so far of the ABC English-Chinese, Chinese-English Dictionary, I expect it to become the dictionary for English-speaking students of Mandarin. I’ll write more about this once I’m able to see a hard copy.

The ABC English-Chinese, Chinese-English Dictionary retails for only US$20, compared to US$75 for the Oxford.

From the Mandarin-English section. But don’t expect the text in the printed edition to be this large. I’ve enlarged the image to make it easier to read on the Web.
examples of entries in the Mandarin-English section of the ABC English-Chinese, Chinese-English Dictionary

From the English-Mandarin section:
examples of entries in the English-Mandarin section of the ABC English-Chinese, Chinese-English Dictionary

(ISBN-10: 0824834852; ISBN-13: 978-0824834852)

See also:

Xin Tang 4

cover of issue number 4 of the journal 'Xin Tang (New China)'The fourth issue of Xin Tang is now online.

For those of you wondering why Xin Tang is spelled Xin Talng on the cover, that’s because parts of this particular issue use a tonal-spelling variation of Hanyu Pinyin, as follows.

Simple rules for tonal spelling

  1. ma (媽) / ling (拎)
  2. mal (麻) / lilng (零)
  3. maa (馬) / liing (領)
  4. mah (罵) / lihng (另)
  5. “‘” biaaoshih qingsheng, kee’shi “‘de” dou –> “d”.

Here, for example, is a message from the publisher.

Colng zheih yihqi qii SHIN TARNG gaai weil XIN TALNG, shiiyohng d welnzih yii Pin Yin (jiaan xiee PY) weil jichuu. Duobahn d welnzhang yohng yooudiaoh PY xiee. Biaodiaoh faa qiing kahn fengmiahn erh xiah’tou d jiaandan shuomilng.

The same passage in Pinyin with tone marks:

Cóng zhèi yì qī qǐ SHIN TARNG gǎi wéi XIN TANG, shǐyòng d wénzì yǐ Pīn Yīn (jiǎn xiě PY) wéi jīchǔ. Duōbàn d wénzhāng yòng yǒudiào PY xiě. Biāodiào fǎ qǐng kàn fēngmiàn èr xià’tou d jiǎndān shuōmíng.

Not all of the romanization in this issue follows that form. Some has no special spellings but instead uses tone marks. Some has no tone marks. Give ‘em all a try and see what you think.

Xin Tang 4 (PDF)

new book in Pinyin

image of the cover of the printed edition of Pinyin Riji DuanwenI’m very pleased to announce the publication of a new book, Pīnyīn Rìjì Duǎnwén, by Zhāng Lìqīng. Other than one introductory letter in English, the work is entirely in Mandarin.

This is perhaps the world’s first Mandarin-language book to be published in Hanyu Pinyin without so much as one Chinese character. Thus, it is of historic importance. But it’s also a wonderful collection of stories. The author generously granted Pinyin.info the right to release all of this book online.

The work will also soon be available in an inexpensive printed edition.

Some of you will recall Zhang’s lovely story Dàshuǐ Guòhòu (“After the Flood”), which first appeared here three years ago. It leads the new collection. The remaining twelve memoirs/stories are mainly in the same vein, recalling a childhood in China and Taiwan.

Zhè shì yī gè lǎo gùshi. Shìqing fāshēng zài 1946 nián xiàtiān. Nà nián wǒ jiāngjìn shí suì, zhù zài Sìchuān Chéngdū jiāoqū d Bǎihuā Qiáo. Zhōngguó Kōngjūn Tōngxìn Xuéxiào d jīdì zài nàli. Wǒ bàba shì nà ge xuéxiào d jūnguān….

The author died earlier this year. She was able to view proofs of the work, though her illness prevented her from making any corrections herself. Fortunately, several people stepped in, contributing substantially to the checking of the Pinyin and other aspects of the work. I’d like especially to thank the following people: David W. Goodrich, Jiao Liwei, Kuo Hsin-chun, Melvin Lee, and Victor H. Mair. Any errors found in the book should be considered my own.

Please report any divergences from the Pinyin orthography established by Yin Binyong and the spellings used in the ABC Chinese-English Comprehensive Dictionary (Zhang was, after all, one of the associate editors of that massive work) to me. I’ve made very few intentional departures from those.

Please note that the use of “d” (where most authors would use “de”) is intentional. This is not a bug but a feature, something I came to understand better the more time I spent with this text. The use of “d” is explained in the second introductory letter (Liǎng Fēng Gěi Biānzhě d Xìn: 2).