Aiyo! OED fails to use Pinyin for some new entries

The Oxford English Dictionary has just added some new entries, including several from Sinitic languages.

A lot of these come by way of Singapore and so reflect the Hokkien language. For example, among the new entries is “ang pow,” which is Hokkien’s equivalent of Mandarin’s “hongbao,” which also made the list.

A few of the entries, however, come from Mandarin, for example two common interjections for surprise. Oddly, though, the OED uses “aiyoh” and “aiyah” instead of their proper Pinyin spellings of “aiyo” and “aiya.”

“Ah,” you say, “but maybe the aiyoh and aiyah spellings are more common in English.”

Nope.

Even in Singapore domains (.sg), the Pinyin spellings are more common than those the OED calls for. As the tables below show, in every instance the Pinyin spellings are also more common in Hong Kong, China, and Taiwan. Throughout the world, the Pinyin spellings are more common — the vast majority of the time by a factor of at least two.

Google search results for “aiyo” (Pinyin) and “aiyoh” (spelling used in the OED)

  aiyo aiyoh
.sg 12,200 5,680
.hk 2,570 187
.cn 6,040 984
.tw 4,690 196
all domains 1,250,000 137,000
all domains  + “chinese” 97,700 77,100
all domains  + “mandarin” 51,800 14,100

Google search results for “aiya” (Pinyin) and “aiyah” (spelling used in the OED)

  aiya aiyah
.sg 17,600 8,310
.hk 6,400 2,360
.cn 13,200 1,860
.tw 5,910 1,710
all domains 3,370,000 332,000
all domains  + “chinese” 238,000 63,200
all domains  + “mandarin” 36,500 22,800

Searching Google Books also reveals that the Pinyin forms are more common.

In short, I do not see any good reason for the OED to have adopted ad hoc spellings rather than the Pinyin standard. They must have their reasons, but it looks like they botched this.

Popularity of Chinese character country code TLDs

Yesterday we looked at the popularity of the Chinese character TLD for Singapore Internet domains. Today we’re going to examine the Chinese character ccTLDs (country code top-level domains) for those places that use Chinese characters and compare the figures with those for the respective Roman alphabet TLDs.

In other words, how, for example, does the use of taiwan in traditional Chinese characters   .台灣 domains compare with the use of .tw domains?

Since, unlike the case with Singapore, I don’t have the registration figures, I’m having to make do with Google hits, which is a different measure. For this purpose, Google is unfortunately a bit of a blunt instrument. But at least it should be a fairly evenhanded blunt instrument and will be useful in establishing baselines for later comparisons.

A few notes before we get started:

  • Japan has yet to bother with completing the process for its own name in kanji (Japan, as written in kanji / Chinese characters), so it is omitted here.
  • Macau only recently asked for aomen in simplified Chinese characters    
  .澳门 and aomen in traditional Chinese characters    
  .澳門, so those figures are still at zero.
  • Oddly enough, there’s no taiwan_super in traditional Chinese characters   
  .臺灣 ccTLD, even though the Ma administration, which was in power when Taiwan’s ccTLDs went into effect, officially prefers the more complex form of taiwan_super in traditional Chinese characters   
  .臺灣 to taiwan in traditional Chinese characters   .台灣 — not to mention prefering it to taiwan in simplified Chinese characters    
  .台湾.
  Google Hits Percent of Total
MACAU    
.mo 18400000 100.00
aomen in simplified Chinese characters    
  .澳门 0 0.00
aomen in traditional Chinese characters    
  .澳門 0 0.00
TAIWAN    
.tw 206000000 99.86
taiwan in simplified Chinese characters    
  .台湾 67600 0.03
taiwan_super in traditional Chinese characters   
  .臺灣 0 0.00
taiwan in traditional Chinese characters   .台灣 230000 0.11
HONG KONG    
.hk 193000000 99.94
xianggang  in Chinese characters 
  .香港 118000 0.06
SINGAPORE    
.sg 97800000 100.00
xinjiapo  in Chinese characters 
  .新加坡 2 0.00
CHINA    
.cn 315000000 99.61
zhongguo in simplified Chinese characters  
  .中国 973000 0.31
zhongguo in traditional Chinese characters   
  .中國 251000 0.08

So in no instance does the Chinese character ccTLD reach even one half of one percent of the total for any given place.

Here are the results in a chart.

Graph showing that although China leads in domains in Chinese characters, they do not reach even one half of one percent of the total for China

Note that the ratio of simplified:traditional forms in China and Taiwan are roughly mirror images of each other, as is perhaps to be expected.

See also Platform on Tai, Pinyin News, December 30, 2011

Popularity of the Chinese character TLD for Singapore Internet domains

For quite a few years Singapore has had several choices for those wishing to register Singapore-specific domain names, including .com.sg, .net.sg,, .org.sg, .edu.sg, .gov.sg, .per.sg, and just .sg.

Of those, .sg is a top-level domain (TLD), whereas .com.sg, .net.sg,, .org.sg, .edu.sg, .gov.sg, and .per.sg are second-level domains. This post is mainly concerned with TLDs; but when I’m giving totals I also include .com.sg, .net.sg,, .org.sg, .edu.sg, .gov.sg, and .per.sg but exclude specific domains such as groupon.sg. OK, now back to the post.

Although English is the dominant language of Singapore, it is but one of four official languages there, along with Mandarin, Malay, and Tamil, with Mandarin (along with other Sinitc languages) being the most common of the latter three. Some three-quarters of the city-state’s population is ethnic Chinese, and around half of that group speak Mandarin as the main language in their homes. In addition, for decades Singapore has promoted its campaign to Strike Hard Against Hoklo, Cantonese, and Other Languages that Your Government Says Are Puny and Insignificant Because They Have Only Tens of Millions of Speakers Apiece Speak Mandarin.

So you might think that four years ago, when Singapore introduced Singapore’s name in Chinese characters ('Singapore' (Xinjiapo) in Chinese characters) as a top-level Internet domain (TLD), many in that multilingual society might jump at the chance to pick up some domain names ending with “Singapore” in Chinese characters. (Oh, it hurts me to use images instead of real text there; but until I get the hack fixed, that’s what I’m stuck with.)

Let’s take a look at what happened when the gates opened.

In September 2011, the first month that dot-Xinjiapo (.'Singapore' (Xinjiapo) in Chinese characters) domains became available, a total of 86 were registered. That’s not much of a land rush. The next month and the month after that saw no new registrations. But, OK, maybe they had a sunrise period limiting things. What happened later?

In December 2011 the number jumped to 218. This figure grew over the year 2012 to an all-time high that October of … 247 domains using the .'Singapore' (Xinjiapo) in Chinese characters TLD. Just 247. During the same month, Singapore had 143,887 registered domains, meaning that at the high point those with the Chinese character TLD were less than one fifth of one percent of the total. Since then, the number has fallen to a mere 210, with the percentage dropping to less than one eighth of one percent of the total.

Let’s look at this over time:
dot_xinjiapo_singapore_domains

A Google search for the .'Singapore' (Xinjiapo) in Chinese characters domains reveals that those domains are even less used than the already astonishingly low registration numbers might indicate.

results of a Google search for  .??? domains

So that’s a total of two active dot-Xinjiapo domains, one of which is for sale. In other words, basically there’s just one being used. Ouch. That’s about as close to utter insignificance as a Singapore TLD can get.

Indeed, the only sort of Singapore-related domain that is of even less interest to the netizens of Singapore is one within the dot-Cinkappur TLD, with Singapore written in the Tamil script: 'Singapore' as written in Tamil

Dot-Cinkappur (.'Singapore' as written in Tamil) domains have been available since December 2011, which is just a few months after the introduction of dot-Xinjiapo domains. The middle of 2015 saw the all-time record high in dot-Cinkappur domain registrations: sixteen. Since then the number has dropped to just fifteen.

A search on Google for dot-Cinkappur domains reveals zero active sites.

source: Registration Statistics, Singapore Network Information Centre (SGNIC), accessed October 27, 2015

See also: sg domain names in Chinese characters lag, Pinyin News, June 23, 2010.

China and U.S. study abroad programs

China remained the fifth most popular destination for U.S. students studying abroad during the 2008/09 school year, and it continued to account for 5 percent of U.S. study abroad.

In the previous academic year, growth for the PRC as a destination increased 19.0 percent, while study abroad as a whole increased 8.5 percent. But for 2008/09 growth for China was a much smaller 3.9 percent, while the total worldwide figure declined -0.8 percent. Figures for the top four destinations also dropped.

The order of the top 10 remained the same as in the previous year, except Mexico and Germany switched places.

Top 10 destinations for study abroad by U.S. students in the 2006-07, 2007-08, and 2008-09 school years
China shown as the fifth most popular destination for study abroad. The top destination is the U.K., followed by Italy, Spain, and France. See the link to my source material for the actual numbers.

Some other figures of possible interest:

  • Japan was in 11th place with 5,784 students, a 1.3 percent increase over the previous year.
  • Taiwan’s total grew 3.3 percent to 597.
  • Hong Kong grew 5.7 percent to 1,155.
  • South Korea grew a dramatic 29.1 percent to 2,062.
  • Singapore grew 7.7 percent to 612.

Study in Asia increased slightly.

Percent of study abroad performed in Asia
chart showing percentage of study abroad in Asia flat at about 6% from 1996-2000, with growth increasing since 2003 to the present 11.1% for the 2008-09 school year

source: Open Doors data portal

Previous posts on this subject:

sg domain names in Chinese characters lag

Between November, 23, 2009, when Singapore first began registering .sg names in Chinese characters, and June 10, 2010, when registrations of Chinese-character .sg domain names opened to all without any additional fee, only 1,024 such names were registered, or just 0.88 percent of all .sg domain names. This apparently includes not just second-level domains (e.g., 中心.sg) but also third-level domains (e.g., 中心.com.sg).

The percentage will likely rise in the coming months, as the process has only recently opened to everyone on a first-come, first-served basis. But, still, demand for such names in Singapore has so far been underwhelming.

A bit more information:

Registrations were accepted in phases, with registrations for government organizations starting on Nov. 23, 2009. Beginning in January, SGNIC began accepting domain name registrations from trademark holders.

During the third phase, the general public was allowed to register domain names starting on March 25, but applicants were charged a “priority fee” of S$100 (US$72) for each domain name, with domain names sought by several applicants awarded to the highest bidder.

In all three phases, applicants could apply for a domain name made up of Chinese numbers or a name with just one Chinese character for a fee of S$500 [US$360]….

The fourth and final phase began on June 10, with SGNIC accepting domain name applications on a first-come, first-served basis. The S$100 priority fee is no longer required, but applicants are no longer allowed to register domain names using Chinese numbers or names with just one Chinese character….

When IDA announced the introduction of Chinese-language domain names last year, SGNIC said the effort was partly intended to help Singaporean businesses target the Chinese market.

source: Singapore registers 1,000 Chinese-language domain names, IDG News Service, June 23, 2010

‘dialects’ wasting ‘important neurons’ needed for Mandarin, English: Lee Kuan Yew

In 1979 Singapore launched its campaign for people there to “Strike Hard Against Hoklo, Cantonese, and Other Languages that Your Government Says Are Puny and Insignificant Because They Have Only Tens of Millions of Speakers Apiece,” er, “Speak Mandarin” (Jiǎng Huáyǔ Yùndòng / 讲华语运动). The city-state has been marking the the 30th anniversary of this with some speeches, such as one a couple of weeks ago by former prime minister Lee Kuan Yew (Lǐ Guāngyào), now “minister mentor.”

Lee described the situation:

Thirty years ago I launched the Speak Mandarin campaign. [Singaporean] Chinese students learned Mandarin at school. Unfortunately, they used to speak dialects amongst themselves, at home, and with their friends — a variety of dialects.

Here, “dialects” is of course the standard misnomer for Sinitic languages other than Mandarin.

Lee said that he himself was setting a bad example during the 1960s and 1970s by doing such highly irresponsible things as giving speeches in the native language of the majority of Singapore’s citizens. So he stopped all that. And he had the government shut down almost all broadcasts in Hokkien (Hoklo) and other such languages.

Lee said that although he understands “the strong emotional ties to one’s mother tongue … the trend is clear. In two generations, Mandarin will become our mother tongue.”

Actually, no, that’s not clear at all. Rather, a very different trend is apparent. During his speech Lee displayed the graph below, with data taken from surveys conducted by Singapore’s Ministry of Education.

Dominant Home Language of Singaporean Chinese Primary-1 Students (1980 to 2009)
graph showing English in a steady climb from 10% -- all numbers are approximate -- (1980) to 60% (2009); 'Chinese dialects' in steep decline from 1980 (62%) to 1988 (9%) and continuing to decline to only 1% or 2% in 2009; and Mandarin, which begins in 1980 at 28% and quickly tops 60% in 1985, with slower growth until 1988 (69%), after which it enters a steady decline to 39% (2009)

As the primary language of the home for young students, Mandarin has dropped steadily since the late 1980s, while English has risen steadily since 1980, with English surpassing Mandarin in 2004. (Language data for the whole population is more complicated. See, for example, the 2005 General Household Survey.)

Of course the government and Lee recognize this. But they don’t want to fight against English, which is crucial to Singapore’s success. So what Lee is proposing is that parents — both parents — speak Mandarin, not English, to their children.

(I see from my stats that this site gets lots of visitors from Singapore. Can any of you comment on how well you think the public will respond to Lee’s proposal.)

Lee explained in his speech that people have only so many “gigabytes” in their brains to devote to languages.

Though stating that speaking “dialects” in some situations can provide “extra warmth,” he warns that by using such languages: “You are losing important neurons with data which should not be there. And like the computer, when you delete it, it doesn’t really go away. It’s there at the back, and you’ve got to go to the rubbish channel and say ‘destroy.’ And it’s still disturbing your hard disk.”

Thus, those rubbish languages must be destroyed “dialects” must be let go, he said.

On March 8 a linguist at Nanyang Technological University (NTU) in Singapore was quoted in the newspaper as saying, “Although Singaporeans are still multilingual, 40 years ago, we were even more multilingual. Young children are not speaking some of these languages at all any more…. All it takes is one generation for a language to die.” But even after all these years, with Sinitic languages other than Mandarin fading fast there, this is apparently still no time to be slacking off, as Lee’s principal private secretary, Chee Hong Tat, promptly responded, “It would be stupid for any Singapore agency or NTU to advocate the learning of dialects, which must be at the expense of English and Mandarin.”

Part of the reason behind Lee’s call, however, is a basic misunderstanding. Setting aside the matters of educating children in a language not native to them and how many languages most people are capable of speaking effectively, the main difficulty with learning Mandarin is not the language itself (especially for those who speak other Sinitic languages) but Chinese characters as its near-exclusive script.

If Singapore is smart about promoting Mandarin, sooner rather than later it will develop a two-track system, with most students studying how to read and write Mandarin exclusively in Hanyu Pinyin, while those who wish become more specialized can go on to study Chinese characters as well. For this to work, Singapore will need to produce plenty of material to read in Pinyin. (A newspaper, for example, would be a must — and one with real news, not just cute stories for kids.) The city-state certainly has the means and motive for this. But does it have the imagination? If it does, most students could save their precious neurons and gigabytes for other things — perhaps even their families’ traditional native languages.

SOURCES:
Lee Kuan Yew speech:

Some Singapore blog posts:

newspaper stories:

letter to the editor:

additional:

Singapore to allow electronic Chinese dictionaries in exams

Students in Singapore will be able to use certain government-approved handheld electronic Mandarin dictionaries in national exams beginning in 2007. Some printed dictionaries are already allowed for ‘O’ level mother tongue composition exams and, from next year, will also be permitted in PSLE mother tongue composition exams.

The electronic dictionaries will be allowed in the Chinese language composition part of the PSLE and GCE ‘N’, ‘O’ and ‘A’ levels.

I’ve written to the company that makes one of the approved electronic dictionaries, the Hansvision Handheld Dictionary PX2051, for details but have not received a response. The product costs S$24 (about US$14).

The move to allow the electronic devices is in response to a report issued last year by the Chinese Language Review Committee that recommended their use.

Basically, students are finding Chinese characters just too much trouble, so Singapore, wisely, has changed its approach to teaching Mandarin to focus more on speaking and listening. Now if it would just place more emphasis on Pinyin….

source: Electronic Chinese dictionaries to be allowed in national exams, Channel News Asia, October 25, 2005.

Singapore encourages trilingualism

Singapore Encourages Learning Of A Third Language

SINGAPORE, Aug 27 (Bernama) — Bilingual Singapore is gearing up towards “trilingualism” among its population by encouraging students to learn a third language, apart from English and their own mother tongue, even if it is only for conversational purposes.

“The ability to speak a third language is useful, and will help young Singaporeans of all races operate effectively in the region and beyond,” Education Minister Tharman Shanmugaratnam said Saturday.

Proficiency in non-native mother-tongue languages would also help nurture increased interaction among the ethnic communities, friendships between students and ties between neighbours, across race and religion, Tharman said at the inaugural Mother Tongue News Writing Competition jointly organised by Innova Junior College and Berita Harian.

“We have to find every way to interest our children in our various races and cultures, give them as many opportunities as possible to interact in school, and give them confidence to talk to people of other races outside the school,” he said….

More than 40 schools, both primary and secondary, and junior colleges, have started to conduct conversational Malay as enrichment classes, with more than 200 other schools having expressed interest in starting similar programmes.

The ministry recently distributed a “Guide to Conversational Malay” to all schools to help them introduce such programmes.

“The Ministry of Education is currently developing a resource booklet for conversational Mandarin to help schools implement the enrichment programme for non-Chinese students,” Tharman said.

English is the medium of instruction in Singapore schools but it is also compulsory for students to take up a mother tongue language — either Chinese, Malay or Tamil.