Japan to switch official romanization from Kunrei-shiki to Hepburn

Japanese newspapers are reporting that Japan will officially switch from Kunrei-shiki romanization to Hepburn romanization.

In a front-page column last week, the Asahi Shimbun said, “A draft report recently published by the Council of Cultural Affairs pointed out that the Hepburn system is more widely used than the Kunrei system, and it is expected that the notation will be adjusted to reflect this. It is surprising because the writing system has not changed for about 70 years, but if confusion can be avoided, the change is to be welcomed.”

Some examples of differences:

Kunrei Hepburn
Aiti Aichi
Atugi Atsugi
Gihu Gifu
Hukusima Fukushima
Sinzyuku Shinjuku
Titibu Chichibu
Tukizi Tsukiji

sources:

Japan likely to regulate pronunciations of personal names

“No, no, no. It’s spelled ‘Raymond Luxury Yacht,’ but it’s pronounced ‘Throatwobbler Mangrove.’” — Monty Python’s Flying Circus

On February 17, Japan’s Legislative Council presented the country’s justice minister with an outline that would mandate that any kanji in names of newborns entered in official family registers include phonetic readings in kana. It would also restrict some readings.

Readings would also be added to names already in registers.

The changes would likely be enforced starting in the 2024 fiscal year (April 1, 2024, to March 31, 2025).

From a news article:

Currently, family registers do not have a field to indicate phonetic readings. After the law revision, family registers will include phonetic readings of kanji in kana characters.

According to the outlines, certain restrictions will be set on “colorful names” whose phonetic readings in kana characters deviate from the original meanings of the kanji characters.

Not only will newborn babies have pronunciations of their names entered in their family registers, but children and adults whose names already appear in family registers will be allowed to add phonetic readings.

Such people will be allowed to register different readings from ones already in their resident registers — records that are distinct from family registers — but the Justice Ministry calls for careful consideration when registering name readings in family registers.

The government plans to submit the revision bill during the current ordinary Diet session, aiming for it to be enforced in fiscal 2024.

The outlines say that “phonetic readings generally accepted as names” will be allowed in family registers.

A supplementary document to the outlines also calls for flexible management of the new system, given the historical and cultural reality that there have been some phonetic readings that are used only for names.

However, the government plans not to accept phonetic readings of names “that would confuse society.”

Examples of this restriction include readings with a meaning opposite to the kanji’s meaning, those that are difficult to distinguish from misreadings or misspellings, and those with no relation to the meaning of the kanji….

Discriminatory and obscene phonetic readings of names will not be accepted. Nor will names of characters from comics, anime and other fictitious works that would cause discomfort if used as the names of real people.

As current family registers do not have a section to indicate phonetic readings of names, people listed in Japanese family registers do not officially have phonetic reading of names under the Family Register Law.

In contrast, phonetic readings are written on resident registers. However, according to the ministry, those phonetics readings are not legally official but exist for administrative convenience. Currently, phonetic readings on birth registrations are used for resident registration purposes, but not for family registers.

After the law revision goes into force, kana characters for phonetic readings in birth registrations of newborns will also be used in family registers. Those who already have family registers can submit phonetic reading of their names to municipalities within one year after the revised law goes into force.

In particular, people with concerns such as the frequent mispronunciation of their names by others may find it necessary to have their resident registers revised to include the desired phonetic readings of their names. However, as changing the submitted names will require permission from a family court, the ministry urges careful consideration in deciding the name readings to be submitted.

For those who do not submit phonetic readings of names within one year after the enforcement of the revision, the official phonetic reading will be decided based on readings indicated in resident registers after municipal mayors send notifications to their respective residents.

I’m still wondering about the “cause discomfort” part. Discomfort to whom? How?

Japan to add romanization to names on My Number cards

The Japanese government has reportedly decided to add romanization for names on My Number cards, starting next year (2024). My Number cards — also known as Individual Number cards (or kojin bangō kādo / 個人番号カード) are a form of national ID.

Here’s basically what they look like now (without a space for romanization):
blank My Number card

But I haven’t been able to find any more specific information yet.

I wrote the authorities with My Number cards for clarification. I wanted to know what romanization system My Number Cards will use: Hepburn, Kunrei-shiki, or something else? Or will people be able to choose any system they want or to choose from a list of government-approved systems?

I also requested links to any articles/announcements about this in English or Japanese.

Unfortunately, the person who politely responded did not have any information about this beyond what I submitted.

Source: one small mention at the end of this article: Pronunciation of Japanese Personal Names to be Regulated by Planned Law Revision, Japan News (from the Yomiuri Shimbun), February 18, 2023.

More Americans studying in Japan

The number of U.S. students studying abroad in Japan is continuing to increase, having recovered from a sharp decline in the 2010–20111 school year.

This is in contrast to the situation in China, which has been seeing fewer and fewer U.S. students.

graph showing a steady increase in U.S. students studying in Japan from 2000, with a 33% decline in 2010, followed by a recovery that now surpasses the 2009 level.

I’m not sure what accounts for the sharp drop in 2010–2011. It occurred before the March 2011 earthquake and tsunami.

source: IEE Open Doors Study Abroad Destinations

China attracting fewer and fewer U.S. study-abroad students

China is continuing to decline as a destination for U.S. study-abroad students, slipping from fifth place to sixth (behind Britain, Spain, Italy, France, and Germany; with Ireland, Australia, Costa Rica, and Japan completing the top ten).

This likely indicates that the craze for learning Mandarin has already peaked. Greater awareness of the unhealthy levels of pollution in China may also be a factor.

chart showing how US enrollments in study-abroad programs in China were low in the 1990s (about 2000 students), grew sharply in the 2000s (to almost 15000 in 2011), and have been declining ever since
Note: The dip in the 2002–2003 school year was a result of worries about the outbreak of SARS.

Meanwhile, almost all other parts of East Asia saw increases in 2015–2016 over 2014–2015:

Destination Students in 2014-15 Students in 2015-16 % Change
China 12,790 11,688 -8.6
Hong Kong 1,508 1,612 6.9
Japan 6,053 7,145 18.0
Macau 3 4 33.3
Mongolia 71 71 0.0
South Korea 3,520 3,622 2.9
Taiwan 880 980 11.4

sources:

Additional reading:

Popularity of Chinese character country code TLDs

Yesterday we looked at the popularity of the Chinese character TLD for Singapore Internet domains. Today we’re going to examine the Chinese character ccTLDs (country code top-level domains) for those places that use Chinese characters and compare the figures with those for the respective Roman alphabet TLDs.

In other words, how, for example, does the use of taiwan in traditional Chinese characters   .台灣 domains compare with the use of .tw domains?

Since, unlike the case with Singapore, I don’t have the registration figures, I’m having to make do with Google hits, which is a different measure. For this purpose, Google is unfortunately a bit of a blunt instrument. But at least it should be a fairly evenhanded blunt instrument and will be useful in establishing baselines for later comparisons.

A few notes before we get started:

  • Japan has yet to bother with completing the process for its own name in kanji (Japan, as written in kanji / Chinese characters), so it is omitted here.
  • Macau only recently asked for aomen in simplified Chinese characters    
  .澳门 and aomen in traditional Chinese characters    
  .澳門, so those figures are still at zero.
  • Oddly enough, there’s no taiwan_super in traditional Chinese characters   
  .臺灣 ccTLD, even though the Ma administration, which was in power when Taiwan’s ccTLDs went into effect, officially prefers the more complex form of taiwan_super in traditional Chinese characters   
  .臺灣 to taiwan in traditional Chinese characters   .台灣 — not to mention prefering it to taiwan in simplified Chinese characters    
  .台湾.
  Google Hits Percent of Total
MACAU    
.mo 18400000 100.00
aomen in simplified Chinese characters    
  .澳门 0 0.00
aomen in traditional Chinese characters    
  .澳門 0 0.00
TAIWAN    
.tw 206000000 99.86
taiwan in simplified Chinese characters    
  .台湾 67600 0.03
taiwan_super in traditional Chinese characters   
  .臺灣 0 0.00
taiwan in traditional Chinese characters   .台灣 230000 0.11
HONG KONG    
.hk 193000000 99.94
xianggang  in Chinese characters 
  .香港 118000 0.06
SINGAPORE    
.sg 97800000 100.00
xinjiapo  in Chinese characters 
  .新加坡 2 0.00
CHINA    
.cn 315000000 99.61
zhongguo in simplified Chinese characters  
  .中国 973000 0.31
zhongguo in traditional Chinese characters   
  .中國 251000 0.08

So in no instance does the Chinese character ccTLD reach even one half of one percent of the total for any given place.

Here are the results in a chart.

Graph showing that although China leads in domains in Chinese characters, they do not reach even one half of one percent of the total for China

Note that the ratio of simplified:traditional forms in China and Taiwan are roughly mirror images of each other, as is perhaps to be expected.

See also Platform on Tai, Pinyin News, December 30, 2011

China and U.S. study-abroad programs

The top 10 destinations for U.S. students studying abroad were unchanged in the 2009–2010 school year compared to the year before. China remained in fifth place, with its numbers up only 1.7% over the previous year.

Number of U.S. students studying abroad, by destination and year

By far the largest gains of destinations in the top 25 were those by Israel (60.7% — up to 3,146 visiting students) and India (44.4% — up to 3,884). Though not in the top 25, Taiwan also experienced very strong growth at 42.4% (850 students) — far higher than any other country in East Asia.

In second place for growth in East Asia was Japan (6.6%), which will soon replace Costa Rica in the top 10 if trends continue.

For places of origin of international students studying in the United States, China was by far the leader, with 157,558 students, about 50% more than India’s 103,895 students in the States. Third and fourth places were held by South Korea and Canada, respectively. Taiwan was fifth with 24,818 students.

Source:

Previous posts on this subject:

Google Translate and romaji revisited

OK, Google has improved its Pinyin converter some, though it still fails in important areas. So that’s the present situation for Google and Mandarin.

How about for Google and Japanese?

Professor J. Marshall Unger of the Ohio State University’s Department of East Asian Languages and Literatures generously agreed to reexamine Google’s performance in conversions to rōmaji (Japanese written in romanization).

Below is his latest evaluation.

For his initial analysis (in December 2009), see Google Translate and rōmaji.

I ran the test passage through Google Translate again. There’s some improvement, but it’s still pretty mediocre.

Original Google Translate
6日午後4時35分ごろ、東京都千代田区皇居外苑の都道(内堀通り)の二重橋前交差点で、中国からの観光客の40代の男性が乗用車にはねられ、全身を強く打って間もなく死亡した。車は歩道に乗り上げて歩いていた男性(69)もはね、男性は頭を強く打って意識不明の重体。丸の内署は、運転していた東京都港区白金3丁目、会社役員高橋延拓容疑者(24)を自動車運転過失傷害の疑いで現行犯逮捕し、容疑を同致死に切り替えて調べている。 6-Nichi gogo 4-ji 35-fun-goro, Tōkyō-to Chiyoda-ku Kōkyogaien no todō (uchibori-dōri) no Nijūbashi zen kōsaten de, Chūgoku kara no kankō kyaku no 40-dai no dansei ga jōyōsha ni hane rare, zenshin o tsuyoku Utte mamonaku shibō shita. Kuruma wa hodō ni noriagete aruite ita dansei (69) mo hane, dansei wa atama o tsuyoku utte ishiki fumei no jūtai. Marunouchi-sho wa, unten shite ita Tōkyō-to Minato-ku hakkin 3-chōme, kaisha yakuin Takahashi nobe Tsubuse yōgi-sha (24) o jidōsha unten kashitsu shōgai no utagai de genkō-han taiho shi, yōgi o dō chishi ni kirikaete shirabete iru.
 同署によると、死亡した男性は横断歩道を歩いて渡っていたところを直進してきた車にはねられた。車は左に急ハンドルを切り、車道と歩道の境に置かれた仮設のさくをはね上げ、歩道に乗り上げたという。さくは歩道でランニングをしていた男性(34)に当たり、男性は両足に軽いけが。 Dōsho ni yoru to, shibō shita dansei wa ōdan hodō o aruite watatte ita tokoro o chokushin shite kita kuruma ni hane rareta. Kuruma wa hidari ni kyū handoru o kiri, shadō to hodō no sakai ni oka reta kasetsu no saku o haneage, hodō ni noriageta toyuu. Saku wa hodō de ran’ningu o shite ita dansei (34) niatari, dansei wa ryōashi ni karui kega.
 同署は、死亡した男性の身元確認を進めるとともに、当時の交差点の信号の状況を調べている。 Dōsho wa, shibō shita dansei no mimoto kakunin o susumeru totomoni, tōji no kōsaten no shingō no jōkyō o shirabete iru.
 現場周辺は東京観光のスポットの一つだが、最近はジョギングを楽しむ人も増えている。 Genba shūhen wa Tōkyō kankō no supotto no hitotsudaga, saikin wa jogingu o tanoshimu hito mo fuete iru.

Notes:

  • The use of numerals dodges a plethora of errors, but “6-Nichi” is still wrong for Muika.
  • Lots of correct capitalizations have been added, but “uchibori” was missed and “Utte” capitalized by mistake.
  • Some false spaces or lack of spaces persist: “hane rare”, “oka reta”; “hitotsudaga” and “niatari” were correctly hitotsu da ga and ni atari in the original test.
  • Names still get butchered (“hakkin” for Shirogane, “nobe Tsubuse” for Nobuhiro.
  • The needless apostrophe in “ran’ningu” is still there.
  • Interestingly, “toyuu” is a new error: it should be to iu.
  • There’s evidence of some attempt to use hyphens, but why not in “kankō kyaku” or “Nijūbashi zen”?

So, to update: Google gets kudos for conscientiousness, but I stick by my original comments.

For more by Prof. Unger, see Pinyin.info’s recommended readings, which includes selections from The Fifth Generation Fallacy: Why Japan Is Betting Its Future on Artificial Intelligence, Literacy and Script Reform in Occupation Japan: Reading Between the Lines, and Ideogram: Chinese Characters and the Myth of Disembodied Meaning.