{"id":6098,"date":"2012-02-14T22:15:33","date_gmt":"2012-02-14T14:15:33","guid":{"rendered":"https:\/\/pinyin.info\/news\/?p=6098"},"modified":"2015-12-08T15:30:12","modified_gmt":"2015-12-08T07:30:12","slug":"new-database-of-cross-strait-differences-in-mandarin-goes-online","status":"publish","type":"post","link":"https:\/\/pinyin.info\/news\/2012\/new-database-of-cross-strait-differences-in-mandarin-goes-online\/","title":{"rendered":"New database of cross-strait differences in Mandarin goes online"},"content":{"rendered":"\n<p>Last week, on the same day President Ma Ying-jeou <a href=\"http:\/\/www.taiwannews.com.tw\/etn\/news_content.php?id=1835921\">accepted the resignation of a minister<\/a> who made some <a href=\"http:\/\/udn.com\/NEWS\/NATIONAL\/NAT1\/6889097.shtml\">drunken lewd remarks<\/a> at a <span class=\"py\">w&#283;iy&#225;<\/span> (year-end office party), Ma was joking to the media about blow jobs. <\/p>\n<p><em>Classy<\/em>.<\/p>\n<p><a href=\"http:\/\/www.youtube.com\/watch?v=Sk88H67KTX8\" title=\"link to video\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/laba_news.jpg\" alt=\"screenshot from a video of a news story on this\" title=\"laba_news\" width=\"500\" height=\"363\" class=\"alignnone size-full wp-image-6152\" srcset=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/laba_news.jpg 500w, https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/laba_news-300x217.jpg 300w\" sizes=\"auto, (max-width: 500px) 100vw, 500px\" \/><\/a><\/p>\n<p>But it was all <em>for a good cause<\/em>, of course. You see, the Mandarin expression <span class=\"py\">chu&#299; l&#462;ba<\/span>, when not referring to the literal playing of a trumpet, is usually taken in Taiwan to refer to a blow job. But in China, Ma explained, <span class=\"py\">chu&#299; l&#462;ba<\/span> means the same thing as the idiom <span class=\"py\">p&#257;i m&#462;p&#236;<\/span> (pat\/kiss the horse&#8217;s ass &#8212; i.e., flatter). And now that we have the handy-dandy <span class=\"py\">Zh&#333;nghu&#225; Y&#468;w&#233;n Zh&#299;shik&#249;<\/span> (Chinese Language Database), which Ma was announcing, we can look up how Mandarin differs in Taiwan and China, and thus not get tripped up by such misunderstandings. Or at least that&#8217;s supposed to be the idea. <\/p>\n<p>The database, which is the result of cross-strait cooperation, can be accessed via two sites: <a href=\"http:\/\/chinese-linguipedia.org\/\">one in Taiwan<\/a>, the other <a href=\"http:\/\/zhonghuayuwen.org\/\">in China<\/a>. <\/p>\n<p>It&#8217;s clear that a lot of money has been spent on this. For example, many entries are accompanied by well-documented, precise explanations by distinguished lexicographers. Ha! Just kidding! Many entries are really accompanied by videos &mdash; some <em>two hundred<\/em> of them &mdash; of cutesy puppets gabbing about cross-strait differences in Mandarin expressions. But if there&#8217;s a video in there of the panda in the skirt explaining to the sheep in the vest that a useful skill for getting ahead in Chinese society is <span class=\"py\">chu&#299; l&#462;ba<\/span>, I haven&#8217;t found it yet. Will <a href=\"http:\/\/www.youtube.com\/user\/NMAWorldEdition?ob=0\">NMA<\/a> will take up the challenge? <\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/laba.jpg\" alt=\"\" title=\"laba\" width=\"476\" height=\"354\" class=\"alignnone size-full wp-image-6149\" srcset=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/laba.jpg 476w, https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/laba-300x223.jpg 300w\" sizes=\"auto, (max-width: 476px) 100vw, 476px\" \/><\/p>\n<p>Much of the site emphasizes not so much language as Chinese characters. For example, another expensively produced video feeds the <a href=\"https:\/\/pinyin.info\/readings\/texts\/ideographic_myth.html\" title=\"Read a chapter from the book _The Chinese Language: Fact and Fantasy_, by John DeFrancis.\">ideographic myth<\/a> by showing off obscure Hanzi, such as the one for <span class=\"py\">ch&#283;ng<\/span>. <\/p>\n<p>WARNING: The screenshot below links to a video that contains scenes with intense <em>wawa<\/em>-ing and thus may not be suitable for anyone who thinks it&#8217;s not really cute for grown women to try to sound like they&#8217;re only thwee-and-a-half years old. <\/p>\n<p><a href=\"http:\/\/www.youtube.com\/watch?feature=player_detailpage&amp;v=64IjOCcGIxI#t=358s\"><img decoding=\"async\" src=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/three_ma_hanzi_cheng3.jpg\" alt=\"cheng3\" title=\"three_ma_hanzi_cheng3\" width=\"500\" class=\"alignnone size-full wp-image-6184\" srcset=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/three_ma_hanzi_cheng3.jpg 852w, https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/three_ma_hanzi_cheng3-300x179.jpg 300w\" sizes=\"(max-width: 852px) 100vw, 852px\" \/><\/a><\/p>\n<p>In a welcome bit of synchronicity, Victor Mair posted on Language Log earlier the same week on <a href=\"http:\/\/languagelog.ldc.upenn.edu\/nll\/?p=3750\">the unpredictability of Chinese character formation and pronunciation<\/a>, briefly discussing just such patterns of duplication, triplication, etc.<\/p>\n<p>Mair notes: <\/p>\n<blockquote>\n<p>Most of these characters are of relatively low frequency and, except for a few of them, neither their meanings nor their pronunciations are known by persons of average literacy.<\/p>\n<p>Many more such characters consisting or two, three, or four repetitions of the same character exist, and their sounds and meanings are in most cases equally or more opaque.<\/p>\n<\/blockquote>\n<p>The Hanzi for <span class=\"py\">ch&#283;ng<\/span> (which looks like &#39340;&#39340;&#39340; run together as one character) in the video above is sufficiently obscure that it likely won&#8217;t be shown correctly in many browsers on most systems when written in real text: &#170466;. But never fear: <a href=\"http:\/\/www.unicode.org\/cgi-bin\/GetUnihanData.pl?codepoint=299e2\" title=\"link to the Unicode page on the Hanzi &#170466;\">It&#8217;s already in Unicode<\/a> and so should be appearing one of these years in a massively bloated system font.<\/p>\n<p>Further reinforcing the impression that the focus is on Chinese characters, <span class=\"py\">Li&#250; Zh&#224;oxu&#225;n<\/span>, who is the head of the association in charge of the project on the Taiwan side, equated traditional Chinese characters with Chinese culture itself and declared that getting the masses in China to recognize them is an important mission. (Liu really needs to read L&#252; Shuxiang&#8217;s &#8220;<a href=\"https:\/\/pinyin.info\/readings\/lyu_shuxiang\/\">Comparing Chinese Characters and a Chinese Spelling Script &#8212; an evening conversation on the reform of Chinese characters<\/a>.&#8221;)<\/p>\n<p>Then he went on about how <a href=\"https:\/\/pinyin.info\/readings\/texts\/visible\/index.html\">Chinese characters<\/a> are a great system because, supposedly, they have a one-to-one correspondence with language that other scripts cannot match and people can know what they mean by looking at them (!) and that they therefore have a high degree of artistic quality (<span class=\"py\">g&#257;od&#249; de y&#236;sh&#249;x&#236;ng<\/span>). Basically, the person in charge of this project seems to have a bad case of the <a href=\"https:\/\/pinyin.info\/news\/2010\/chinese-characters-like-wow\/\">Like Wow syndrome<\/a>, which is not a reassuring trait for someone in charge of producing a dictionary. <\/p>\n<div style=\"float: right; width: 235px; height: 310px;\"><a href=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/cross_strait_words2.png\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/cross_strait_words2-223x300.png\" alt=\"\" title=\"cross_strait_words2\" width=\"223\" height=\"300\" class=\"alignnone size-medium wp-image-6134\" srcset=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/cross_strait_words2-223x300.png 223w, https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/cross_strait_words2.png 502w\" sizes=\"auto, (max-width: 223px) 100vw, 223px\" \/><\/a><\/div>\n<p>The same cooperation that built the Web sites led to a new book, <a href=\"http:\/\/chinese-linguipedia.org\/clk\/news\/find.do?id=13555f7c50e00000b600\"><span class=\"py\">Li&#462;ng&#8217;&#224;n M&#283;ir&#236; Y&#299; C&#237;<\/span><\/a> (&#12298;&#20841;&#23736;&#27599;&#26085;&#19968;&#35422;&#12299; \/ Roughly: <em>Cross-Strait Term-a-Day Book<\/em>), which was also touted at the press conference. <\/p>\n<p>The book contains Hanyu Pinyin, as well as <a href=\"https:\/\/pinyin.info\/romanization\/bopomofo\/basic.html\" title=\"table comparing zhuyin fuhao and Hanyu Pinyin\"><em>zhuyin fuhao<\/em><\/a>. But, alas, the book <a href=\"https:\/\/pinyin.info\/news\/2012\/some-fonts-not-to-use-for-pinyin\/\">makes the Pinyin look ugly<\/a> <em>and<\/em> fails completely at <a href=\"https:\/\/pinyin.info\/readings\/zyg\/rules.html\" title=\"Basic rules for writing Hanyu Pinyin\">the first rule of Pinyin: use word parsing<\/a>. (In the online images from the book, such as the one below, all of the words are se pa ra ted in to syl la bles.)<\/p>\n<div style=\"clear: both;\"><a href=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/didao_daodi.png\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/didao_daodi.png\" alt=\"\" title=\"didao_daodi\" width=\"500\" height=\"333\" class=\"alignnone size-full wp-image-6209\" srcset=\"https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/didao_daodi.png 600w, https:\/\/pinyin.info\/news\/news_photos\/2012\/02\/didao_daodi-300x200.png 300w\" sizes=\"auto, (max-width: 500px) 100vw, 500px\" \/><\/a><\/div>\n<p>The Web site also has ugly Pinyin, with the <a href=\"http:\/\/chinese-linguipedia.org\/clk\/css\/in2.css\">CSS file for the Taiwan site<\/a> calling for Pinyin to be shown in SimSun, which is one of the <a href=\"https:\/\/pinyin.info\/news\/2012\/some-fonts-not-to-use-for-pinyin\/\">fonts it&#8217;s better <em>not<\/em> to use for Pinyin<\/a>. But the word parsing on the Web site is at least not <em>always<\/em> wrong. Here are a few examples. <\/p>\n<ul>\n<li>&#8220;&#36305;&#31070;&#20818;&#8221; is given as <span class=\"py\">p&#462;osh&#233;nr<\/span> (good).<\/li>\n<li>And apostrophes appear to be used correctly: e.g., <span class=\"py\">f&#224;n&#8217;&#257;n<\/span> (&#36009;&#23433;), <span class=\"py\">ch&#363;n&#8217;&#257;n<\/span> (&#26149;&#23433;), and <span class=\"py\">f&#275;i&#8217;&#257;n<\/span> (&#39131;&#23433;). <\/li>\n<li>But &#8220;&#31532;&#20108;&#26149;&#8221; is run together as &#8220;<span class=\"py\">d&#236;&#232;rch&#363;n<\/span>&#8221; (no <a href=\"https:\/\/pinyin.info\/news\/2008\/when-to-use-hyphens-in-hanyu-pinyin\/\">hyphen<\/a>) rather than as shown correctly as <span class=\"py\">d&#236;-&#232;r ch&#363;n<\/span>.<\/li>\n<li>And &#8220;&#19968;&#20491;&#38957;&#20841;&#20491;&#22823;&#8221; is given as <span class=\"py\">y&#237;&#609;e t&#243;u li&#462;n&#609;&#609;e d&#224;<\/span> (for Taiwan) and <span class=\"py\">y&#299;&#609;e t&#243;u li&#462;n&#609;&#609;e d&#224;<\/span> (for China). But <a href=\"https:\/\/pinyin.info\/news\/2011\/how-to-write-numbers-and-measure-words-in-hanyu-pinyin\/\" title=\"How to write measure words in Hanyu Pinyin\"><em>ge<\/em> is supposed to be written separately<\/a>. (The variation of tone for <em>yi<\/em> is in this case useful.)<\/li>\n<\/ul>\n<p>Still, my general impression from this is that we should not expect the forthcoming cross-strait dictionary to be very good. <\/p>\n<p>Further reading: <\/p>\n<ul>\n<li><a href=\"http:\/\/news.chinatimes.com\/wantdaily\/11052101\/112012020900212.html\"><span class=\"py\">M&#462; z&#466;ngt&#466;ng: f&#257;zh&#462;n li&#462;ng&#8217;&#224;n t&#333;ngy&#242;ng w&#233;nz&#236;<\/span><\/a> (&#39340;&#32317;&#32113;&#65306;&#30332;&#23637;&#20841;&#23736;&#36890;&#29992;&#25991;&#23383;), <em>China Times<\/em>, February 9, 2012<\/li>\n<li><a href=\"http:\/\/taipeitimes.com\/News\/front\/archives\/2012\/02\/09\/2003525026\">Ma applauds cross-strait online Chinese dictionary<\/a>, <em>Taipei Times<\/em>, February 9, 2012<\/li>\n<li><a href=\"http:\/\/www.chinapost.com.tw\/taiwan\/china-taiwan-relations\/2012\/02\/09\/331057\/p2\/Online-databases.htm\">Online databases detail cross-strait languages<\/a>, <em>China Post<\/em>, February 9, 2012<\/li>\n<li><a href=\"http:\/\/n.yam.com\/newtalk\/politics\/201202\/20120208262469.html\"><span class=\"py\">Z&#466;ngt&#466;ng: Zh&#333;nggu&#243; &#8220;chu&#299; l&#462;ba&#8221; d&#283;ngy&#250; T&#225;iw&#257;n &#8220;p&#257;i m&#462;p&#236;&#8221;<\/span><\/a> (&#32317;&#32113;&#65306;&#20013;&#22283;&#12300;&#21561;&#21895;&#21485;&#12301;&#31561;&#26044;&#21488;&#28771;&#12300;&#25293;&#39340;&#23617;&#12301;), Yam News, February 8, 2012<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Last week, on the same day President Ma Ying-jeou accepted the resignation of a minister who made some drunken lewd remarks at a w&#283;iy&#225; (year-end office party), Ma was joking to the media about blow jobs. Classy. But it was &hellip; <a href=\"https:\/\/pinyin.info\/news\/2012\/new-database-of-cross-strait-differences-in-mandarin-goes-online\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4,12,15,92,41,106,28,95,885,32,138,20,3,122],"tags":[],"class_list":["post-6098","post","type-post","status-publish","format-standard","hentry","category-china","category-chinese","category-chinese-characters","category-dictionary","category-english","category-hanyu","category-languages","category-linguistics","category-ma-ying-jeou","category-mandarin","category-philology","category-pinyin","category-taiwan","category-victor-h-mair"],"_links":{"self":[{"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/posts\/6098","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/comments?post=6098"}],"version-history":[{"count":91,"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/posts\/6098\/revisions"}],"predecessor-version":[{"id":6955,"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/posts\/6098\/revisions\/6955"}],"wp:attachment":[{"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/media?parent=6098"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/categories?post=6098"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/pinyin.info\/news\/wp-json\/wp\/v2\/tags?post=6098"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}