China’s Andy Rooney Has Some Funny Opinions About How Great The Chinese Government Is
Monthly Archives: July 2009
The Quintessential Japanese Problem
I should be happy about the job I got today. It is simple; build a few Regexes, and simplify the interface to a CMS.
Simple – yet stupid.
First, why is this difficult system being used? (Politics of a sunk cost)
Second, why is the design of the interface so utterly bad? (Japan only product therefore it doesn’t have to compete in an unbiased market)
Finally, is it morally right to sell this band-aid as a “solution” when the problem is actually a deeper malaise? (No)
I feel dirty.
Google, Bi-Culturalism, and Japan.
I was having an interesting conversation with Paul P on his blog about Google and short term vs. long term hiring practices.
My belief comes down to cross-cultural pollination being inevitable in Japan. As a result, bilingual / bi-cultural Japanese would be far more likely to have insight into future trends than hiring monolingual Japanese or monolingual Gaikokujin.
It’s festival season in Japan.
Like everything else, the festivals run exactly on time; 6:00-7:40. I still think there is something perverse about scheduling enjoyment, but maybe that is just me…
How many radicals are there for the basic 2000 Japanese Kanji anyway?
I am working on this Kanji dictionary program that helps sort out similar-looking (but different-meaning) words. I found an open source dictionary (kradfile) and was doing some analysis on it when I came across a total of 252 radicals (shown below). However, a popular online list has 214 radicals. What’s up?
[0] => 匕 [1] => ノ [2] => 勹 [3] => ヨ [4] => äº [5] => 厶 [6] => å· [7] => 一 [8] => | [9] => 二 [10] => ム[11] => è¡£ [12] => ⺅ [13] => å£ [14] => 大 [15] => 矢 [16] => 廾 [17] => æ–‡ [18] => æ–‰ [19] => å¹¶ [20] => 木 [21] => 王 [22] => 羊 [23] => 耒 [24] => æ—¥ [25] => 白 [26] => å…¥ [27] => 冂 [28] => 凵 [29] => å„¿ [30] => 冖 [31] => 冫 [32] => 刀 [33] => å·¥ [34] => 丶 [35] => 力 [36] => ä¹ [37] => 生 [38] => å [39] => åˆ [40] => å°¸ [41] => å·¾ [42] => 父 [43] => ç”° [44] => 彑 [45] => æ°´ [46] => 月 [47] => 西 [48] => é–€ [49] => 亅 [50] => å½³ [51] => 金 [52] => 土 [53] => é•· [54] => 女 [55] => 宀 [56] => 耳 [57] => 禸 [58] => 虫 [59] => 夂 [60] => â»– [61] => 人 [62] => 寸 [63] => 厂 [64] => å±± [65] => å¹² [66] => 而 [67] => 鬼 [68] => å° [69] => 示 [70] => ç¬ [71] => é³¥ [72] => å‡ [73] => 夕 [74] => 豆 [75] => 心 [76] => å¿„ [77] => 爪 [78] => 扌 [79] => 手 [80] => 臼 [81] => éš¹ [82] => åœ [83] => å [84] => 幺 [85] => 攵 [86] => 至 [87] => 戈 [88] => 刂 [89] => 𠆢 [90] => ç©´ [91] => è°· [92] => å±® [93] => 爿 [94] => ç›® [95] => ç½’ [96] => 自 [97] => 竹 [98] => 高 [99] => æ¬ [100] => æ°µ [101] => 广 [102] => ç¾½ [103] => é© [104] => ç« [105] => å© [106] => 士 [107] => 牛 [108] => ä¹™ [109] => çŠ [110] => 屯 [111] => ⺾ [112] => ç–’ [113] => è² [114] => 禾 [115] => é [116] => ç¼¶ [117] => 見 [118] => 毛 [119] => 亀 [120] => 世 [121] => ç±³ [122] => 戸 [123] => 糸 [124] => è¿ [125] => ⺌ [126] => 匚 [127] => 臣 [128] => ⺹ [129] => æ° [130] => 舌 [131] => 舟 [132] => é’ [133] => 風 [134] => 乃 [135] => 馬 [136] => éš¶ [137] => æ¢ [138] => â» [139] => çš¿ [140] => è¡€ [141] => 衤 [142] => 雨 [143] => è§’ [144] => 言 [145] => 豸 [146] => è¶³ [147] => 車 [148] => é…‰ [149] => 食 [150] => 骨 [151] => 比 [152] => 黽 [153] => 弓 [154] => ç–‹ [155] => æ–¤ [156] => 彡 [157] => 井 [158] => ä¹… [159] => å»´ [160] => 也 [161] => ç«‹ [162] => è¾› [163] => マ [164] => 五 [165] => 亡 [166] => æ›° [167] => å›— [168] => 弋 [169] => æ–¹ [170] => åŠ [171] => 支 [172] => 犬 [173] => 用 [174] => 艮 [175] => 玄 [176] => æ¹ [177] => ユ [178] => 冊 [179] => æ¯ [180] => 毋 [181] => è¾° [182] => å·² [183] => å·´ [184] => 里 [185] => å… [186] => éž [187] => 奄 [188] => è™ [189] => 甘 [190] => èµ° [191] => 韋 [192] => å‹¿ [193] => 音 [194] => é¢ [195] => 舛 [196] => 豕 [197] => å“ [198] => 黄 [199] => ç™¶ [200] => å°¢ [201] => å°¤ [202] => æ— [203] => 齊 [204] => 辶 [205] => ç„¡ [206] => é»’ [207] => 鹿 [208] => å…ƒ [209] => 牙 [210] => å·› [211] => 岡 [212] => 矛 [213] => é¼» [214] => 石 [215] => 身 [216] => 殳 [217] => 瓜 [218] => 肉 [219] => 行 [220] => 鬲 [221] => 麻 [222] => æ¯ [223] => 首 [224] => 赤 [225] => éš [226] => 竜 [227] => 爻 [228] => æ–— [229] => çš® [230] => 片 [231] => æ»´ [232] => éŸ [233] => 釆 [234] => å·¨ [235] => é‚‘ [236] => æ°” [237] => é¼ [238] => 礻 [239] => 色 [240] => 香 [241] => é¹µ [242] => é¾ [243] => 瓦 [244] => 鼓 [245] => é» [246] => 飛 [247] => 髟 [248] => 鬥 [249] => 鬯 [250] => 麦 [251] => 黹 [252] => 鼎
If you want to duplicate the result yourself, here is the PHP code to use.
$krad_input = file_get_contents(dirname(__FILE__) . "/krad.txt");
$krad_array = explode("\n", $krad_input);
unset($krad_input);
$total = count($krad_array);
$i = 0;
$radicalarray = array();
$stored = array();
while ($i < $total){
$string = $krad_array[$i];
if (substr($string, 0, 1) != "#"){
$kanji_and_radicals = extract_kanji_and_radicals($string);
//print_r($kanji_and_radicals);
foreach($kanji_and_radicals['radicals'] as $radical){
if (!isset($stored[$radical])){
$stored[$radical] = 1;
$radicalarray[] = $radical;
}
}
// We know the Kanji. We know its radicals. We can insert it into the database.
}
$i++;
}
print_r($radicalarray);
\n
function extract_kanji_and_radicals($string){
$regex = "`(\s:\s|\s+)`imus";
$matches = preg_split($regex, $string);
$return['kanji'] = $matches[0];
array_shift($matches);
$return['radicals'] = $matches;
return $return;
}
You can get the dictionary file from Kanjicafe.com.
-edit-
Curiouser and curiouser, it seems that the classic number is 212. There are a bunch of radicals in there that simply do not exist classically; perhaps they are an amalgamation of the radicals available in all the various Japanese-English dictionaries? (In some cases the radicals were altered to more closely suit the kanji itself)