Character Code for Japanese Text Processing
スポンサーリンク
概要
- 論文の詳細を見る
Japanese standard character set JIS X0208 has been widely utilized in Japanese computer systems for over ten years. This character set has 6877 characters including 6353 kanji or Chinese characters, and seems to be nearly sufficient for daily use of Japanese text processing. However, the total number of kanji characters is more than fifty thousand, and the missing character problem is inevitable for some application fields. In this paper, problems around character codes are discussed from the following four view points: Japanese writing systems, script of Japanese writing system, inter-change character set, and internal codes of operating systems. At the first, Japanese writing system is briefly introduced, and kanji script and variant forms are discussed. Then, topics around the JIS X0208 and the new standard character set are introduced, and the internal code design of JEF, shift-JIS, UNIX EUC-JAE, and TRON TAD are described. Problems are mainly in two points. The vast number of kanji characters and the ambiguity of the graphic forms causes problems in the character set controloperation. And the increasement of character sets may introduce difficulties into the internal code designs. For the resolution of these problems, efforts on both the linguistic field and computer science field will be required.
- 一般社団法人情報処理学会の論文
- 1990-03-31
著者
-
MIYAZAWA Akira
National Institute of Advanced Industrial Science and Technology (AIST)
-
Miyazawa A
National Inst. Materials And Chemical Res. Ibaraki
関連論文
- Microwave-Assisted Reduction of Acetophenones Using Ni-Al Alloy in Water
- Intermolecular [2+2] Photocycloaddition of Formyl-and Cyano-Substituted Diphenylhexatrienes in the Solid State
- Character Code for Japanese Text Processing
- Microwave-Assisted Direct H/D Exchange Reactions of Dimetridazole and Metronidazole in Alkaline D_2O