Modification of LZSS by Using Structures of Hangul Characters for Hangul Text Compression
スポンサーリンク
概要
- 論文の詳細を見る
This paper suggests modified LASS which is suitable for compressing Hangul data by Hangul character token and the string token with small size based on Hangul properties.The Hangul properties can be described in 2 ways. 1) The structure of a Hangul character consists of 3 1etters: The first sound letter, the middle sound letter, and the last sound letter which are called Cho-seong, Jung-seong, and Jong-seong, respectively. 2) The code of Hangul is represented by 2 bytes. The first property is used for making the character token processing Hangul characters which occupies most of the unmatched characters. That is, the unmatched Hangul characters are replaced with one Hangul character token represented by Huffman codes of Cho-seong, Jung-seong, and Jong-seong in regular sequence,instead of 2 character tokens. The second property is used to shorten the size of the string token processing matched string. In other words, since more than 75% of Hangul data are Hangul and Hangul codes are constructed in 2 bytes, the addresses of the window of LZSS can be assigned in 2-byte unit. As a result, the distance field and the length field of the string token can be lessened by one bit each. After compressing Hangul data through these tokens, about 3% of improvement could be made in compression ratio.information theory, coding theory, text compression,hangul processing
- 社団法人電子情報通信学会の論文
- 1996-11-25
著者
-
Lee Jae
The Department Of Computer Science Hallym University
-
SUNG Keong
the Department of Electronics Engineering, Seoul National University
-
Sung Keong
The Department Of Electronics Engineering Seoul National University
関連論文
- Improvement of Recognition Performance for the Fuzzy ARTMAP Using Average Learning and Slow Learning
- Blind Algorithm for Decision Feedback Equalizer
- Microwave Properties of Sapphire Resonators with a Gap and Their Applicability for Measurements of the Intrinsic Surface Impedance of Thin Superconductor Films( Superconducting High-frequency Devices)
- Delayed Hyponatremia Following Transsphenoidal Surgery for Pituitary Adenoma
- Modification of LZSS by Using Structures of Hangul Characters for Hangul Text Compression
- Adaptive Resource Allocation for a Two-Way OFDM Relay Network with Fairness Constraints