圧縮プログラムを応用した著者推定
スポンサーリンク
概要
- 論文の詳細を見る
原著論文Benedetto et al. recently confirmed the validity of a method for measuring similarity using data compression software. Despite its potential, this method has not yet been applied to the field of information science. The present study proposes the use of CIR, a modified method that uses an improved ratio of compression, and describes two experiments on authorship attribution using data from modern Japanese literature. The first experiment compares the results of applying CIR and Benedetto's method to test collections of modified data (fixed length) using aprocedure similar to that described by Matsuura et al. The second experiment is based on original data (variable length).The first experiment showed an average precision rate of 97.7% for CIR, while Benedetto's method gave a rate of 90.5%. The CIR method proves to be an improvement on the best method described by Matsuura et al. The second experiment confirmed the e
著者
関連論文
- 最適採餌理論の情報探索行動への応用 : 情報量の測定に関する提案
- インターネット経由で DIALOG を検索する際の利用者インターフェース : 適合度順出力機能を中心として
- 圧縮プログラムを応用した著者推定
- 組み合わせ型情報検索手法の検索実験による評価