Fast n-gram Assortment Construction for Filtering Hazardous Information
スポンサーリンク
概要
- 論文の詳細を見る
Document filtering systems based on pattern matching require well-chosen features to provide high accuracy. Using high level features such as bi-grams can boost accuracy, but require large amount of calculation time to find the optimal bi-gram set. In this paper, we propose a method to find an approximate optimal bi-gram set from a given set of uni-grams, while ensuring the selected assortments are independent from one another. Our method drastically reduced the amount of calculation time in comparison of using exhaustive calculation methods, while maintaining a much higher accuracy in terms of precision.
- 2009-11-09
著者
-
Tadashi Yanagihara
KDDI R&D Laboratories
-
Kazushi Ikeda
KDDI R&D Laboratories
-
Kazunori Matsumoto
KDDI R&D Laboratories
-
Yasuhiro Takishima
KDDI R&D Laboratories
-
Kazushi Ikeda
Kddi R&d Laboratories
-
Takishima Yasuhiro
Kddi R&d Laboratories
-
Yasuhiro Takishima
Kddi R&d Laboratories
-
Tadashi Yanagihara
Kddi R&d Laboratories
-
Kazunori Matsumoto
Kddi R&d Laboratories
関連論文
- Fast n-gram Assortment Construction for Filtering Hazardous Information
- Automatic query expansion and classification for television related tweet collection