Additional Selection of Extracted Terms for a Specific Area
スポンサーリンク
概要
- 論文の詳細を見る
In text mining of documents of a specific area, especially for generating a map of concepts or terms and a summary of concepts or terms, the quality of keywords strongly affects the results of analysis. A list of technical terms is available as keyword candidates. We can recognize terms in the corpus automatically using a scoring method based on statistics of compound nouns. However, because fractions of words or meaningless strings are also included in those term candidates, further selections are necessary. For such further selection, we consider a method to obtain overlapping terms between the two groups of terms that are extracted from two independent corpora of the same area. For the experimental selection of terms, three target areas are specified: livestock raising, fruit farming, and vegetable gardening. For each area, two groups of documents are collected. The term candidates are extracted from these corpora using a scoring method based on statistics of compound nouns. The terms overlapping the two groups are extracted. After this selection procedure, the proportion of unsuitable terms is lower. From an efficiency viewpoint, the selection procedure improves selection. In addition, the procedure provides the advantage that it is independent from subjective decisions related to manual selection.
- 農業情報学会の論文
著者
-
Ninomiya Seishi
National Agricultural Research Center
-
Ninomiya Seishi
National Agricultural Res. Center
-
Horyu Daisuke
National Agricultural Research Center
関連論文
- CROWIS: A System for Sharing and Integrating Crop and Weather Data
- Diallel Analysis of Leaf Shape Variations of Citrus Varieties Based on Elliptic Fourier Descriptors
- Statistical Models for Prediction of Dry Weight and Nitrogen Accumulation Based on Visible and Near-Infrared Hyper-Spectral Reflectance of Rice Canopies
- AntMap : Constructing Genetic Linkage Maps Using an Ant Colony Optimization Algorithm
- Additional Selection of Extracted Terms for a Specific Area
- Plant Shape Discrimination of Several Taxa without Shape Feature Extraction Using Neural Networks with Image Input
- RIAS: an Approach to Provide Internet-Accessible Image Analysis Service