Optimal Local Dimension Analysis of Latent Semantic Indexing on Query Neighbor Space(Special Issue on Text Processing for Information Access)
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we present our investigation of Latent Semantic Indexing (LSI) on the local query regions for solving the computation restrictions of the LSI on the global information space. Through the experiments with different SVD dimensionality on the local query regions, the results show that low-dimensional LSI can achieve much better precision than VSM and similar precision to global LSI. Such small SVD factors indicate that there is an almost linear surface in the local query regions. The largest or the two largest singular vectors have the ability to capture such a linear surface and benefit the particular query. In spite of the fact that Local LSI analysis needs to perform the Singular Value Decomposition (SVD) computation for each query, the surprisingly small requirements of the SVD dimension resolve the computation restrictions of LSI for large scale IR tasks. Moreover, on the condition that several relevant sample documents are available, application of low dimensional LSI for these documents can obtain comparable precision with the Local RF in a different manner.
- 社団法人電子情報通信学会の論文
- 2003-09-01
著者
-
Umemura K
Dept. Of Information And Computer Sciences Toyohashi University Of Technology
-
Umemura Kyoji
Dept. Of Information And Computer Sciences Toyohashi University
-
XU Yinghui
Dept. of Information and Computer Sciences, Toyohashi University of Technology
-
Xu Yinghui
Dept. Of Information And Computer Sciences Toyohashi University Of Technology
-
Umemura Kyoji
Dept. Of Computer Science And Eng. Toyohashi University Of Tech.
関連論文
- Optimal Local Dimension Analysis of Latent Semantic Indexing on Query Neighbor Space(Special Issue on Text Processing for Information Access)
- Traffic Anomaly Analysis and Characteristics on a Virtualized Network Testbed
- Factor Controlled Hierarchical SOM Visualization for Large Set of Data(Special Issue on Text Processing for Information Access)
- Analytical Modeling of Network Throughput Prediction on the Internet