Effectiveness of Passage-Based Document Retrieval for Short Queries(Special Issue on Text Processing for Information Access)
スポンサーリンク
概要
- 論文の詳細を見る
Document retrieval is a fundamental but important task for intelligent access to a huge amount of information stored in documents. Although the history of its research is long, it is still a hard task especially in the case that lengthy documents are retrieved with very short queries (a few keywords). For the retrieval of long documents, methods called passage-based document retrieval have proven to be effective. In this paper, we experimentally show that a passage-based method based on window passages is also effective for dealing with short queries on condition that documents are not too short. We employ a method called "density distributions" as a method based on window passages, and compare it with three conventional methods : the simple vector space model, pseudo relevance feedback and latent semantic indexing. We also compare it with a passage-based method based on discourse passages.
- 社団法人電子情報通信学会の論文
- 2003-09-01
著者
-
Kise Koichi
Dept. Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
-
Dengel Andreas
German Research Center For Artificial Intelligence
-
Dengel Andreas
German Research Center For Artificial Intelligence (dfki)
-
Junker Markus
German Research Center For Artificial Intelligence (dfki)
-
MATSUMOTO Keinosuke
Dept. of Computer and Systems Sciences, Graduate School of Engineering, Osaka Prefecture University
-
Matsumoto Keinosuke
Dept. Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
関連論文
- ドイツ人工知能研究センター(DFKI)の20年 : 成功への道筋とそれを可能とした人々
- Position detection for a camera pen using LLAH and dot patterns (パターン認識・メディア理解)
- Object detection in images with cluttered background by using local features and their configuration (パターン認識・メディア理解)
- Position detection for a camera pen using LLAH and dot patterns (ヒューマン情報処理)
- Effectiveness of Passage-Based Document Retrieval for Short Queries(Special Issue on Text Processing for Information Access)
- Automatic Word Ground Truth Generation for Camera Captured Documents