Simple Weighting Techniques for Query Expansion in Biomedical Document Retrieval(Contents Technology and Web Information Systems)
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose two weighting techniques to improve performances of query expansion in biomedical document retrieval, especially when a short biomedical term in a query is expanded with its synonymous multi-word terms. When a query contains synonymous terms of different lengths, a traditional IR model highly ranks a document containing a longer terminology because a longer terminology has more chance to be matched with a query. However, such preference is clearly inappropriate and it often yields an unsatisfactory result. To alleviate the bias weighting problem, we devise a method of normalizing the weights of query terms in a long multi-word biomedical term, and a method of discriminating terms by using inverse terminology frequency which is a novel statistics estimated in a query domain. The experiment results on MEDLINE corpus show that our two simple techniques improve the retrieval performance by adjusting the inadequate preference for long multi-word terminologies in an expanded query.
- 社団法人電子情報通信学会の論文
- 2007-11-01
著者
-
RIM Hae-Chang
Korea University
-
Park So‐young
Sangmyung Univ. Seoul Kor
-
Park So-young
Sangmyoung University
-
Rim Hae‐chang
Korea Univ. Kor
-
Rim Hae
Department Of Computer Science Korea University
-
Kim Sang-bum
Nhn Co.
-
Song Young-in
Korea University
-
Rim Hae-chang
Department Of Computer Science Engineering Korea University
-
HAN Kyoung-Soo
Div. of Computer Engineering, Sungkyul University
-
HAN Kyoung-Soo
SK Telecom
-
Song Young-in
Dept. Of Computer And Radio Communications Engineering Korea Univ.
-
Han Kyoung-soo
Div. Of Computer Engineering Sungkyul University
-
Kim Sang
Department Of Computer Science Korea University
関連論文
- Three-Phase Text Error Correction Model for Korean SMS Messages
- Three-Phase Text Error Correction Model for Korean SMS Messages
- Automatic Acronym Dictionary Construction Based on Acronym Generation Types
- Utilizing the Web for Automatic Word Spacing
- Computing Word Semantic Relatedness for Question Retrieval in Community Question Answering
- Incorporating Frame Information to Semantic Role Labeling
- Simple Weighting Techniques for Query Expansion in Biomedical Document Retrieval(Contents Technology and Web Information Systems)
- A Definitional Question Answering System Based on Phrase Extraction Using Syntactic Patterns(Natural Language Processing)
- Topic Document Model Approach for Naive Bayes Text Classification(Natural Language Processing)
- Minimizing Human Intervention for Constructing Korean Part-of-Speech Tagged Corpus