蛋白質立体構造データを利用した文献からの蛋白質相互作用記述文抽出方式
スポンサーリンク
概要
- 論文の詳細を見る
Because a protein expresses its function through interaction with other substrates, it is vital to create a database of protein interaction. Since the total volume of information on protein interaction is described in terms of thousands of literatures, it is nearly impossible to extract all this information manually. Although extraction systems for interaction information based on the template matching method have already been developed, it is not possible to match all the sentences with interaction information due to the extent of sentence complexity.We propose a method of extracting sentences with interaction information independent of sentence structure. In a protein-compound complex structure, the interacting residue is near to its partner. The distance between them can be calculated by using the structure data in the PDB database, with a short distance indicating that the sentences associated with them might describe the interaction information. In a free-protein structure, the distance cannot be calculated because the coordinates of the proteins partner are not registered in the structure data. Hence, we use the homology protein structure data, which is complexed with the proteins parter.The proposed method was applied to seven literatures written about protein-compound complexes and four literatures written about free proteins, obtaining F-measures of 71% and 72%, respectively.
- 社団法人 電気学会の論文
- 2005-05-01
著者
関連論文
- 分子表面データ間の距離を利用した効率的類似蛋白質検索方式
- 属性付き法線ベクトルを用いた蛋白質分子表面比較方式
- ゲノムネット検索を利用した文献内の固有表現同定方式
- 蛋白質立体構造データを利用した文献からの蛋白質相互作用記述文抽出方式
- テンプレートマッチングと照応解析を利用した文献からの蛋白質活性部位情報抽出方式(バイオインフォマティクス)(情報システム論文)
- 蛋白質立体構造データに基づく原子間距離情報を利用した文献からの蛋白質相互作用情報抽出方式(情報抽出)