General-Purpose Search Techniques for Genomic Text
スポンサーリンク
概要
- 論文の詳細を見る
Fast and accurate techniques for searching large genomic text collections are becoming increasingly important. While Information Retrieval is well-established for general-purpose text retrieval tasks, less is known about retrieval techniques for genomic text data. In this paper, we investigate and propose general-purpose search techniques for genomic text. In particular, we show that significant improvements can result from manual term expansion, where additional words are added to queries and documents. We also show that collection partitioning, where documents are included in or excluded from the search space, is highly effective for some tasks. We experiment with our techniques on four text collections and show, for example, that the collection partitioning scheme can improve effectiveness by almost 9.5% over a standard retrieval baseline. We conclude by recommending techniques that can be considered for most genomic search tasks.
- 日本バイオインフォマティクス学会の論文
日本バイオインフォマティクス学会 | 論文
- Performance Improvement in Protein N-Myristoyl Classification by BONSAI with Insignificant Indexing Symbol
- A combined pathway to simulate CDK-dependent phosphorylation and ARF-dependent stabilization for p53 transcriptional activity
- A versatile petri net based architecture for modeling and simulation of complex biological processes
- XML documentation of biopathways and their simulations in Genomic Object Net
- Prediction of debacle points for robustness of biological pathways by using recurrent neural networks