A Combined Approach for de novo DNA Sequence Assembly of Very Short Reads
スポンサーリンク
概要
- 論文の詳細を見る
De novo DNA sequence assembly is very important in genome sequence analysis. In this paper, we investigated two of the major approaches for de novo DNA sequence assembly of very short reads: overlap-layout-consensus (OLC) and Eulerian path. From that investigation, we developed a new assembly technique by combining the OLC and the Eulerian path methods in a hierarchical process. The contigs yielded by these two approaches were treated as reads and were assembled again to yield longer contigs. We tested our approach using three real very-short-read datasets generated by an Illumina Genome Analyzer and four simulated very-short-read datasets that contained sequencing errors. The sequencing errors were modeled based on Illuminas sequencing technology. As a result, our combined approach yielded longer contigs than those of Edena (OLC) and Velvet (Eulerian path) in various coverage depths and was comparable to SOAPdenovo, in terms of N50 size and maximum contig lengths. The assembly results were also validated by comparing contigs that were produced by assemblers with their reference sequence from an NCBI database. The results show that our approach produces more accurate results than Velvet, Edena, and SOAPdenovo alone. This comparison indicates that our approach is a viable way to assemble very short reads from next generation sequencers.
論文 | ランダム
- ポリビニルアルコ-ルゲルの引張り試験〔英文〕
- Assessment of Groundwater Quality for Irrigation Use in Jamalpur and Sherpur Districts of Bangladesh
- Pneumoretroperitoneumによる腎周園炎の診断
- 783. 瀬戸川層群の石灰岩層から産出した中・後期始新世の浮遊性有孔虫群
- 2 コガタアカイエカを捕食するクモ類の水田における分布様式