5. Computational Issues in Statistical Data Analysis:EVALUATION OF EXECUTION TIME ON DATA ANALYSIS WITH PARALLEL VIRTUAL MACHINE
スポンサーリンク
概要
- 論文の詳細を見る
We have to analyze enormous data in many cases. A personal computer can handle them, however, it would take a lot of time even if today's personal computer would have good specifications. Anyway, we have to seek a faster analysis environment. A parallel computer which has large computing power will satisfy us.Parallel Virtual Machine (PVM) is one of the popular computer libraries to make many computers, connected via computer network, one (virtual) parallel one. If we could use thousands of connected computers concurrently, we would analyze various data quickly with PVM.We have inve s tigated PVM features through many simulations and found some interesting ones. Accordingly we construct a generic experimental model of execution time in PVM. This model is applicable for most methods on data analysis which can be implemented with master-slave style, in other words, which can be divided into one main part and some sub parts.In terms of this model, we evaluate turn-around time, related to amount of transferred data, load (described by execution time) on each slave computer and number of (part-) jobs. Our model is so generic that we can estimate execution time for such analysis methods as Bootstrap, κ-means, etc. We can also derive how many computers are required if we analyze data in time.In this paper, we summarize our work with numerical examples and discuss some points to use our framework in practice.
- 日本計算機統計学会の論文
日本計算機統計学会 | 論文
- MCMC法に基づく多変量階層線形データの分析(セッション5A(学生研究発表賞セッションII))
- コンテンツ・アプリケーション連動型複合統計教材開発と授業への展開(セッション4A)
- 初等中等教育を支援するコンテンツ・アプリケーション連動型複合統計教材の開発(セッション6A)
- 非対称可変分類法のシミュレーションによる評価
- 級内相関係数に関するパーミュテーションテストについて