Fast Normalization-Transformed Subsequence Matching in Time-Series Databases(Data Mining)
スポンサーリンク
概要
- 論文の詳細を見る
Normalization transform is known to be very useful for finding the overall trend of time-series data since it enables finding sequences with similar fluctuation patterns. Previous subsequence matching methods with normalization transform, however, would incur index overhead both in storage space and in update maintenance since they should build multiple indexes for supporting query sequences of arbitrary length. To solve this problem, we adopt a single-index approach in the normalization-transformed subsequence matching that supports query sequences of arbitrary length. For the single-index approach, we first provide the notion of inclusion-normalization transform by generalizing the original definition of normalization transform. To normalize a window, the inclusion-normalization transform uses the mean and the standard deviation of a subsequence that includes the window while the original transform uses those of the window itself. Next, we formally prove the correctness of the proposed normalization-transformed subsequence matching method that uses the inclusion-normalization transform. We then propose subsequence matching and index-building algorithms to implement the proposed method. Experimental results for real stock data show that our method improves performance by up to 2.5〜2.8 times compared with the previous method.
- 社団法人電子情報通信学会の論文
- 2007-12-01
著者
-
MOON Yang-Sae
Department of Computer Science, Kangwon National University
-
KIM Jinho
Department of Computer Science, Kangwon National University
-
Kim Jinho
Department Of Computer Science Kangwon National University
-
Moon Yang‐sae
Kangwon National Univ. Kor
-
Moon Yang-sae
Department Of Computer Science Kangwon National University
関連論文
- Efficient Storage and Querying of Horizontal Tables Using a PIVOT Operation in Commercial Relational DBMSs
- Physical Database Design for Efficient Time-Series Similarity Search
- Fourier Magnitude-Based Privacy-Preserving Clustering on Time-Series Data
- Hybrid Lower-Dimensional Transformation for Similar Sequence Matching
- Fast Normalization-Transformed Subsequence Matching in Time-Series Databases(Data Mining)
- A Data Cleansing Method for Clustering Large-Scale Transaction Databases
- Linear Detrending Subsequence Matching in Time-Series Databases
- A Fast Divide-and-Conquer Algorithm for Indexing Human Genome Sequences