Skip to main content

Research Repository

Advanced Search

Efficient Shapelet Discovery for Time Series Classification

Li, Guozhong; Choi, Byron; Xu, Jianliang; Bhowmick, Sourav S.; Chun, Kwok Pan; Wong, Grace Lai Hung

Efficient Shapelet Discovery for Time Series Classification Thumbnail


Authors

Guozhong Li

Byron Choi

Jianliang Xu

Sourav S. Bhowmick

Profile Image

Dr Kwok Chun Kwok.Chun@uwe.ac.uk
Lecturer in Environmental Managment

Grace Lai Hung Wong



Abstract

Time-series shapelets are discriminative subsequences, recently found effective for time series classification (tsc). It is evident that the quality of shapelets is crucial to the accuracy of tsc. However, major research has focused on building accurate models from some shapelet candidates. To determine such candidates, existing studies are surprisingly simple, e.g., enumerating subsequences of some fixed lengths, or randomly selecting some subsequences as shapelet candidates. The major bulk of computation is then on building the model from the candidates. In this paper, we propose a novel efficient shapelet discovery method, called bspcover, to discover a set of high-quality shapelet candidates for model building. Specifically, bspcover generates abundant candidates via Symbolic Aggregate approXimation with sliding window, then prunes identical and highly similar candidates via Bloom filters, and similarity matching, respectively. We next propose a $p$p-Cover algorithm to efficiently determine discriminative shapelet candidates that maximally represent each time-series class. Finally, any existing shapelet learning method can be adopted to build a classification model. We have conducted extensive experiments with well-known time-series datasets and representative state-of-the-art methods. Results show that bspcover speeds up the state-of-the-art methods by more than 70 times, and the accuracy is often comparable to or higher than existing works.

Citation

Li, G., Choi, B., Xu, J., Bhowmick, S. S., Chun, K. P., & Wong, G. L. H. (2022). Efficient Shapelet Discovery for Time Series Classification. IEEE Transactions on Knowledge and Data Engineering, 34(3), 1149-1163. https://doi.org/10.1109/TKDE.2020.2995870

Journal Article Type Article
Acceptance Date Jun 2, 2022
Online Publication Date May 19, 2022
Publication Date 2022-03
Deposit Date Jun 14, 2022
Publicly Available Date Jun 16, 2022
Journal IEEE Transactions on Knowledge and Data Engineering
Print ISSN 1041-4347
Electronic ISSN 1558-2191
Publisher Institute of Electrical and Electronics Engineers
Peer Reviewed Peer Reviewed
Volume 34
Issue 3
Pages 1149-1163
DOI https://doi.org/10.1109/TKDE.2020.2995870
Keywords Time series classification; shapelet discovery; efficiency; accuracy
Public URL https://uwe-repository.worktribe.com/output/9431475
Publisher URL https://ieeexplore.ieee.org/document/9096567

Files

Efficient shapelet discovery for time series classification (1.5 Mb)
PDF

Licence
http://www.rioxx.net/licenses/all-rights-reserved

Publisher Licence URL
http://www.rioxx.net/licenses/all-rights-reserved

Copyright Statement
This is the author’s accepted manuscript. The final published version is available here: 10.1109/TKDE.2020.2995870
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.






You might also like



Downloadable Citations