International Journal of Computational Intelligence Systems

Volume 12, Issue 1, November 2018, Pages 342 - 350

A New Algorithm of Mining High Utility Sequential Pattern in Streaming Data

Authors
Huijun Tang, Yangguang Liu*, Le Wang
School of Information Engineering, Ningbo Dahongying University, N0. 899, XueYuan Road, YinZhou District Ningbo, Zhejiang, P.R. China, 315175
*Corresponding author. Email: y.g.liu@foxmail.com
Corresponding Author
Yangguang Liu
Received 3 January 2019, Accepted 11 January 2019, Available Online 28 January 2019.
DOI
10.2991/ijcis.2019.125905650How to use a DOI?
Keywords
High utility sequential pattern; Data streaming; Sliding windows; Tree structure; Header table
Abstract

High utility sequential pattern (HUSP) mining has emerged as a novel topic in data mining, its computational complexity increases compared to frequent sequences mining and high utility itemsets mining. A number of algorithms have been proposed to solve such problem, but they mainly focus on mining HUSP in static databases and do not take streaming data into account, where unbounded data come continuously and often at a high speed. The efficiency of mining algorithms is still the main research topic in this field. In view of this, this paper proposes an efficient HUSP mining algorithm named HUSP-UT (utility on Tail Tree) based on tree structure over data stream. Substantial experiments on real datasets show that HUSP-UT identifies high utility sequences efficiently. Comparing with the state-of-the-art algorithm HUSP-Stream (HUSP mining over data streams) in our experiments, the proposed HUSP-UT outperformed its counterpart significantly, especially for time efficiency, which was up to 1 order of magnitude faster on some datasets.

Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

Journal
International Journal of Computational Intelligence Systems
Volume-Issue
12 - 1
Pages
342 - 350
Publication Date
2019/01/28
ISSN (Online)
1875-6883
ISSN (Print)
1875-6891
DOI
10.2991/ijcis.2019.125905650How to use a DOI?
Copyright
© 2019 The Authors. Published by Atlantis Press SARL.
Open Access
This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Huijun Tang
AU  - Yangguang Liu
AU  - Le Wang
PY  - 2019
DA  - 2019/01/28
TI  - A New Algorithm of Mining High Utility Sequential Pattern in Streaming Data
JO  - International Journal of Computational Intelligence Systems
SP  - 342
EP  - 350
VL  - 12
IS  - 1
SN  - 1875-6883
UR  - https://doi.org/10.2991/ijcis.2019.125905650
DO  - 10.2991/ijcis.2019.125905650
ID  - Tang2019
ER  -