Discovering patterns with great significance is an important problem in data mining discipline. An episode is defined to be a partially ordered set of events for consecutive and fixed-time intervals in a sequence. Most of previous studies on episodes consider only frequent episodes in a sequence of events (called simple sequence). In real world, we may find a set of events at each time slot in terms of various intervals (hours, days, weeks, etc.). We refer to such sequences as complex sequences. Mining frequent episodes in complex sequences has more extensive applications than that in simple sequences. In this paper, we discuss the problem on mining frequent episodes in a complex sequence. We extend previous algorithm MINEPI to MINEPI+ for episode mining from complex sequences. Furthermore, a memory-anchored algorithm called EMMA is introduced for the mining task. Experimental evaluation on both real-world and synthetic data sets shows that EMMA is more efficient than MINEPI+.
In this paper, we introduce a diamond episode of the form s1 -> E -> s2, where s1 and s2 are events and E is a set of events. The diamond episode s1 -> E -> s2 means that every event of E follows an event s1 and is followed by an event s2. Then, by formulating the support of diamond episodes, in this paper, we design the algorithm FreqDmd to extract all of the frequent diamond episodes from a given event sequence. Finally, by applying the algorithm FreqDmd to bacterial culture data,we extract diamond episodes representing replacement of bacteria.
E. Cem, and O. Ozkasap. Proceedings of the 25th International Symposium on Computer and Information Sciences, page 199-202. Springer Netherlands, (2011)
V. Wahler, D. Seipel, J. Gudenberg, and G. Fischer. Proceedings of the Source Code Analysis and Manipulation, Fourth IEEE International Workshop, page 128--135. Washington, DC, USA, IEEE Computer Society, (2004)
D. Ignatov, and S. Kuznetsov. Proceedings of the 17th International Conference on Conceptual Structures (ICCS 2009), volume 5662 of Lecture Notes in Computer Science, page 185-200. Springer, (2009)
X. Yan, P. Yu, and J. Han. Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, page 335--346. New York, NY, USA, ACM, (2004)
P. Kalaivani, D. Hanirex, and D. Kaliyamurthie. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (3):
1142--1144(March 2015)
Y. Ye, Y. Zheng, Y. Chen, J. Feng, and X. Xie. International Conference on Mobile Data Management: Systems, Services and Middleware (MDM), page 1--10. IEEE, (2009)