Repository | Book | Chapter

176357

Finding best patterns practically

Ayumi Shinohara Setsuo Arikawa

pp. 307-317

Abstract

Finding a pattern which separates two sets is a critical task in discovery. Given two sets of strings, consider the problem to find a subsequence that is common to one set but never appears in the other set. The problem is known to be NP-complete. Episode pattern is a generalized concept of subsequence pattern where the length of substring containing the subsequence is bounded. We generalize these problems to optimization problems, and give practical algorithms to solve them exactly. Our algorithms utilize some pruning heuristics based on the combinatorial properties of strings, and efficient data structures which recognize subsequence and episode patterns.

Publication details

Published in:

Arikawa Setsuo, Shinohara Ayumi (2002) Progress in discovery science: final report of the Japanese discovery science project. Dordrecht, Springer.

Pages: 307-317

DOI: 10.1007/3-540-45884-0_21

Full citation:

Shinohara Ayumi, Arikawa Setsuo (2002) „Finding best patterns practically“, In: S. Arikawa & A. Shinohara (eds.), Progress in discovery science, Dordrecht, Springer, 307–317.