Bit-parallel approximate string matching algorithms with transposition |
| |
Authors: | Heikki Hyyr |
| |
Affiliation: | Department of Computer Sciences, University of Tampere, Finland |
| |
Abstract: | Using bit-parallelism has resulted in fast and practical algorithms for approximate string matching under Levenshtein edit distance, which permits a single edit operation to insert, delete or substitute a character. Depending on the parameters of the search, currently the fastest non-filtering algorithms in practice are the O(km/wn) algorithm of Wu and Manber, the O((k+2)(m−k)/wn) algorithm of Baeza-Yates and Navarro, and the O(m/wn) algorithm of Myers, where m is the pattern length, n is the text length, k is the error threshold and w is the computer word size. In this paper we discuss a uniform way of modifying each of these algorithms to permit also a fourth type of edit operation: transposing two adjacent characters in the pattern. This type of edit distance is also known as Damerau edit distance. In the end we also present an experimental comparison of the resulting algorithms. |
| |
Keywords: | Levenshtein edit distance Damerau edit distance Approximate string matching Bit-parallelism |
本文献已被 ScienceDirect 等数据库收录! |
|