首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Transposition invariant string matching
Authors:Veli Mkinen  Gonzalo Navarro  Esko Ukkonen
Institution:aDepartment of Computer Science, PO Box 26 (Teollisuuskatu 23), FIN-00014, University of Helsinki, Finland;bCenter for Web Research, Department of Computer Science, University of Chile, Blanco Encalada 2120, Santiago, Chile
Abstract:Given strings A=a1a2am and B=b1b2bn over an alphabet , where is some numerical universe closed under addition and subtraction, and a distance function d(A,B) that gives the score of the best (partial) matching of A and B, the transposition invariant distance is , where A+t=(a1+t)(a2+t)…(am+t). We study the problem of computing the transposition invariant distance for various distance (and similarity) functions d, including Hamming distance, longest common subsequence (LCS), Levenshtein distance, and their versions where the exact matching condition is replaced by an approximate one. For all these problems we give algorithms whose time complexities are close to the known upper bounds without transposition invariance, and for some we achieve these upper bounds. In particular, we show how sparse dynamic programming can be used to solve transposition invariant problems, and its connection with multidimensional range-minimum search. As a byproduct, we give improved sparse dynamic programming algorithms to compute LCS and Levenshtein distance.
Keywords:Edit distance  Music sequence comparison  Transposition invariance  Sparse dynamic programming  Range-minimum searching
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号