-
Notifications
You must be signed in to change notification settings - Fork 2
/
README
56 lines (38 loc) · 1.8 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
Copyright (c) David Powell <[email protected]>
This package is provided under he GNU Public License v2 and
comes with ABSOLUTELY NO WARRANTY, or details see file COPYRIGHT
This package contains 4 programs for aligning sequences
under linear gap costs. Thats is where gaps of length
k have a cost = a + b * k. The two dpa_ programs can
have the costs set from the commandline. While the two ukk_
programs must have the costs set at compile time. Note that
ukk_ programs must have integer costs, and a match must cost
0.
dpa_linear:
Calculates the edit cost, and displays an optimal
alignment between two sequences. Uses the standard DPA.
Has time and space complexity of O(n*n).
dpa_lcheckp:
Calculates the edit cost, and displays an optimal
alignment between two sequences. Uses the
check-pointing(1) to recover the alignment. Has time
complexity O(n*n), and space complexity O(n).
ukk_linear:
Calculates the edit cost between two strings, _but_
does not recover an alignment. This program uses a
modification of Ukkonen's algorithm(2). Has average time
complexity of O(d*d + n), and space complexity O(d)
(where d is the edit distance)
ukk_lcheckp:
Calculates the edit cost between two strings, and
displays an optimal alignment. This program uses a
modification of Ukkonen's algorithm(2) with
check-pointing(1) to recover the alignment. The average
time complexity is O(n*log(d) + d*d), and space complexity
is O(d) (where d is the edit distance)
1: D. R. Powell, L. Allison and T. I. Dix,
"A Versatile Divide and Conquer Technique for Optimal String Alignment",
Information Processing Letters, 1999, 70:3, pp 127-139
2: E. Ukkonen, "On Approximate String Matching",
Foundations of Computation Theory, 1983, 158, pp 487-495
-- David Powell <[email protected]>