Levenshtein Edit Distance Calculator (with Operations Trace)
Levenshtein distance is the minimum number of single-character edits (insertions, deletions, substitutions) needed to change one string into another. "kitten" → "sitting" is distance 3 (sub k→s, sub e→i, insert g). This calculator shows the distance plus the actual sequence of operations, color-coded.
How to use the Levenshtein Edit Distance Calculator (with Operations Trace)
Type the two strings to compare. The output shows the edit distance and the operations one-by-one. Custom insert / delete / substitute costs let you replicate variants (Damerau-Levenshtein adds transposition; some applications weight substitution differently).
About Levenshtein Edit Distance Calculator (with Operations Trace)
Levenshtein distance (named after Vladimir Levenshtein, 1965) is the foundational edit-distance metric. It's the answer to "what's the minimum number of character changes I need to make one string into another?" The three allowed operations are insert (add a character), delete (remove a character), substitute (replace one character with another). Each operation has a cost (usually 1); the distance is the minimum total cost of an edit sequence.
The standard algorithm is dynamic programming: build an (|A|+1) × (|B|+1) matrix where D[i][j] is the distance between the first i characters of A and the first j characters of B. Each cell is the minimum of (delete + D[i-1][j]), (insert + D[i][j-1]), (substitute (if needed) + D[i-1][j-1]). Time and space complexity: O(|A|·|B|). The actual operations can be recovered by walking back through the matrix from D[|A|][|B|].
Variants:
- Damerau-Levenshtein — adds transposition (swap of two adjacent characters) as a single operation. Better for typo detection.
- Hamming — distance counting only substitutions (requires equal-length strings).
- Jaro / Jaro-Winkler — similarity metrics weighted toward common prefixes; used by record linkage.
Common use cases
- Spell correction — finding dictionary words close to a misspelling.
- DNA / protein sequence alignment — bioinformatics edit-distance variants.
- Diff algorithms — character-level diffs use Levenshtein under the hood.
- Search autocomplete — ranking suggestions by edit distance to the partial query.
- Record linkage — matching "John Smith" to "Jonathan Smith" across noisy databases.