Problem Description:
Given two strings S and T, determine if they are both one edit distance apart.
To solve this problem, you first need to know what is edit distance. You may refer to this wikipedia article for more information.
For this problem, it implicitly assumes to use the classic Levenshtein distance, which involvesinsertion, deletion and substitution operations and all of them are of the same cost. Thus, if S
is one edit distance apart from T
, T
is automatically one edit distance apart from S
.
Now let's think about the possible cases for two strings to be one edit distance apart. Well, that means, we can transform S
to T
by using exactly one edit operation. There are three possible cases:
- We insert a character into
S
to getT
. - We delete a character from
S
to getT
. - We substitute a character of
S
to getT
.
For cases 1 and 2, S
and T
will be one apart in their lengths. For cases 3, they are of the same length.
It is relatively easy to handle case 3. We simply traverse both of them and compare the characters at the corresponding position. If we find exactly one mismatch during the traverse, they are one edit distance apart.
Now let's move on to cases 1 and 2. In fact, they can be merged into one case, that is, to delete a character from the long string to get the short one, or equivalently, to insert a character into the short string to get the long one.
We will handle cases 1 and 2 using the short string as the reference. We traverse the two strings, once we find a mismatch. We know this position is where the deletion in the long string happens. For example, suppose S = "kitten"
and T = "kiten"
, we meet the first mismatch in the 4
-th position, which corresponds to the deleted character below, shown in between *
. We then continue to compare the remaining string of T
(en) with the remaining string of S
(en) and find them to be the same. So they are one edit distance apart.
S: k i t t e n
T: k i t *t* e n
In fact, cases 1, 2 and 3 can be further handled using the same piece of code. For strings of the same length, once we find a mismatch, we just substitute one to be another and check whether they are now the same. For strings of one apart in lengths, we insert the deleted character of the long string into the short one and compare whether they are the same.
The code is as follows. If you find the first half of the return statement (!mismatch && (n - m == 1)
) hard to understand, run the code on cases that the mismatch only occurs at the last character of the long string, like S = "ab"
and T = "abc"
.
1 class Solution { 2 public: 3 bool isOneEditDistance(string s, string t) { 4 int m = s.length(), n = t.length(); 5 if (m > n) return isOneEditDistance(t, s); 6 if (n - m > 1) return false; 7 bool mismatch = false; 8 for (int i = 0, j = 0; i < m; i++, j++) { 9 if (s[i] != t[j]) { 10 if (m == n) s[i] = t[j]; 11 else s.insert(i, 1, t[j]); 12 mismatch = true; 13 break; 14 } 15 } 16 return (!mismatch && (n - m == 1)) || (mismatch && s == t); 17 } 18 };