trekhleb
diff --git a/‎src/algorithms/string/hamming-distance/README.en-EN.md‎
Lines changed: 24 additions & 0 deletions b/‎src/algorithms/string/hamming-distance/README.en-EN.md‎
Lines changed: 24 additions & 0 deletions
diff --git a/‎src/algorithms/string/hamming-distance/README.md‎
Lines changed: 12 additions & 16 deletions b/‎src/algorithms/string/hamming-distance/README.md‎
Lines changed: 12 additions & 16 deletions
diff --git a/‎src/algorithms/string/knuth-morris-pratt/README.en-EN.md‎
Lines changed: 20 additions & 0 deletions b/‎src/algorithms/string/knuth-morris-pratt/README.en-EN.md‎
Lines changed: 20 additions & 0 deletions
diff --git a/‎src/algorithms/string/knuth-morris-pratt/README.md‎
Lines changed: 9 additions & 12 deletions b/‎src/algorithms/string/knuth-morris-pratt/README.md‎
Lines changed: 9 additions & 12 deletions
diff --git a/‎src/algorithms/string/levenshtein-distance/README.en-EN.md‎
Lines changed: 116 additions & 0 deletions b/‎src/algorithms/string/levenshtein-distance/README.en-EN.md‎
Lines changed: 116 additions & 0 deletions
@@ -1,2 +1,26 @@
+# Hamming Distance
+
 _Read this in other languages:_
 [_Tiếng Việt_](README.md)
+
+the Hamming distance between two strings of equal length is the
+number of positions at which the corresponding symbols are
+different. In other words, it measures the minimum number of
+substitutions required to change one string into the other, or
+the minimum number of errors that could have transformed one
+string into the other. In a more general context, the Hamming
+distance is one of several string metrics for measuring the
+edit distance between two sequences.
+
+## Examples
+
+The Hamming distance between:
+
+- "ka**rol**in" and "ka**thr**in" is **3**.
+- "k**a**r**ol**in" and "k**e**r**st**in" is **3**.
+- 10**1**1**1**01 and 10**0**1**0**01 is **2**.
+- 2**17**3**8**96 and 2**23**3**7**96 is **3**.
+
+## References
+
+[Wikipedia](https://en.wikipedia.org/wiki/Hamming_distance)
@@ -1,23 +1,19 @@
-# Hamming Distance
+# Khoảng cách Hamming
 
-the Hamming distance between two strings of equal length is the 
-number of positions at which the corresponding symbols are 
-different. In other words, it measures the minimum number of
-substitutions required to change one string into the other, or 
-the minimum number of errors that could have transformed one 
-string into the other. In a more general context, the Hamming 
-distance is one of several string metrics for measuring the 
-edit distance between two sequences.
+_Nhấn vào đây để đọc bằng ngôn ngữ khác:_
+[_English_](README.en-EN.md)
 
-## Examples
+Khoảng cách Hamming giữa hai chuỗi có cùng độ dài là số lượng vị trí mà các ký tự tương ứng khác nhau. Nó đo lường số lượng thay thế tối thiểu cần thiết để biến đổi một chuỗi thành chuỗi khác, hoặc số lượng lỗi tối thiểu có thể đã biến đổi một chuỗi thành chuỗi khác. Trong ngữ cảnh tổng quát hơn, khoảng cách Hamming là một trong các phương pháp đo lường khoảng cách chỉnh sửa giữa hai chuỗi.
 
-The Hamming distance between:
+## Ví dụ
 
-- "ka**rol**in" and "ka**thr**in" is **3**.
-- "k**a**r**ol**in" and "k**e**r**st**in" is **3**.
-- 10**1**1**1**01 and 10**0**1**0**01 is **2**.
-- 2**17**3**8**96 and 2**23**3**7**96 is **3**.
+Khoảng cách Hamming giữa:
 
-## References
+- "ka**rol**in" và "ka**thr**in" là **3**.
+- "k**a**r**ol**in" và "k**e**r**st**in" là **3**.
+- 10**1**1**1**01 và 10**0**1**0**01 là **2**.
+- 2**17**3**8**96 và 2**23**3**7**96 là **3**.
+
+## Tham khảo
 
 [Wikipedia](https://en.wikipedia.org/wiki/Hamming_distance)
@@ -1,2 +1,22 @@
+# Knuth–Morris–Pratt Algorithm
+
 _Read this in other languages:_
 [_Tiếng Việt_](README.md)
+
+The Knuth–Morris–Pratt string searching algorithm (or
+KMP algorithm) searches for occurrences of a "word" `W`
+within a main "text string" `T` by employing the
+observation that when a mismatch occurs, the word itself
+embodies sufficient information to determine where the
+next match could begin, thus bypassing re-examination
+of previously matched characters.
+
+## Complexity
+
+-**Time:**`O(|W| + |T|)` (much faster comparing to trivial `O(|W| * |T|)`)
+-**Space:**`O(|W|)`
+
+## References
+
+-[Wikipedia](https://en.wikipedia.org/wiki/Knuth%E2%80%93Morris%E2%80%93Pratt_algorithm)
+-[YouTube](https://www.youtube.com/watch?v=GTJr8OvyEVQ&list=PLLXdhg_r2hKA7DPDsunoDZ-Z769jWn4R8)
@@ -1,19 +1,16 @@
-# Knuth–Morris–Pratt Algorithm
+# Thuật toán Knuth–Morris–Pratt
 
-The Knuth–Morris–Pratt string searching algorithm (or 
-KMP algorithm) searches for occurrences of a "word" `W`
-within a main "text string" `T` by employing the 
-observation that when a mismatch occurs, the word itself 
-embodies sufficient information to determine where the 
-next match could begin, thus bypassing re-examination 
-of previously matched characters.
+_Nhấn vào đây để đọc bằng ngôn ngữ khác:_
+[_English_](README.en-EN.md)
 
-## Complexity
+Thuật toán tìm kiếm chuỗi Knuth–Morris–Pratt (hoặc thuật toán KMP) tìm kiếm các lần xuất hiện của một "từ" `W` trong một "chuỗi văn bản chính" `T` bằng cách sử dụng quan sát rằng khi một không khớp xảy ra, từ chính nó cung cấp đủ thông tin để xác định nơi mà sự khớp tiếp theo có thể bắt đầu, qua đó bỏ qua việc kiểm tra lại các ký tự đã khớp trước đó.
 
--**Time:**`O(|W| + |T|)` (much faster comparing to trivial `O(|W| * |T|)`)
--**Space:**`O(|W|)`
+## Độ phức tạp
 
-## References
+-**Thời gian:**`O(|W| + |T|)` (nhanh hơn nhiều so với phương pháp trực tiếp `O(|W| * |T|)`)
+-**Không gian:**`O(|W|)`
+
+## Tham khảo
 
 -[Wikipedia](https://en.wikipedia.org/wiki/Knuth%E2%80%93Morris%E2%80%93Pratt_algorithm)
 -[YouTube](https://www.youtube.com/watch?v=GTJr8OvyEVQ&list=PLLXdhg_r2hKA7DPDsunoDZ-Z769jWn4R8)
@@ -1,2 +1,118 @@
+# Levenshtein Distance
+
 _Read this in other languages:_
 [_Tiếng Việt_](README.md)
+
+The Levenshtein distance is a string metric for measuring the
+difference between two sequences. Informally, the Levenshtein
+distance between two words is the minimum number of
+single-character edits (insertions, deletions or substitutions)
+required to change one word into the other.
+
+## Definition
+
+Mathematically, the Levenshtein distance between two strings
+`a` and `b` (of length `|a|` and `|b|` respectively) is given by
+![Levenshtein](https://wikimedia.org/api/rest_v1/media/math/render/svg/4cf357d8f2135035207088d2c7b890fb4b64e410)
+where
+
+![Levenshtein](https://wikimedia.org/api/rest_v1/media/math/render/svg/f0a48ecfc9852c042382fdc33c19e11a16948e85)
+
+where
+![Levenshtein](https://wikimedia.org/api/rest_v1/media/math/render/svg/52512ede08444b13838c570ba4a3fc71d54dbce9)
+is the indicator function equal to `0` when
+![Levenshtein](https://wikimedia.org/api/rest_v1/media/math/render/svg/231fda9ee578f0328c5ca28088d01928bb0aaaec)
+and equal to 1 otherwise, and
+![Levenshtein](https://wikimedia.org/api/rest_v1/media/math/render/svg/bdc0315678caad28648aafedb6ebafb16bd1655c)
+is the distance between the first `i` characters of `a` and the first
+`j` characters of `b`.
+
+Note that the first element in the minimum corresponds to
+deletion (from `a` to `b`), the second to insertion and
+the third to match or mismatch, depending on whether the
+respective symbols are the same.
+
+## Example
+
+For example, the Levenshtein distance between `kitten` and
+`sitting` is `3`, since the following three edits change one
+into the other, and there is no way to do it with fewer than
+three edits:
+
+1.**k**itten → **s**itten (substitution of "s" for "k")
+2. sitt**e**n → sitt**i**n (substitution of "i" for "e")
+3. sittin → sittin**g** (insertion of "g" at the end).
+
+## Applications
+
+This has a wide range of applications, for instance, spell checkers, correction
+systems for optical character recognition, fuzzy string searching, and software
+to assist natural language translation based on translation memory.
+
+## Dynamic Programming Approach Explanation
+
+Let’s take a simple example of finding minimum edit distance between
+strings `ME` and `MY`. Intuitively you already know that minimum edit distance
+here is `1` operation, which is replacing `E` with `Y`. But
+let’s try to formalize it in a form of the algorithm in order to be able to
+do more complex examples like transforming `Saturday` into `Sunday`.
+
+To apply the mathematical formula mentioned above to `ME → MY` transformation
+we need to know minimum edit distances of `ME → M`, `M → MY` and `M → M` transformations
+in prior. Then we will need to pick the minimum one and add _one_ operation to
+transform last letters `E → Y`. So minimum edit distance of `ME → MY` transformation
+is being calculated based on three previously possible transformations.
+
+To explain this further let’s draw the following matrix:
+
+![Levenshtein Matrix](https://cdn-images-1.medium.com/max/1600/1*aTunSUoy0BJyYBVn4tWGrA.png)
+
+- Cell `(0:1)` contains red number 1. It means that we need 1 operation to
+ transform `M` to an empty string. And it is by deleting `M`. This is why this number is red.
+- Cell `(0:2)` contains red number 2. It means that we need 2 operations
+ to transform `ME` to an empty string. And it is by deleting `E` and `M`.
+- Cell `(1:0)` contains green number 1. It means that we need 1 operation
+ to transform an empty string to `M`. And it is by inserting `M`. This is why this number is green.
+- Cell `(2:0)` contains green number 2. It means that we need 2 operations
+ to transform an empty string to `MY`. And it is by inserting `Y` and `M`.
+- Cell `(1:1)` contains number 0. It means that it costs nothing
+ to transform `M` into `M`.
+- Cell `(1:2)` contains red number 1. It means that we need 1 operation
+ to transform `ME` to `M`. And it is by deleting `E`.
+- And so on...
+
+This looks easy for such small matrix as ours (it is only `3x3`). But here you
+may find basic concepts that may be applied to calculate all those numbers for
+bigger matrices (let’s say a `9x7` matrix for `Saturday → Sunday` transformation).
+
+According to the formula you only need three adjacent cells `(i-1:j)`, `(i-1:j-1)`, and `(i:j-1)` to
+calculate the number for current cell `(i:j)`. All we need to do is to find the
+minimum of those three cells and then add `1` in case if we have different
+letters in `i`'s row and `j`'s column.
+
+You may clearly see the recursive nature of the problem.
+
+![Levenshtein Matrix](https://cdn-images-1.medium.com/max/1600/1*w8UB4DSvBnAK6mBXRGQDjw.png)
+
+Let's draw a decision graph for this problem.
+
+![Minimum Edit Distance Decision Graph](https://cdn-images-1.medium.com/max/1600/1*8jD0qvr5B9PwRFM_9z7q9A.png)
+
+You may see a number of overlapping sub-problems on the picture that are marked
+with red. Also there is no way to reduce the number of operations and make it
+less than a minimum of those three adjacent cells from the formula.
+
+Also you may notice that each cell number in the matrix is being calculated
+based on previous ones. Thus the tabulation technique (filling the cache in
+bottom-up direction) is being applied here.
+
+Applying this principle further we may solve more complicated cases like
+with `Saturday → Sunday` transformation.
+
+![Levenshtein distance](https://cdn-images-1.medium.com/max/2600/1*497gMaFErzJpCXG7kS_7dw.png)
+
+## References
+
+-[Wikipedia](https://en.wikipedia.org/wiki/Levenshtein_distance)
+-[YouTube](https://www.youtube.com/watch?v=We3YDTzNXEk&list=PLLXdhg_r2hKA7DPDsunoDZ-Z769jWn4R8)
+-[ITNext](https://itnext.io/dynamic-programming-vs-divide-and-conquer-2fea680becbe)