divide and conquer multiplication

(rev. 2019/03/26)

Divide and Conquer Multiplication

Suppose we want to multiply two 8-bit binary numbers 0 1 0 0 0 0 1 1 x 1 1 0 1 1 1 1 1 ------------------- We can rewrite 0 1 0 0 0 0 1 1 as 0 1 0 0 0 0 1 1 = 2⁴(0 1 0 0) + (0 0 1 1) = 2⁴x₁ + x₀ and similarly 1 1 0 1 1 1 1 1 = 2⁴(1 1 0 1) + (1 1 1 1) = 2⁴y₁ + y₀ 0 1 0 0 0 0 1 1 x 1 1 0 1 1 1 1 1 ------------------- is the same as [2⁴(0 1 0 0) + (0 0 1 1)][2⁴(1 1 0 1) + (1 1 1 1)] = [2⁴x₁ + x₀][2⁴y₁ + y₀] = [2⁸][x₁][y₁] + [2⁴][x₁y₀+y₁x₀] + [x₀y₀] = [2⁸][(0 1 0 0)][(1 1 0 1)] + [2⁴][(0 1 0 0)(1 1 1 1)+(1 1 0 1)(0 0 1 1)] + [(0 0 1 1)(1 1 1 1)] In other words, a multiplication of two 8-bit numbers can be reduced to four multiplications of numbers half the length (4-bit), plus some additional work that is proportional to the number of bits - splitting the numbers, doing shifts, and additions. This idea works generally for two n-bit numbers, if n is even. In other words, we could do the same kind of trick to reduce, say, the multiplication of two 256-bit numbers to four multiplications of 128-bit numbers. We can see the algorithm has a work function that satisfies this relation: W(n) < 4W(n/2) + Cn, where C is some constant. (The idea is that we recursively perform the smaller multiplications using the same divide-and-conquer strategy, until reaching some base-case size that is handled non-recursively, say by just using a computer hardware multiplication instruction. Thus the relation above would apply to all problem sizes that are greater than the base case.) However, as we will see proved later, this work function W is Θ(n²), and therefore working the multiplication problem with this form of divide-and-conquer strategy does not make things any more efficient than the "traditional" Θ(n²) multiplication algorithm (in the "Big-O sense"). The "trick" of Karatsuba-Ofman multiplication (1960) is to perform these three multiplications: [x₁][y₁], [x₀][y₀], and [(x₁+x₀)(y₁+y₀)], to get the cross term [x₁y₀+y₁x₀] by subtraction as [(x₁+x₀)(y₁+y₀)] - [x₁][y₁] - [x₀][y₀] = [x₁y₁ + x₁y₀ + x₀y₁ + x₀y₀ - x₁y₁ - x₀y₀ = [x₁y₀ + y₁x₀], and then to just do everything else as described before to get the result of the multiplication as = [2⁸][x₁][y₁] + [2⁴][x₁y₀+y₁x₀] + [x₀y₀] Using the Karatsuba-Ofman method, we get a work function that satisfies this relation: W(n) < 3W(n/2) + Kn, where K is some constant. (Actually, the number of bits in (x₁+x₀) and/or (y₁+y₀) could be as much as (n/2)+1, so the relation above is not quite precise, but a deeper analysis shows that nevertheless we can use the relation above to find the big-Θ() of the work function of Karatsuba-Ofman multiplication.