Matrix chain multiplication

Matrix chain multiplication is a classic optimization problem in computer science that involves determining the most efficient way to compute the product of a sequence of two or more matrices by finding the optimal order of multiplication to minimize the total number of scalar multiplications required.
Matrix multiplication is associative, allowing different parenthesizations of the product—such as ((A₁A₂)A₃) versus (A₁(A₂A₃))—but the computational cost varies significantly based on the dimensions of the matrices involved, as multiplying two matrices of sizes p×q and q×r requires pqr scalar multiplications.^[1]
For example, multiplying four matrices with dimensions 30×1, 1×40, 40×10, and 10×25 can cost as few as 1,400 or as many as 41,200 multiplications depending on the order chosen.^[1] The problem is efficiently solved using dynamic programming, which leverages the optimal substructure property: the optimal cost for multiplying a subchain of matrices can be used to compute the cost for larger chains.^[2]
Let M[i,j] denote the minimum number of scalar multiplications needed to compute the product of matrices Aᵢ through Aⱼ; the recurrence relation is M[i,j] = min_{i ≤ k < j} (M[i,k] + M[k+1,j] + d_{i-1} · d_k · d_j), where d is the array of matrix dimensions.^[1]
A bottom-up dynamic programming algorithm fills an n×n table in O(n³) time by iterating over chain lengths and possible split points, also tracking the optimal parenthesization for reconstruction.^[2] As a foundational example of dynamic programming, matrix chain multiplication illustrates key concepts like overlapping subproblems and memoization, and it has practical applications in areas such as compiler optimization for expression evaluation, database query processing, and scientific computing tasks involving chained linear algebra operations like random walks and matrix factorization.^[2]^[3]

Fundamentals

Matrix Multiplication Basics

Matrix multiplication is an operation that combines two matrices to produce a third, provided the number of columns in the first matrix matches the number of rows in the second. For matrices A of dimensions p \times q and B of dimensions q \times r, the product C = AB is a p \times r matrix.^[4] Each element c_{ij} in C is computed as the sum of the products of corresponding elements from the i-th row of A and the j-th column of B:

c_{ij} = \sum_{k=1}^{q} a_{ik} b_{kj}

This process involves taking the dot product of the row vector from A and the column vector from B for every pair of indices i and j.^[4] The computational cost of this naive algorithm is O(p q r) scalar multiplications and a comparable number of additions, as each of the p r elements requires q multiplications and q-1 additions. For square matrices where p = q = r = n, this simplifies to O(n^3).^[4] A key property of matrix multiplication is its associativity: for compatible matrices A, B, and C, (AB)C = A(BC). This holds because the operation aligns intermediate dimensions correctly, ensuring the result is independent of parenthesization order.^[4] To illustrate, consider the 2×2 matrices

A = \begin{pmatrix} 1 & 2 \\ 3 & 4 \end{pmatrix}, \quad B = \begin{pmatrix} 5 & 6 \\ 7 & 8 \end{pmatrix}.

The product C = AB has elements:

c_{11} = 1 \cdot 5 + 2 \cdot 7 = 19, \quad c_{12} = 1 \cdot 6 + 2 \cdot 8 = 22,

c_{21} = 3 \cdot 5 + 4 \cdot 7 = 43, \quad c_{22} = 3 \cdot 6 + 4 \cdot 8 = 50.

Thus,

C = \begin{pmatrix} 19 & 22 \\ 43 & 50 \end{pmatrix}.

This example requires 8 multiplications and 4 additions, consistent with the O(2^3) = O(8) complexity.^[4]

Chain Multiplication Problem

The chain multiplication problem arises when computing the product of a sequence of two or more matrices, where the order of pairwise multiplications significantly impacts the total computational cost due to the non-commutative nature of matrix operations and varying dimensions. Given a chain of n matrices A_1 A_2 \cdots A_n, where A_i has dimensions d_{i-1} \times d_i for i=1 to n, the objective is to find a parenthesization of the product that minimizes the number of scalar multiplications required.^[5] This optimization is crucial because matrix multiplication is associative but not commutative, allowing multiple ways to group the operations, each yielding potentially different intermediate matrix sizes and thus varying costs.^[5] A naive strategy is to perform the multiplications strictly from left to right, as in ((A_1 A_2) A_3) \cdots A_n. Under this approach, the cost accumulates as follows: after the first multiplication yields a d_0 \times d_1 result, each subsequent step multiplies the growing d_0 \times d_k intermediate by A_{k+1} (d_k \times d_{k+1}), incurring d_0 d_k d_{k+1} scalar multiplications for k=1 to n-1, for a total cost of \sum_{k=1}^{n-1} d_0 d_k d_{k+1}.^[6] However, this may be suboptimal, as alternative groupings can reduce the cost by keeping intermediate dimensions smaller. The total number of possible parenthesizations is exponential, specifically the (n-1)th Catalan number C_{n-1} = \frac{1}{n} \binom{2(n-1)}{n-1}, reflecting the combinatorial explosion due to associativity.^[7] Consider three matrices A ($10 \times 30), B ($30 \times 5), and C ($5 \times 60). The left-associative (AB)C first computes AB ($10 \times 5) at a cost of $10 \cdot 30 \cdot 5 = 1500 multiplications, then multiplies the result by C at a cost of $10 \cdot 5 \cdot 60 = 3000, for a total of $4500. In contrast, A(BC)first computesBC (30 \times 60) at 30 \cdot 5 \cdot 60 = 9000, then multiplies by Aat10 \cdot 30 \cdot 60 = 18000

, totaling &#36;27000

—demonstrating how poor grouping can increase costs by a factor of six.^[6] This problem is motivated by practical scenarios in linear algebra and numerical computations, such as successive matrix transformations in computer graphics, robotics, and scientific simulations, where matrix dimensions often vary irregularly across the chain, making efficient ordering essential for performance.^[8] The standard approach to solving it efficiently is dynamic programming.^[5]

Dynamic Programming Solution

Problem Formulation

The matrix chain multiplication problem is formally defined for a sequence of n matrices A_1, A_2, \dots, A_n, where each matrix A_i has dimensions d_{i-1} \times d_i for i = 1, 2, \dots, n, with the dimension sequence given by d_0, d_1, \dots, d_n. The goal is to determine the optimal parenthesization of the product A_1 A_2 \dots A_n that minimizes the total number of scalar multiplications required. This optimization arises because matrix multiplication is associative, permitting multiple ways to group the operations, but the computational cost varies depending on the order due to the dimensional constraints. This dynamic programming approach was first described by Godbole in 1973.^[9] Let m(i,j) denote the minimum number of scalar multiplications needed to compute the matrix product of the subchain from A_i to A_j (for $1 \leq i \leq j \leq n). The base case is m(i,i) = 0 for all i, since no multiplication is required for a single matrix. The recurrence relation for the optimal cost is:

m(i,j) = \min_{k=i}^{j-1} \left[ m(i,k) + m(k+1,j) + d_{i-1} \cdot d_k \cdot d_j \right]

for j > i, where the term d_{i-1} \cdot d_k \cdot d_j represents the cost of multiplying the resulting matrices from the subchains A_i \dots A_k and A_{k+1} \dots A_j. This formulation captures the optimal substructure of the problem, as the minimum cost for a subchain depends on the minimum costs of its partitions.^[9] To reconstruct the optimal parenthesization corresponding to this minimum cost, an auxiliary array s(i,j) is used to record the optimal split point, defined as s(i,j) = k^* where k^* is the value of k that achieves the minimum in the recurrence for m(i,j). The full parenthesization can then be derived recursively from the s values, starting from s(1,n). This tracking ensures that the solution not only computes the minimum cost but also specifies the exact grouping of matrix multiplications.^[9]

Recursive Solution

The recursive solution employs a top-down approach to compute the minimum number of scalar multiplications required to multiply a chain of matrices from index i to j, denoted as m[i, j]. This is achieved by defining a recursive function that tries all possible split points k between i and j-1, recursively solving the subchains A_i \cdots A_k and A_{k+1} \cdots A_j, and adding the cost of multiplying the resulting matrices, which is p_{i-1} \times p_k \times p_j, where p is the array of matrix dimensions. The function returns the minimum over all such k.

m[i, j] = \begin{cases} 0 & \text{if } i = j \\ \min_{i \leq k < j} \left( m[i, k] + m[k+1, j] + p_{i-1} p_k p_j \right) & \text{if } i < j \end{cases} \end{align*}$$ This naive recursive implementation leads to exponential time complexity, specifically $T(n) = O(n \cdot 2^n)$, because it recomputes the same subproblems multiple times due to overlapping substructures in the recursion tree. To optimize, memoization is introduced using a 2D table $m[1..n, 1..n]$ initialized to undefined values, where each computed $m[i, j]$ is stored and retrieved on subsequent calls, avoiding redundant recursions. This reduces the time complexity to $O(n^3)$ while using $O(n^2)$ space, as there are $O(n^2)$ unique subproblems, each requiring $O(n)$ work to solve. The following pseudocode illustrates the memoized recursive function: ``` MATRIXCHAIN(i, j) if m[i, j] ≠ undefined return m[i, j] if i == j m[i, j] ← 0 else m[i, j] ← ∞ for k ← i to j-1 q ← MATRIXCHAIN(i, k) + MATRIXCHAIN(k+1, j) + p_{i-1} * p_k * p_j if q < m[i, j] m[i, j] ← q return m[i, j] ``` To invoke, call `MATRIXCHAIN(1, n)` for $n$ matrices. Consider an example with four matrices $A(10 \times 20)$, $B(20 \times 30)$, $C(30 \times 40)$, $D(40 \times 30)$, so dimensions $p = [10, 20, 30, 40, 30]$. The initial call `MATRIXCHAIN(1,4)` tries splits at $k=1,2,3$: - For $k=1$: `MATRIXCHAIN(1,1)=0` + `MATRIXCHAIN(2,4)` + $10 \times 20 \times 30 = 6000$. Now compute `MATRIXCHAIN(2,4)`: tries $k=2$: 0 + `MATRIXCHAIN(3,4)` + $20 \times 30 \times 30 = 18000$; `MATRIXCHAIN(3,4)` tries $k=3$: 0+0 + $30 \times 40 \times 30 = 36000$, so min for (3,4)=36000, thus for $k=2$: 18000+36000=54000; for $k=3$: `MATRIXCHAIN(2,3)` + 0 + $20 \times 40 \times 30 = 24000$; `MATRIXCHAIN(2,3)`: min over $k=2$: 0+0 + $20 \times 30 \times 40 = 24000$. So (2,3)=24000, thus for $k=3$: 24000+24000=48000. Min for (2,4)=48000. Total for top $k=1$: 0+48000+6000=54000. - For $k=2$: `MATRIXCHAIN(1,2)` + `MATRIXCHAIN(3,4)` + $10 \times 30 \times 30 = 9000$; (1,2): min $k=1$: 0+0 + $10 \times 20 \times 30 = 6000$; (3,4)=36000 as above. Total: 6000+36000+9000=51000. - For $k=3$: `MATRIXCHAIN(1,3)` + 0 + $10 \times 40 \times 30 = 12000$; (1,3): tries $k=1$: 0 + (2,3)=24000 + $10 \times 20 \times 40 = 8000$=32000; $k=2$: (1,2)=6000 + 0 + $10 \times 30 \times 40 = 12000$=18000. Min (1,3)=18000. Total for $k=3$: 18000+0+12000=30000. With [memoization](/page/Memoization), subproblems like (2,3) and (3,4) are computed once and reused (e.g., (3,4) used in multiple paths), yielding the overall minimum of 30000 for the full chain. The [recursion](/page/Recursion) tree prunes redundant branches via table lookups, filling the table progressively. ### Bottom-Up Algorithm The bottom-up dynamic programming algorithm for the matrix chain multiplication problem computes the minimum multiplication cost and the corresponding optimal parenthesization by iteratively filling two two-dimensional tables: one for the costs and one for the split points. Let the input be a sequence of $n$ matrices $A_1, A_2, \dots, A_n$ with dimensions given by the [array](/page/ARRAY) $p = [p_0, p_1, \dots, p_n]$, where $A_i$ is $p_{i-1} \times p_i$. The algorithm initializes an $n \times n$ cost table $m$ and a split table $s$ such that $m = 0$ for $i = 1$ to $n$ (no cost for a single matrix) and $s$ is [undefined](/page/Undefined).[](https://ieeexplore.ieee.org/document/5009182) The tables are filled in order of increasing chain length $l$, starting from $l = 2$ up to $l = n$. For each $l$, the algorithm iterates over all possible starting indices $i = 1$ to $n - l + 1$, setting $j = i + l - 1$. For each subchain from $A_i$ to $A_j$, it computes the minimum cost by trying all possible split points $k$ from $i$ to $j-1$:

m = \min_{i \leq k < j} \left( m + m[k+1] + p_{i-1} \cdot p_k \cdot p_j \right)