Fact-checked by Grok 2 weeks ago

Multi-index notation

Multi-index notation is a compact mathematical employed in multivariable analysis to denote partial derivatives, powers of variables, and related multi-dimensional operations using a single symbol for a of exponents. A multi-index \alpha is defined as a vector \alpha = (\alpha_1, \alpha_2, \dots, \alpha_n) where each \alpha_i is a non-negative integer, and its order is given by |\alpha| = \alpha_1 + \alpha_2 + \dots + \alpha_n. For a smooth function u: \mathbb{R}^n \to \mathbb{R}, the partial derivative operator D^\alpha u (or \partial^\alpha u) represents \frac{\partial^{|\alpha|} u}{\partial x_1^{\alpha_1} \partial x_2^{\alpha_2} \cdots \partial x_n^{\alpha_n}}, simplifying the expression of higher-order mixed derivatives. Additionally, for powers, x^\alpha = x_1^{\alpha_1} x_2^{\alpha_2} \cdots x_n^{\alpha_n}, and the multi-index factorial is \alpha! = \alpha_1! \alpha_2! \cdots \alpha_n!, which facilitates combinatorial aspects like multinomial expansions. This notation streamlines the generalization of single-variable concepts to multiple dimensions, particularly in Taylor expansions and partial differential equations (PDEs). In the multivariable Taylor theorem, the expansion of a function f(x + y) around x includes terms \frac{y^\alpha}{\alpha!} D^\alpha f(x) summed over multi-indices \alpha with |\alpha| < k, plus a remainder term that can be expressed in integral or Lagrange form using higher-order multi-index derivatives. For instance, the set D^k u = \{D^\alpha u : |\alpha| = k\} collects all partial derivatives of total order k, enabling concise statements of approximation theorems. In PDEs, multi-index notation is essential for defining the order and type of equations, such as classifying the transport equation as first-order (involving D^\alpha with |\alpha| = 1) or the heat equation as second-order (with |\alpha| = 2). It also supports operations like addition of multi-indices, where \alpha + \beta = (\alpha_1 + \beta_1, \dots, \alpha_n + \beta_n), ensuring that mixed partials commute: D^{\alpha + \beta} u = D^\alpha (D^\beta u) = D^\beta (D^\alpha u). Beyond classical analysis, it appears in functional spaces like W^{k,p}(\Omega), where weak derivatives up to order k are required to satisfy \partial^\alpha u \in L^p(\Omega) for all \alpha with |\alpha| \leq k. These applications highlight its role in making complex multi-variable expressions more manageable and intuitive.

Definition and Notation

Formal Definition

In mathematics, particularly in multivariable calculus and analysis, a multi-index \alpha is defined as an n-tuple of non-negative integers, belonging to the set \mathbb{N}_0^n, where \mathbb{N}_0 denotes the non-negative integers \{0, 1, 2, \dots\} and n is the dimension of the underlying space. This structure allows \alpha to systematically represent orders or degrees in multi-dimensional contexts. Explicitly, \alpha = (\alpha_1, \alpha_2, \dots, \alpha_n) with each component \alpha_i \in \mathbb{N}_0 for i = 1, \dots, n. In finite-dimensional settings, all components are specified, but for infinite-dimensional spaces—such as in the theory of distributions or function spaces—multi-indices are typically required to have finite support, meaning only finitely many \alpha_i are non-zero, while the rest are zero. This restriction ensures well-defined operations and convergence in infinite products or sums. Multi-indices serve as a compact notational tool for indexing multi-dimensional quantities, such as the orders of partial derivatives or the exponents in monomials within multivariate polynomials. For instance, they facilitate concise expressions for higher-order derivatives in analysis, as explored in subsequent applications.

Symbolic Conventions

In multi-index notation, multi-indices are commonly denoted by Greek letters such as \alpha or \beta, which are occasionally rendered in boldface (e.g., \boldsymbol{\alpha}) in printed texts to emphasize their vectorial nature. For a vector \mathbf{x} = (x_1, \dots, x_n) \in \mathbb{R}^n and a multi-index \alpha = (\alpha_1, \dots, \alpha_n), the associated power is defined as \mathbf{x}^\alpha = \prod_{i=1}^n x_i^{\alpha_i}. This convention extends the standard exponentiation to multivariable contexts, facilitating compact expressions for monomials in polynomials. Summations involving multi-indices are expressed using sigma notation, such as \sum_\alpha to indicate summation over all relevant multi-indices or \sum_{|\alpha|=k} to restrict to those with total order k = \alpha_1 + \dots + \alpha_n. These forms are standard in expansions like , where the sum aggregates terms weighted by multi-index factorials. The zero multi-index is denoted by \mathbf{0} = (0, \dots, 0), satisfying \mathbf{x}^\mathbf{0} = 1 for any \mathbf{x}, which aligns with the empty product convention in exponentiation.

Algebraic Operations

Arithmetic on Multi-Indices

Multi-indices, denoted as \alpha = (\alpha_1, \dots, \alpha_n) \in \mathbb{N}_0^n where \mathbb{N}_0 is the set of non-negative integers, support vector-like arithmetic operations defined componentwise to preserve their structure in non-negative integer tuples. The addition of two multi-indices \alpha and \beta is given by \alpha + \beta = (\alpha_1 + \beta_1, \dots, \alpha_n + \beta_n), which results in another multi-index since the sum of non-negative integers remains non-negative. This operation is commutative, meaning \alpha + \beta = \beta + \alpha, and associative, (\alpha + \beta) + \gamma = \alpha + (\beta + \gamma) for any multi-indices \alpha, \beta, \gamma. Scalar multiplication by a non-negative integer k \in \mathbb{N}_0 is defined as k\alpha = (k\alpha_1, \dots, k\alpha_n), yielding a multi-index where each component is scaled accordingly. This operation distributes over addition: k(\alpha + \beta) = k\alpha + k\beta, and addition over scalars: (k + m)\alpha = k\alpha + m\alpha for k, m \in \mathbb{N}_0. These properties mirror those of vector spaces but are restricted to the semigroup structure of \mathbb{N}_0^n under componentwise operations. Subtraction \alpha - \beta is defined only when \beta \leq \alpha componentwise, i.e., \beta_i \leq \alpha_i for all i = 1, \dots, n, and takes the form \alpha - \beta = (\alpha_1 - \beta_1, \dots, \alpha_n - \beta_n), ensuring the result is again a multi-index in \mathbb{N}_0^n. This partial operation supports decompositions in contexts like multi-index sums but does not extend to a full group structure.

Combinatorial Coefficients

In multi-index notation, the factorial of a multi-index \alpha = (\alpha_1, \dots, \alpha_n) \in \mathbb{N}_0^n is defined as the product of the factorials of its components: \alpha! = \prod_{i=1}^n \alpha_i!. This definition is used in the denominator of the multinomial coefficient \binom{|\alpha|}{\alpha_1, \dots, \alpha_n} = \frac{|\alpha|!}{\alpha!}, which counts the number of ways to divide |\alpha| distinct objects into n labeled groups of sizes \alpha_1, \dots, \alpha_n. A key combinatorial coefficient derived from this is the multinomial coefficient \binom{\alpha}{\beta}, defined for multi-indices \beta \leq \alpha (componentwise) as \binom{\alpha}{\beta} = \frac{\alpha!}{\beta! \, (\alpha - \beta)!}, and zero otherwise. This expression equals the product \prod_{i=1}^n \binom{\alpha_i}{\beta_i}, reflecting its structure as an independent choice per dimension. Combinatorially, as it equals the product \prod_{i=1}^n \binom{\alpha_i}{\beta_i}, it counts the total number of ways to choose, independently for each i, a subset of \beta_i elements from a set of \alpha_i distinct elements across the dimensions. These coefficients appear prominently in multivariable expansions, such as the multinomial theorem for (x + y)^\alpha, where x = (x_1, \dots, x_n) and y = (y_1, \dots, y_n) are vectors: (x + y)^\alpha = \sum_{\beta \leq \alpha} \binom{\alpha}{\beta} x^\beta y^{\alpha - \beta}. This product form \prod_{i=1}^n (x_i + y_i)^{\alpha_i} expands to the sum above, with each \binom{\alpha}{\beta} weighting the terms according to the ways to allocate exponents between x and y per variable. The construction ensures the expansion captures all possible combinations while preserving the total order per component, essential for applications in series and approximations.

Structural Properties

Length and Magnitude

In multi-index notation, the length or order of a multi-index \alpha = (\alpha_1, \dots, \alpha_n) \in \mathbb{N}_0^n is defined as |\alpha| = \sum_{i=1}^n \alpha_i, which corresponds to the total degree of the multi-index. This quantity plays a central role in multivariable analysis, particularly for classifying homogeneous polynomials, where a monomial x^\alpha is homogeneous of degree k if |\alpha| = k. More generally, measures of magnitude for multi-indices extend to l_p-norms, defined as |\alpha|_p = \left( \sum_{i=1}^n \alpha_i^p \right)^{1/p} for $1 \leq p < \infty, providing weighted assessments of size that generalize the total degree (the case p=1). For p = \infty, the supremum norm is \|\alpha\|_\infty = \max_i \alpha_i, capturing the maximum component of the multi-index. These norms are used to define multi-index sets in polynomial approximation, such as \{ \alpha \in \mathbb{N}_0^n : \|\alpha\|_p \leq m \}, which filter terms in sums by controlling the overall magnitude and help mitigate the curse of dimensionality in high dimensions. Such magnitude measures also facilitate the organization of summation indices in multivariate series expansions, where terms are grouped by total degree |\alpha| to isolate contributions from specific orders.

Partial Ordering

In multi-index notation, the set \mathbb{N}_0^n of n-tuples of nonnegative integers is equipped with a partial order defined componentwise: for multi-indices \alpha = (\alpha_1, \dots, \alpha_n) and \beta = (\beta_1, \dots, \beta_n), one has \alpha \leq \beta if and only if \alpha_i \leq \beta_i for every i = 1, \dots, n. This order makes \mathbb{N}_0^n into a partially ordered set (poset), specifically the product of n copies of the chain poset \mathbb{N}_0. The associated strict partial order is given by \alpha < \beta if \alpha \leq \beta and \alpha \neq \beta. In this poset, a chain is a totally ordered subset, meaning any two elements are comparable under \leq, while an antichain is a subset in which no two distinct elements are comparable. By , the size of the largest antichain equals the minimum number of chains needed to cover the poset, which has applications in combinatorial optimization over multi-indices. To obtain a total order compatible with the partial order, one may use the lexicographic order \leq_{\lex}, defined by \alpha <_{\lex} \beta if, in the leftmost component j where \alpha_j \neq \beta_j, it holds that \alpha_j < \beta_j. This order refines the componentwise partial order and is commonly employed as a monomial ordering in polynomial rings. In applications such as the for higher-order derivatives, the partial order facilitates summation over all \beta \leq \alpha.

Applications in Analysis

Higher-Order Derivatives

In multi-index notation, higher-order partial derivatives of a function f: \mathbb{R}^n \to \mathbb{R} are expressed compactly using a multi-index \alpha = (\alpha_1, \dots, \alpha_n) \in \mathbb{N}_0^n, where each \alpha_i is a non-negative integer denoting the order of differentiation with respect to the variable x_i. The partial derivative operator is defined as \partial^\alpha f = \frac{\partial^{\alpha_1}}{\partial x_1^{\alpha_1}} \cdots \frac{\partial^{\alpha_n}}{\partial x_n^{\alpha_n}} f, which applies the derivatives successively in any order, provided f is sufficiently smooth. The total order of this derivative is given by the magnitude |\alpha| = \alpha_1 + \dots + \alpha_n, which represents the overall degree of differentiation. For instance, derivatives of order k are those where |\alpha| = k. Under appropriate smoothness conditions, such as f \in C^{|\alpha|}(\mathbb{R}^n), the mixed partial derivatives commute, meaning the order of differentiation does not affect the result. This is a multivariable extension of , stating that \partial^\alpha \partial^\beta f = \partial^\beta \partial^\alpha f for multi-indices \alpha and \beta whenever f is twice continuously differentiable in the relevant components. The equality holds more generally for higher orders when f is sufficiently smooth, ensuring that the notation \partial^\alpha f is well-defined independently of the sequence of partials applied. Special cases of this notation include the gradient and the Hessian matrix. The gradient corresponds to first-order derivatives, where |\alpha| = 1 and exactly one \alpha_i = 1 with the rest zero, yielding \nabla f = D^1 f = (\partial f / \partial x_1, \dots, \partial f / \partial x_n). The Hessian arises for second-order derivatives with |\alpha| = 2, forming an n \times n symmetric matrix whose entries are the mixed partials \partial^2 f / \partial x_i \partial x_j, again assuming f \in C^2(\mathbb{R}^n).

Taylor Expansion

The multivariable Taylor theorem utilizes multi-index notation to express the polynomial approximation of a sufficiently smooth function f: \mathbb{R}^n \to \mathbb{R} around a point a \in \mathbb{R}^n. For f of class C^m in a convex open neighborhood of a, the expansion up to order m takes the form f(x) = \sum_{|\alpha| \leq m} \frac{\partial^\alpha f(a)}{\alpha!} (x - a)^\alpha + R_m(x), where the sum collects all partial derivative terms scaled by the multinomial coefficient \alpha! and the corresponding monomial powers. The remainder R_m(x) quantifies the approximation error, and in the Peano form, it satisfies R_m(x) = o(|x - a|^m) as x \to a. This little-o condition holds provided f is C^m near a, ensuring the polynomial part captures the local behavior up to order m. Each component of the Taylor polynomial is a homogeneous polynomial: the terms of exact degree k form \sum_{|\alpha|=k} c_\alpha (x - a)^\alpha, where the coefficients c_\alpha = \partial^\alpha f(a) / \alpha! are determined by the higher-order partial derivatives at a. These homogeneous components generalize the familiar quadratic forms in single-variable expansions to higher dimensions. When f is real analytic at a, meaning it equals its power series locally in some neighborhood, the infinite Taylor series \sum_{|\alpha| = 0}^\infty \frac{\partial^\alpha f(a)}{\alpha!} (x - a)^\alpha converges to f(x) for all x within the radius of convergence, which is the distance from a to the nearest point where f fails to be analytic. This convergence requires growth bounds on the derivatives, such as |\partial^\alpha f(a)| \leq M K^{|\alpha|} |\alpha|! for constants M, K > 0.

Leibniz Product Rule

The Leibniz product rule in multi-index notation generalizes the classical from single-variable to higher-order partial derivatives of products of multivariable functions. For smooth functions u, v: \mathbb{R}^n \to \mathbb{R} and a multi-index \alpha \in \mathbb{N}_0^n, the rule states that \partial^\alpha (u v) = \sum_{\beta \leq \alpha} \binom{\alpha}{\beta} (\partial^\beta u) (\partial^{\alpha - \beta} v), where the sum is over all multi-indices \beta such that \beta_i \leq \alpha_i for each i = 1, \dots, n, and \binom{\alpha}{\beta} = \frac{\alpha!}{\beta! (\alpha - \beta)!} is the multi-index . This formula accounts for the combinatorial ways in which the derivatives can be distributed between u and v across each variable. A proof of this rule can be obtained by on the order |\alpha| of the multi-index. The base case |\alpha| = 1 follows directly from the for first-order partial derivatives: for \alpha = e_j (the vector with 1 in the j-th position and 0 elsewhere), \partial_{x_j} (u v) = (\partial_{x_j} u) v + u (\partial_{x_j} v), which matches the sum with the two terms where \beta = 0 or \beta = e_j. Assuming the formula holds for all multi-indices of order at most k, consider |\alpha| = k+1. , take \alpha = \gamma + e_j for some multi-index \gamma with |\gamma| = k and j \in \{1, \dots, n\}. Applying the first-order to \partial^\gamma (u v) yields \partial^\alpha (u v) = \partial_{x_j} (\partial^\gamma (u v)) = (\partial_{x_j} (\partial^\gamma u)) v + u (\partial_{x_j} (\partial^\gamma v)). By the hypothesis and commutativity of mixed partial derivatives, each term expands into the desired sum over \beta \leq \alpha. The general case follows by iterating over the components of \alpha. When n=1, the multi-index \alpha reduces to a non-negative , \beta \leq \alpha means $0 \leq \beta \leq \alpha, and \binom{\alpha}{\beta} is the standard , so the formula recovers the classical Leibniz rule for the \alpha-th derivative of a product: (u v)^{(\alpha)} = \sum_{\beta=0}^\alpha \binom{\alpha}{\beta} u^{(\beta)} v^{(\alpha - \beta)}. This rule finds application in differentiating powers of functions, such as u^m for positive m, by viewing the power as an m-fold product and iteratively applying the formula, which leads to a multinomial expansion involving . It also extends to more general compositions in certain contexts, though higher-order chain rules require additional tools like Faà di Bruno's formula.

References

  1. [1]
    [PDF] 1 Introduction
    multi-index notation. A multi-index is a vector α = (α1,...,αn) where each αi is a nonnegative integer. The order of the multi-index is |α| = α1 + ... + αn.
  2. [2]
    [PDF] Notes on Partial Differential Equations John K. Hunter - UC Davis Math
    where we use the multi-index notation for partial derivatives explained in Sec- ... multi-indices, we define the multi-index (α + β) by α + β = (α1 + β1,α2 ...
  3. [3]
    [PDF] Notes on Taylor's formula Math 511, Spring 2018
    We define the order of a multi-index to be. |α| = α1 + ··· + αn. We further define the factorial of a multi-index as α! = α1!α2! ··· αn!.
  4. [4]
    Multi-Index Notation -- from Wolfram MathWorld
    Multi-index notation is used to shorten expressions that contain many indices. Let x in R^n and write x=(x_1,...,x_n). A multi-index alpha is an n-tuple of ...
  5. [5]
    [PDF] Multi-index notation - Applied Mathematics Consulting
    Sep 18, 2008 · Multi-index notation makes multi-variable generalizations of familiar one- variable theorems easier to remember. We will give two examples: ...
  6. [6]
    multi-index notation - PlanetMath.org
    Mar 22, 2013 · Multi-indices form a powerful notational device for keeping track of multiple Mathworld Planetmath derivatives Planetmath Planetmath or multiple powers.
  7. [7]
    Multi-Index - an overview | ScienceDirect Topics
    A multi-index is defined as an n-tuple of nonnegative integers, denoted by α = (α₁, …, αₙ), which is used to represent the degree of a monomial xᵃ in the ...
  8. [8]
    [PDF] Multivariable Calculus Michael Taylor
    ... notation xv. Chapter 1 ... This is a text for students with a background in one-variable calculus, who are ready to tackle calculus in several variables.
  9. [9]
    [PDF] A Guide to Advanced Real Analysis - Kufunda.net
    multi-index notation for polynomials and partial derivatives. A multi-index is an n-tuple˛ D .˛1;:::;˛n/ of nonnegative integers. If ˛ is a multi-index and ...
  10. [10]
    [PDF] Introduction to Sobolev Spaces
    Dec 23, 2018 · (c) Multi-index notation. (i) A list α = (α1,...,αn) of integers αi ≥ 0 is a multi-index of order. |α| := α1 + ··· + αn. Set α! := α1! ...αn!
  11. [11]
    None
    **Summary of Multiindices Lecture Notes by Ulrik Skre Fjordholm (February 18, 2021)**
  12. [12]
    [PDF] 1. “Prove (probably by induction), the multi-variable form of Leibniz ...
    (0, 0, ..., 1). Thus, given a multi-index α = (α1, ... αn), we can write α = α1ε1 + ...Missing: mathematics | Show results with:mathematics<|control11|><|separator|>
  13. [13]
    [PDF] Notes on Partial Differential Equations John K. Hunter - UC Davis Math
    where we use the multi-index notation for partial derivatives explained in Sec- tion 1.8. This norm is finite because the derivatives ∂αu are continuous ...
  14. [14]
    [PDF] Multivariate Newton Interpolation in Downward Closed Spaces ...
    Apr 24, 2025 · Consider the multi-index sets Am,n,p = {α ∈ Nm : ∥α∥p ≤ n} ⊆ Nm of bounded lp-norm and the induced polynomial spaces Πm,n,p = span{xα ...Missing: l_p | Show results with:l_p
  15. [15]
    [PDF] PARTIAL DIFFERENTIAL EQUATIONS:AFIRST COURSE
    The multi-index notation is very convenient to denote polynomials in higher dimen- ... This is also convenient to denote higher order partial derivatives in ...
  16. [16]
    [PDF] ADVANCED CALCULUS - UW Math Department
    ... Folland. Department of Mathematics. University of Washington. Seattle, WA 98195 ... Rudin [19].1. At the outset, let us review some standard notation and ...
  17. [17]
    [PDF] Some notes related to partial differential equations
    ... product rule for partial derivatives. Let U be a nonempty open subset of Rn, and let N, ˜N be nonnegative integers. Suppose that for each multi-index α with ...