Combination

In mathematics, particularly within the field of combinatorics, a combination is defined as an unordered selection of r elements from a set, equivalent to choosing a subset of that size without regard to the arrangement of the selected items.^[1] Unlike permutations, where the order of selection matters, combinations focus solely on the identity of the chosen elements, making them essential for counting scenarios where sequence is irrelevant, such as forming committees from a group of people.^[2] The number of combinations of n distinct objects taken k at a time is calculated using the binomial coefficient, denoted \binom{n}{k} or C(n,k), with the formula \frac{n!}{k!(n-k)!}, where n! represents the factorial of n (the product of all positive integers up to n).^[3] Combinations are a cornerstone of enumerative combinatorics, the branch of mathematics dedicated to counting discrete structures, and they underpin the binomial theorem, which expands expressions like (x + y)^n as a sum of terms involving these coefficients.^[4] This theorem, with roots tracing back to ancient mathematicians but formalized in the 17th century by figures like Isaac Newton and Gottfried Wilhelm Leibniz, highlights the algebraic significance of combinations in generating functions and polynomial expansions.^[5] Historically, the systematic study of combinations emerged in the late 16th and early 17th centuries amid developments in probability theory, where mathematicians such as Blaise Pascal and Jacob Bernoulli applied combinatorial counting to games of chance and statistical problems, laying the groundwork for modern applications.^[6] Beyond pure mathematics, combinations find wide-ranging applications across disciplines; in probability and statistics, they determine the total number of possible outcomes in events like drawing cards in poker (e.g., the 3,744 possible full house hands in 5-card poker from a standard deck) or calculating hypergeometric distributions for sampling without replacement.^[7] In computer science, they model algorithm complexity, such as counting subsets in database queries or optimizing search problems in artificial intelligence.^[4] Other practical uses include applications in biology for genetic pairings or chemistry for molecular configurations, and even in security devices like combination locks, which despite the name rely on ordered sequences of digits (permutations); for a 40-number dial lock with 3 distinct numbers entered in order, there are 40 × 39 × 38 = 59,280 possible combinations.^[8] These applications underscore the versatility of combinations in solving real-world enumeration challenges, from engineering networks to economic modeling.^[4]

Fundamentals

Definition

In mathematics, a combination refers to the selection of k distinct objects from a set of n objects, where the order of selection is irrelevant, making it equivalent to choosing a subset of size k from the original set. For instance, selecting 2 fruits from the set \{ \text{apple}, \text{banana}, \text{cherry} \} yields the combinations \{ \text{apple}, \text{banana} \}, \{ \text{apple}, \text{cherry} \}, and \{ \text{banana}, \text{cherry} \}, regardless of the sequence in which they are picked.^[9] This concept builds on foundational ideas from set theory, where a set is a collection of distinct elements and a subset is any collection formed by selecting some or all of those elements.^[10] A key case is the empty combination, corresponding to k=0, which is the empty set \emptyset; there is exactly one way to choose no objects from any set, emphasizing that the empty subset is a valid combination.^[10] The term "combination" derives from the Latin combinatio, meaning a joining or uniting.^[11] It entered mathematical usage in the 17th century as part of the emerging field of combinatorics.^[12] Mathematicians like Blaise Pascal systematically explored combinations during this period, particularly in his 1654 Traité du triangle arithmétique, where they appeared in the context of probability problems and binomial expansions.^[13] Unlike permutations, where order matters, combinations treat selections as unordered collections.^[10]

Distinction from Permutations

A combination is a selection of items from a larger set where the order of selection does not matter, in contrast to a permutation, which is an arrangement where order is significant.^[10] Permutations count the number of ways to arrange k distinct objects from a set of n objects, given by the formula P(n,k) = \frac{n!}{(n-k)!}, which accounts for all possible sequences.^[10] For instance, if three people—A, B, and C—are to be seated in three distinct chairs, there are P(3,3) = 6 permutations, as each seating order (e.g., A-B-C versus B-A-C) is unique.^[8] In combinations, however, the same three people selected for a team yield only one outcome, since arrangements like A-B-C and C-B-A represent the identical group regardless of order.^[8] This distinction arises because each unique combination corresponds to exactly k! permutations, reflecting the number of ways to reorder the k selected items.^[10] Thus, combinations eliminate redundancy by effectively dividing the permutation count by k!, focusing solely on the subset chosen rather than its arrangement.^[10] A common misconception is treating sequences like "ABC" and "CBA" as distinct combinations when they are not, leading students to overcount by applying permutation logic to unordered selection problems.^[14] This error often stems from vague interpretations of whether "order matters," resulting in incorrect mixtures of permutation and combination approaches in scenarios where only selection is relevant.^[14]

Counting Formulas

Without Repetition

In combinatorics, combinations without repetition refer to the selection of k distinct items from a set of n distinct items, where the order of selection does not matter. This contrasts with permutations, where order is considered, providing the foundational basis for deriving the combination formula by adjusting for unordered selections. The number of such combinations is given by the binomial coefficient, denoted \binom{n}{k} or C(n,k), and calculated as:

\binom{n}{k} = \frac{n!}{k!(n-k)!}

for $0 \leq k \leq n, where n! represents the factorial of n, defined as the product of all positive integers up to n (with $0! = 1). This formula arises from the permutation count P(n,k) = \frac{n!}{(n-k)!}, which tallies the ordered selections of k items from n. Since each unordered combination corresponds to k! permutations (the ways to arrange the k selected items), dividing P(n,k) by k! yields the unordered count: \binom{n}{k} = \frac{P(n,k)}{k!}.^[15]^[16] Key properties of binomial coefficients include symmetry, \binom{n}{k} = \binom{n}{n-k}, which follows from the formula by substituting n-k for k, and boundary cases \binom{n}{0} = \binom{n}{n} = 1, reflecting the single way to choose nothing or everything from the set. These hold for all non-negative integers n and k where the expressions are defined.^[15] For computation, especially with large n, direct factorial evaluation risks overflow and inefficiency due to enormous intermediate values. Instead, use the multiplicative formula:

\binom{n}{k} = \prod_{i=1}^{k} \frac{n - k + i}{i},

which builds the value incrementally by multiplying k terms, reducing numerical issues and allowing early termination if needed. Alternatively, construct values via Pascal's triangle, where each entry is the sum of the two above it: \binom{n}{k} = \binom{n-1}{k-1} + \binom{n-1}{k}, enabling table-based computation in O(n^2) time for all coefficients up to n.^[17] The binomial coefficient and its tabular representation were systematically introduced by Blaise Pascal in his 1654 treatise Traité du triangle arithmétique, where he explored the arithmetic triangle's properties, including applications to probability and figurate numbers.^[18]

With Repetition

Combinations with repetition arise when selecting k items from n distinct types allows multiple selections of the same type, resulting in unordered collections where duplicates are permitted. This scenario is modeled using multisets, which are generalizations of sets that accommodate repeated elements by tracking multiplicities. A multiset is formally defined as a pair (X, f), where X is an underlying set and f: X \to \mathbb{N} (with \mathbb{N} denoting positive integers) assigns the multiplicity f(x) to each element x \in X. For instance, the multiset \{ \text{apple}, \text{apple}, \text{banana} \} has X = \{ \text{apple}, \text{banana} \} and f(\text{apple}) = 2, f(\text{banana}) = 1.^[19] The number of such k-multisets from n types is given by the formula \binom{n + k - 1}{k}, also expressible as \binom{n + k - 1}{n - 1}. This counts the non-negative integer solutions to x_1 + x_2 + \cdots + x_n = k, where x_i represents the multiplicity of the i-th type.^[20]^[21] Unlike combinations without repetition, which restrict to distinct items and yield \binom{n}{k}, this allows for repetitions, leading to larger counts for k > 1.^[20] The formula derives from the stars and bars theorem, a combinatorial technique visualizing the problem as arranging k indistinguishable stars (representing selections) and n-1 bars (dividing the stars into n groups for each type). This creates a sequence of n + k - 1 positions, where choosing n-1 positions for the bars (or equivalently k for the stars) determines the multiplicities. The total arrangements equal \binom{n + k - 1}{n - 1}, as each valid placement corresponds to a unique multiset. For example, selecting 3 fruits from types {apple, banana, cherry} yields \binom{3 + 3 - 1}{3} = 10 multisets, including {apple, apple, apple} and {apple, banana, cherry}.^[20]^[21] This counting method finds application in distributing indistinguishable objects into distinct bins, such as allocating resources where types are identical but recipients differ, though detailed uses appear in broader combinatorial contexts.^[20]

Enumeration Techniques

Basic Examples

A fundamental example of combinations involves selecting 3 books from a set of 5 distinct books, labeled A, B, C, D, and E, where the order of selection does not matter. The possible combinations are: ABC, ABD, ABE, ACD, ACE, ADE, BCD, BCE, BDE, and CDE, resulting in a total of 10 unique selections.^[22] In a real-world scenario, consider a small company with 4 employees—Alice, Bob, Charlie, and Dana—needing to select a committee of 2 members. Manual enumeration yields the following combinations: Alice-Bob, Alice-Charlie, Alice-Dana, Bob-Charlie, Bob-Dana, and Charlie-Dana, for a total of 6 possibilities. This approach highlights how combinations account for unordered groups in practical decision-making, such as team assignments.^[23] A common error in counting arises when overlooking that order is irrelevant, leading to double-counting. For instance, in the handshake problem with 4 people at a gathering—W, X, Y, Z—each pair shakes hands exactly once, without regard to who initiates. The unique pairs are: W-X, W-Y, W-Z, X-Y, X-Z, Y-Z, totaling 6 handshakes; treating sequences like W-X and X-W as distinct would incorrectly double the count.^[24] To aid understanding, the combination values for selecting k items from 4 (denoted C(4,k)) form a symmetric sequence, as shown in the table below:

k	C(4,k)
0	1
1	4
2	6
3	4
4	1

These values can be verified using the combination formula from standard counting principles.

Generating All Combinations

Generating all k-combinations from a set of n elements can be achieved through systematic algorithmic approaches that ensure completeness and efficiency. One widely used method is lexicographic ordering, which produces combinations in a sorted sequence based on the numerical or alphabetical order of elements. For instance, the 2-combinations of the set {1, 2, 3, 4} are generated as (1,2), (1,3), (1,4), (2,3), (2,4), (3,4), where each subsequent combination is the next in dictionary order. This approach, detailed as Algorithm L in Knuth's seminal work on combinatorial algorithms, allows for straightforward implementation by advancing through positions like odometer digits, incrementing the rightmost possible index while maintaining the combination size.^[25]^[26] Another common technique is the recursive method, which builds combinations by deciding for each element whether to include or exclude it, subject to reaching exactly k selections. Starting from the first element, the algorithm branches into including it (and recursing on the remaining elements for k-1 more) or excluding it (and recursing for k from the rest), using backtracking to explore all paths without repetition. This depth-first strategy, a form of backtracking, is particularly suitable for generating combinations on-the-fly and is covered extensively in standard texts on combinatorial generation.^[27] The computational complexity of these methods is dominated by the output size, requiring Θ(C(n,k)) time in total to generate all combinations, where C(n,k) denotes the total number of k-combinations from n elements; efficient implementations achieve amortized O(1) time per combination using techniques like revolving doors or minimal change updates between successive outputs.^[27] In programming practice, standard libraries provide optimized iterators for this purpose, such as Python's itertools.combinations, which yields combinations in lexicographic order without storing them all in memory, enabling lazy evaluation for large n and k.^[28]

Generalizations

Across All Subset Sizes

The total number of subsets of a finite set with n elements is given by the sum of the binomial coefficients over all possible subset sizes k, from k = 0 to k = n:

\sum_{k=0}^{n} \binom{n}{k} = 2^n.

This identity follows directly from the binomial theorem, which expands (x + y)^n = \sum_{k=0}^{n} \binom{n}{k} x^{n-k} y^k; substituting x = 1 and y = 1 yields the result.^[15] The inclusion of the k = 0 term accounts for the empty set as a valid subset, ensuring the sum encompasses all possibilities.^[15] Combinatorially, this sum equals $2^n because each of the n elements in the set can independently be either included or excluded from a subset, yielding $2 binary choices per element and thus $2^n total subsets overall.^[29] The power set \mathcal{P}(S) of a set S with |S| = n is precisely the collection of all such subsets, so |\mathcal{P}(S)| = 2^n.^[29] For example, consider a set S = \{a, b, c\} with n = 3. The subsets are: \emptyset, \{a\}, \{b\}, \{c\}, \{a, b\}, \{a, c\}, \{b, c\}, and \{a, b, c\}, totaling 8, which matches $2^3 = 8. The binomial coefficients verify this: \binom{3}{0} + \binom{3}{1} + \binom{3}{2} + \binom{3}{3} = 1 + 3 + 3 + 1 = 8.^[15]

Partitioning into Bins

In combinatorics, partitioning distinct objects into bins extends the concept of combinations by considering how to distribute n distinct objects into k bins such that each bin receives a non-empty subset, forming a partition of the set. When the bins are unlabeled (indistinguishable), the number of ways to do so is given by the Stirling number of the second kind, denoted S(n,k), which counts the partitions of an n-element set into k non-empty unlabeled subsets.^[30] If the bins are labeled (distinct), each such partition can be assigned to the bins in k! ways, yielding a total of k! \, S(n,k) distributions.^[31] This partitioning relates directly to combinations through the lens of surjective (onto) functions: each distribution corresponds to a surjective function from the set of n objects to the set of k bins, where the preimage of each bin forms a non-empty combination (subset) of objects.^[32] The formula k! \, S(n,k) thus enumerates these surjections, ensuring no bin is empty by excluding non-surjective mappings. The explicit expression for the Stirling number is

S(n,k) = \frac{1}{k!} \sum_{j=0}^{k} (-1)^{k-j} \binom{k}{j} j^n,

derived via inclusion-exclusion to count the valid partitions.^[30] For illustration, consider partitioning 3 distinct balls (labeled A, B, C) into 2 distinct non-empty boxes. The possible distributions are: {A} in box 1 and {B,C} in box 2; {B} in box 1 and {A,C} in box 2; {C} in box 1 and {A,B} in box 2; and the reverses ({B,C} in box 1 and {A} in box 2, etc.), totaling 6 ways, computed as $2! \, S(3,2) = 2 \times 3 = 6.^[31] This contrasts with cases involving indistinguishable objects, where the count simplifies to partitions of the integer n into k positive parts, yielding fewer distinct configurations.^[33]

Applications

In Probability

In probability theory, combinations play a fundamental role in calculating probabilities for scenarios involving uniform random selections without replacement, where each possible subset of a fixed size is equally likely. This arises in sampling from finite populations, such as drawing items from a deck or urn, ensuring that the total number of ways to choose the sample serves as the denominator in probability computations.^[34] A key application is the hypergeometric distribution, which models the probability of observing exactly k successes in n draws without replacement from a population of size N containing K successes. The probability mass function is given by

P(K = k) = \frac{\dbinom{K}{k} \dbinom{N - K}{n - k}}{\dbinom{N}{n}},

where \dbinom{N}{n} represents the total number of ways to choose n items from N, and the numerator counts the favorable combinations of k successes and n - k failures. This distribution is particularly useful in finite population sampling, contrasting with the binomial distribution used for with-replacement scenarios.^[34] For instance, consider drawing 5 cards from a standard 52-card deck, where there are 13 hearts (successes) and 39 non-hearts (failures), with N = 52, K = 13, and n = 5. The probability of exactly 3 hearts is

P(K = 3) = \frac{\dbinom{13}{3} \dbinom{39}{2}}{\dbinom{52}{5}} \approx 0.0815,

illustrating how combinations quantify the likelihood of specific compositions in random hands. This example underscores the hypergeometric model's reliance on combinatorial counting to normalize probabilities over all possible outcomes.^[34]^[35] Combinations also underpin the uniform random combination model, where a k-subset is selected uniformly at random from a set of size n, making each possible k-subset equally likely with probability $1 / \dbinom{n}{k}. This uniform distribution over the collection of all k-subsets forms the basis for many probabilistic analyses in combinatorics.^[36] Furthermore, the law of total probability can incorporate combinations by conditioning on subset sizes, allowing decomposition of complex events into manageable parts based on the number of elements in intermediate subsets. For example, to find the probability of an event like a match in a random assignment, one conditions on the size k of a relevant subset and sums over possible k, weighting by the uniform probabilities derived from combinations. This approach leverages combinatorial uniformity to simplify conditional computations.^[37]

In Generating Functions

Generating functions provide a powerful algebraic framework for encoding and manipulating sequences of binomial coefficients arising from combinations. The binomial theorem establishes the foundational connection, stating that for nonnegative integer n and variables x and y,

(x + y)^n = \sum_{k=0}^n \binom{n}{k} x^{n-k} y^k,

where \binom{n}{k} denotes the number of combinations of n items taken k at a time. This expansion directly interprets the coefficients as combinatorial counts: each term \binom{n}{k} x^{n-k} y^k arises from selecting k factors of y and n-k factors of x when multiplying n copies of (x + y), with the coefficient giving the number of such selections. A combinatorial proof follows from this interpretation, confirming the theorem without algebraic manipulation.^[38] The ordinary generating function for the sequence of binomial coefficients \binom{n}{k} with fixed n and varying k is precisely (1 + x)^n = \sum_{k=0}^n \binom{n}{k} x^k. This function encapsulates the total number of subsets of an n-element set when x = 1, yielding $2^n, but more broadly enables operations like differentiation or composition to derive further identities involving combinations. An inductive proof of the generating function proceeds by verifying the base case n=0 (where (1 + x)^0 = 1 = \binom{0}{0}) and assuming it holds for n, then using the Pascal identity \binom{n+1}{k} = \binom{n}{k} + \binom{n}{k-1} to show (1 + x)^{n+1} = (1 + x)^n + x(1 + x)^n, matching the expanded sum. For illustration, consider n=3 and x=2: the expansion (1 + 2)^3 = 27 decomposes as \binom{3}{0} \cdot 1^3 \cdot 2^0 + \binom{3}{1} \cdot 1^2 \cdot 2^1 + \binom{3}{2} \cdot 1 \cdot 2^2 + \binom{3}{3} \cdot 2^3 = 1 + 6 + 12 + 8, aligning with the coefficients.^[15] This framework extends to the multinomial theorem, generalizing combinations to partitions of n items into m labeled groups of sizes n_1, n_2, \dots, n_m where \sum n_i = n. The theorem asserts

(x_1 + x_2 + \dots + x_m)^n = \sum_{n_1 + \dots + n_m = n} \frac{n!}{n_1! n_2! \dots n_m!} x_1^{n_1} x_2^{n_2} \dots x_m^{n_m},

with the multinomial coefficient \frac{n!}{n_1! \dots n_m!} counting the ways to divide n distinct items into such groups. The corresponding generating function (x_1 + \dots + x_m)^n thus encodes these partitioned combinations, facilitating analysis of multisets or colored subsets. Historically, Leonhard Euler advanced the use of such series expansions in the 18th century, particularly in his Introductio in analysin infinitorum (1748), where he explored infinite series involving binomial coefficients to develop early generating function techniques for analytic continuation and interpolation. Euler's contributions, including applications to trigonometric series, laid groundwork for modern enumerative combinatorics.^[39]^[40]