Fact-checked by Grok 2 weeks ago

Multiset

In , a multiset (also known as a or mset) is a of a set that permits multiple instances of , where each has an associated multiplicity or indicating how many times it appears. Unlike standard sets, which enforce uniqueness and disregard order, multisets emphasize the count of occurrences while remaining unordered collections. Formally, a multiset can be constructed as a pair consisting of a base set of distinct elements and a function assigning a non-negative integer multiplicity to each element, such as the multiset represented by {a: 2, b: 3, d: 1}, meaning a appears twice, b three times, and d once. Common operations on multisets include the union, which takes the maximum multiplicity for each element across the two multisets; the intersection, which takes the minimum multiplicity; and the sum (or multiset addition), which adds the multiplicities element-wise. These operations extend set-theoretic concepts to handle repetitions systematically, enabling comparisons and manipulations based on cardinality, where the total cardinality of a multiset is the sum of all multiplicities. The foundational development of multiset theory occurred in the late , with Wayne D. Blizard establishing a rigorous axiomatic framework in 1989 that defines multisets as structures allowing finite repetitions of elements within a setting. Blizard's subsequent 1991 survey traces the evolution of multiset theory from early informal uses in and to a formalized branch of , highlighting its distinctions from related concepts like fuzzy sets or sequences. Multisets play a crucial role in , where they model selections with repetition, such as the number of multisets of size k from n types given by the \binom{n+k-1}{k}, often called "stars and bars." In , they underpin data structures like bags or multisets in libraries (e.g., C++'s std::multiset) for efficient storage and querying of elements with duplicates, and they appear in algorithms for , such as multiset intersection protocols. Further applications include for knowledge representation, software verification with constraints, and in and , where multisets facilitate frequency-based similarity measures like the over repetitions.

Fundamentals

Definition

In mathematics, a multiset is a generalization of the concept of a set that allows multiple instances, or multiplicities, of elements, thereby distinguishing it from standard sets where elements are unique. Formally, a multiset M is defined as an ordered pair (X, \mu), where X is a set (often called the underlying set or the domain) and \mu: X \to \mathbb{N}_0 is a multiplicity function that assigns to each element x \in X a non-negative integer \mu(x) representing the number of times x occurs in the multiset. This definition assumes familiarity with basic set theory, including the notions of sets and functions. Alternative formalizations of a multiset include representing it as a set in which elements may appear more than once, such as \{a, a, b\}, where repetitions are explicitly listed; however, this notation is informal and less suitable for infinite or abstract contexts. Another equivalent approach specifies a multiset via its support—the subset of elements with positive multiplicity—and the corresponding multiplicity map restricted to that support, ensuring \mu(x) > 0 only for x in the support. In a more general transfinite setting, as introduced by Richard Rado, a multiset can be viewed as a function from a set to the class of cardinal numbers, with the domain consisting of elements assigned nonzero cardinals. The of a multiset M = (X, \mu) is defined as the of the multiplicities over its underlying set, given by |M| = \sum_{x \in X} \mu(x), which counts the total number of occurrences, including repetitions. The empty multiset, denoted \emptyset or $0, is the pair (\emptyset, \mu) where the underlying set is empty (equivalently, \mu(x) = 0 for all x in some ), and it has 0. A multiset consists of an underlying set with one x and \mu(x) = 1, yielding 1.

Examples

A shopping containing two apples and one exemplifies a multiset, which can be denoted as {apple, apple, } to indicate repetitions or more compactly as {apple: 2, : 1}, preserving the multiplicity of each item. In comparison, viewing the same through the lens of a standard set yields {apple, }, which eliminates multiplicity information and thus fails to capture the full contents. A of cards, particularly when multiple decks are combined in games allowing duplicates of suits and ranks, represents a multiset where identical cards coexist without regard to order. In , the prime of an provides a clear example: the number 12 decomposes into the multiset {2, 2, 3}, reflecting the repeated prime factor 2. Multisets are commonly notated by listing elements with repetitions, such as {a, a, b}, or by specifying multiplicities, as in {a: 2, b: 1} to denote two instances of a and one of b. The of a multiset, or its total size, sums the multiplicities of its elements; for instance, the multiset {1: 3, 2: 1} has 4.

Properties and Operations

Basic Properties

A multiset M over a base set X is defined by its multiplicity \mu_M: X \to \mathbb{N}_0, where \mathbb{N}_0 denotes the non-negative integers, and the multiplicity \mu_M(x) indicates the number of occurrences of each element x \in X. Two multisets M and N over the same base set are equal, denoted M = N, their multiplicity functions agree for every element, that is, \mu_M(x) = \mu_N(x) for all x \in X. This equality condition directly follows from the definition of multisets via multiplicity functions, as any difference in multiplicities would distinguish the collections. The relation for multisets generalizes set inclusion to account for multiplicities. A multiset M is a submultiset of another multiset N, denoted M \subseteq N, if \mu_M(x) \leq \mu_N(x) for every x \in X. This relation is reflexive (M \subseteq M) and transitive (M \subseteq N and N \subseteq P imply M \subseteq P), forming a partial order on the collection of multisets over X. A proper submultiset holds when the inequality is strict for at least one x, ensuring M \neq N. To verify , one compares multiplicities element-wise; if all \mu_M(x) \leq \mu_N(x) and equality holds everywhere, then M = N, otherwise M is a proper submultiset if the condition holds without equality. The support of a multiset M, denoted \operatorname{supp}(M), is the underlying set consisting of elements with positive multiplicity: \operatorname{supp}(M) = \{ x \in X \mid \mu_M(x) > 0 \}. This support forms an ordinary set without repetitions and determines the distinct elements present in M, relating directly to the base set X as a where multiplicities vanish outside \operatorname{supp}(M). For submultisets, \operatorname{supp}(M) \subseteq \operatorname{supp}(N) if M \subseteq N, since zero multiplicity in M is compatible with positive in N, but the converse does not hold without multiplicity checks. When the base set X is equipped with a \leq, an ordered multiset inherits this order on its elements, allowing identification of based on the order of distinct items in the . The maximal element of M is the largest x \in \operatorname{supp}(M) under \leq, regardless of multiplicity, and similarly the minimal element is the smallest such x. In this totally ordered context, these extremal elements coincide with the greatest and least elements of the , providing endpoints for the ordered structure.

Arithmetic Operations

Multisets support a variety of operations that extend their set-theoretic counterparts by accounting for multiplicities, enabling the of new multisets from existing ones. These operations are defined via the multiplicity \mu: U \to \mathbb{N}_0, where U is the universal set and \mathbb{N}_0 denotes non-negative integers, ensuring that the resulting multiplicity for each is computed based on the input multiplicities. Such operations are fundamental in and for modeling collections with repetitions, like bags in databases or weighted selections. The union of two multisets M and N, denoted M \cup N, is the multiset whose multiplicity function is given by \mu_{M \cup N}(x) = \max(\mu_M(x), \mu_N(x)) for all x \in U. This operation retains the highest occurrence count for each element, analogous to set union but preserving multiplicities. For example, if M = \{a: 2, b: 1\} and N = \{a: 1, c: 3\}, then M \cup N = \{a: 2, b: 1, c: 3\}. Union is commutative, as \max(\mu_M(x), \mu_N(x)) = \max(\mu_N(x), \mu_M(x)); associative, since \max(\max(a, b), c) = \max(a, \max(b, c)) for non-negative integers a, b, c; and idempotent, with M \cup M = M. The intersection M \cap N takes the minimum multiplicity: \mu_{M \cap N}(x) = \min(\mu_M(x), \mu_N(x)) for all x \in U, including only elements common to both with their overlapping counts. Using the prior example, M \cap N = \{a: 1\}. Intersection shares the commutativity and associativity of union, via \min(\mu_M(x), \mu_N(x)) = \min(\mu_N(x), \mu_M(x)) and \min(\min(a, b), c) = \min(a, \min(b, c)), and is idempotent since M \cap M = M. The sum, or disjoint union, M + N adds multiplicities directly: \mu_{M + N}(x) = \mu_M(x) + \mu_N(x) for all x \in U, useful in combinatorial enumeration for combining independent selections. For the example multisets, M + N = \{a: 3, b: 1, c: 3\}. This operation is commutative, as addition of non-negative integers is commutative, and associative, since (a + b) + c = a + (b + c). The difference M - N subtracts multiplicities, flooring at zero: \mu_{M - N}(x) = \max(\mu_M(x) - \mu_N(x), 0) for all x \in U, removing elements from N as much as possible from M. In the example, M - N = \{a: 1, b: 1\}, but N - M = \{c: 3\}, highlighting its non-commutativity. Unlike the others, difference lacks general associativity or idempotence. The M \times N extends pairs across elements, with multiplicity as the product: \mu_{M \times N}((x, y)) = \mu_M(x) \cdot \mu_N(y) for all x, y \in U, forming a multiset over U \times U. For M = \{a: 2\} and N = \{b: 1, c: 2\}, M \times N includes (a, b) with multiplicity 2 and (a, c) with multiplicity 4. This operation is commutative in the sense that M \times N = N \times M up to relabeling of pairs and is associative when extended to multiple factors.

Enumeration

Recurrence Relations

The number of multisets of cardinality k that can be formed from a base set of n distinct types, often denoted b(n, k), counts the combinations where repetitions of types are permitted and the order of elements is irrelevant. A recursive approach computes b(n, k) via the relation b(n, k) = b(n-1, k) + b(n, k-1) for n \geq 1 and k \geq 1, with base cases b(n, 0) = 1 for all n \geq 0 (corresponding to the single empty multiset), b(0, 0) = 1, and b(0, k) = 0 for k > 0 (as no positive-sized multiset is possible without types). This recurrence derives from partitioning multisets based on the highest type label, assuming types are labeled $1 through n. Multisets excluding type n match those of size k from n-1 types, yielding b(n-1, k) cases. Multisets including at least one type n map bijectively to those of size k-1 from n types by removing one instance of n, yielding b(n, k-1) cases. The then combines these disjoint cases. For illustration, the table below shows computed values of b(n, k) for small n and k, obtained by applying the recurrence iteratively from the base cases:
n \setminus k01234
010000
111111
212345
31361015
These values demonstrate the step-by-step buildup, such as b(3, 2) = b(2, 2) + b(3, 1) = 3 + 3 = 6. As a non-recursive intuition, the recurrence aligns with the stars and bars method, visualizing k indistinguishable elements (stars) distributed among n types via n-1 dividers (bars), though the recursive formulation emphasizes iterative construction over direct combinatorial placement.

Generating Functions

Generating functions provide a powerful tool for enumerating multisets by encoding the number of such objects of various sizes into a . For multisets formed from n distinct types, the ordinary arises from the independent choice of multiplicity for each type, where the multiplicity m of a type contributes the \sum_{m=0}^{\infty} x^m = \frac{1}{1-x}. Thus, the overall is the product \prod_{i=1}^n \frac{1}{1-x} = \frac{1}{(1-x)^n}, which counts multisets of any size by the exponent of x. This product form reflects the independence of choices across types, as each factor corresponds to one type and the coefficient of x^k sums over all ways to distribute k indistinguishable elements into n types allowing repetitions. The coefficient of x^k in \frac{1}{(1-x)^n}, denoted [x^k] \frac{1}{(1-x)^n}, gives the number of k-element multisets from n types, which equals the binomial coefficient \binom{n+k-1}{k}. This follows from the generalized , where (1-x)^{-n} = \sum_{k=0}^{\infty} \binom{n+k-1}{k} x^k. For unrestricted multiset size, the full series \sum_{k=0}^{\infty} b(n,k) x^k = \frac{1}{(1-x)^n}, with b(n,k) denoting the number of k-multisets, provides a compact representation for problems. In the multivariate case, when types are labeled and multiplicities tracked separately, the generating function becomes \prod_{i=1}^n \sum_{m=0}^{\infty} x_i^m = \prod_{i=1}^n \frac{1}{1 - x_i}. Here, the of x_1^{a_1} \cdots x_n^{a_n} is 1 if each a_i \geq 0, corresponding to the unique multiset with those multiplicities. This form extends the univariate case and is useful for distinguishing types in applications like partition enumeration. For example, consider integer partitions, which can be viewed as multisets of positive integers. The generating function for unrestricted partitions (allowing repeated parts) is \prod_{i=1}^{\infty} \frac{1}{1 - x^i}, reflecting unlimited repetitions of each part size. In contrast, partitions into distinct parts (equivalent to sets, not multisets) have generating function \prod_{i=1}^{\infty} (1 + x^i), where each part appears at most once, leading to different coefficient growth rates.

Connections to Binomial Expansions

The enumeration of multisets is intimately connected to the generalized binomial theorem, particularly through expansions involving negative exponents. The number of multisets of cardinality k chosen from n distinct elements is given by the binomial coefficient \binom{n + k - 1}{k}, which arises as the coefficient in the series expansion (1 - x)^{-n} = \sum_{k=0}^{\infty} \binom{n + k - 1}{k} x^k. This identity follows from the generalized binomial theorem, where the binomial coefficient for negative upper index is \binom{-n}{k} = (-1)^k \binom{n + k - 1}{k}, linking the algebraic expansion directly to multiset counting upon accounting for the sign alternation. The negative binomial series (1 - x)^{-n} converges for |x| < 1, providing an infinite power series whose coefficients precisely enumerate unrestricted multisets of varying sizes from n types. For instance, the coefficient of x^k counts the ways to select k elements with repetition allowed from n options, equivalent to the number of non-negative integer solutions to x_1 + \cdots + x_n = k. A combinatorial proof of this coefficient uses the stars and bars theorem: represent the k selected elements as k stars (*) and the n-1 dividers between the n types as bars (| ), forming a sequence of n + k - 1 positions where n-1 are chosen for bars, yielding \binom{n + k - 1}{n-1} = \binom{n + k - 1}{k} arrangements. This visualization maps directly to distributing indistinguishable selections into distinct bins, confirming the multiset count. For generalizations, when multiplicities are weighted or bounded, the exponent in the binomial expansion can be adjusted to reflect the structure, such as using fractional or variable exponents in the series to model weighted selections. An example of the negative upper index connection is that the number of multisets of size k from n elements equals (-1)^k \binom{-n}{k}, illustrating how the combinatorial interpretation "negates" the usual subset selection to allow repetitions. The hockey-stick identity further relates multiset counts by summing them: \sum_{k=0}^{m} \binom{n + k - 1}{k} = \binom{n + m}{m}, which enumerates all multisets of cardinality at most m from n elements, as the partial sum of the negative binomial series coefficients. This identity provides a closed form for cumulative multiset enumerations, bridging individual counts to total selections up to a limit.

Applications

Combinatorics and Algebra

In partition theory, multisets provide a natural framework for representing integer partitions, where the parts of the partition correspond to the elements of the multiset, allowing repetitions to indicate multiplicity in the summands. This perspective is particularly useful for typed or colored partitions, where distinct types distinguish otherwise identical parts, extending classical unrestricted partitions to more general combinatorial objects. For instance, a multiset partition algebra has been developed to handle such structures, generalizing the to insertions on multiset partitions. Multisets play a key role in the theory of symmetric functions by extending classical bases like power sums and elementary symmetric polynomials to account for multiplicities. The power sum symmetric functions p_k = \sum_i x_i^k and elementary symmetric polynomials e_k = \sum_{1 \leq i_1 < \cdots < i_k} x_{i_1} \cdots x_{i_k} can be generalized to multisets, where variables are weighted by their multiplicities, enabling the enumeration of symmetric objects with repeated labels. This extension aligns with , where multisets serve as the domain for functors mapping finite sets to combinatorial structures, yielding generating functions that capture multiplicity-adjusted symmetries. Seminal work in this area establishes operations like sum, product, and plethysm for symmetric species, directly corresponding to those on symmetric functions. In graph theory, multisets model the edges of multigraphs, where parallel edges between vertices form a multiset of edge incidences, allowing multiple connections without loops in simple cases. The degree sequence of a multigraph, defined as the nonincreasing list of vertex degrees, is inherently a multiset, as degrees can repeat across vertices, facilitating the study of graphical realizations and extremal properties. For example, realizing a given multiset as a degree sequence in a multigraph requires the sum of degrees to be even and no degree exceeding the possible connections. Combinatorial identities involving multisets often center on permutation counts, where the number of distinct permutations of a multiset with n elements and multiplicities k_1, k_2, \dots for distinct types is given by the multinomial coefficient \frac{n!}{k_1! k_2! \cdots}. This formula arises from dividing the total permutations of a set by the repetitions within each type, providing a core identity for counting arrangements with indistinguishable objects. Such identities underpin broader results, like those in the , and extend to colored multisets for multifactorial generalizations. In algebraic structures, multisets appear in commutative algebra through multiset ideals, which generalize monomial ideals by allowing multiplicities in generators, particularly in zero-dimensional cases where they locally resemble monomial ideals under . These ideals support efficient computations via algorithms like Cerlienco-Mureddu for lexicographic standard monomials. Additionally, in combinatorial species theory, multisets form the basis for species functors, as introduced by , enabling the algebraic manipulation of labeled structures through exponential generating functions that account for relabeling and multiplicity. Multisets are treated via extensions such as symmetric species, facilitating compositions like substitutions for complex enumerative problems. A specific application arises in necklace counting, where multisets encode color repetitions to determine distinct configurations under rotation. Bijections to multisets with divisible subset sums simplify enumeration for bounded colors. This approach equates necklaces with at most q colors to multisets of integers modulo n whose subset sums are divisible by n, providing an explicit combinatorial correspondence.

Computer Science and Data Structures

In computer science, multisets are implemented using data structures that efficiently handle duplicate elements and their multiplicities, enabling operations like insertion, deletion, and querying counts. Balanced binary search trees, such as , are commonly used for ordered multisets, as in the C++ standard library's std::multiset, which maintains elements in sorted order while allowing multiples of the same key. This structure supports logarithmic time complexity for key operations, making it suitable for scenarios requiring sorted access or range queries. Alternatively, hash maps provide an unordered implementation by associating each unique element with a counter for its multiplicity, as seen in libraries like Apache Commons Collections' HashMultiSet, which offers average-case constant-time insertions and lookups at the expense of order preservation. Algorithms for multiset operations, such as and , leverage the underlying structure for efficiency. For sorted multisets, and can be computed in linear time O(n + m) using merge-like procedures on sorted sequences, similar to those in C++'s <algorithm> header for set operations adapted to handle multiplicities by taking maximum or minimum counts, respectively. These algorithms find applications in with duplicates, where multisets preserve order and multiplicity during sorts, avoiding the need for auxiliary tracking structures. Arithmetic operations like multiset and , as defined mathematically, align with these implementations for practical computation. Programming languages provide built-in or library support for multisets through specialized classes. In , the collections.Counter class implements a multiset as a subclass, enabling frequency counting of hashable objects with methods like update() for adding elements and most_common() for retrieving elements by count. Similarly, Java's library offers a Multiset with implementations like HashMultiset and TreeMultiset, supporting operations such as count(), add(), and remove() to manage multiplicities efficiently. In databases, multisets model duplicate records naturally, with SQL's GROUP BY clause combined with COUNT() aggregating rows by key while tracking frequencies, as standardized in and implemented in systems like SQL Server. For example, querying SELECT column, COUNT(*) FROM table GROUP BY column produces a multiset of unique values and their occurrences, facilitating duplicate without explicit multiplicity storage. The time and space complexities of multiset operations vary by implementation. Tree-based multisets, like std::multiset, achieve O(log n) time for insert, delete, and search operations, with O(n) space for n elements, due to the balanced overhead. Hash-based versions offer amortized O(1) time for these operations but may degrade to O(n) in worst-case scenarios from hash collisions, still using O(n) space. As of 2025, modern applications include , where bag-of-words models in use CountVectorizer to represent documents as multisets of terms for frequency-based feature extraction in tasks like text classification. In search engines, term frequencies are handled as multisets to compute relevance scores, such as in TF-IDF weighting, powering inverted indexes in systems like .

Extensions

Finite Variations

A bounded multiset is a generalization of a standard multiset where the multiplicity function μ satisfies μ(x) ≤ m for each element x and some fixed positive integer m, restricting the number of occurrences of any single element. This constraint models scenarios in combinatorics and computer science where repetitions are limited, such as in resource allocation or extremal set theory analogs like intersecting families of k-multisets from ^m. The number of bounded multisets of cardinality k over an n-element universe is given by the coefficient of z^k in the generating function (1 + z + ⋯ + z^m)^n, which expands using the finite geometric series formula \frac{1 - z^{m+1}}{1 - z}. Counting such objects can also employ inclusion-exclusion principles to account for violations of the bound. An m-uniform multiset extends this idea by requiring exactly m occurrences for every element in its , meaning the support has size k and the total is mk for some k. These structures arise in the study of multiset permutations, where arrangements of such multisets exhibit symmetries analyzable via Gray codes or cyclic sieving phenomena. For instance, the multiset M(n, m) consists of m copies each of n distinct symbols, and its permutations form the set P(n, m) with \frac{(nm)!}{\prod_{i=1}^n m!}. Bounded multisets relate closely to combinatorial designs, particularly , where blocks are interpreted as multisets of points with multiplicity constraints reflecting replication numbers. In such representations, the collection of blocks forms a multiset itself, allowing repeated blocks, and the between blocks is computed as the sum of products of point multiplicities. Automorphisms preserve these multiplicities, ensuring the design's symmetry under point permutations that map blocks to equivalent multisets. This framework generalizes classical balanced incomplete designs to accommodate bounded repetitions, useful in statistical and applications. Properties of bounded multisets require adjustments to standard arithmetic operations to maintain the multiplicity bound. As an example, the bounded reformulates the input as a bounded multiset of item types, where each type i has multiplicity at most u_i (the availability bound), weights w_i, and values v_i. The goal is to select a submultiset S with total weight ∑{x ∈ S} w_x ≤ W (capacity) that maximizes ∑{x ∈ S} v_x, solvable via dynamic programming in O(n W ∑ u_i) time. This multiset perspective highlights the constraints on repetitions, distinguishing it from unbounded variants.

Infinite and Fuzzy Multisets

Infinite multisets extend the standard multiset framework to scenarios where the support—the set of elements with positive multiplicity—may be infinite. Formally, given a countable universe X, an infinite multiset is defined by a multiplicity function \mu: X \to \mathbb{N} \cup \{0, \infty\} (or more generally to cardinals), where the total cardinality \sum_{x \in X} \mu(x) may be infinite. Cases with finite support, where \mu(x) > 0 for only finitely many x, align with finite multisets but highlight the boundary with infinite structures. These constructions appear in measure theory, for instance, as discrete atomic measures where multiplicities represent point masses. A key property of infinite multisets is that the may be , with the m-cardinality providing a unified measure incorporating both the count of distinct elements and their multiplicities, yielding transfinite m-cardinals. In , infinite multisets can model local multiplicity in spaces with accumulation points, though operations like may not extend unambiguously. Counting infinite multisets poses challenges beyond finite cases, with cardinality addressed through the m-cardinality, a unified measure incorporating both the of distinct elements and their multiplicities, yielding transfinite m-cardinals strictly less than \aleph_0. Fuzzy multisets further generalize by replacing multiplicities with degrees of membership in the unit interval, defined as a \mu: X \to [0,1] for each element, capturing partial or vague repetitions beyond crisp counts. This formulation extends Zadeh's s—where membership degrees model uncertainty—by incorporating multiset-like repetition through aggregated or sequenced degrees within [0,1], as formalized by Yager. Basic operations follow fuzzy set conventions: intersection via pointwise minimum, \mu_{A \cap B}(x) = \min(\mu_A(x), \mu_B(x)), and union via maximum, \mu_{A \cup B}(x) = \max(\mu_A(x), \mu_B(x)), preserving Zadeh's extension principle for propagation under vagueness. For fuzzy multisets, properties include \alpha-cuts to convert to crisp multisets: for \alpha \in [0,1], the \alpha-cut [\mu]_\alpha is the multiset where each x has multiplicity equal to the number of membership degrees \geq \alpha, yielding nested crisp structures as \alpha increases. Inverse \alpha-cuts, defined as multiplicities for degrees < \alpha, are monotonic increasing and facilitate decomposition, though the ground set does not fully decompose into \alpha-cut and inverse pairs unlike in standard fuzzy sets. Convergence for infinite fuzzy multisets requires the integral or sum of degrees to be finite, ensuring bounded total membership akin to probability measures. These extensions tease applications in for handling vague collections, with fuller details in dedicated sections on uses. For example, in , a fuzzy multiset might represent a as a collection of terms where each term t has a relevance score \mu(t) \in [0,1] reflecting contextual importance or weighted frequency, enabling nuanced similarity computations.

Historical Context

Origins and Early Concepts

The concept of multisets, which permit multiple occurrences of elements unlike traditional sets, traces its implicit origins to early combinatorial problems involving repetitions. In the , the Indian mathematician addressed permutations and combinations with repetition in his treatise Lilavati, effectively employing multiset-like structures to count arrangements where identical items could recur, such as in poetic meters or selections from limited types. This approach anticipated formal multiset theory by treating repetitions as inherent to the collection rather than distinct entities. In the , Leonhard Euler's work on integer during the further exemplified implicit multiset usage as a precursor to 19th-century . Euler analyzed ways to express numbers as sums of positive integers, disregarding order but allowing repeated parts, as seen in his derivations for counts; this treated the parts as a multiset whose elements could repeat without violating the structure. Such explorations in highlighted the utility of repetitions, influencing later combinatorial methods. The formal distinction of multisets from standard sets emerged in the mid-20th century amid the development of axiomatic . The Bourbaki group's Théorie des Ensembles (starting 1939, English 1968) rigorously defined sets via , prohibiting duplicate elements and emphasizing unique membership, which underscored the need for a separate construct to handle multiplicities in applications like algebra and geometry. By the 1950s, N. G. de Bruijn utilized multiset concepts in on sequences, paving the way for formalization. then popularized the idea in his 1968 text , Volume 1, referring to multisets as "bags" in discussions of data structures and combinatorial algorithms, where repetitions were essential for unordered collections.

Key Developments and Contributors

The term "multiset" was coined by mathematician in the 1970s through private correspondence with , who subsequently popularized the concept and defined key operations on multisets in his seminal 1973 work, , Volume 3: Sorting and Searching. Knuth's formalization integrated multisets into algorithmic analysis, particularly for and problems, establishing them as a fundamental structure in . De Bruijn further advanced multiset theory in during the 1970s, developing generating functions to count multisets and their partitions. These methods linked multisets to broader combinatorial identities, influencing subsequent work on cycle indices and in algebra. In the late , Wayne D. Blizard established a rigorous axiomatic framework for multiset theory in , allowing finite repetitions of elements. Blizard's 1991 survey traced the evolution of multiset theory from early informal uses in and statistics to a formalized branch of , highlighting its distinctions from related concepts like fuzzy sets or sequences. The foundations for generalizations like fuzzy multisets were laid by Lotfi A. Zadeh's 1965 introduction of fuzzy sets, which allowed graded memberships and inspired extensions to handle multiplicities. formalized fuzzy multisets in 1986, defining them as structures where elements have multivalued fuzzy memberships, enabling applications in uncertain reasoning. In parallel, Zdzisław Pawlak's 1982 rough set theory incorporated multiset-like approximations for handling incomplete data, bridging multisets with granular computing in the 1980s. Modern developments include integrations of multisets into , such as multiset-enriched categories for modeling nondeterministic computations and the . Milestones encompass the formal inclusion of multisets as data types in ISO/IEC 15909-1 (2004) for high-level Petri nets, standardizing their use in .

References

  1. [1]
    [2110.12902] An Introduction to Multisets - arXiv
    Oct 21, 2021 · Multisets are sets that allow repetition of elements. As such, multisets pave the way to a number of interesting possibilities of theoretical and applied ...
  2. [2]
    [PDF] Lecture 1.5: Multisets and multichoosing
    A set with repetition is called a multiset. M. Macauley (Clemson). Lecture 1.5: Multisets and multichoosing. Discrete Mathematical Structures. 2 / 1 ...
  3. [3]
    Sets and MultiSets
    Oct 23, 2023 · For a multiset, this means that instead of just asking “is K in this set?” we can now ask “how many K's are in this set?”. For a multimap, ...
  4. [4]
    Multiset theory. - Project Euclid
    Multiset theory. Wayne D. Blizard. Download PDF + Save to my library. Notre Dame J. Formal Logic 30(1): 36-66 (Winter 1989).
  5. [5]
    The development of multiset theory - Project Euclid
    Blizard. "The development of multiset theory." Mod. Log. 1 (4) 319 - 352, Spring 1991. Information. Published: Spring 1991. First available in Project Euclid ...
  6. [6]
    [PDF] Secure Multiset Intersection Cardinality and its Application to ...
    Sep 1, 2016 · In this paper, we concentrate on comput- ing the Jaccard Coefficient over multisets (SJCM) [7] since it is a more general and practical problem ...
  7. [7]
    [PDF] Description Logics over Multisets
    Thus, from a practical point of view multisets are very useful structures as they arise in many areas of mathematics and computer science [8][9][19][22][23][27] ...
  8. [8]
    [PDF] Decision Procedures for Multisets with Cardinality Constraints
    Applications in software verification and interactive theorem proving often involve reasoning about sets of objects. Cardinality con- straints on such ...
  9. [9]
    THE DEVELOPMENT OF MULTISET THEORY * - Project Euclid
    135). Working within a set theory that admits classes, Rado defines a multiset to be any cardinal-valued function whose non-trivial domain (the collection of ...
  10. [10]
  11. [11]
    Multiset | Brilliant Math & Science Wiki
    A multiset (a.k.a. bag, mset) is a generalization of a set where repetition of elements matters.
  12. [12]
    Multiset -- from Wolfram MathWorld
    A set-like object in which order is ignored, but multiplicity is explicitly significant. Therefore, multisets {1,2,3} and {2,1,3} are equivalent.
  13. [13]
    Predicting Cards Using a Fuzzy Multiset Clustering of Decks
    Aug 18, 2020 · Especially the advantages of using fuzzy multisets instead of crisp multisets are highlighted based on some explanatory examples. We further ...
  14. [14]
    [PDF] Properties of Multisets
    Mar 8, 2019 · In this paper sub msets of a mset and sub msets of mset topological spaces are characterized. Definition 1.1: An mset M is a pair (X, C) where X ...Missing: equality | Show results with:equality
  15. [15]
    [PDF] A new look at multisets - School of Mathematics & Statistics | Science
    Oct 15, 2003 · Now let us define a fourth multiset D to consist of the third, second and first elements of X. Thus A = B = C = D = [2 1 1]. Since each ...
  16. [16]
    [PDF] Multisets | HAL
    May 13, 2022 · Abstract. Multisets are sets that allow repetition of elements, therefore accounting for their frequency, or multiplicity of observation.
  17. [17]
    [PDF] General relations between partially ordered multisets and their ...
    If a pomset has a least element, then this is also the minimal element, analogously the greatest element is the maximal element. In a linearly ordered mset the ...
  18. [18]
    [PDF] arXiv:2301.04635v2 [math.NT] 18 Jan 2023
    Jan 18, 2023 · Cartesian product: The Cartesian product A × B ∈ M(X × X) is defined as. µA×B((x1,x2)) = µA(x1)µB(x2). Difference: If A ⊆ B, the difference ...
  19. [19]
    [PDF] Enumerative Combinatorics Volume 1 second edition - Mathematics
    What is Enumerative Combinatorics? Enumerative combinatorics has undergone enormous development since the publication of the first edition of this book in 1986.
  20. [20]
    [PDF] chapter 2: counting finite sets, mainly by recursion - garsia at york
    The multisets of n with k elements which contain at least one n are in bijection with the multisets of n with k −1 elements (and hence there are n k − 1.<|control11|><|separator|>
  21. [21]
    [PDF] Notes on Partitions and their Generating Functions - UC Berkeley math
    To choose an arbitrary partition λ of unrestricted n, we can decide independently for each positive integer i how many times to include i as a part of λ.
  22. [22]
    Negative Binomial Series -- from Wolfram MathWorld
    The series which arises in the binomial theorem for negative integer -n ,. (x+a)^(-n), = sum_(k=0)^(infty)(-n; k). (1). = sum_(k=0)^(infty)(-1)^k.
  23. [23]
    7.2: The Generalized Binomial Theorem - Mathematics LibreTexts
    Jul 12, 2021 · We are going to present a generalised version of the special case of Theorem 3.3.1, the Binomial Theorem, in which the exponent is allowed to be negative.Missing: multiset | Show results with:multiset
  24. [24]
    Stars and Bars - Discrete Mathematics - An Open Introduction
    This requires stars and bars. Use a star to represent each of the 5 digits in the number, and use their position relative to the bars to say what numeral fills ...Missing: source paper
  25. [25]
    Hockey Stick Identity | Brilliant Math & Science Wiki
    The hockey stick identity is an identity regarding sums of binomial coefficients. The hockey stick identity gets its name by how it is represented in ...Missing: multiset | Show results with:multiset
  26. [26]
    [1905.02071] An insertion algorithm on multiset partitions ... - arXiv
    May 6, 2019 · Abstract:We generalize the Robinson-Schensted-Knuth algorithm to the insertion of two row arrays of multisets. This generalization leads to ...
  27. [27]
    [PDF] Chapter 1. Basic Graph Theory
    Dec 23, 2020 · A multigraph G is a pair of multisets (in a multiset, elements can be repeated more than once) (V,E) where V is nonempty, and E is a (possibly.
  28. [28]
    DLMF: §26.16 Multiset Permutations ‣ Properties ‣ Chapter 26 ...
    The number of elements in 𝔖 S is the multinomial coefficient (§26.4) ( a 1 + a 2 + ⋯ + a n a 1 , a 2 , … , a n ) .Missing: source | Show results with:source
  29. [29]
    [PDF] Gröbner theory of zero dimensional ideals
    Jan 2, 2014 · The next theorem claims that finite multiset ideals are exactly the zero dimensional ideals which locally look like monomial ideals. Theorem ...
  30. [30]
    [PDF] Introduction to the Theory of Species of Structures
    Nov 25, 2013 · This chapter contains the basic concepts of the combinatorial theory of species ... Méndez, Multisets and the Combinatorics of Symmetric Functions ...
  31. [31]
  32. [32]
    None
    ### Summary of Bounded Multisets from arXiv:2303.06647
  33. [33]
    [PDF] AC.pdf - Analytic Combinatorics
    Analytic combinatorics aims to enable precise quantitative predictions of the proper- ties of large combinatorial structures. The theory has emerged over ...<|separator|>
  34. [34]
    Combinations of multisets with finite multiplicities - MathOverflow
    Jul 25, 2010 · ... combinatorics, I had to ask it somewhere: What are the most ... mi denotes multiplicities of n different elements in the multiset):.An Operation on Multisets - MathOverflowProbability of unique elements in each of 'S' multisets sampled with ...More results from mathoverflow.netMissing: bounded | Show results with:bounded
  35. [35]
    [PDF] A Middle Levels Conjecture for Multiset Permutations with Uniform ...
    Dec 15, 2015 · There exists a star-transposition Gray code for the uniform-frequency multiset permutations. P(n, m) for all n, m ≥ 0. Conjecture 3 holds for n ...
  36. [36]
    [PDF] Block designs: Representations, automorphisms, duality
    May 30, 2003 · Each block is a multiset of points, and the blocks form a multiset. It is quite tricky to say what the plots are in this model; but, if the ...
  37. [37]
    [PDF] Bounded Multi-HyperSet Theory and Polynomial Computability
    Jul 25, 2006 · In general, we should take the maximum of multiplicities ni for repeating vi. Another known natural version of the union operation for multisets ...
  38. [38]
    A well-solvable special case of the bounded knapsack problem
    The bounded knapsack problem (BKP) is a variant of the classical knapsack problem in which every single item is available in a certain limited quantity. There ...
  39. [39]
  40. [40]
    [PDF] An Introduction to Measure Theory - Terry Tao
    In the fall of 2010, I taught an introductory one-quarter course on graduate real analysis, focusing in particular on the basics of mea- sure and integration ...Missing: multisets | Show results with:multisets
  41. [41]
  42. [42]
  43. [43]
    (PDF) An extension of the properties of inverse α-cuts to fuzzy multisets
    ### Summary of α-Cuts and Inverse α-Cuts in Fuzzy Multisets
  44. [44]
    Fuzzy Logic in Natural Language Processing – A Closer View
    The examples are chosen carefully to illustrate and demonstrate the applications of fuzzy logic in natural language processing environment for every reader.
  45. [45]
    [PDF] The theory of partitions - Jonathan Love
    Apr 13, 2023 · In 1740, french mathematician Philip Naude (1684-1747) raised the following question in a letter to Leonard Euler (1707-1783) [4].
  46. [46]
    Theory of Sets - N. Bourbaki - Google Books
    Oct 20, 2004 · This is a softcover reprint of the English translation of 1968 of N. Bourbaki's, Théorie des Ensembles (1970).Missing: distinction | Show results with:distinction
  47. [47]
    Combinatorial geometry in the plane : Hadwiger, Hugo
    Aug 3, 2019 · Publication date: 1964. Topics: Combinatorial geometry, Convex domains. Publisher: New York, Holt, Rinehart and Winston.Missing: multiset 1940s
  48. [48]
    The Art of Computer Programming (TAOCP)
    These books were named among the best twelve physical-science monographs of the century by American Scientist, along with: Dirac on quantum mechanics, Einstein ...Missing: multiset | Show results with:multiset
  49. [49]
    The art of computer programming, volume 3: (2nd ed.) sorting and ...
    The art of computer programming, volume 3: (2nd ed.) sorting and searchingJanuary 1998. Author: Author Picture Donald E. Knuth. Stanford University, CA.
  50. [50]
    Mathematics of Multisets | Request PDF - ResearchGate
    According to DeBruijin [2], the concept of multiset was introduced to D. E. Knuth by N. G. de Bruijn in a private message, and since then, the word has been ...
  51. [51]
    [PDF] Poset-valued sets or How to build models for Linear Logics
    May 2, 2002 · We describe a method for constructing models of linear logic based on the category of sets and relations. The resulting categories are.