Fact-checked by Grok 2 weeks ago

Composite data type

In computer science, a composite data type, also referred to as an aggregate or compound data type, is a data structure that combines multiple individual data items—typically primitive types such as integers or strings—into a single cohesive entity to represent more complex information.^[1]^[2] This grouping enables programmers to model real-world objects or relationships efficiently, contrasting with simple scalar types that hold a single value.^[3] Common examples of composite data types include arrays, which collect elements of the same type accessible via indices, often stored contiguously in memory for efficient access; records (or structures), which aggregate fields of potentially different types under named components; and unions, which overlay multiple types in shared memory space to optimize storage.^[1]^[3] More advanced forms, such as classes in object-oriented languages, extend records by incorporating methods and supporting encapsulation, while recursive types like linked lists allow self-referential structures for dynamic data organization.^[1]^[3] These types are implemented differently across programming languages—for instance, C uses structs and arrays directly, while higher-level languages like Python provide built-in support for lists and dictionaries as composites.^[1] Composite data types are fundamental to software design, facilitating modular code, data abstraction, and efficient memory management, as they allow operations on entire collections rather than individual elements.^[2] Their use underpins abstract data types (ADTs), where the logical interface is separated from physical implementation details, enhancing reusability and maintainability in programs.^[2] Key considerations in their design include type safety, memory layout (e.g., field offsets in records), and support for operations like slicing or variant forms to handle polymorphism.^[3]

Introduction

Definition

A composite data type is a data structure in computer science formed by combining multiple scalar or atomic data types—such as integers, floats, or booleans—to create more complex structures that aggregate related data elements into a single unit.^[4] This composition enables the representation of multifaceted entities beyond simple values, distinguishing composites from primitive types, which handle individual, indivisible data.^[5] Key characteristics of composite data types include heterogeneity, where components can be of differing primitive or composite types; variability in size, which may be fixed (as in structures) or dynamic (as in certain arrays); and support for structured access mechanisms, such as indexing or field selection, to retrieve or manipulate individual elements efficiently.^[4] These traits facilitate modular organization of data, enhancing code readability and maintainability in programming.^[6] A basic example is a 2D point representing spatial coordinates, composed of two integer fields: x and y, allowing the point to encapsulate both values as a cohesive entity.^[4] The concept of composite data types evolved from early computing practices and was formalized in the 1960s through languages like ALGOL 60, which advanced data representation via blocks and arrays, and PL/I, which introduced explicit structures for ordered collections of related items.^[6]^[7]

Historical Context

The origins of composite data types trace back to the 1950s, when low-level programming in assembly languages began supporting rudimentary forms of aggregation through hardware features like indexed addressing. This allowed programmers to treat sequences of memory locations as arrays by using index registers to modify addresses dynamically during execution. The IBM 704, introduced in 1954, was a pioneering machine in this regard, featuring three index registers that enabled efficient array manipulation for scientific computations, marking a shift from manual address calculations to more structured data handling in early stored-program computers.^[8] By the mid-1960s, high-level languages formalized these concepts into robust composite structures. PL/I, developed by IBM and released in 1964, was the first widely influential high-level language to incorporate comprehensive support for composites, including nested structures, multi-dimensional arrays, and qualified name access, blending features from FORTRAN's arrays and COBOL's records to serve scientific, commercial, and systems programming needs. ALGOL 68, finalized in 1968, further advanced the paradigm by introducing records as a core mode for defining structured types, proposed by C. A. R. Hoare in 1965–1966 to enable orthogonal combinations of fields with type safety and expressiveness.^[9] These developments were facilitated by evolving hardware, particularly the transition from word-addressable to byte-addressable memory in machines like the IBM System/360 (1964), which allowed flexible allocation of variable-sized components within composites without rigid word boundaries.^[10] In the 1970s, composite types gained broader adoption through systems programming languages. The C language, developed by Dennis Ritchie at Bell Labs starting in 1972, popularized structs as a mechanism for aggregating heterogeneous data fields with compile-time offsets, evolving from earlier B language influences to support pointer arithmetic and assignment, which became essential for Unix kernel development. The 1980s saw composites extend into object-oriented paradigms, with Smalltalk-80 (released 1980) treating classes as blueprints for objects that encapsulate both data fields and methods, building on Alan Kay's 1972 vision at Xerox PARC to model real-world entities as dynamic composites. Concurrently, Ada (standardized in 1983) introduced variant records—discriminated unions allowing conditional field inclusion—to address safety-critical systems, standardizing flexible subtypes within records for embedded and defense applications.^[11]

Primitive vs Composite

Primitive Data Types

Primitive data types are the fundamental building blocks in programming languages, consisting of basic, indivisible units of data that are predefined and natively supported by the language or underlying hardware. These types represent simple values that cannot be decomposed into simpler components without losing their meaning, serving as the atomic elements from which more complex data structures are constructed.^[12]^[13] Common examples of primitive data types include integers, floating-point numbers, booleans, and characters. Integers, such as the int type in C, typically represent signed 32-bit whole numbers, while unsigned variants like unsigned int handle non-negative values. Floating-point types, like float and double, adhere to the IEEE 754 standard for binary floating-point arithmetic, enabling representation of approximate real numbers with single (32-bit) or double (64-bit) precision.^[14] Booleans capture logical values of true or false, often implemented as a single bit or byte, and characters, such as char in C, store individual symbols using encodings like ASCII, with a typical size of 8 bits. Strings are sometimes treated as primitive types, as in JavaScript where they are immutable sequences of characters without object overhead, but in languages like C or Java, they are generally composite structures built from character arrays or objects.^[15]^[12] These types exhibit key properties that distinguish them as primitives: an atomic nature, meaning they are treated as single, indivisible units during operations; a fixed size determined by the language implementation, such as 8 bits for char or at least 16 bits for int in C; direct mapping to hardware representations like CPU registers or memory words for efficient processing; and the absence of internal structure, preventing nested access or modification of components.^[13] In standards like ISO/IEC 9899 for the C programming language, these primitives are defined with minimum size guarantees to ensure portability across implementations, while allowing flexibility for hardware-specific optimizations.

Key Distinctions

Primitive data types, such as integers and booleans, are scalar in nature, representing single, indivisible values without internal structure or components.^[16]^[17] In contrast, composite data types are aggregate structures that combine multiple fields or elements, allowing for the organization of related data into a cohesive unit.^[16]^[18] This structural distinction enables primitives to handle atomic information directly, while composites facilitate the representation of multifaceted entities. In terms of usage, primitive data types are primarily employed for straightforward operations like arithmetic calculations or logical comparisons, where simplicity and direct manipulation suffice.^[16] Composite data types, however, are suited for modeling complex real-world objects, such as an employee record encompassing fields for name, ID, and salary, thereby promoting modular and intuitive program design.^[17] This contrast highlights how primitives support basic computational tasks, whereas composites enhance abstraction for intricate data relationships. Performance-wise, primitive data types offer superior efficiency for direct operations due to their alignment with hardware-level instructions and fixed memory footprints, minimizing processing overhead.^[16] Composites, by aggregating multiple elements, introduce additional overhead in access and manipulation, such as through field selection or indexing mechanisms, which can impact execution speed in resource-intensive scenarios.^[16] Regarding type safety, primitive data types inherently lack mechanisms for encapsulation, exposing their values directly to operations without built-in protections against misuse.^[16] Composite data types, through their structural composition, enable encapsulation by bundling data with associated behaviors, thereby supporting stronger type checking and reducing errors from unintended access.^[19]^[20]

Types of Composites

Arrays

An array is an ordered, fixed-size collection of elements, all of the same data type, that can be accessed via indices.^[21] This structure allows for efficient storage and retrieval of multiple related values under a single identifier.^[22] Arrays exhibit several key properties that define their utility in programming. They are homogeneous, meaning every element must share the identical type, ensuring type safety and uniform memory allocation per element.^[23] In most implementations, arrays occupy contiguous blocks of memory, which facilitates rapid access through direct address calculations and optimizes cache performance. Indexing typically begins at zero, a convention that simplifies offset computations in languages like C and Java, where the address of an element at index i is base address plus i times element size.^[22] Basic operations on arrays include declaration, access, and length determination. Declaration specifies the element type and size, as in int arr[10]; for a static array of ten integers in C.^[24] Access occurs via bracket notation, such as arr[i] to retrieve or modify the element at index i, with bounds checking often omitted for performance but risking undefined behavior if exceeded.^[25] Length is obtained through language-specific mechanisms, like the length property in Java or sizeof(arr)/sizeof(arr[0]) in C for compile-time sizes.^[26] Arrays come in variants to accommodate different needs. Static arrays have a fixed size determined at compile time, providing constant-time access but no resizing capability.^[27] Dynamic arrays, by contrast, allow runtime resizing, often via reallocation and copying when capacity is exceeded, as seen in structures like Java's ArrayList or C++'s std::vector, balancing flexibility with amortized efficiency.^[28] Multi-dimensional arrays extend this to higher dimensions, typically implemented as arrays of arrays; for instance, a 2D array represents a matrix where rows are one-dimensional arrays, accessed as matrix[row][col].^[22]

Structures and Records

Structures and records are heterogeneous composite data types that group related variables of potentially different types into a single unit, with each variable identified by a unique name for access. This allows for the logical organization of data, such as coordinates in a point or attributes of an entity, without relying on positional indexing. Unlike homogeneous composites like arrays, which store elements of the same type, structures and records support varied field types to model complex, real-world entities more naturally.^[29]^[30] In the C programming language, a structure is declared using the struct keyword followed by a tag name and a list of member declarations enclosed in braces. For example:

c
struct Point {
    int x;
    int y;
};
struct Point {
    int x;
    int y;
};

This defines a type where x and y are named integer fields. The members are allocated in the order declared, and the structure's size is at least the sum of its members' sizes, potentially including padding bytes inserted between or after members to satisfy alignment requirements—typically aligning each member to a boundary matching its type's natural size (e.g., 4 bytes for an int on a 32-bit system). The overall structure alignment is determined by the strictest member requirement, ensuring efficient memory access.^[29] Initialization of structures in C can occur at declaration using brace-enclosed values in member order, such as struct Point p = {1, 2};, which sets p.x to 1 and p.y to 2. Designated initializers, introduced in C99, allow out-of-order assignment like struct Point p = {.y = 2, .x = 1}; for clarity. Members are accessed via the dot operator for direct variables (p.x) or arrow operator for pointers (pt->x), enabling read or write operations on individual fields. These features facilitate data aggregation, such as bundling employee details (ID, name, salary) into one entity for processing.^[31] The term "record" is synonymous with structure in several languages, emphasizing its role in data grouping. In Pascal, records are defined with the record keyword, supporting heterogeneous fields and optional variants for conditional structures, as in:

pascal
type
  TPerson = record
    name: [string](/page/String);
    age: [integer](/page/Integer);
  end;
type
  TPerson = record
    name: [string](/page/String);
    age: [integer](/page/Integer);
  end;

Access uses dot notation (person.name), and entire records can be assigned or compared directly, promoting structured programming. Pascal records, originating from the language's design for readability, are used to aggregate data like customer profiles.^[32] In COBOL, records form the basis of hierarchical data description in the Data Division, where level-01 items define record names grouping subordinate elementary or group items of varying types (e.g., PIC clauses for display or numeric formats). For instance, a record might aggregate invoice details under a level-01 INVOICE-RECORD. This structure supports fixed-length or variable-length formats, essential for business data processing and file I/O.^[33] Records are widely employed for data aggregation analogous to database rows, where each record encapsulates a set of named fields representing a tuple in a relational table, such as a row with columns for ID, name, and status. This mapping enables seamless transfer between in-memory structures and persistent storage, as seen in database APIs that treat rows as record-like objects.^[30]

Unions

A union is a composite data type in which all members share the same contiguous block of memory, with the overall size of the union determined by the size of its largest member, potentially including padding for alignment purposes.^[34] This memory overlap allows for efficient storage when only one member needs to be active at a time, but it introduces risks if multiple members are accessed simultaneously.^[35] Key properties of unions include type reinterpretation, where writing to one member overwrites the data of others, rendering only the written member valid for subsequent reads.^[36] Misuse, such as accessing a non-active member, typically results in undefined behavior, making unions type-unsafe without additional safeguards.^[34] To mitigate this, unions are often used in variant data representations or combined with tags to form discriminated (or tagged) unions, ensuring only the correct member is accessed based on an indicator field.^[37] Operations on unions involve declaration using a keyword like union followed by a member list, such as union { int i; float f; } u;, which allocates memory sufficient for the larger type (typically float here).^[34] Access occurs via member selectors (e.g., u.i or u.f), but programmers must ensure the accessed member matches the last written one to avoid undefined behavior; in some languages, such access requires explicit unsafe constructs.^[37] Unions originated in the development of the C programming language during the 1970s, where they were added to support low-level memory manipulation and bit-level operations, such as type punning for efficient data handling in systems programming.^[38] In modern languages like Rust, raw unions remain for interoperability (e.g., with C code) but are considered unsafe, with discriminated unions implemented safely via enums to enforce variant exclusivity at compile time, as in enum Variant { Int(i32), Float(f32) }.^[37] These constructs are particularly useful for representing heterogeneous data, such as protocol messages with varying payloads, while minimizing memory footprint.^[37]

Classes and Objects

In object-oriented programming, classes and objects form an advanced composite data type that integrates data fields and associated methods, thereby supporting key principles such as encapsulation—which bundles data and operations while restricting direct access to internal state—and polymorphism, which allows objects of different classes to be treated uniformly through a common interface.^[39]^[40] A class acts as a template or blueprint defining the structure and behavior for creating multiple instances, combining attributes (data) with functions (methods) into a cohesive unit that models real-world entities or abstract concepts.^[41]^[42] Objects are runtime instances of classes, each maintaining its own unique data values while sharing the class's methods, thus representing the dynamic realization of the composite type.^[43] Core properties include instance variables, which store the object's state as data members; methods, which encapsulate behaviors and operations on that state; constructors, special methods automatically invoked upon object creation to initialize instance variables; and destructors, methods called when an object is destroyed to release resources and perform cleanup.^[44]^[45]^[46] Instantiation, the process of creating an object from a class, allocates memory for the instance and invokes the constructor, as illustrated in this pseudocode example:

java
[class](/page/Class) Point {
    [int](/page/INT) x, y;  // instance variables
    Point([int](/page/INT) initX, [int](/page/INT) initY) {  // constructor
        x = initX;
        y = initY;
    }
    void move([int](/page/INT) dx, [int](/page/INT) dy) {  // [method](/page/Method)
        x += dx;
        y += dy;
    }
}
Point p = new Point(0, 0);  // [instantiation](/page/Instantiation)
[class](/page/Class) Point {
    [int](/page/INT) x, y;  // instance variables
    Point([int](/page/INT) initX, [int](/page/INT) initY) {  // constructor
        x = initX;
        y = initY;
    }
    void move([int](/page/INT) dx, [int](/page/INT) dy) {  // [method](/page/Method)
        x += dx;
        y += dy;
    }
}
Point p = new Point(0, 0);  // [instantiation](/page/Instantiation)

This operation binds the object's data to the class's behavioral template, enabling modular and reusable code.^[44]^[47] A hallmark of classes as composites in object-oriented programming is inheritance, where a derived class extends a base class, acquiring its data fields and methods while potentially adding or overriding elements to form hierarchical relationships that enhance code reusability and model "is-a" associations between types.^[48]^[49] Abstract classes further refine this by providing a partial implementation that cannot be instantiated directly, serving as a foundation for subclasses to complete through required method implementations, thus enforcing common interfaces across related composites.^[50]^[51] Unlike simpler records that merely aggregate data without behavioral integration, classes emphasize dynamic interaction between state and operations.^[52]

Implementation Details

Memory Representation

Composite data types are allocated in contiguous blocks of memory to enable efficient sequential access and cache utilization. Arrays store their elements in adjacent memory locations, ensuring that the entire collection occupies a single, unbroken region starting from the base address. Structures follow a similar contiguous layout for their members, arranged in the order of declaration, with each member's starting address determined by the offset from the structure's base, calculated as the sum of preceding members' sizes plus any necessary padding. To optimize performance on hardware that requires data alignment—such as placing multi-byte types on boundaries that are multiples of their size—compilers insert padding bytes between structure members or after the last member. For instance, a 4-byte integer (int) is typically aligned to a 4-byte boundary, so if preceded by a 1-byte character (char), three padding bytes may be added to ensure proper alignment. This padding minimizes access overhead but can increase the overall memory footprint of the composite type. The size of a structure, obtainable via the sizeof operator, equals the total of all member sizes plus any inserted padding, including trailing padding to satisfy the structure's own alignment requirements, which is often the strictest among its members. In contrast, a union's size is determined by the largest member, as all members overlap at the same memory location, with possible additional padding to meet alignment needs. For example, a union containing a 4-byte int and an 8-byte double would have a size of 8 bytes plus any padding for the double's alignment. Endianness, the ordering of bytes within multi-byte fields, influences the interpretation of data in composite types but does not alter the relative layout of fields themselves. On little-endian systems, the least significant byte of a multi-byte field like an int is stored at the lowest address, while big-endian systems store the most significant byte first; this affects serialization or cross-platform data exchange involving composites with such fields.

Access and Manipulation

Accessing elements within composite data types typically involves language-specific syntax that enables direct or indirect referencing of components. For arrays, which store homogeneous elements in contiguous memory, access is achieved through indexing operations, such as array[i] in languages like C and Java, where i is an integer offset starting from 0. This method computes the memory address of the element by adding the offset to the base address of the array. Structures and records, which aggregate heterogeneous fields, use dot notation for member selection, exemplified by structVar.fieldName in C, allowing retrieval or modification of named components without explicit offset calculations. Pointers and references provide indirect access; in C, dereferencing a pointer to an array or structure uses *ptr or ptr->field for pointed-to members, while C++ references act as aliases (e.g., ref.field) that behave like direct access but bind to existing objects. These mechanisms ensure type-safe navigation, though pointers introduce risks like null dereferencing if not managed properly.^[53]^[54]^[55] Manipulation of composite data types includes assignment, copying, and iteration to alter or traverse contents. Assignment directly binds a new value to the composite or its elements, such as array[i] = value for arrays or structVar.[field](/page/Field) = value for structures, overwriting existing data in place. Copying distinguishes between shallow and deep operations, particularly for nested composites; a shallow copy creates a new container with references to the original elements (e.g., shallow = original[:] in Python lists), while a deep copy recursively duplicates all nested objects (e.g., via copy.deepcopy()), preventing shared mutations. This is crucial for mutable composites like arrays of structures, where shallow copies can lead to unintended side effects. Iteration enables systematic traversal, often using for loops for arrays (e.g., for (int i = 0; i < length; i++) in C or Java), processing each element sequentially and supporting operations like summation or search. These techniques maintain data integrity during runtime modifications.^[56]^[57]^[58] Error handling for access and manipulation varies by language safety guarantees. In safe languages like Java, bounds checking is enforced at runtime for array indexing; attempting access beyond the array's length (e.g., array[10] on a 5-element array) throws an ArrayIndexOutOfBoundsException, preventing invalid memory reads. Conversely, unsafe languages like C omit automatic bounds checking to prioritize performance, resulting in undefined behavior for out-of-bounds access, which may cause crashes, data corruption, or security exploits without guaranteed error signals. For structures, field access errors are rarer but can occur via invalid pointers, leading to segmentation faults in unsafe contexts. Programmers in unsafe languages must implement manual checks (e.g., if (i < length)) to mitigate risks.^[59]^[60] Performance considerations in accessing and manipulating composites hinge on memory access patterns and hardware cache behavior. Contiguous composites like arrays exhibit strong spatial locality, as sequential indexing loads adjacent elements into the CPU cache, reducing misses and enabling vectorized operations for significant speed gains in tight loops compared to non-contiguous access. Scattered structures or pointer-based traversals (e.g., linked lists or dynamically allocated record arrays) suffer from poor locality, incurring frequent cache misses due to unpredictable jumps, which can degrade performance by factors of 2-10 on modern processors. Optimizing involves aligning structures contiguously or using cache-aware iteration to minimize these penalties.^[61]^[62]

Language-Specific Examples

In Procedural Languages

In procedural languages such as C and Pascal, composite data types enable the grouping of related data elements without incorporating object-oriented features like inheritance or polymorphism. These languages emphasize explicit control over data structures, allowing programmers to define and manipulate composites through straightforward syntax that prioritizes efficiency and portability.^[29]^[63] In C, structures (structs) provide a fundamental way to create composite types by aggregating variables of different types into a single unit. For instance, a struct can represent an employee record as follows:

c
struct Employee {
    char name[50];
    int id;
};
struct Employee {
    char name[50];
    int id;
};

This declaration defines a new type where members are accessed using the dot operator, such as employee.name or employee.id. Arrays serve as another core composite, declared statically like int nums[100]; for fixed-size collections or dynamically using malloc for runtime sizing. Unions in C allow overlapping storage for variant data, mimicking variant records by sharing memory among members of different types; an example is:

c
union Variant {
    int i;
    float f;
};
union Variant {
    int i;
    float f;
};

Here, only one member is active at a time, with the union's size matching its largest member, useful for space-efficient handling of alternative data representations.^[29]^[64] Pascal employs records to achieve similar aggregation, defining them with a type declaration that includes named fields of varying types. A comparable employee example is:

pascal
type
  Employee = record
    name: string;
    id: integer;
  end;
type
  Employee = record
    name: string;
    id: integer;
  end;

Field access occurs via the dot notation, like emp.name or emp.id, enabling structured data handling akin to C structs but with Pascal's emphasis on type safety and readability. Records in Pascal can include variant parts for conditional structures, though basic fixed records suffice for most procedural needs.^[65]^[63] Composites in these languages are commonly passed as function parameters, either by value for small structs (copying the entire structure) or by reference using pointers in C to avoid overhead, as in void process_employee(struct Employee *emp);. Dynamic allocation enhances flexibility; in C, arrays or structs can be allocated at runtime with malloc, such as int *nums = malloc(100 * sizeof(int));, requiring explicit deallocation via free to prevent memory leaks. Pascal supports dynamic allocation through new and dispose for heap-based records, maintaining procedural discipline.^[66]^[63] A key limitation of composites in procedural languages like C and Pascal is the absence of built-in object-oriented mechanisms, necessitating manual implementation of any advanced behaviors through pointers and functions. Memory management remains entirely under programmer control, with no automatic garbage collection, which demands careful allocation and deallocation to manage resources effectively but exposes risks like dangling pointers if mishandled.^[67]^[63]

In Object-Oriented Languages

In object-oriented programming (OOP) languages, composite data types are primarily embodied through classes and objects, which encapsulate multiple data members (fields) and behaviors (methods) into a single unit, promoting modularity and abstraction. Classes serve as blueprints for creating objects, allowing programmers to define complex structures that combine primitive types and other composites. This approach contrasts with simpler records by integrating access controls, constructors, and inheritance mechanisms to manage state and interactions. In Java, a class is defined using the class keyword, with fields typically declared as private for encapsulation, and public methods for controlled access. For instance, a Point class representing a 2D coordinate might be implemented as follows:

java
[public](/page/Public) [class](/page/Class) Point {
    [private](/page/Private) int x;
    [private](/page/Private) int y;
    
    [public](/page/Public) void setX(int val) {
        x = val;
    }
    
    [public](/page/Public) int getX() {
        return x;
    }
    
    // Similar methods for y
}
[public](/page/Public) [class](/page/Class) Point {
    [private](/page/Private) int x;
    [private](/page/Private) int y;
    
    [public](/page/Public) void setX(int val) {
        x = val;
    }
    
    [public](/page/Public) int getX() {
        return x;
    }
    
    // Similar methods for y
}

Objects are instantiated using the new keyword, such as Point p = new Point();, which allocates memory on the heap and invokes the default constructor. This structure allows composites to model real-world entities, like a Point combining integer coordinates with setter and getter methods for manipulation. In C++, classes and structs provide similar functionality, with structs allowing public members by default while classes enforce private access unless specified otherwise. A Point class example includes data members and methods, often with constructors for initialization:

cpp
class Point {
private:
    int x;
    int y;
public:
    Point(int xVal = 0, int yVal = 0) : x(xVal), y(yVal) {}
    
    void move(int dx) {
        x += dx;
    }
    
    int getX() const {
        return x;
    }
};
class Point {
private:
    int x;
    int y;
public:
    Point(int xVal = 0, int yVal = 0) : x(xVal), y(yVal) {}
    
    void move(int dx) {
        x += dx;
    }
    
    int getX() const {
        return x;
    }
};

Instantiation occurs via Point p(5, 10);, leveraging constructors to set initial values and supporting operator overloading for intuitive operations on composites. C++ classes enable fine-grained control over memory layout, such as padding for alignment in multi-member composites. Key features of composites in OOP include inheritance and polymorphism, which extend basic classes into hierarchies. In C++, inheritance is declared as class Derived : public Base {}, allowing a derived class to inherit members from a base, reducing redundancy while composing more complex types. Polymorphism is achieved through virtual methods, enabling runtime method resolution based on object type, as in virtual void draw() = 0; for abstract base classes. Java supports similar mechanisms with extends for inheritance, e.g., class Circle extends Shape {}, and interfaces for polymorphic behavior via method overriding. Memory management for OOP composites differs significantly between languages. Java employs automatic garbage collection to reclaim memory for unreferenced objects, handled by the JVM without explicit deallocation, which simplifies development but may introduce pauses. In contrast, C++ requires manual memory management using new and delete, or smart pointers like std::unique_ptr for automatic deallocation, providing efficiency but risking leaks or dangling pointers if mismanaged. This manual approach allows precise control over composite layouts in memory, essential for performance-critical applications.

In Scripting Languages

In scripting languages such as Python and JavaScript, composite data types emphasize flexibility and dynamic behavior, allowing developers to create and manipulate structured data without compile-time enforcement of types or sizes.^[68]^[69] These languages support built-in composites like lists and dictionaries in Python, or arrays and objects in JavaScript, which facilitate rapid development by enabling runtime adjustments and loose typing.^[68]^[70] Python provides lists as ordered, mutable sequences that function as dynamic arrays, capable of holding elements of mixed types and resizing automatically during operations like appending or extending.^[71] For example, a list can be initialized and modified as follows:

python
fruits = ['orange', 'apple']
fruits.append('pear')
print(fruits)  # Output: ['orange', 'apple', 'pear']
fruits = ['orange', 'apple']
fruits.append('pear')
print(fruits)  # Output: ['orange', 'apple', 'pear']

Dictionaries in Python serve as unordered collections of key-value pairs, akin to record-like structures, where keys are immutable and values can be accessed or updated dynamically without predefined schemas.^[72] An illustrative dictionary resembles a simple record:

python
point = {'x': 1, 'y': 2}
print(point['x'])  # Output: 1
point['z'] = 3  # Adds a new key-value pair
point = {'x': 1, 'y': 2}
print(point['x'])  # Output: 1
point['z'] = 3  # Adds a new key-value pair

Classes in Python enable the creation of user-defined composite types that bundle attributes and methods, supporting object-oriented patterns while allowing instance attributes to be added or modified at runtime.^[73] A basic class example is:

python
[class](/page/Class) Point:
    def __init__(self, x, y):
        [self](/page/Self).x = x
        [self](/page/Self).y = y

p = Point(2, 3)
print(p.x, p.y)  # Output: 2 3
p.z = 4  # Runtime addition of attribute
[class](/page/Class) Point:
    def __init__(self, x, y):
        [self](/page/Self).x = x
        [self](/page/Self).y = y

p = Point(2, 3)
print(p.x, p.y)  # Output: 2 3
p.z = 4  # Runtime addition of attribute

In JavaScript, arrays act as dynamic, resizable collections that store elements of any type, indexed from zero, and support methods for manipulation without fixed size constraints.^[70] For instance:

javascript
let fruits = ['apple', 'banana'];
fruits.push('orange');
console.log(fruits);  // Output: ['apple', 'banana', 'orange']
let fruits = ['apple', 'banana'];
fruits.push('orange');
console.log(fruits);  // Output: ['apple', 'banana', 'orange']

Objects in JavaScript represent composite structures through key-value pairs, where properties can include data or functions, and support prototypal inheritance for sharing behaviors across instances.^[69] A simple object example is:

javascript
let point = {x: 1, y: 2};
console.log(point.x);  // Output: 1
point.z = 3;  // Adds property at runtime
let point = {x: 1, y: 2};
console.log(point.x);  // Output: 1
point.z = 3;  // Adds property at runtime

Key features in these scripting languages include duck typing, where compatibility is determined by attributes and methods rather than explicit type declarations—as seen in Python's interchangeable use of objects with similar interfaces—and the ability to modify composites at runtime without strict size or type enforcement.^[74] This dynamic nature, exemplified by JavaScript's prototypal model where objects inherit from prototypes, enhances flexibility for rapid prototyping and iterative development in scripting environments.^[69]

References

[1]
Programming Concepts Basic Data Types - Computer Science
Composite data types are collections or groupings of multiple data items into a single entity. Different languages provide support for different kinds of ...
[2]
1.2. Abstract Data Types — CSci 2101 Data Structures - OpenDSA
Such a record is an example of an aggregate type or composite type. A data item is a piece of information or a record whose value is drawn from a type. A data ...
[3]
Chapter 8 Composite Types - FSU Computer Science
Composite types: records, arrays, and recursive types. Records: whole-record operations, variant records/unions, type safety, and memory layout. Arrays: memory ...
[4]
Composite Data - an overview | ScienceDirect Topics
Composite data types in computer science are formed by combining scalar, or atomic, data types to create more complex structures. 1. Examples of composite data ...
[5]
Composite Data Types - Visual Basic - Microsoft Learn
Sep 15, 2021 · Data Types. A composite type is different from the data type of any of its components. For example, an array of Integer elements is not of ...Missing: computer science
[6]
[PDF] Block-structured procedural languages Algol and Pascal
with emphasis on scientific and numerical applications. ♢ Compared with FORTRAN, Algol 60 provided better ways to represent data structures and, like LISP, ...
[7]
[PDF] A PL/I Primer - Bitsavers.org
The purpose of this publication is to provide tutorial material not only for the person with some knowledge of computer programming, but also for the novice ...
[8]
Index register - Computer History Wiki
Dec 15, 2018 · In early computers with index registers (such as the IBM 704 and descendants), they were special registers dedicated to this function; in ...
[9]
A history of ALGOL 68 | History of programming languages---II
ALGOL 68 is a language with a lot of history. The reader will hear of discord, resignations, unreadable documents, a minority report, and all manner of ...
[10]
What was the minimum amount of addressable memory? When and ...
Oct 19, 2025 · Byte addressable only means that the smallest common data unit that can be named (addressed) in all (most *2) instructions is less than a memory ...Did any computer use a 7-bit byte? - Retrocomputing Stack ExchangeWhy did IBM 7030 or IBM 360 use byte and word addressing ...More results from retrocomputing.stackexchange.com
[11]
[PDF] Rationale for the Design of the ADA (Tradename) Programming ...
Oct 2, 2025 · This document is the rationale for the design of the Ada 1986 programming language, including goals, structure, and classical programming.
[12]
Primitive Data Types - Java™ Tutorials - Oracle Help Center
A primitive type is predefined by the language and is named by a reserved keyword. Primitive values do not share state with other primitive values. The eight ...
[13]
What is meant by a primitive data type?
Mar 14, 2012 · A primitive data type is a basic, built-in type that cannot be decomposed into simpler components, representing a single value.
[14]
IEEE 754-2019 - IEEE SA
Jul 22, 2019 · This standard specifies interchange and arithmetic formats and methods for binary and decimal floating-point arithmetic in computer programming environments.
[15]
Primitive - Glossary - MDN Web Docs
Jul 11, 2025 · In JavaScript, a primitive is data that is not an object and has no methods or properties. There are 7 primitive data types.BigInt · String · Number · Null
[16]
[PDF] Basic Data Types - Purdue Computer Science
In C, basic data types include int, float, double, char, void, and enum. Composite types include pointers, functions, arrays, structs, and unions.
[17]
Basic Java Data Structures
Atomic or primitive type: A data type whose elements are single, nondecomposable data items (as primitives in Java) · Composite type A data type whose elements ...
[18]
None
### Atomic and Composite Data Types
[19]
1. Abstraction & Encapsulation - CS2030 Programming Methodology II
The concept of keeping all the data and functions operating on the data related to a composite data type together within an abstraction barrier is called ...
[20]
Reading 10: Abstract Data Types - MIT
Encapsulation: The local variables of a method are encapsulated, since only the method itself can use or modify them. Contrast with global variables, which are ...
[21]
1.10. Arrays — OpenDSA Complete Catalog
An array is a named collection of contiguous storage locations–storage locations that are next to each other–that contain data items of the same type.
[22]
1.4 Arrays - Computer Science: An Interdisciplinary Approach
A one-dimensional array (or array) is a data structure that stores a sequence of (references to) objects. We refer to the objects within an array as its ...
[23]
[PDF] 4. Arrays - cs.Princeton
Arrays: A basic building block in programming. • They enable storage of large amounts of data (values all of the same type). • With an index, a program can ...<|control11|><|separator|>
[24]
Array Basics
Declaring Arrays: An array declaration is similar to the form of a normal declaration (typeName variableName), but we add on a size: typeName variableName[size];.
[25]
[PDF] Lecture 4: Elementary Data Structures - Steven Skiena
Constant-time access given the index. • Arrays consist purely of data, so no space is wasted with links or other formatting information. • Physical continuity ( ...Missing: properties | Show results with:properties
[26]
[PDF] Arrays - Purdue Computer Science
The array length can be read with the instance variable length, e.g. the following code displays the number 20 (the size, or length of the. Species array, entry):.
[27]
Data Structures in Real Life: Arrays - Sites@Duke Express
Jun 28, 2021 · You can think of an array as a row of boxes with each box capable of storing a single element. The type of element that is stored in each box ...Missing: definition | Show results with:definition
[28]
[PDF] Lecture 4: Elementary Data Structures Steven Skiena Department of ...
With dynamic arrays we start with an array of size 1, and double its size ... The relative advantages of linked lists over static arrays include: 1 ...
[29]
https://en.cppreference.com/w/c/language/struct
[30]
Records - Ada Computer Science
A record is a data structure that consists of a fixed number of variables, called fields. Every field has an identifier (field name) and a data type.
[31]
Struct and union initialization - cppreference.com - C++ Reference
Jan 26, 2023 · When initializing a struct, the first initializer in the list initializes the first declared member (unless a designator is specified)(since C99) ...
[32]
Record - Free Pascal wiki
Nov 28, 2022 · A record is a structured data type in Pascal used to group data logically. It can contain different types of elements, each called a field.Missing: programming | Show results with:programming
[33]
https://www.ibm.com/docs/en/cobol-zos/6.3.0?topic=cobol-data-division
[34]
[PDF] Programming languages — C - Open Standards
May 6, 2016 · Its purpose is to promote portability, reliability, maintainability, and efficient execution of C language programs on a variety of computing ...
[35]
union - Microsoft Learn
Apr 4, 2023 · A union is a user-defined type where all members share the same memory location, and only one object can be stored at a time.
[36]
Unions (GNU C Language Manual)
A union type defines alternative ways of looking at the same piece of memory. Each alternative view is defined with a data type, and identified by a name.
[37]
Unions - The Rust Reference
The key property of unions is that all fields of a union share common storage. As a result, writes to one field of a union can overwrite its other fields.
[38]
[PDF] The development of the C programming language - Brent Hailpern
During 1973-1980, the language grew a bit: the type structure gained unsigned, long, union, and enumeration types, and structures became nearly first-class ...<|control11|><|separator|>
[39]
Object-Oriented Terminology
Object-oriented methodology relies on three characteristics that define object-oriented languages: encapsulation, polymorphism, and inheritance.Messages And Receivers · Polymorphism And Overloading · Classes
[40]
4.2. Introduction to Object-Oriented Programming - OpenDSA
Run-time polymorphism. Base classes may define and implement abstract, or virtual methods, and derived classes can override them, which means they provide their ...
[41]
Objects & Classes - CS106A
May 25, 2020 · Object Oriented Programming is built upon the idea of classes, which act like blueprints for objects that need to be represented in a program.Defining A Class · Giving Our Class Properties · Adding Behaviours To A Class
[42]
Object-Oriented Programming in Python - BYU CS 111
A class is a blueprint or template that defines the attributes (data) and behaviors (methods) of objects. An object, on the other hand, is an instance of a ...
[43]
Understanding classes and objects
A class is a template for defining objects, specifying properties and behavior. An object is an instance of a class, with a state of properties.
[44]
Classes and Objects
A class is a user-defined type that describes what a certain type of object will look like. A class description consists of a declaration and a definition.
[45]
[PDF] Constructors and Destructors
Sep 28, 2007 · Whenever a class definition omits the destructor from the interface, the compiler synthesizes a public destructor with an empty body.
[46]
Object-Oriented Programming and Classes - RC Learning Portal
An object is a data structure which has associated data (variables) and behaviors (procedures). Objects work on their own data, communicating with outside units ...
[47]
[PDF] Chapter 7 Object-Oriented Programming
To describe objects and classes, and use classes to model objects (§7.2). To define classes (§7.2.1). To construct an object using a constructor that ...
[48]
Inheritance
Inheritance is an object-oriented programming concept used to model an "is-a" relationship between two classes.
[49]
[PDF] Object-Oriented Programming: Inheritance - User pages
Inheritance promotes software reusability, where a subclass extends a superclass, inheriting its data and behaviors, and can customize with new capabilities.
[50]
Inheritance, Polymorphism, and Abstract Classes
An abstract class is one that is not used to construct objects, but only as a basis for making subclasses. An abstract class exists only to express the common ...
[51]
[PDF] Inheritance & Abstract Classes
Abstract classes can have constructors that initialize the fields defined in the abstract class. Abstract classes can also have concrete (implemented) methods. ...
[52]
Chapter 10: Data Abstraction and Object Orientation - CS, FSU
Generally, initialization in an object-oriented paradigm has been called a "constructor"; some languages have also allowed for "destructors", though this is ...
[53]
[PDF] Composite Types
Both data structures perm it individual characters to be accessed using double subscripts, but the memory layout (and corresponding address arithmetic) is quite ...
[54]
C Structures - GeeksforGeeks
Oct 25, 2025 · The struct keyword is used to define a structure. The items in the structure are called its members and they can be of any valid data type.
[55]
Pointers and References in C++ - GeeksforGeeks
Aug 21, 2025 · Pointers are used to store the memory address of another variable whereas references are used to create an alias for an already existing variable.Arrays · Call by Reference · C++ Function Call By Value<|separator|>
[56]
copy — Shallow and deep copy operations ... - Python documentation
A copy is sometimes needed so one can change one copy without changing the other. This module provides generic shallow and deep copy operations (explained ...
[57]
Iteration Statements in Programming - GeeksforGeeks
Jul 23, 2025 · For loops iterate over a range of values or elements in a collection. These are mainly used where the number of iterations is known beforehand ...
[58]
12.1 — Introduction to compound data types - Learn C++
Jan 18, 2022 · Pointer to data member ... In this chapter, we'll cover some of the more straightforward compound types, including l-value references , and ...
[59]
Bounds Checking
When your program is running and it tries to access an element of an array, the Java virtual machine checks that the array element actually exists.
[60]
A Guide to Undefined Behavior in C and C++, Part 1
Jul 9, 2010 · In a safe programming language, errors are trapped as they happen. Java, for example, is largely safe via its exception system. In an unsafe ...
[61]
Cache-Friendly Code | Baeldung on Computer Science
Mar 18, 2024 · The reason is that arrays take up contiguous memory, so are more cache-friendly. That isn't the case with the other two structures. 4.6 ...
[62]
Why Arrays have better cache locality than Linked list?
Aug 28, 2023 · Arrays have better cache locality than linked lists because they store their elements contiguously in memory, while linked lists store their ...
[63]
[PDF] Pascal User Manual and Report
Throughout the example programs in this User Manual, the predeclared procedures Read, Readln, Write, and Writeln are used. This section describes how to ...
[64]
https://en.cppreference.com/w/c/language/union
[65]
Record types - Free Pascal
Free Pascal supports fixed records and records with variant parts. The syntax diagram for a record type isMissing: documentation | Show results with:documentation
[66]
malloc - cppreference.com
### Summary of malloc Example for Dynamic Array Allocation in C
[67]
The C Programming Language: | Guide books | ACM Digital Library
This book is meant to help the reader learn how to program in C. The book assumes some familiarity with basic programming concepts like variables, assignment ...
[68]
5. Data Structures
### Summary of Lists and Dictionaries as Composite Data Types
[69]
https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Working_with_Objects
[70]
Array - JavaScript - MDN Web Docs
The Array object, as with arrays in other programming languages, enables storing a collection of multiple items under a single variable name.
[71]
https://docs.python.org/3/tutorial/datastructures.html#more-on-lists
[72]
https://docs.python.org/3/tutorial/datastructures.html#dictionaries
[73]
https://docs.python.org/3/tutorial/classes.html#class-definition-syntax
[74]
https://docs.python.org/3/tutorial/classes.html#random-remarks