Fact-checked by Grok 2 weeks ago

Append-only

Append-only storage is a core principle in computer science and data management, first proposed in the context of log-structured file systems in the early 1990s.^[1] It characterizes systems where new data is added exclusively by appending it to the existing structure, while all prior data remains immutable and cannot be modified or deleted.^[2] This design ensures tamper resistance and chronological integrity, making it ideal for applications requiring verifiable historical records.^[2] In database systems, append-only mechanisms leverage sequential writes to achieve high performance, particularly in time-series databases where data arrives in ordered streams, such as sensor telemetry or market feeds.^[2] The immutability supports robust audit trails, simplifies backups and recovery processes, and enables efficient compaction to organize data without altering originals.^[2] For instance, write-ahead logging in databases often employs append-only files to record changes durably before applying them. Blockchain technology exemplifies append-only storage on a distributed scale, functioning as a decentralized ledger where transactions are appended in sequential blocks, each cryptographically linked to the previous one to prevent alterations.^[3] This property underpins the security and consensus mechanisms in systems like Bitcoin and Ethereum, ensuring a shared, append-only record across untrusted networks.^[3] Beyond blockchains, append-only interfaces have been adapted for semiconductor storage, such as NAND flash, to optimize for log-structured filesystems and avoid costly in-place rewrites.^[4] Advanced variants, like attested append-only memory, extend the concept to distributed computing by providing cryptographic proofs of log consistency, enhancing fault tolerance in Byzantine environments.^[5] Overall, append-only approaches prioritize durability and verifiability, influencing modern data architectures in finance, logging, and high-velocity analytics.^[2]

Definition and Principles

Core Concept

Append-only storage refers to a data management paradigm in computing where new information is exclusively added to the end of an existing dataset, rendering all prior data immutable and prohibiting any modifications or deletions.^[6] This design treats the storage as a linear, chronologically ordered sequence, often implemented as a log, ensuring that the historical record remains intact and tamper-proof.^[6] The concept emerged in the 1970s and 1980s within early Unix-like operating systems, where it was employed in file systems and logging mechanisms to preserve data integrity amid growing system complexity.^[7] For instance, Unix introduced support for append-mode file operations through flags like O_APPEND in versions such as 4.2BSD around 1983, allowing writes to reliably extend files without overwriting existing content.^[7] Under append-only rules, write operations function by sequentially appending new records to the storage, capitalizing on the performance advantages of contiguous disk writes. Read operations, in turn, typically involve traversing the sequence from a designated offset—often the end for tailing recent additions—or relying on secondary indexes to access specific entries without full scans.^[6] A representative example is a basic log file that accumulates timestamped event records, such as system alerts, in strict chronological order, with each addition preserving the unaltered history.^[6] This approach underpins the broader immutability principle, ensuring once-written data endures unchanged.

Immutability and Append Operations

In append-only systems, the immutability principle dictates that once data is written to storage, it cannot be modified or overwritten, creating a permanent and tamper-evident record that preserves historical integrity.^[1] This contrasts sharply with mutable storage systems, such as traditional file systems or B-tree-based databases, where updates replace existing data in place, potentially complicating recovery and auditability.^[8] By enforcing write-once semantics, append-only approaches ensure that all subsequent changes are recorded as new entries, fostering reliability in environments requiring verifiable data lineages, as exemplified in log-structured storage designs.^[9] Append operations in these systems involve writing new data sequentially to the end of the storage medium, typically using offsets or pointers to reference positions without altering prior content.^[1] For instance, in a log-structured file system, all modifications—including file data, attributes, and indices—are appended to a contiguous log structure, enabling efficient indexing for random access while avoiding in-place edits.^[1] This mechanic leverages the storage device's natural affinity for sequential I/O, where pointers in an inode map or similar structure track the latest valid versions of data blocks.^[9] To manage logical deletions or updates without violating immutability, append-only systems append new versions of the data and mark obsolete entries as invalid, often through metadata like version numbers or tombstones, deferring physical removal.^[1] Compaction and garbage collection then reorganize storage by copying live data to fresh segments and reclaiming space from invalid portions, as seen in segment cleaning processes that use cost-benefit policies to minimize overhead.^[1] In log-structured merge-trees, this involves rolling merges that batch and consolidate entries across components, preserving the append-only nature while controlling storage growth.^[9] The performance implications of these append operations stem from sequential writes, which reduce mechanical seek times on spinning disks and minimize write amplification on solid-state drives by aligning with their parallel access patterns.^[1] On hard disk drives, this can achieve 65-75% of raw disk bandwidth for writes, compared to 5-10% in traditional systems with random updates.^[1] For SSDs, sequential appends further optimize endurance and throughput by enabling efficient garbage collection at the device level, transforming random workloads into batched, linear operations.^[10]

Advantages and Limitations

Key Benefits

Append-only storage systems provide robust data integrity and tamper resistance through their immutable nature, where data can only be added sequentially without modification or deletion of existing records. This design prevents accidental overwrites or malicious alterations, ensuring a verifiable historical record that is essential for regulatory compliance and auditing purposes. For instance, the append-only log structure in archival systems like Venti eliminates many software errors that could lead to data corruption by simplifying the storage abstraction and avoiding complex update mechanisms.^[11] Similarly, in secure cloud environments, append-only ledgers maintain tamper-proof properties via cryptographic commitments, allowing auditors to verify the completeness and authenticity of records without risking unauthorized changes.^[12] A key advantage is the high write performance achieved by sequential append operations, which minimize random input/output (I/O) patterns and leverage the efficiency of modern storage devices like SSDs. In log-structured file systems (LFS), all modifications are written to disk in a contiguous log, amortizing the cost of metadata updates and enabling sustained throughput even under heavy write loads, such as in real-time data ingestion pipelines. This approach can deliver orders-of-magnitude improvements in write speeds compared to traditional in-place update systems, particularly for workloads involving frequent small writes.^[13] For example, systems employing write-ahead logging (WAL) further optimize this by ensuring logs are appended durably before in-memory changes, supporting high-throughput scenarios without performance degradation from seek times.^[14] Simplified crash recovery is another benefit, as append-only logs serve as a complete, ordered history that can be replayed from the last consistent checkpoint to restore system state without needing to undo partial updates. This replay mechanism, common in WAL-based recovery algorithms like ARIES,^[15] ensures atomicity and durability by reapplying committed operations sequentially, reducing recovery time significantly compared to systems requiring complex undo logs. In LFS designs, the log's structure allows for rapid merging with the file system during recovery, avoiding the need to reconstruct fragmented data and minimizing downtime after failures.^[13] Such efficiency is particularly valuable in high-availability environments where quick restoration is critical. In distributed environments, append-only storage enhances scalability through linear replication of logs across nodes, as append operations are inherently sequential and can be made idempotent using offsets or sequence numbers to handle duplicates without state corruption. This facilitates easier partitioning and sharding, enabling systems to scale horizontally by distributing write loads while maintaining consistency via log replay. Log-structured databases like LogBase demonstrate this by using append-only storage to eliminate write bottlenecks in cloud settings, supporting massive parallelism and fault-tolerant replication across clusters.^[16] Replicated WAL systems further bolster this by coordinating appends in a fault-tolerant manner, ensuring high availability without the overhead of synchronized updates.^[17]

Potential Drawbacks

Append-only storage systems, while promoting immutability by permitting only additions to data structures, inherently lead to storage inefficiency through continuous accumulation of records without in-place modifications. Each update or deletion operation results in the appending of new entries, such as revised versions or tombstone markers, causing the overall data volume to grow unbounded over time and necessitating eventual space management interventions like archiving. This redundancy can significantly inflate storage requirements, as historical versions persist alongside current ones, leading to wasted space in scenarios where data evolution is frequent.^[18] Read operations in append-only designs often face increased complexity, as retrieving the current state or specific records may require scanning multiple appended segments or versions rather than direct access to a single location. Without additional indexing mechanisms, queries must filter through superseded entries, amplifying the number of disk accesses and potentially degrading latency for point lookups or historical reconstructions. This scan-heavy approach contrasts with mutable systems' efficient in-place retrieval, making append-only storage less suitable for read-intensive workloads without supplementary optimizations.^[18]^[19] Handling updates and deletions logically in append-only environments complicates state management, as these operations are simulated by appending new records that override prior ones, requiring full log replay to derive the effective current view. For instance, reconstructing an entity's state involves sequentially processing all relevant appends from the log's inception, which becomes progressively more cumbersome as the log lengthens. This process not only hinders real-time consistency checks but also introduces challenges in maintaining a coherent, up-to-date representation without dedicated replay tooling.^[20]^[18] The resource demands of append-only systems are elevated due to the need for merging or filtering operations during reads and maintenance, often resulting in higher CPU and memory usage compared to traditional mutable storage. Write amplification arises from repeated data copying in versioned appends, while read amplification from multi-segment scans further strains computational resources, particularly in large-scale deployments where frequent reconstructions are required. These overheads can limit scalability in resource-constrained settings, underscoring the trade-offs of immutability for performance-critical applications.^[18]

Applications

Logging and Audit Trails

Append-only logs serve as a foundational mechanism in system logging by recording events such as errors, user actions, and transactions in a sequential, time-ordered manner, enabling effective debugging, performance monitoring, and incident response. These logs function as an immutable sequence where new entries are added to the end without altering prior records, ensuring a complete historical record of system behavior. This approach facilitates real-time analysis and historical review, as the ordered structure allows administrators to reconstruct event timelines accurately.^[6]^[21] In audit trail applications, particularly within financial systems, append-only mechanisms provide immutable records that support regulatory compliance, such as the Sarbanes-Oxley Act (SOX), by demonstrating non-repudiation of actions through tamper-evident logging. SOX mandates robust internal controls over financial reporting, including the maintenance of detailed audit logs that capture changes to financial data, user identities, and timestamps, with immutability ensuring that records cannot be retroactively modified to conceal discrepancies. This integrity is critical for proving accountability during audits and investigations, reducing fraud risks, and meeting retention requirements that often span several years.^[21] Log rotation and retention strategies in append-only systems manage growth by appending entries until predefined size limits or time intervals are reached, at which point a new file is created while older files are archived and preserved to comply with legal holds and regulatory mandates. Rotation utilities close the active log file, compress archived versions for storage efficiency, and maintain access controls to prevent unauthorized alterations, ensuring that historical data remains available for forensic analysis or compliance verification without disrupting ongoing logging. Retention policies, guided by standards like those in NIST guidelines, typically dictate storage durations based on organizational needs, such as seven years for SOX-related records, with secure offsite or encrypted storage to protect against loss or tampering.^[21] Common tools leveraging append-only patterns include syslog for centralized system event collection, where configurations enforce sequential writes to files with append-only permissions to maintain integrity. In modern stacks like the ELK (Elasticsearch, Logstash, Kibana) ecosystem, data streams support append-only ingestion of timestamped logs, optimizing for high-volume event tracking while integrating rotation via index lifecycle policies that automate archiving and deletion after retention periods. These implementations emphasize efficient sequential write patterns to minimize overhead and enhance reliability in production environments.^[22]

Databases and Storage Systems

In time-series databases, append-only mechanisms enable efficient sequential ingestion of metrics and events, particularly in high-volume scenarios such as IoT device monitoring and system performance tracking. For instance, InfluxDB employs an append-only storage engine optimized for time-series workloads, where data is written sequentially to leverage the predominantly immutable nature of such data, achieving high ingestion rates without the overhead of in-place updates.^[23] Similarly, QuestDB utilizes append-only ingestion protocols like the InfluxDB Line Protocol to handle real-time streams from IoT sensors, supporting ingestion rates exceeding 800,000 rows per second while maintaining constant performance across varying data cardinalities in monitoring applications.^[24]^[25] Append-only tables are a key feature in columnar storage systems, where new rows are added to the end of data structures without modifying existing schemas or records, facilitating scalable analytics on large datasets. In systems like ClickHouse, inserts generate independent data parts in a columnar format, allowing concurrent writes and efficient compression for analytical queries without locking or rewriting prior data, which supports workloads in business intelligence and data warehousing.^[26] This design preserves schema stability while accommodating evolving data streams, making it ideal for online analytical processing (OLAP) environments where historical data integrity is paramount.^[27] Write-ahead logging (WAL) represents a foundational append-only technique in relational databases for ensuring transaction durability. PostgreSQL, for example, records all database changes as sequential entries in an append-only WAL file before applying them to the main data files, guaranteeing crash recovery and atomicity even if the system fails mid-transaction.^[28] This log-structured approach minimizes random I/O by batching writes, enhancing reliability in production environments handling mixed transactional and analytical loads. The adoption of append-only paradigms in databases evolved from early NoSQL systems in the mid-2000s, which emphasized log-structured storage to scale horizontally for web-scale data, to contemporary OLAP architectures that integrate these principles for append-heavy analytics. Influential NoSQL designs, such as those in Bigtable-inspired systems, prioritized sequential appends to boost write throughput on distributed clusters, as articulated in foundational discussions on log-based storage abstractions.^[6] By the 2010s, this evolved into modern OLAP tools like ClickHouse and Druid, which optimize columnar append-only tables for real-time ingestion and ad-hoc querying, bridging the gap between NoSQL scalability and SQL compatibility in cloud-native data platforms.^[29]

Distributed and Blockchain Systems

In distributed and blockchain systems, append-only structures form the foundation for maintaining immutable records across multiple nodes, enabling consensus and fault tolerance without centralized control. Blockchain ledgers exemplify this principle, where new blocks containing transactions are sequentially appended to the chain, with each block's header including a cryptographic hash of the previous block to enforce integrity and prevent tampering.^[30] This design, first introduced in Bitcoin in 2009, ensures that altering any historical block would require recomputing the proof-of-work for all subsequent blocks, a computationally infeasible task under the majority honest node assumption.^[30] The resulting structure provides a tamper-resistant, decentralized ledger that supports applications like cryptocurrencies and supply chain tracking. Distributed logging systems extend append-only principles to handle high-throughput event streaming in fault-tolerant environments. Apache Kafka, for instance, organizes data into append-only topics partitioned across a cluster of brokers, where producers append messages sequentially to maintain order, and these logs are replicated to multiple nodes for durability and availability.^[31] This replication occurs at the partition level, typically with a factor of three copies, allowing the system to survive broker failures by reassigning leadership without data loss.^[31] Kafka's design thus supports geo-replicated deployments for real-time analytics and microservices, where consumers can read from any offset to process events asynchronously. Consensus protocols in distributed systems leverage append-only logs to achieve agreement on replicated state machines. In the Raft algorithm, leaders append new entries to their logs before issuing AppendEntries RPCs to followers, enforcing the leader append-only property that prohibits overwriting or deleting existing entries.^[32] This mechanism ensures log matching and completeness, where committed entries—those replicated to a majority—are applied in the same order across nodes, providing linearizability for operations like database updates.^[32] By restricting elections to nodes with up-to-date logs, Raft guarantees that all servers execute the same sequence of commands, enhancing safety in failure-prone clusters. Scaling append-only operations in geo-distributed systems introduces challenges, particularly in balancing latency and consistency. In setups spanning multiple data centers, appends must propagate across high-latency links, often relying on eventual consistency models where replicas converge over time rather than immediately.^[33] Systems like those using causal consistency with append-only logs, as in geo-replicated stores, mitigate conflicts through timestamping and vector clocks, but face issues like increased coordination overhead and potential for temporary divergences during partitions.^[33] These trade-offs prioritize availability, allowing reads and writes to continue locally while background replication resolves inconsistencies, though they require careful tuning to avoid amplifying tail latencies in global deployments.

Data Structures

Append-Only Files and Logs

Append-only files and logs serve as fundamental data structures for sequential, immutable storage, where new records are added exclusively to the end of the file without altering prior content. These structures typically consist of binary or text files opened in append mode, which ensures that writes always occur at the current end-of-file position. In Python, this is achieved using the open() function with the 'a' mode flag, which creates the file if it does not exist and appends new data while preserving existing content.^[34] Similarly, in C++, the std::ofstream class supports append mode via the std::ios::app flag in its constructor, positioning the stream at the file's end before each output operation.^[35] To handle variable-length records, entries are commonly prefixed with their byte lengths (e.g., a fixed-size integer indicating size) or separated by delimiters like newlines for text-based logs, enabling reliable sequential parsing without requiring random access.^[36] For efficient read operations on large append-only files, secondary indexing mechanisms are essential to avoid exhaustive linear scans from the beginning. Offset indexes, often implemented as in-memory hash maps mapping keys to byte positions within the file, allow quick seeking to specific records by directly jumping to the relevant offset.^[37] Bloom filters can also be employed as compact, probabilistic auxiliaries to test for the potential presence of keys, filtering out non-matches before offset lookups and thus reducing I/O overhead in scenarios with sparse data distribution.^[38] These indexes are typically maintained separately and updated incrementally during appends, ensuring the primary log remains a pure, flat sequence. In Unix-like systems, append-only behavior can be enforced at the filesystem level using the chattr command with the +a flag, which sets the immutable append-only attribute on a file or directory. This restriction allows writes only in append mode and prevents truncation, deletion, or modification, but it requires superuser privileges or the CAP_LINUX_IMMUTABLE capability to set or unset.^[39] For simple log implementations, Python's standard logging module uses FileHandler in append mode by default to record events sequentially, while C++ applications can leverage std::ofstream with std::ios::app for basic journaling, often combined with synchronization calls like fsync() for durability on SSDs. These approaches prioritize straightforward, low-overhead writing over complex querying. Append-only files find application in scenarios where immutability and simplicity outweigh advanced retrieval needs, such as backup archives and transaction journals. In deduplicating backup systems like Borg, repository segments are strictly append-only files that log PUT, DELETE, and COMMIT operations for chunks, ensuring transactional consistency and historical recoverability without overwriting prior states.^[40] Transaction journals, such as those in immutable accounting ledgers, use append-only structures to record double-entry operations sequentially, maintaining an unalterable audit trail for financial reconciliation and error recovery.^[41]

Log-Structured Merge Trees (LSM)

Log-Structured Merge Trees (LSM trees) represent a sophisticated append-only data structure designed for write-optimized storage in key-value systems, where updates are batched and merged to minimize random disk I/O. In this architecture, incoming writes are first buffered in an in-memory structure known as a memtable, typically implemented as a balanced tree like a red-black tree or skip list, allowing fast insertions without immediate disk access. Once the memtable reaches a configurable size threshold, its contents are flushed to disk as an immutable Sorted String Table (SSTable), which is a sorted, append-only file containing key-value pairs. This process ensures all operations are sequential appends, leveraging the sequential write performance of modern storage devices.^[9] LSM trees organize data across multiple levels, forming a tree-like hierarchy where level 0 corresponds to the active memtable, and subsequent levels (L1, L2, etc.) consist of SSTables on disk. New appends target level 0, and as levels grow, background compaction processes merge sorted runs from adjacent levels to resolve overlaps and deletions, producing larger, more stable SSTables at higher levels. This merging follows a leveled or tiered strategy, with sizes increasing geometrically (often by a factor of 10) across levels to bound space amplification. The concept was introduced in 1996 by O'Neil et al., who described a two-component structure expandable to multiple levels via rolling merges, enabling efficient handling of high insert rates in disk-based indexes.^[9] For reads, LSM trees search from the newest level (memtable) downward through all relevant SSTables, combining results to reconstruct the latest value for a key. To accelerate this and reduce unnecessary disk seeks, each SSTable includes indexes (such as block-based or sparse indexes) for quick key location within files, and Bloom filters—a probabilistic structure that estimates key presence with minimal false positives—are employed to skip irrelevant SSTables entirely. This approach trades increased read complexity and potential space overhead for superior write throughput, making LSM trees suitable for workloads with write-to-read ratios exceeding 10:1. Prominent implementations include LevelDB, an embeddable key-value store developed by Google that uses LSM trees to support millions of writes per second on SSDs; RocksDB, a high-performance fork by Facebook extending LevelDB with advanced compaction options for production-scale workloads; and Apache Cassandra, a distributed NoSQL database that employs LSM-based storage engines for fault-tolerant, high-volume appends in wide-column data models.^[42]

Security and Access Control

Authentication Mechanisms

In append-only data structures, cryptographic hashing serves as a foundational mechanism to ensure the integrity of sequential additions by linking each new entry to the previous state. Typically, a hash of the entire prior log or a representative structure, such as the root of a Merkle tree, is included with each append operation. This chaining allows detection of any tampering, as alterations to earlier data would invalidate subsequent hashes due to the collision-resistant properties of the hash function. For instance, in transparency logs like those used in Certificate Transparency, an append-only Merkle tree is maintained where each leaf represents a new entry (e.g., a certificate), and the root hash is published to verify the log's consistency over time.^[43]^[43] Digital signatures provide authenticity and non-repudiation for append-only records by cryptographically binding each addition to the appendant's private key. In this approach, the signer generates a signature over the new data along with a hash of the previous log state, ensuring that the append cannot be forged or disavowed later. This technique is prevalent in authenticated append-only sets (AADS), where signatures enable third parties to verify that additions originate from authorized sources without trusting the storage provider. Append-only signatures (AOS) extend standard digital signature schemes, such as those based on bilinear pairings, to support efficient extensions of signed messages while maintaining security under assumptions like the Bilinear Diffie-Hellman Exponent problem.^[44] RSA-based accumulators offer an efficient method for proving inclusion of elements in large append-only datasets without revealing the full set, leveraging the commutative properties of RSA modular exponentiation. An accumulator maintains a single value representing the product of all elements raised to a secret exponent modulo a large RSA composite, allowing short membership witnesses that can be updated incrementally as new data is appended. This enables logarithmic-sized proofs for verifying that a specific record belongs to the current log state, which is particularly useful in distributed systems requiring scalable audits. While RSA accumulators date back to foundational work in the 1990s, their application to dynamic, append-only authenticated structures has been refined in subsequent research to support efficient batch updates and verification.^[45] Verification processes in append-only systems allow clients to confirm the sequence and integrity of data without replaying the entire log, often by recomputing hashes from provided proofs or employing advanced cryptographic primitives. For hash-based verification, a client can traverse a Merkle proof path to recompute the root hash and match it against a trusted anchor, detecting deviations in O(log n) time. To enhance privacy and efficiency, zero-knowledge proofs (ZKPs) can be integrated, enabling demonstration of correct append sequences or inclusion without disclosing underlying data; for example, zk-SNARKs or Bulletproofs prove compliance with append-only rules over Merkle commitments in logarithmic space. These methods collectively ensure tamper-evidence while minimizing computational overhead for remote auditors.^[43]

Access Control Policies

In append-only storage systems, access control policies govern who may append data and read existing entries, while prohibiting modifications, deletions, or truncations to preserve immutability and traceability. These policies typically leverage operating system, database, or application-level mechanisms to enforce least-privilege principles, ensuring that only authorized entities can contribute to the log without compromising its integrity. File system permissions in Unix-like systems provide a foundational mechanism for append-only access. The append-only attribute ('a'), configurable via the chattr utility, restricts a file such that it can only be opened in append mode for writing; non-superusers or processes lacking the CAP_LINUX_IMMUTABLE capability cannot read, overwrite, rename, or delete the file, thereby preventing unauthorized alterations.^[39] This attribute is supported on filesystems like ext4 and XFS, where it applies to both files and directories—for directories, it blocks deletions or renames of contents while allowing new file creation.^[39] In databases, role-based access control (RBAC) implements append-only policies by limiting privileges to insertion operations. For instance, PostgreSQL allows administrators to define roles with exclusive INSERT grants on tables, enabling users to append rows without SELECT, UPDATE, or DELETE rights, which aligns with append-only semantics in ledger or audit tables.^[46] Similarly, SQL Server's ledger features enforce append-only tables that permit only INSERT actions, even for privileged users like database owners, to support verifiable data histories.^[47] Audit logging integration is a core aspect of these policies, mandating that all access attempts—authorized or denied—are recorded immutably in append-only trails for forensic analysis and regulatory compliance. NIST SP 800-92 recommends configuring log files with append-only privileges, restricting users to write-only access without read, rename, or delete capabilities, to safeguard against tampering while ensuring comprehensive event capture.^[48] Capability-based access control offers an advanced model for granular append rights, using unforgeable tokens that encapsulate object references and specific permissions, such as append-only operations, without granting broader ownership. In secure enclaves like those enabled by Intel SGX, capabilities facilitate delegation of limited append access to isolated environments, allowing confidential computations to log data securely without exposing full system control.

References

[1]
Append-only Storage - QuestDB
Append-only storage is a database design pattern where new data is exclusively added to the end of existing data structures, without modifying or deleting ...Missing: computer science
[2]
Implementing an Append-Only Interface for Semiconductor Storage
Jan 1, 2010 · We present an alternative way to use flash storage, where an append interface is exposed directly to software.
[3]
Append-only - Mastering Blockchain - Second Edition [Book] - O'Reilly
Blockchain is append-only, which means that data can only be added to the blockchain in time-ordered sequential order.
[4]
When good blocks go bad: Managing unwanted blockchain data
Technically, blockchain is a distributed and decentralized append-only database. This latter aspect leads to an important, yet overlooked governance issue ...<|control11|><|separator|>
[5]
[PDF] Attested Append-Only Memory: Making Adversaries Stick to their Word
Here we describe an attested append-only memory (A2M), a sim- ple attestation-based abstraction that, when trusted, can remove the ability of adversarial ...Missing: science | Show results with:science<|control11|><|separator|>
[6]
The Log: What every software engineer should know about real-time ...
Dec 16, 2013 · A log is perhaps the simplest possible storage abstraction. It is an append-only, totally-ordered sequence of records ordered by time.
[7]
[PDF] FILE I/O: THE UNIVERSAL I/O MODEL - man7.org
The flags argument is a bit mask that specifies the access mode for the file, using one of the constants shown in Table 4-2. Early UNIX implementations used the ...
[8]
[PDF] The design and implementation of a log-structured file system
A log-structured file system writes all modifications to disk sequentially in a log-like structure, thereby speeding up both file writing.
[9]
What is mutable versus immutable? - IBM
Some databases use append-only logs, meaning each change is recorded permanently and cannot be altered. Others are mutable, allowing updates or deletion of data ...
[10]
[PDF] The Log-Structured Merge-Tree (LSM-Tree) - UMass Boston CS
The Log-Structured Merge-Tree (LSM-Tree). Patrick O'Neil1, Edward Cheng2. Dieter Gawlick3, Elizabeth O'Neil1. To be published: Acta Informatica. ABSTRACT. High ...
[11]
[PDF] Immutability Changes Everything
Immutability is a trend for storing data, enabling coordination, and reducing coordination challenges. It's used in apps, SQL, and distributed systems.
[12]
Venti: a new approach to archival storage - USENIX
Dec 27, 2001 · The simplicity of the append-only log structure eliminates many possible software errors that might cause data corruption and facilitates a ...
[13]
Microsoft Azure confidential ledger overview
Apr 14, 2025 · The confidential ledger offers an auditable data store with unique data integrity advantages, including immutability, tamper-proofing, and ...Missing: resistance | Show results with:resistance
[14]
The design and implementation of a log-structured file system
This paper presents a new technique for disk storage management called a log-structured file system. A log-structured file system writes all modifications ...
[15]
Write-ahead logging and the ARIES crash recovery algorithm
Aug 26, 2022 · The write to the log on disk is a sequential write that appends to the end of the log. Sequential append-only I/O is significantly faster than ...Write Ahead Logging · The Wal Protocol · Maintaining Log Records
[16]
[PDF] LogBase: A Scalable Log-structured Database System in the Cloud
In this paper, we introduce LogBase – a scalable log-structured database system that adopts log-only storage for removing the write bottleneck and supporting ...
[17]
[PDF] PALF: Replicated Write-Ahead Logging for Distributed Databases
This paper has presented PALF, which acts as the replicated write- ahead logging system of OceanBase and is expected to serve as a building block for many ...
[18]
Immutability Changes Everything - Communications of the ACM
Jan 1, 2016 · In append-only computing, observations are recorded forever (or for a long time). Derived results are calculated on demand (or periodically pre- ...
[19]
An index scheme for fast data stream to distributed append-only store
Jun 26, 2016 · Such append-only store architecture maximizes the processing throughput on incoming data, but potentially incur higher costs on query processing ...Missing: limitations | Show results with:limitations
[20]
[PDF] Efficient Data Retrieval from a Secure, Durable, Append-Only Log
In this paper, we present the Global Data. Plane File System (GDPFS), a distributed filesystem that expresses mutable files within append-only logs and provides.
[21]
None
Summary of each segment:
[22]
25.3. Basic Configuration of Rsyslog - Red Hat Documentation
The main configuration file for rsyslog is /etc/rsyslog.conf. Here, you can specify global directives, modules, and rules that consist of filter and action ...
[23]
In-memory indexing and the Time-Structured Merge Tree (TSM)
Given that time series is mostly an append-only workload, you might think that it's possible to get great performance on a B+Tree.
[24]
What is time-series data, and why are we building ... - DEV Community
Nov 19, 2020 · QuestDB also saturates the network cards to process messages from several senders in parallel. Our ingestion is append-only, with constant ...
[25]
How databases handle 10 million devices in high-cardinality ...
Jun 16, 2021 · QuestDB showed a mostly constant ingestion rate of 815k rows/sec with all degrees of cardinality. ClickHouse could ingest 900k rows/sec but ...
[26]
Columnar databases explained | ClickHouse Resource Hub
Sep 16, 2025 · Modern column stores like ClickHouse are built to handle high write concurrency. Inserts create independent data parts that don't block one ...Row-Based Vs. Column-Based # · When Should I Use A Column... · What Are The Challenges Of...
[27]
Postgres to ClickHouse: Data Modeling Tips - PeerDB Docs
PeerDB maps PostgreSQL tables to ClickHouse using the ReplacingMergeTree engine. ClickHouse performs best with append-only workloads and does not recommend ...
[28]
Documentation: 18: 28.3. Write-Ahead Logging (WAL) - PostgreSQL
Briefly, WAL 's central concept is that changes to data files (where tables and indexes reside) must be written only after those changes have been logged, that ...
[29]
NoSQL Databases: a Survey and Decision Guidance | by Felix Gessert
Aug 15, 2016 · Append-only storage (also referred to as log-structuring) tries to maximize throughput by writing sequentially. Although log-structured file ...
[30]
[PDF] A Peer-to-Peer Electronic Cash System - Bitcoin.org
The network timestamps transactions by hashing them into an ongoing chain of hash-based proof-of-work, forming a record that cannot be changed without redoing.
[31]
Apache Kafka
Summary of each segment:
[32]
[PDF] In Search of an Understandable Consensus Algorithm
May 20, 2014 · A leader never overwrites or deletes entries in its own log (the Leader Append-Only. Property in Figure 3). This log replication mechanism ...
[33]
[PDF] nsdi17-mehdi.pdf - USENIX
Mar 27, 2017 · This paper presents Occult (Observable Causal. Consistency Using Lossy Timestamps), the first geo- replicated and sharded data store that ...
[34]
7. Input and Output
To open a file in append mode using Python's `open()` function, use the `'a'` mode flag:
[35]
std::basic_ofstream<CharT,Traits>::basic_ofstream - cppreference.com
### How to Open a File in Append Mode in C++ Using `std::ofstream`
[36]
Data structure and file structure for storing append only messages?
Jan 23, 2017 · algorithm - Data structure and file structure for storing append only messages? - Stack Overflow.What is the best-practise way to store an append only series of files ...What are some file formats used when writing an append only file?More results from stackoverflow.com
[37]
What you should know about database storage and retrieval.
May 7, 2018 · The immutable append-only design turns out to be good for several reasons! ... Concurrency and crash recovery are much simpler;; Merging old ...Hash Indexes · Sorted-String Tables · B-Trees
[38]
Bloom filter indexes | Databricks on AWS
Aug 4, 2025 · A Bloom filter index is a space-efficient data structure that enables data skipping on chosen columns. It is especially effective for columns ...
[39]
chattr(1) - Linux manual page - man7.org
a A file with the 'a' attribute set can only be opened in append mode for writing. Only the superuser or a process possessing the CAP_LINUX_IMMUTABLE capability ...
[40]
Data structures and file formats - Borg Documentation - Read the Docs
Transactionality is achieved by using a log (aka journal) to record changes. ... Those files are strictly append-only and modified only once. Tag is either ...
[41]
Books, an immutable double-entry accounting database service
Oct 16, 2019 · Because we cache balances on the book_entires, both journal and book entries data sets are effectively append-only and immutable once stored.
[42]
RocksDB Overview - GitHub
Jul 18, 2023 · RocksDB is a storage engine library of key-value store interface where keys and values are arbitrary byte streams.
[43]
[PDF] arXiv:2305.01378v3 [cs.CR] 18 Sep 2023
Sep 18, 2023 · This allows for logarithmic-sized proofs that the log is append-only as new values (e.g., the hashes of new certificates in Certificate.
[44]
Append-only Authenticated Data Sets based on RSA accumulators ...
This paper proposes the Append-only Authenticated Data Sets (AADS) model, which enhances cloud computing security by supporting append-only properties and fork ...
[45]
Transparency Logs via Append-Only Authenticated Dictionaries
Importantly, clients always ensure the set remains append-only by verifying an append-only proof πi,j ... Let us assume we have an RSA accumulator over m elements ...
[46]
Documentation: 18: 5.8. Privileges - PostgreSQL
To assign privileges, the GRANT command is used. For example, if joe is an existing role, and accounts is an existing table, the privilege to update the table ...
[47]
Append-only ledger tables - SQL Server - Microsoft Learn
Jul 31, 2024 · Append-only ledger tables allow only INSERT operations on your tables, which ensure that privileged users such as database administrators can't alter data.Missing: roles | Show results with:roles
[48]
[PDF] Guide to Computer Security Log Management
11 NIST SP 800-53 is the primary source of recommended security controls for Federal agencies. It describes several controls related to log management,.