Fact-checked by Grok 2 weeks ago

Distributed lock manager

A distributed lock manager (DLM) is a software component in distributed systems that coordinates access to shared resources, such as files, databases, or network services, across multiple nodes by granting and managing locks to ensure mutual exclusion and prevent conflicts like data corruption or race conditions.^[1] DLMs typically operate by maintaining a shared lock database or state replicated across cluster nodes, allowing processes on different machines to request, acquire, convert, or release locks in a coordinated manner.^[1] This mechanism is essential for high-availability clusters, enabling applications like clustered file systems (e.g., GFS2) or transaction processing to function reliably without centralized bottlenecks.^[1] In operation, a DLM supports various lock modes—ranging from shared read access (e.g., concurrent reads) to exclusive write access (e.g., single writer)—and handles lock conversions, such as promoting a shared lock to exclusive, while incorporating protocols for deadlock detection and recovery.^[2] Communication between nodes often relies on protocols like TCP/IP or SCTP, with the system requiring cluster membership awareness, quorum for decisions, and fault tolerance to survive node failures as long as a majority remains operational.^[1] Popular implementations include Redlock, which uses multiple independent Redis masters to acquire locks from a majority quorum for enhanced safety against single points of failure—though its safety guarantees in asynchronous networks have been debated^[3]^[4]—and the DynamoDB Lock Client, which leverages conditional writes in Amazon DynamoDB tables for atomic lock acquisition with lease-based expiration.^[5] DLMs address key challenges in distributed environments, such as network partitions, clock skew, and asynchronous failures, by incorporating time-to-live (TTL) leases on locks to enable automatic release if a holder crashes, thus avoiding indefinite deadlocks.^[3] For instance, in Redlock, locks are valid only if acquired within a specified time window accounting for potential clock drift, ensuring liveness while preserving mutual exclusion under the assumption that no more than a minority of nodes fail simultaneously.^[3] These systems prioritize scalability and performance, often using fine-grained locking for efficiency, but trade-offs exist, such as increased latency from distributed coordination or the need for persistent storage to survive restarts.^[5] Overall, DLMs form a foundational building block for reliable distributed applications.^[6]

Overview

Definition and Purpose

A distributed lock manager (DLM) is a software component that operates across multiple nodes in a cluster, providing locking services to coordinate access to shared resources while maintaining a replicated, cluster-wide lock database on each participating machine.^[7] This replication ensures that all nodes have a consistent view of lock states, enabling mutual exclusion even in the presence of concurrent requests from different processes.^[8] The primary purpose of a DLM is to prevent concurrent access conflicts that could lead to data corruption or inconsistencies when multiple nodes attempt to modify shared resources, such as files or storage blocks, in a clustered environment.^[7] By enforcing synchronized access, it supports high availability through active-active configurations and enhances scalability by distributing lock management responsibilities across nodes, allowing systems to handle increased workloads without centralized bottlenecks.^[9] In essence, the DLM acts as a coordination mechanism to maintain data integrity in distributed setups where traditional single-node locking is insufficient.^[10] DLMs are commonly employed in cluster file systems for resource coordination, such as in the Global File System 2 (GFS2), where they synchronize metadata access across nodes sharing block devices to ensure immediate visibility of changes.^[10] Similarly, the Oracle Cluster File System 2 (OCFS2) relies on a DLM to manage concurrent read-write operations on shared disks, preventing any single node from altering a file block while another accesses it.^[11] Beyond file systems, DLMs facilitate resource allocation in high-availability clusters and synchronization in distributed databases, enforcing consistency for globally replicated data.^[9] These use cases highlight the DLM's role in enabling reliable, multi-node operations in environments prone to challenges like network latency and node failures, which can disrupt communication and require mechanisms such as fencing to isolate faulty nodes.^[8]^[9]

Historical Development

The origins of distributed lock managers (DLMs) trace back to the mid-1990s, emerging as a critical component for coordinating access in early cluster file systems. Development of the Global File System (GFS), one of the first scalable shared-disk file systems for Linux clusters, began in 1995 at the University of Minnesota, sponsored by the University of Minnesota until 2000.^[12] Sistina Software, founded in 1997 to commercialize this research, integrated a proprietary DLM into GFS to manage locks across nodes, enabling concurrent access to shared storage without a central coordinator.^[13] This marked an early milestone in distributed locking for open-source environments, addressing the growing demand for high-availability clustering in enterprise computing during the late 1990s.^[14] Proprietary systems like Veritas Cluster Server (VCS), introduced in the late 1990s for Unix platforms, further advanced distributed locking concepts by providing fault-tolerant resource management across nodes, including lock coordination for failover scenarios.^[15] VCS's architecture influenced subsequent designs by emphasizing scalability and integration with storage solutions, setting a standard for commercial high-availability clusters before the widespread adoption of open-source alternatives.^[16] A pivotal shift occurred in December 2003 when Red Hat acquired Sistina for approximately $31 million in stock, gaining control of GFS and its DLM to bolster enterprise Linux offerings.^[17] Following the acquisition's closure in early 2004, Red Hat open-sourced key components, including the DLM as OpenDLM, under the GPL on June 24, 2004, alongside GFS, LVM2 clustering extensions, and other infrastructure tools.^[18] Integration of the DLM into the Linux kernel began with patches proposed in May 2005 against version 2.6.12, enabling native support for distributed locking in filesystems like GFS2 and OCFS2.^[19] This upstreaming effort, led by Red Hat engineers such as Patrick Caulfield and David Teigland, standardized DLM for broader Linux adoption, transitioning it from proprietary roots to a core kernel module by the mid-2000s.^[14] Red Hat played a central role in this standardization, embedding DLM into Red Hat Enterprise Linux (RHEL) clusters to support enterprise-grade high availability.^[17] In the 2010s, DLMs evolved to meet demands for larger-scale deployments, driven by virtualization and cloud computing growth.^[20] Advancements in Pacemaker, a resource manager integrated with Corosync for cluster communication, enhanced DLM support for clusters exceeding dozens of nodes, introducing features like improved quorum handling and multi-site fencing for high-availability storage.^[21] These developments, building on open-source foundations, extended DLM's applicability from traditional on-premises clusters to hybrid environments, influencing broader distributed systems paradigms. In the 2020s, DLMs have further integrated with cloud-native technologies, such as container orchestration platforms like Kubernetes and service meshes, providing distributed locking for microservices as of 2025.^[22]^[6]

Core Concepts

Resources

In distributed lock managers (DLMs), resources refer to shared entities that require coordinated access across cluster nodes to maintain consistency and prevent conflicts. These resources can include physical objects such as shared files, devices, and volumes, as well as logical elements like database records, cache buffers, and abstract identifiers such as IP addresses in clustered environments.^[23]^[7] For instance, in OpenVMS clusters, the DLM protects files, records within files, and network devices, while Linux-based DLMs like those in Red Hat Enterprise Linux safeguard shared storage objects such as filesystems and logical volumes.^[24]^[25] Resources are represented in the cluster-wide lock database through unique identifiers, typically names or IDs that all nodes recognize and maintain consistently. Each resource maintains an associated lock queue, which tracks pending and granted locks, managed centrally by the DLM across the cluster.^[7]^[23] In systems like OpenDLM, resources are identified by names up to 31 characters in length, with an optional lock value block (LVB) for storing small amounts of application-specific data, such as sequence numbers or status flags, limited to 16 or 32 bytes.^[25] This representation ensures that every node holds an identical view of the lock database, enabling atomic updates and conflict resolution.^[24] The lock space for resources employs either a flat or hierarchical namespace to organize identifiers globally. In flat namespaces, resources use simple unique names without structure, suitable for uniform entities, whereas hierarchical namespaces allow tree-like organization, such as nesting records under files or files under volumes.^[23]^[7] For example, OpenVMS DLMs use tree-structured names with common prefixes (e.g., SYS$ for executive resources), facilitating conversion from local node-specific names to cluster-wide global ones during lock requests.^[24] This namespace design supports scalability by distributing resource mastery across nodes based on activity.^[25] Lock granularity determines the scope of protection, balancing concurrency with overhead; finer granularity permits more parallel access but increases management complexity. DLMs support levels from coarse (e.g., entire databases or filesystems) to fine (e.g., individual records or byte ranges within files).^[7] In Linux DLMs, granularity can extend to specific regions within resources, defined by offsets and lengths, allowing overlapping locks of compatible modes.^[25] OpenVMS implementations achieve this through hierarchical naming, enabling locks on sub-resources like pages or buckets within a database while allowing concurrent operations on unrelated parts.^[23]^[24]

Lock Modes

Distributed lock managers (DLMs) support multiple lock modes to control concurrent access to shared resources, balancing concurrency and exclusivity based on the intended operation.^[2] The standard modes include shared locks for read-only access, which permit multiple holders, and exclusive locks for write access, which allow only a single holder.^[2] These modes ensure data consistency in distributed environments by defining compatibility rules that prevent conflicting operations.^[2] In systems like the Linux DLM, six primary lock modes are defined, ranging from no restrictions to full exclusivity.^[2] The null (NL) mode indicates no interest in the resource and is compatible with all other modes, serving as a placeholder.^[2] Concurrent read (CR) mode allows read-only access and is compatible with all modes except exclusive (EX).^[2] Concurrent write (CW) mode permits both read and write operations but is restricted to compatibility with NL, CR, and other CW modes.^[2] Protected read (PR) mode provides read access while blocking writes from others, compatible with NL, CR, and other PR modes.^[2] Protected write (PW) mode allows read and write but excludes other writes, compatible only with NL and CR.^[2] Exclusive (EX) mode grants full read/write access to a single holder, compatible solely with NL.^[2] The interaction between these modes is governed by a compatibility matrix, which determines whether a requested lock can be granted alongside existing locks on the same resource.^[2] The matrix below illustrates this for the Linux DLM, where "Yes" indicates compatibility and "No" indicates blocking:

Requested \ Granted	NL	CR	CW	PR	PW	EX
NL	Yes	Yes	Yes	Yes	Yes	Yes
CR	Yes	Yes	Yes	Yes	Yes	No
CW	Yes	Yes	Yes	No	No	No
PR	Yes	Yes	No	Yes	No	No
PW	Yes	Yes	No	No	No	No
EX	Yes	No	No	No	No	No

This matrix ensures that, for example, multiple PR locks can coexist for concurrent reads without writes, while an EX request blocks all others.^[2] Extended modes in some DLMs, such as protected read (PR) and concurrent write (CW), provide finer-grained control beyond basic shared/exclusive paradigms; PR enforces read exclusivity against writes, while CW allows concurrent unprotected reads and writes among compatible holders.^[2] In the Linux DLM, these modes are represented by enumerated values: NL (0), CR (1), CW (2), PR (3), PW (4), and EX (5).^[26] Mode conversion enables dynamic adjustment of lock levels, such as upgrading from shared (PR) to exclusive (EX) for writing or downgrading from EX to NL upon completion.^[2] Up-conversions are granted only if compatible with the most restrictive existing lock and if the conversion queue is empty; otherwise, the request blocks in a first-in-first-out (FIFO) queue.^[2] Down-conversions occur in-place without queueing, immediately freeing the lock for others.^[2] These behaviors prevent race conditions during transitions while maintaining system throughput.^[2]

Lock Operations

Obtaining a Lock

In distributed lock managers (DLMs), the process of obtaining a lock generally involves a client application requesting access to a shared resource, with the DLM coordinating across nodes to ensure mutual exclusion. In master-based systems like the Linux DLM, this begins when a client on a node issues a request to its local DLM instance, typically through an API call such as dlm_lock(). This request specifies the resource identifier (a unique name up to 64 bytes), the desired lock mode (e.g., shared read or exclusive write), and optional flags that influence behavior, such as whether to attach a lock value block for additional data. The local DLM instance then communicates the request across the cluster, using cluster infrastructure like Corosync (which employs UDP for membership and multicast) and reliable messaging over TCP/IP or SCTP for lock coordination to inform relevant nodes.^[27]^[2] In such systems, the lock master—the designated node responsible for the specific resource—evaluates compatibility with existing locks held by other nodes, based on predefined mode hierarchies that prevent conflicts (e.g., exclusive modes block all others). If the request is compatible, the lock is granted immediately, and the master updates its internal state before notifying the requesting node via an asynchronous callback function (AST), which provides a lock handle (an opaque identifier) for future reference and tracking. In cases of incompatibility, the request is queued: compatible but pending conversions enter a convert queue for prioritized processing, while fully blocked requests join a wait queue processed in first-in-first-out (FIFO) order; the master periodically re-evaluates queues as locks are modified or released, granting via the same AST notification mechanism. Lock handles, or cookies, serve as unique tokens to identify and manage the lock throughout its lifecycle across nodes.^[2]^[7] The granting process relies on protocols that ensure cluster-wide agreement, often involving the lock master probing current holders for compatibility and then notifying the requester to maintain consistent views. In the Linux DLM, a master node is dynamically elected for each resource (initially the requester), and all operations route through it via reliable cluster messaging over TCP/IP or SCTP to handle potential packet loss; this master-slave model facilitates global consistency without full consensus overhead. Node joins or leaves during a request trigger recovery protocols: if the master fails, locks are re-mastered to another node using cluster membership views, potentially suspending and replaying pending requests to avoid loss, while new joins receive the current lock state via synchronization messages.^[2]^[7] Alternative DLMs, such as Redlock, use a quorum-based approach where clients attempt to acquire locks on a majority of independent nodes (e.g., Redis masters) within a time window, incorporating time-to-live (TTL) leases to handle failures without a central master. Similarly, the DynamoDB Lock Client employs atomic conditional writes to a distributed table for lock acquisition, with automatic expiration via leases. To manage contention and failures, DLMs incorporate configurable timeouts and retry mechanisms during lock acquisition. Requests can be non-blocking, returning immediately with an error (e.g., -EAGAIN) if the lock cannot be granted without waiting, via flags like LKF_NOQUEUE (in Linux DLM), allowing applications to implement custom polling or exponential backoff retries. For blocking requests, a timeout parameter (in centiseconds) can be set to abandon the wait after a specified period, triggering the AST callback with a failure status; this prevents indefinite hangs in high-contention scenarios, with typical defaults balancing responsiveness and overhead in cluster environments.^[3]^[5]^[27]

Releasing a Lock

In distributed lock managers (DLMs), releasing a lock involves the holder notifying the system to free the resource, allowing waiting requests to proceed and updating the shared state. In master-based systems like the Linux DLM, this begins when the lock holder invokes an unlock operation on its local DLM instance, typically using an API call such as dlm_unlock or dlm_ls_unlock with the lock handle or ID as input. This call removes the lock from the grant queue if it is currently held, or returns it to the grant queue from the convert or wait queue if the unlock cancels a pending request. The local DLM then updates the cluster-wide lock resource database to reflect the change in lock state, and if the lock is granted, it notifies any waiting requesters by invoking their asynchronous transfer of control (AST) routines with an appropriate status, such as -EUNLOCK, allowing them to proceed with their lock acquisition attempts.^[2] To ensure consistency across the cluster, the unlock operation propagates the state change to all nodes via the DLM's inter-node communication protocol, with the master node for the resource coordinating the updates to maintain a synchronized view of the lock database. In cases of node failure, surviving nodes initiate forced releases of locks held by the failed node as part of the recovery process; this involves suspending lock-related activities until the failed node is fenced—isolated through mechanisms like power-off or network disconnection—to prevent split-brain scenarios, after which the locks are evicted and resources are re-mastered on healthy nodes.^[2]^[28] Cleanup following a successful release includes removing the lock entry from all relevant queues (grant, convert, or block), performing any necessary mode demotions for compatible locks, and updating the resource's overall state in the database; if the released lock is the last one on the resource, the resource itself is destroyed along with any associated lock value block (LVB). For performance optimization, DLMs support optional asynchronous release modes, where the unlock call returns immediately and dispatches the AST notification later via a work queue, reducing blocking in high-throughput environments. If the release fails due to network issues or other errors, the DLM returns an error code (e.g., EFAULT) in the lock status block, prompting the application to retry or handle the failure; in distributed setups, such errors may trigger broader recovery actions, including re-fencing and lock re-acquisition to restore consistency. In lease-based DLMs like Redlock, release involves explicitly deleting the lock key on the acquired nodes, but failure to do so results in automatic expiration via TTL to prevent deadlocks.^[2]^[3]

Advanced Features

Lock Value Block

In some distributed lock managers, such as the Linux DLM used in Red Hat Enterprise Linux clusters, a lock value block (LVB) is an optional fixed-size data structure associated with a lock resource, enabling the atomic storage and retrieval of application-specific metadata tied to the resource's state.^[2]^[29] This feature allows processes across distributed nodes to share concise global data, such as resource version numbers or synchronization flags, without requiring additional inter-process communication mechanisms. The block is allocated upon the creation of the lock resource and persists as long as at least one lock on that resource is held, ensuring consistent visibility to all contending participants.^[2]^[29] In usage, the lock value block facilitates efficient coordination for tasks like maintaining cache consistency or tracking resource modifications in clustered environments, where it might store details such as file offsets or update timestamps to avoid redundant synchronization. For instance, applications can embed metadata indicating local changes to a shared resource, allowing new lock holders to quickly assess and propagate state updates.^[7]^[2] Operations on the lock value block occur atomically during lock conversions or releases, where it can be read into the requesting process's buffer or written from it, ensuring thread-safe updates across the cluster. Upon releasing a lock in an exclusive or protected-write mode, any modifications to the block are propagated automatically to the distributed lock manager's database, making the updated value available to subsequent holders without explicit messaging. This integration streamlines lock obtain and release processes by embedding data handling directly into the locking protocol.^[2]^[29] Limitations of lock value blocks include inherent size constraints, typically designed to minimize storage and network overhead in distributed systems, which restrict their use to essential, compact data rather than arbitrary payloads. They are not intended as a general-purpose distributed database, focusing solely on lock-associated metadata to prevent performance degradation from excessive data transfer. Additionally, abrupt termination of a lock-holding process may invalidate the block's contents, necessitating application-level recovery mechanisms.^[2]^[7]

Deadlock Detection and Resolution

Deadlocks in distributed lock managers arise from cycles in the wait-for graph, where processes on different nodes each hold locks required by others, creating cross-node dependencies that prevent progress. These distributed cycles differ from local deadlocks by spanning multiple nodes, often involving resources mastered by different lock managers. For instance, a process on node A may hold a shared lock on resource X while waiting for an exclusive lock on resource Y mastered by node B, while a process on node B holds the exclusive lock on Y and waits for X, forming a classic two-way cycle. Such deadlocks are exacerbated in high-concurrency environments due to network latencies and asynchronous lock requests.^[30] Detection algorithms for these deadlocks typically build a global wait-for graph by querying lock statuses across nodes, either periodically or on-demand, to identify cycles through graph traversal. A seminal distributed approach is the edge-chasing algorithm by Chandy, Misra, and Haas, which propagates probe messages along wait-for edges from blocked processes; if a probe returns to its initiator, a cycle is confirmed without requiring a central coordinator. Timeout-based detection complements this by flagging prolonged waits as potential deadlocks, while some systems employ centralized detectors that aggregate local wait-for graphs from all nodes for analysis. These methods balance detection accuracy with communication overhead, as full graph construction can be costly in large clusters.^[30]^[31]^[2] Upon detection, resolution involves selecting a victim transaction—often based on criteria like least recently used or longest wait time—and aborting its locks to break the cycle, followed by asynchronous callbacks to notify the affected client for retry. In cluster environments, fencing may isolate the victim node's resources to prevent propagation of the conflict. Prevention strategies mitigate deadlocks proactively through lock ordering hierarchies, where resources are assigned unique global identifiers, and locks are acquired in ascending order to eliminate cycles; timeouts on lock requests further break potential loops by forcing releases. Wait-for graphs serve as a key metric for monitoring dependency buildup and preempting cycles before they form. Incompatibilities between lock modes, such as exclusive versus shared, form the basis of wait-for edges in these graphs.^[30]^[2]

Implementations

Linux Clustering

The Distributed Lock Manager (DLM) in Linux operates as a kernel module (fs/dlm/ in the kernel source tree) that enables symmetric distributed locking across cluster nodes, serving as the foundational component for coordinating access to shared resources in high-availability environments. It integrates closely with Corosync, which handles cluster messaging and membership via low-latency multicast protocols like Totem, and Pacemaker, which manages resource allocation, monitoring, and node fencing to ensure cluster integrity during failures. This kernel-level DLM has been a core element of open-source Linux clustering since its upstream integration around 2004, aligning with early efforts in the Open Cluster Framework for standardized cluster services.^[32]^[33]^[34] Configuration of DLM centers on defining lock spaces—logical namespaces that partition locks to prevent interference between applications or subsystems—which were traditionally specified in the /etc/cluster/cluster.conf file in legacy Cluster Manager (cman) setups, including parameters for lock space names, node participation, and tuning options like queue lengths. In modern Pacemaker/Corosync stacks (since RHEL 7/SLE 12), lock spaces are configured dynamically using tools like pcs or crm, such as creating a cloned DLM resource with pcs resource create dlm ocf:pacemaker:controld to ensure it runs on all nodes. These configurations support clusters of up to 16 nodes in RHEL and up to 32 nodes in SLE, leveraging Corosync's multicast for sub-millisecond message delivery in local networks, though scalability is constrained by kernel limits on lock tables and communication overhead.^[7]^[35]^[8] DLM finds primary application in shared-disk file systems like GFS2 (Global File System 2) and OCFS2 (Oracle Cluster File System 2), where it manages glocks (GFS locks) or cluster locks to allow concurrent read/write access across nodes while enforcing consistency on block devices. For instance, in a GFS2 setup, DLM coordinates metadata and journal locks to enable active-active storage sharing without single-node bottlenecks. It also supports high-availability configurations for databases, such as Pacemaker-orchestrated MySQL clusters using shared GFS2 storage for data directories, ensuring failover by releasing locks from failed nodes and recovering via fencing.^[36]^[37]^[38] Key performance enhancements in DLM include asynchronous lock operations via its user-space API (e.g., dlm_lock() with callbacks), which allow non-blocking requests to minimize contention in I/O-intensive workloads, and built-in journaling support for lock state recovery, integrated with file system journals like those in GFS2 to replay operations after node crashes in under a second for typical configurations. As of Linux kernel 6.x series (released starting 2022), DLM received stability fixes, such as improved mid-communication handling in multi-node setups and better address management to prevent kernel panics during cluster reconfiguration.^[39]

Other Systems

Veritas Cluster Server (VCS), originally developed by Symantec in the 1990s and now maintained by Veritas Technologies, incorporates a distributed lock manager known as the Group Lock Manager (GLM) to ensure consistent access to shared file systems across cluster nodes.^[40] The GLM provides metadata and cache coherency by coordinating locks in a clustered environment, particularly for the Veritas Cluster File System (CFS), which supports active-active configurations over shared storage like SAN.^[40] This mechanism has evolved to handle multi-node concurrency while maintaining single-host file system semantics.^[41] IBM's High Availability Cluster Multi-Processing (HACMP), now evolved into PowerHA SystemMirror for AIX, employs Reliable Scalable Cluster Technology (RSCT) group services for concurrent locking to manage access to shared storage in enhanced concurrent volume groups, using reserved regions on physical disks for heartbeats to prevent split-brain scenarios during failover and resource takeover.^[42] This approach supports multi-node access to resources like databases, ensuring availability without full exclusive locks in certain configurations.^[43] In open-source and cloud environments, Apache ZooKeeper's Curator framework offers distributed locking recipes, such as InterProcessMutex, which implement globally synchronous locks across nodes without a full-fledged DLM but serving similar coordination purposes.^[44] These locks use ZooKeeper's hierarchical znodes and watches to enforce mutual exclusion, making them suitable for leader election and resource synchronization in distributed applications.^[45] Similarly, the Redis Redlock algorithm approximates a distributed lock manager by acquiring locks across multiple independent Redis masters, requiring success on a majority (e.g., 3 out of 5) to ensure fault tolerance against single-node failures.^[3] Redlock employs time-bound leases to prevent indefinite holds, addressing challenges in asynchronous networks.^[3] The DynamoDB Lock Client is a library for acquiring distributed locks using Amazon DynamoDB tables. It leverages conditional writes for atomic lock acquisition and supports lease-based expiration to automatically release locks if a holder fails, providing safety and liveness in cloud environments without requiring additional infrastructure.^[5] Distributed lock systems vary between centralized models, where a coordinator like ZooKeeper manages all lock states, and decentralized ones, which distribute responsibility across nodes for better fault tolerance. Windows Server Failover Clustering (WSFC) exemplifies a hybrid approach with Cluster Shared Volumes (CSV), enabling simultaneous read-write access from multiple nodes via an integrated lock service that coordinates I/O through a redirector and global update handle.^[46] This service uses SCSI-3 persistent reservations and direct storage access to minimize latency while resolving conflicts.^[46] Compared to performance-oriented systems, ZooKeeper's locking prioritizes strong consistency through its ZAB protocol, scaling to hundreds of nodes but incurring quorum overhead that can limit throughput in high-contention scenarios.^[47] In contrast, approximations like Redlock focus on availability and low-latency acquisition in eventually consistent environments. Recent developments include etcd-based locking in Kubernetes, introduced around 2015, which leverages leases and mutexes for coordinating pod scheduling and configuration across clusters, providing lightweight distributed synchronization without external dependencies.^[48] Etcd's locks use revision numbers and TTLs to ensure atomicity, supporting Kubernetes' scalability to thousands of nodes.^[49]

References

[1]
Chapter 5. Lock Management | High Availability Add-On Overview
In a Red Hat Enterprise Linux cluster, DLM (Distributed Lock Manager) is the lock manager. A lock manager is a traffic cop who controls access to resources ...
[2]
Distributed Locks with Redis | Docs
This page describes a more canonical algorithm to implement distributed locks with Redis. We propose an algorithm, called Redlock, which implements a DLM.
[3]
Building Distributed Locks with the DynamoDB Lock Client
Aug 15, 2017 · The DynamoDB Lock Client is a Java Library widely used inside Amazon, which enables you to solve distributed computing problems like leader election and ...
[4]
Distributed lock overview - Dapr Docs
Distributed locks provide mutually exclusive access to resources, usually for state updates, and only one instance can hold a lock at a time, scoped to a Dapr ...<|control11|><|separator|>
[5]
Distributed Lock Manager (DLM) | SLE HA 12 SP5
The Distributed Lock Manager (DLM) in the kernel is the base component used by OCFS2, GFS2, Cluster MD, and cLVM to provide active-active storage at each ...
[6]
None
### Summary of Distributed Locking (DLM) from arXiv:2504.03073
[7]
Global File System — The Linux Kernel documentation
GFS reads and writes to the block device like a local file system, but also uses a lock module to allow the computers coordinate their I/O so file system ...
[8]
Overview of OCFS2 - IBM
OCFS2 uses a distributed lock manager to organize access and a network interconnect to share the information between clustered servers when a file is changed.Missing: GFS | Show results with:GFS
[9]
LWN.net Weekly Edition for July 1, 2004
Jun 30, 2004 · Then Matthew O'Keefe, a professor at the university, founded Sistina around GFS. Sistina stopped making new versions of GFS available under the ...
[10]
Sistina Releases Global File System 4.0 For Linux - HPCwire
Jan 26, 2001 · Minneapolis, MN – – Sistina Software Inc. on January 22nd, announced the release of the Global File System (GFS TM) version 4.0. GFS 4.0 is the ...
[11]
[PDF] The GFS2 Filesystem
Jun 30, 2007 · This was the default locking manager for GFS until the DLM. (see [1]) was written by Patrick Caulfield and Dave Tei- gland. Sistina Software Inc ...Missing: origins | Show results with:origins
[12]
Veritas Software Corporation | Encyclopedia.com
Later in the first half of 2000 Veritas released improved versions of its clustering software. The Cluster Server for Windows NT allowed IT managers to link ...
[13]
[PDF] Veritas Cluster Server Release Notes
The product described in this document is distributed under licenses restricting its use, copying, distribution, and decompilation/reverse engineering. No part ...Missing: history | Show results with:history
[14]
Red Hat Continues Scale Out of Open Source Architecture with ...
Dec 18, 2003 · Sistina GFS for Oracle9i RAC, a solution designed to specifically reduce the complexity of implementing and maintaining an Oracle9i RAC system.Missing: DLM | Show results with:DLM
[15]
The Global File System goes full circle - LWN.net
Jun 30, 2004 · In 2003, Red Hat announced that it was acquiring Sistina, and that it would work to release Sistina's current technologies as open source in ...Missing: acquisition | Show results with:acquisition
[16]
dlm: overview - LWN.net
May 16, 2005 · These are the distributed lock manager (dlm) patches against 2.6.12-rc4 that we'd like to see added to the kernel. We've made changes based ...
[17]
High-Availability Clustering in the Open Source Ecosystem - Alteeve
May 28, 2016 · In 2003, Red Hat purchased Sistina Software. The Birth of Red Hat Cluster Services. In 2002, when MCL closed, Red Hat took their “Kimberlite ...<|separator|>
[18]
Ahead of the Pack: the Pacemaker High-Availability Stack
Jun 18, 2012 · The default cluster communications layer in the Linux HA stack is Corosync, which evolved out of the earlier, now all but defunct, OpenAIS ...
[19]
1. Introduction — Pacemaker Administration - ClusterLabs
Pacemaker is a high-availability cluster resource manager that preserves integrity and minimizes downtime of desired services.
[20]
Distributed Lock Manager - VSI OpenVMS Wiki
Nov 16, 2019 · The Distributed Lock Manager was designed by Steve Beckhardt and released in 1984 with VAX/VMS V4.0. Contents. 1 Resources; 2 Lock Modes ...Missing: origins Sistina GFS
[21]
[PDF] OpenVMS Locking Concepts - Oracle
Jun 20, 2005 · – Distributed Lock Manager. – Used by VMS, XQP, RMS, Rdb, etc. Page 4. 4. Distributed Lock Manager. • Cooperating processes use lock manager to.
[22]
[PDF] Programming Locking Applications - OpenDLM
The Distributed Lock Manager provides two distinct locking models: the DLM locking model and the UNIX ®. System V locking model. The two locking models exist in ...
[23]
[PDF] Programming Locking Applications - Red Hat People
An Overview of the Distributed Lock Manager. The Distributed Lock Manager (DLM) provides advisory locking services that allow concurrent applications running ...Missing: origins Sistina
[24]
dlmconstants.h « linux « uapi « include - kernel/git/torvalds/linux.git - Linux kernel source tree
### Lock Mode Defines and Values
[25]
dlm_lock(3): acquire/convert DLM lock - Linux man page - Die.net
LKF_ALTCW If the requested mode can't be granted (generally PR), try to grant in CW and return DLM_SBF_ALTMODE. LKF_TIMEOUT The lock will time out per the ...
[26]
1.3. Cluster Infrastructure | Red Hat Enterprise Linux | 5
As implied in its name, DLM is a distributed lock manager and runs in each cluster node; lock management is distributed across all nodes in the cluster (refer ...
[27]
18.2 Distributed Lock Manager (DLM) - Flylib.com
The TruCluster Server Distributed Lock Manager (DLM) component provides a form of distributed (cluster-wide) locking that goes well beyond traditional locks.<|control11|><|separator|>
[28]
Distributed deadlock detection | ACM Transactions on Computer ...
CHANDY, K.M., AND MISRA, J. A distributed algorithm for detecting resource deadlocks in distributed systems. In Proc. A CM SIGA CT-SIGOPS Syrup.
[29]
A survey of distributed deadlock detection algorithms
This paper will concentrate on distributed deadlock detection algorithms. Only detection of resource deadlocks will be reviewed here, though other types of ...Missing: manager | Show results with:manager
[30]
DLM Project Page - Sourceware
This is a distributed lock manager (dlm) that we'd like to see added to the kernel. The dlm programming api is very similar to that found on other operating ...
[31]
[PDF] Using Pacemaker to Create Highly Available Linux Solutions on IBM ...
Dec 20, 2023 · This document describes using Pacemaker to create highly available Linux solutions on IBM Power, applying to Red Hat Enterprise Linux 9.0 and 8 ...Missing: advancements | Show results with:advancements
[32]
Distributed Lock Manager (DLM) | SLE HA 15 SP7
The Distributed Lock Manager (DLM) in the kernel is the base component used by OCFS2, GFS2, Cluster MD, and Cluster LVM (lvmlockd) to provide active-active ...
[33]
Support Policies for RHEL Resilient Storage - dlm General Policies
Apr 16, 2025 · This guide lays out Red Hat's policies applicable to the use of the Distributed Lock Manager - dlm - in RHEL Resilient Storage deployments.
[34]
Global File System 2 | Red Hat Enterprise Linux | 7
The glock subsystem provides a cache management function which is implemented using the distributed lock manager (DLM) as the underlying communication layer.<|control11|><|separator|>
[35]
[PDF] Managing the Oracle Cluster File System Version 2
Jul 1, 2025 · OCFS2 (Oracle Cluster File System Version 2) is a general-purpose shared-disk file system intended for use with clusters. OCFS2 offers high ...
[36]
MySQL High Availability Clustering Using DRBD and Pacemaker on ...
This guide describes an approach for designing and implementing High Availability (HA) MySQL cluster using DRBD, Pacemaker, and Corosync on RHEL 8 or CentOS 8
[37]
Bug#1063338: [regression 6.1.76] dlm: cannot start dlm midcomms ...
Feb 8, 2024 · We tried to apply commit c51c9cd8 (fs: dlm: don't put dlm_local_addrs on heap) to the debian kernel 6.1.76 and came up with the attached patch.
[38]
[PDF] Veritas Storage Foundation™ for Cluster File System Administrator's ...
CFS uses a distributed locking mechanism called Global Lock Manager. (GLM) to ensure all nodes have a consistent view of the file system. GLM provides.
[39]
About single-host file system semantics using Group Lock Manager
Apr 2, 2024 · SFCFSHA uses the Veritas Group Lock Manager (GLM) to reproduce UNIX single-host file system semantics in clusters. This is most important in ...
[40]
[PDF] IBM Eserver pSeries HACMP V5.x Certification Study Guide Update
uses a small region on the physical disk for the concurrent locking mechanism. Note: When using this feature, HACMP assumes that the CUoD license agreement ...
[41]
[PDF] Guide to IBM PowerHA SystemMirror for AIX Version 7.1.3
This guide outlines the latest PowerHA enhancements, describes clustering with unicast communications, and includes migration scenarios.
[42]
Recipe: Shared Lock - Apache Curator
Jul 23, 2025 · Fully distributed locks that are globally synchronous, meaning at any snapshot in time no two clients think they hold the same lock.
[43]
Recipes | Apache Curator
Jul 23, 2025 · Shared Reentrant Lock - Fully distributed locks that are globally synchronous, meaning at any snapshot in time no two clients think they hold ...Locks · Caches · Nodes/watches
[44]
Cluster Shared Volumes overview | Microsoft Learn
Jul 9, 2025 · A Cluster Shared Volume (CSV) enables multiple nodes in a Windows Server failover cluster or Azure Local to simultaneously have read-write access to the same ...
[45]
etcd versus other key-value stores
Jan 12, 2025 · With this mechanism, etcd provides distributed locking for clients. It means that a client knows that it is acquiring a lock of a key when its ...Use cases · Comparison chart · ZooKeeper · Consul
[46]
How to create locks - etcd
Oct 25, 2021 · How to create locks. Guide to creating distributed locks in etcd. lock for distributed lock: 08_etcdctl_lock_2016050501.
[47]
Operating etcd clusters for Kubernetes
Operating etcd clusters for Kubernetes. etcd is a consistent and highly-available key value store used as Kubernetes' backing store for all cluster data.

Requested \ Granted	NL	CR	CW	PR	PW	EX
NL	Yes	Yes	Yes	Yes	Yes	Yes
CR	Yes	Yes	Yes	Yes	Yes	No
CW	Yes	Yes	Yes	No	No	No
PR	Yes	Yes	No	Yes	No	No
PW	Yes	Yes	No	No	No	No
EX	Yes	No	No	No	No	No

Requested \ Granted	NL	CR	CW	PR	PW	EX
NL	Yes	Yes	Yes	Yes	Yes	Yes
CR	Yes	Yes	Yes	Yes	Yes	No
CW	Yes	Yes	Yes	No	No	No
PR	Yes	Yes	No	Yes	No	No
PW	Yes	Yes	No	No	No	No
EX	Yes	No	No	No	No	No

Requested \ Granted	NL	CR	CW	PR	PW	EX
NL	Yes	Yes	Yes	Yes	Yes	Yes
CR	Yes	Yes	Yes	Yes	Yes	No
CW	Yes	Yes	Yes	No	No	No
PR	Yes	Yes	No	Yes	No	No
PW	Yes	Yes	No	No	No	No
EX	Yes	No	No	No	No	No