Fact-checked by Grok 2 weeks ago

Deferred Procedure Call

A Deferred Procedure Call (DPC) is a kernel-mode mechanism in the Microsoft Windows operating system that enables device drivers to postpone non-time-critical interrupt processing from an Interrupt Service Routine (ISR), which runs at a high Interrupt Request Level (IRQL), to a lower-priority execution context at DISPATCH_LEVEL IRQL.^[1] This deferral ensures that ISRs complete quickly to minimize system latency, while allowing deferred tasks—such as completing I/O operations or updating device states—to execute later without blocking higher-priority interrupts.^[1] DPCs are queued by the system when an ISR calls a routine like IoRequestDpc, associating the call with a DPC object tied to the device's functional device object, which is initialized during driver startup.^[1] Once queued, the DPC routine executes on the same processor that handled the interrupt, in the context of an arbitrary system thread at DISPATCH_LEVEL, where it can access pageable memory and paged pool but must avoid operations that could cause deadlocks or high latency.^[1] Drivers may also create custom DPC objects for non-interrupt-related deferred work, such as timer expirations, using routines like KeInitializeDpc and KeInsertQueueDpc.^[1] In Windows kernel architecture, DPCs play a critical role in balancing responsiveness and efficiency, particularly for hardware drivers handling high-volume interrupts from devices like network cards or storage controllers.^[1] High DPC execution times can lead to system bottlenecks, measurable via tools like Event Tracing for Windows (ETW), and are often optimized to prevent audio glitches or input lag in real-time applications.^[2] Unlike Asynchronous Procedure Calls (APCs), which run in user-mode thread contexts, DPCs operate strictly in kernel mode and are not bound to specific threads, making them suitable for interrupt-driven workloads.^[3]

Overview

Definition

A Deferred Procedure Call (DPC) is a kernel-mode mechanism in the Microsoft Windows operating system designed to defer the execution of procedures from high-priority contexts, such as Interrupt Service Routines (ISRs), to a lower Interrupt Request Level (IRQL) called DISPATCH_LEVEL after the initial high-IRQL processing completes.^[1] This deferral minimizes the duration spent at elevated IRQLs during interrupt handling, thereby improving overall system responsiveness and efficiency.^[4] The DPC is represented by an opaque kernel structure known as KDPC. Drivers initialize a KDPC object using routines like KeInitializeDpc, which associates a callback routine and optional context with the DPC before it can be queued for execution.^[5] DPCs execute exclusively at DISPATCH_LEVEL IRQL, positioned below the higher IRQLs of ISRs but above PASSIVE_LEVEL. At this level, interrupts at or below DISPATCH_LEVEL are masked, preventing preemption by lower-priority interrupts, and thread preemption by the scheduler is disabled.^[4] This IRQL ensures that DPC processing remains protected from routine interrupts but can be interrupted by higher-priority kernel activities if necessary.^[6]

Purpose

Deferred Procedure Calls (DPCs) serve as a critical mechanism in the Windows kernel to minimize the execution time of Interrupt Service Routines (ISRs), which operate at high Interrupt Request Levels (IRQLs) and must complete rapidly to avoid blocking subsequent interrupts. By allowing ISRs to queue non-urgent tasks for later processing at the lower DISPATCH_LEVEL IRQL, DPCs enable drivers to offload work such as completing interrupt servicing or handling secondary operations, thereby reducing overall interrupt latency and preventing potential system hangs from prolonged ISR execution.^[1]^[7] This deferral improves kernel performance by executing tasks in a more appropriate context, where resources like memory and processing time are less constrained, and facilitates better resource management, including automatic stack switching to a dedicated per-processor DPC stack to prevent overflows on the limited ISR stack. In interrupt-driven environments, particularly for device drivers managing hardware events, DPCs ensure that high-priority interrupts are not unduly delayed by lengthy processing, which is essential for maintaining real-time system responsiveness and avoiding disruptions in time-sensitive operations.^[7]^[2] However, misuse of DPCs, such as queuing excessive or inefficient routines, can lead to backlogs in per-processor DPC queues, resulting in elevated DPC latency that degrades system performance. High DPC execution times, ideally kept under 100 microseconds per Microsoft guidelines, become observable through tools like the Windows Performance Toolkit, which analyzes trace data to identify problematic drivers and quantify delays in DPC processing.^[2]^[7]

Historical Development

Origins in Operating System Design

The concept of deferred procedure calls emerged in the 1970s and 1980s as operating systems grappled with the need to handle hardware interrupts efficiently without prolonging the time spent in interrupt service routines (ISRs), which could otherwise degrade system responsiveness. Early systems recognized that ISRs should perform only urgent acknowledgment and state saving, deferring non-critical processing to avoid blocking higher-priority interrupts or extending disable periods excessively. This separation addressed fundamental limitations in interrupt architectures, such as those in the PDP-11 family, where vectored interrupts provided fast entry points but slow context switches—often taking hundreds of microseconds—necessitated minimizing ISR execution to maintain throughput in time-sharing environments.^[8] One of the earliest structured approaches appeared in Multics, a pioneering time-sharing system operational from 1969, which implemented a dedicated interrupt interceptor to route hardware signals to appropriate handlers while supporting deferred processing through supervisor-level mechanisms for faults like page faults. In Multics, interrupts triggered supervisor intervention to resolve conditions asynchronously, allowing the main program to resume quickly while deferred actions, such as memory management, were queued for later execution outside the immediate interrupt context. This design emphasized modularity, influencing subsequent kernels by demonstrating how interrupt handling could integrate with higher-level abstractions like event channels for non-urgent work.^[9]^[10] In Unix, particularly with Version 7 released in 1979, the bottom-half mechanism formalized interrupt deferral using software interrupts to schedule post-ISR work at a lower priority, ensuring ISRs remained brief. The top half of an interrupt handler would quickly acknowledge the event and set flags or queue data, while the bottom half—executed via a software interrupt—handled deferred tasks like I/O completion without re-enabling interrupts prematurely. This approach, detailed in contemporary documentation, optimized for the PDP-11's constraints by reducing ISR latency to tens of instructions, promoting kernel modularity in multiprogrammed settings.^[11]^[12] Similarly, the Virtual Memory System (VMS), introduced in 1978, employed asynchronous system traps (ASTs) and fork procedures to defer execution of routines outside the primary thread, delivering notifications at specified priority levels after interrupt acknowledgment. ASTs allowed processes to queue callbacks for events like resource availability, executed asynchronously to prevent ISR bloat, while fork procedures created lightweight subprocesses for deferred computation, enhancing real-time capabilities on VAX hardware. These mechanisms addressed early multiprocessor needs by enabling deferred work to run on idle CPUs, laying groundwork for structured deferral in later operating systems including Windows NT.^[13]

Implementation in Windows NT

The implementation of Deferred Procedure Calls (DPCs) in Windows NT originated with the release of Windows NT 3.1 in 1993, where it was introduced as a core kernel mechanism for deferring interrupt-related work. Developed by the Windows NT team led by Dave Cutler, a veteran of Digital Equipment Corporation's VMS operating system, DPCs adapted concepts like VMS fork procedures to enable efficient handling of time-sensitive tasks in a multiprocessor environment. This design choice was integral to the NT kernel's executive subsystem, facilitating asynchronous I/O operations and device driver management by allowing interrupt service routines (ISRs) to queue callbacks for execution at a lower interrupt request level (IRQL).^[14]^[15]^[16] In early Windows NT versions prior to Vista, DPCs relied on Inter-Processor Interrupts (IPIs) to dispatch high-importance routines across multiple processors, ensuring prompt execution but incurring overhead from cross-processor communication. With the advent of Windows Vista in 2007, Microsoft enhanced DPC dispatching by introducing configurable importance levels—normal and high—along with medium-high priorities, which optimized queue placement and reduced IPI usage in multiprocessor systems for better scalability and latency control. These changes allowed drivers to specify DPC urgency via APIs like KeSetImportanceDpc, placing high-importance DPCs at the front of per-processor queues to prioritize critical tasks without always triggering expensive IPIs.^[7]^[17]^[18] DPCs have remained a foundational element of the Windows NT kernel lineage, deeply integrated with the executive for managing I/O request packets (IRPs) and device interactions, and continuing to evolve through subsequent releases up to Windows 11 as of 2025. Key milestones include refinements in Windows NT 4.0 (1996) for improved per-processor queue management to handle growing system complexity, and the introduction of threaded DPCs in Windows 7 (2009), which enabled low-priority DPCs to execute in dedicated kernel threads rather than directly in interrupt context, mitigating latency in multimedia and real-time scenarios. This persistence underscores DPCs' role in balancing responsiveness and efficiency across the NT kernel's 30-year evolution.^[1]^[19]

Mechanism and Implementation

DPC Objects and Initialization

In the Windows kernel, Deferred Procedure Call (DPC) objects are represented by the opaque KDPC structure, which drivers allocate from resident memory such as a device extension or nonpaged pool but do not directly manipulate.^[5] The KDPC structure includes fields such as Type (indicating the object type, e.g., DpcObject or ThreadedDpcObject), Importance (specifying priority levels like MediumImportance or HighImportance), DpcListEntry (a LIST_ENTRY or SINGLE_LIST_ENTRY for queuing), DeferredRoutine (a pointer to the callback function), DeferredContext (a driver-supplied context value), SystemArgument1 and SystemArgument2 (additional parameters passed to the callback), and TargetProcessor (for processor affinity targeting).^[20] These fields are managed internally by the kernel, ensuring drivers interact solely through documented APIs to maintain system stability.^[21] To initialize a custom DPC object, drivers call the KeInitializeDpc routine, providing a pointer to the allocated KDPC structure, a pointer to the DeferredRoutine callback, and an optional DeferredContext value.^[5] This routine sets up the DPC for later queuing via kernel functions like KeInsertQueueDpc, allowing drivers to defer non-urgent processing from high-IRQL contexts such as interrupt service routines. For DPCs associated with specific device objects, the system automatically provides one pre-allocated DPC object per DEVICE_OBJECT, which drivers initialize by calling IoInitializeDpcRequest with the device object pointer and a pointer to an IO_DPC_ROUTINE (also known as DpcForIsr).^[22] This associates the callback with the device's DPC, enabling queuing through IoRequestDpc from the driver's ISR.^[23] Drivers often create additional custom DPCs beyond the system-supplied one per device object, storing the KDPC in driver-allocated memory for specialized deferral needs, such as timer callbacks or multi-device handling.^[24] However, drivers have no direct access to the internal contents of any DPC object—whether system-provided or custom—as all configuration and management occur via kernel APIs like KeInitializeDpc or IoInitializeDpcRequest.^[1] The callback routine specified during initialization follows the KDEFERRED_ROUTINE signature: VOID (*KDEFERRED_ROUTINE)(IN PKDPC Dpc, IN PVOID DeferredContext OPTIONAL, IN PVOID SystemArgument1 OPTIONAL, IN PVOID SystemArgument2 OPTIONAL).^[25] Here, the Dpc parameter points to the KDPC object, DeferredContext carries driver-specific data from initialization, and SystemArgument1 and SystemArgument2 provide optional system or driver-supplied arguments, enabling flexible handling of deferred tasks at DISPATCH_LEVEL IRQL.^[25]

Queuing and Execution

In the Windows kernel, the queuing of a Deferred Procedure Call (DPC) typically occurs from an Interrupt Service Routine (ISR) at an elevated Interrupt Request Level (IRQL) higher than DISPATCH_LEVEL. For device-associated DPCs, the ISR calls IoRequestDpc, providing the device object, IRP, and context to queue the associated DPC routine. For custom DPCs, the ISR invokes KeInsertQueueDpc, passing a pointer to the initialized KDPC structure along with optional system arguments for context. Both mechanisms insert the DPC into the target processor's per-processor DPC queue, located within the Processor Control Region Block (PRCB), if the DPC is not already queued; KeInsertQueueDpc returns TRUE upon successful insertion and FALSE otherwise.^[26]^[23]^[18] The kernel executes DPCs at DISPATCH_LEVEL IRQL, which is lower than typical ISR levels but higher than thread execution levels, ensuring they preempt normal kernel code while remaining interruptible by higher-priority hardware interrupts. To prevent stack overflow from the limited ISR stack space, the kernel switches execution to a dedicated per-processor DPC stack during processing. The DPC queue is drained through mechanisms such as a software interrupt raised at DISPATCH_LEVEL or by the system idle thread on the processor, allowing the callback routine specified in the KDPC to run synchronously until completion.^[1]^[27] Dispatching of queued DPCs begins when the processor's IRQL drops below DISPATCH_LEVEL, often immediately after the ISR returns or at the end of the current thread's time quantum. For DPCs targeted to a remote processor via KeSetTargetProcessorDpc, the kernel may send an Inter-Processor Interrupt (IPI) to the target if the DPC importance is high, prompting it to drain its queue promptly; otherwise, execution awaits natural opportunities like quantum end or idling. The DPC routine processes items from the queue in order, executing each until it completes or the queue is empty.^[18]^[28] Queue management in the Windows kernel relies on per-processor lists to minimize contention in multiprocessor systems, with each PRCB maintaining a separate DPC queue ordered by importance—normal DPCs append to the end, while high-importance DPCs (set via KeSetImportanceDpc) insert at the front for earlier execution. If queue depth or insertion rate exceeds thresholds, the kernel accelerates draining to prevent backlog; low-importance DPCs may be deferred during high load to prioritize urgent work, though the system avoids outright rejection unless the DPC is already pending.^[18]

Types of DPCs

Ordinary DPCs

Ordinary DPCs, also known as normal DPCs, represent the standard type of deferred procedure call in the Windows kernel. They execute at DISPATCH_LEVEL interrupt request level (IRQL) in kernel mode and operate at the default medium importance level unless otherwise specified. By default, they are queued to the processor currently executing the queuing routine, such as through a call to KeInsertQueueDpc without a pre-set target, ensuring execution occurs on the same CPU to maintain locality.^[26]^[29] This default affinity simplifies implementation for single-processor scenarios or when work is inherently tied to the interrupting CPU. Drivers can optionally designate a target processor for an ordinary DPC using KeSetTargetProcessorDpc after initialization with KeInitializeDpc but before queuing. The target can be a specific zero-based processor number, the current processor ((CCHAR)-1), or any available processor ((CCHAR)-2). This feature facilitates load balancing in symmetric multiprocessing (SMP) environments by allowing work to be offloaded to less busy CPUs. If targeted to a different processor, the kernel may issue an inter-processor interrupt (IPI) to prompt execution, depending on the DPC's importance level and system state.^[30] Ordinary DPCs are particularly suited for quick, local deferrals of non-urgent tasks that must be postponed from higher IRQLs, such as completing I/O operations initiated by an interrupt service routine (ISR). For instance, after an ISR acknowledges an interrupt and performs minimal hardware handling, an ordinary DPC can dequeue the next I/O request packet (IRP) for processing, complete the current IRP if feasible, or reprogram the device for subsequent transfers or error retries.^[4] Such use cases leverage the DPC's ability to run at DISPATCH_LEVEL IRQL, where it can access pageable memory and certain kernel APIs unavailable at higher IRQLs, while keeping interrupt latency low. In terms of behavior, ordinary DPCs are inserted into the target processor's queue and processed in first-in, first-out (FIFO) order among DPCs of similar importance. If the queue was previously empty, queuing an ordinary DPC triggers immediate processing at DISPATCH_LEVEL upon return from the ISR, provided the importance is not set to low. The KeInsertQueueDpc routine returns TRUE if the DPC is successfully queued (indicating it was not already pending) or FALSE if it was already in the queue, preventing duplicate insertions.^[18]^[26] Ordinary DPCs support configurable importance levels via KeSetImportanceDpc, which influence queue positioning and dispatch timing: LowImportance places the DPC at the end of the queue without triggering immediate processing; MediumImportance (default) appends to the end but initiates queue processing promptly; MediumHighImportance, introduced in Windows Vista, appends to the end while enabling more aggressive dispatching; and HighImportance positions the DPC at the queue's head and forces immediate execution. These levels evolved from the basic low/medium/high scheme in early Windows NT implementations to provide finer control in modern multiprocessor kernels.^[29]^[31] Despite their simplicity, ordinary DPCs have limitations in SMP environments, as default queuing to the current processor can result in uneven load distribution across CPUs, potentially leading to bottlenecks on heavily interrupted processors without built-in balancing for inter-processor deferral. Additionally, since they run at DISPATCH_LEVEL, they cannot perform operations requiring PASSIVE_LEVEL, such as accessing pageable code or handling page faults, which may limit their use for more complex deferred work.^[18]

Threaded DPCs

Threaded DPCs represent an advanced variant of deferred procedure calls introduced in Windows Server 2003 SP1 (version 5.2) and available in subsequent Windows versions. Unlike ordinary DPCs, threaded DPCs are designed to execute at PASSIVE_LEVEL IRQL in the context of a dedicated high-priority system thread, allowing access to pageable memory, user-mode resources, and operations that might cause page faults without risking system instability. This makes them suitable for longer-running or more complex deferred tasks that would be impractical at DISPATCH_LEVEL. Threaded DPCs are enabled by default as of Windows 10, but drivers can disable them if needed for compatibility or latency reasons.^[32] To set up a threaded DPC, a driver initializes a DPC object using KeInitializeThreadedDpc instead of KeInitializeDpc. Like ordinary DPCs, threaded DPCs support targeting a specific processor via KeSetTargetProcessorDpc (or KeSetTargetProcessorDpcEx for processor groups in Windows 7 and later) and importance levels via KeSetImportanceDpc. They are queued using KeInsertQueueDpc or related routines, and if targeted to a different processor, an IPI may be used to schedule execution on the target. However, because they run in a thread context, threaded DPCs introduce slightly higher latency compared to ordinary DPCs but reduce the risk of DPC queue buildup and system-wide delays.^[30]^[31]^[29]^[7] When queued, a threaded DPC is placed in a separate per-processor queue (distinct from the ordinary DPC queue) within the Processor Control Region (PCR). The system processes these queues when dropping to DISPATCH_LEVEL or during idle time, scheduling the threaded DPC on a worker thread at PASSIVE_LEVEL. If the system does not support threaded DPC execution (e.g., in older configurations), it falls back to DISPATCH_LEVEL execution as an ordinary DPC. High-importance threaded DPCs can trigger immediate IPIs for faster dispatching, while lower-importance ones wait for idle periods to minimize interference with active workloads.^[18] The primary advantages of threaded DPCs lie in their ability to handle resource-intensive deferred work without blocking kernel dispatch, improving overall system responsiveness in multiprocessor environments. They are commonly used for tasks like complex I/O completions, network packet processing, or timer callbacks that benefit from full kernel API access. However, drivers must ensure the DPC routine is written to execute safely at DISPATCH_LEVEL as a fallback and avoid recursive queuing to prevent stack overflows or deadlocks. This approach enhances efficiency for modern hardware drivers while maintaining compatibility with legacy behaviors.^[32]^[18]

Usage in Device Drivers

Common Scenarios

In network device drivers, the interrupt service routine (ISR) quickly acknowledges the receipt of incoming packets by disabling further interrupts from the hardware and queues a deferred procedure call (DPC) to process the packet data and complete the associated I/O request packet (IRP). This transition ensures that the ISR executes briefly at device IRQL (DIRQL), deferring resource-intensive tasks like data copying or protocol processing to the DPC at DISPATCH_LEVEL, thereby reducing system interrupt latency.^[33]^[34] DPCs integrate seamlessly with kernel timers in device drivers, serving as the callback mechanism invoked by functions such as KeSetTimer or KeSetTimerEx to execute periodic operations. For instance, a driver might set a recurring timer to poll hardware registers for status updates, with the associated DPC handling the polling logic, error checking, or resource allocation at an appropriate IRQL without requiring constant high-priority interrupt handling.^[35]^[25] In storage device drivers built on the StorPort framework, the ISR responds to hardware interrupts by performing minimal acknowledgment and queuing a DPC using StorPortIssueDpc for deferred execution. The resulting DPC routine, such as HwStorDpcRoutine, then manages post-interrupt tasks including buffer synchronization, data transfer completion, or logging I/O events, allowing the driver to efficiently handle disk operations while maintaining low DIRQL dwell time.^[36]^[37] Audio device drivers leverage DPCs to address buffer underruns detected during interrupt handling, where the ISR identifies the event but defers buffer refilling or stream adjustment to the DPC to avoid prolonging the high-IRQL phase. This approach enables real-time audio processing without ISR blocking, mitigating glitches in playback by shifting buffer management to DISPATCH_LEVEL execution.^[1]

Best Practices and Limitations

When implementing Deferred Procedure Calls (DPCs) in Windows device drivers, developers must adhere to strict guidelines to ensure system stability and performance. DPC routines should execute quickly, ideally completing within 100 microseconds, to minimize interference with other kernel operations and prevent delays in system responsiveness.^[27] For tasks exceeding this threshold, routines should promptly queue worker threads using IoQueueWorkItem or ExQueueWorkItem to handle deferred processing at PASSIVE_LEVEL, avoiding prolonged execution at DISPATCH_LEVEL.^[27] Synchronization with interrupt service routines (ISRs) or shared resources requires the use of spin locks or KeSynchronizeExecution with a critical section routine, as these mechanisms are suitable for DISPATCH_LEVEL without introducing contention.^[27] DPC routines must avoid operations that could block or sleep, such as acquiring mutexes, performing paging I/O, or accessing pageable code and data, since these are prohibited at DISPATCH_LEVEL and can lead to deadlocks or system crashes.^[27] Developers should also refrain from using KeStallExecutionProcessor for delays longer than 100 microseconds, opting instead for timer-based DPCs to schedule follow-up work.^[27] To verify compliance, the Windows Driver Kit (WDK) provides tools like ETW-based tracing (e.g., via tracelog) for measuring DPC execution times during development and testing.^[27] Key limitations of DPCs include their inability to block, which restricts them to non-waiting operations, and the reliance on the kernel stack, limited to approximately 12 KB on x86 systems (or 24 KB on x64), posing a risk of stack overflow from deep call chains, large local variables, or recursive calls.^[38] High volumes of queued DPCs can result in latency spikes, as they preempt lower-priority threads and accumulate in per-processor queues, potentially disrupting real-time applications like audio or video processing.^[39] Troubleshooting such issues involves analyzing traces with Windows Performance Analyzer (WPA), which visualizes DPC/ISR durations, queue depths, and offending modules to identify problematic drivers.^[39] In modern Windows versions (Windows 7 and later), threaded DPCs offer an evolution for lower-priority work, executing at PASSIVE_LEVEL as kernel threads to reduce impact on real-time latency, though they introduce slight overhead compared to traditional DPCs; these are enabled by default but can be disabled if needed.^[32] For non-time-critical tasks, system work items remain preferable over DPCs to further mitigate stack and latency risks.^[27]

Comparisons with Other Systems

In Linux Kernel

In the Linux kernel, mechanisms equivalent to Windows Deferred Procedure Calls (DPCs) for deferring non-critical work from interrupt service routines (ISRs) are softirqs and tasklets, which enable execution in a software interrupt context shortly after the hard interrupt handler completes. Softirqs are primarily used for system-wide deferred tasks such as network packet reception (e.g., the NET_RX softirq) and timekeeping, while tasklets serve as a dynamic interface for driver-specific upper-half processing akin to DPCs, allowing ISRs to schedule callbacks without blocking on urgent hardware handling. Both operate in an atomic interrupt context, prohibiting sleepable operations, but they execute post-interrupt to minimize latency in the hardirq path.^[40]^[41] A primary distinction from DPCs lies in the design of softirqs, which are statically defined at compile time with a fixed set of types (e.g., NET_RX for incoming network data or TIMER_SOFTIRQ for timer events), limiting flexibility compared to the fully dynamic DPC objects in Windows. Tasklets, implemented atop softirqs, offer dynamic registration similar to DPCs but serialize execution per instance across CPUs, running from per-CPU lists (tasklet_vec and tasklet_hi_vec) without targeting a specific processor, which promotes cache locality and reduces the need for IPIs. Softirqs are raised via raise_softirq() from ISRs or other kernel code, with execution triggered by do_softirq()—typically called at the end of hardirq handlers via invoke_softirq() or during scheduler hooks when returning to user space—ensuring deferred work integrates seamlessly into the interrupt exit path.^[40]^[42]^[41] Linux's approach provides enhanced flexibility for symmetric multiprocessing (SMP) environments through per-CPU data structures and integration with Read-Copy-Update (RCU) for lockless synchronization, allowing efficient scaling without the IRQL-bound constraints of DPCs at DISPATCH_LEVEL. Like Windows DPCs, Linux uses per-processor queuing for locality, but softirqs and tasklets lack discrete priority levels while supporting preemption in certain kernel configurations, such as PREEMPT_RT, where softirqs execute in dedicated ksoftirqd threads, enabling interruptible and higher-priority task insertion for real-time workloads. This design favors locality-managed execution that can handle concurrent softirq instances across CPUs while tasklets ensure non-reentrancy per handler for simpler driver coding.^[43]^[44]

In Other Operating Systems

In early UNIX systems, interrupt handling employed a split approach known as top halves and bottom halves, where the top half performed minimal immediate processing in interrupt context, and the bottom half deferred non-urgent work to avoid prolonged interruption of system responsiveness.^[45] This concept evolved in BSD variants, such as FreeBSD, where software interrupts (SWIs) serve as a mechanism to queue deferred work from hardware interrupt handlers, allowing handlers to be scheduled via lightweight threads for later execution outside the primary interrupt context.) In FreeBSD, additional deferral is achieved through callouts, which schedule timed callbacks via the softclock software interrupt for periodic or delayed tasks, and tsleep(), which enables threads to block on wait channels until an event like a wakeup signal, facilitating deferred processing in drivers.^[46] The OpenVMS operating system influenced later designs through its Asynchronous Procedure Calls (APCs), which defer user-mode work asynchronously in thread context, and fork procedures, kernel-level mechanisms that directly inspired Deferred Procedure Calls by postponing non-critical tasks from interrupt service routines.^[15] In real-time operating systems like VxWorks, deferred interrupts are managed via Deferred Interrupt Service Routines (DISRs) or by signaling semaphores and message queues from the ISR to notify a dedicated task for subsequent processing, ensuring the ISR remains brief while offloading complex operations to task context.^[47]^[48] Similarly, IBM's OS/390 mainframe environment uses a comparable ISR-deferred split, where initial interrupt handling is minimal, and further work is dispatched via Service Request Blocks (SRBs), dispatchable units that execute asynchronously to maintain system throughput.^[49]^[50] Modern real-time operating systems increasingly favor work queues for thread-based deferral of interrupt work, enabling sleepable operations in process context and reducing reliance on non-preemptible interrupt handlers to enhance predictability and scalability.^[51]^[52]

References

[1]
Introduction to DPC Objects - Windows drivers - Microsoft Learn
May 1, 2025 · Therefore, the system provides support for deferred procedure calls (DPCs), which can be queued from ISRs and which are executed at a later time ...
[2]
Example 15 Measuring DPC/ISR Time - Windows drivers
Dec 14, 2021 · You can measure the amount of time that a driver spends in deferred procedure calls (DPCs) and interrupt service routines (ISRs) by tracing these events in the ...
[3]
Types of APCs - Windows drivers | Microsoft Learn
Apr 3, 2023 · APCs are similar to deferred procedure calls (DPCs), but unlike DPCs, APCs execute within the context of a particular thread.
[4]
Introduction to DPCs - Windows drivers | Microsoft Learn
Dec 14, 2021 · A DpcForIsr or CustomDpc routine is called in an arbitrary DPC context at IRQL DISPATCH_LEVEL. Running at DISPATCH_LEVEL restricts the set ...
[5]
KeInitializeDpc function (wdm.h) - Windows drivers | Microsoft Learn
Feb 22, 2024 · The KeInitializeDpc routine initializes a DPC object, and registers a CustomDpc routine for that object.
[6]
Managing Hardware Priorities - Windows drivers | Microsoft Learn
Feb 21, 2025 · For example, some driver support routines require that the caller be running at IRQL = DISPATCH_LEVEL. Others can't be called safely if the ...
[7]
Deferred Procedure Call Details - OSR
A working definition of DPCs is that they are a method by which a driver can request a callback to an arbitrary thread context at IRQL DISPATCH_LEVEL.
[8]
[PDF] PDP-11 Architectural Enhancement Strategy - Gordon Bell
Context switching and interrupt servicing are highly related and require an integrated approach. The PDP-11 architecture does not lend itself toward high speed ...
[9]
[PDF] 11/10/66 Identification Overview of Interrupt Handling - Multics
Discussion. An Interrupt, by Multics definition, is caused by a signal from some source other than a condition within the hardware of the processor.
[10]
[PDF] INTRODUCTION AND OVERVIEW OF THE MULTICS SYSTEM
Whenever a reference to a missing page occurs, the su- pervisor must interrupt the program, fetch the missing page, and reinitiate the pro- gram without loss of ...
[11]
[PDF] unix programmer's manual - Bitsavers.org
An introduction to the capabilities of the com'mand interpreter, the shell. 7. Learn-Computer Aided Instruction on UNIX. 109. M. E. Lesk and B. W. Kernighan.
[12]
[PDF] The Unix I/O System - squoze.net
This paper gives an overview of the workings of the Unix I/O system. It was written with an eye toward providing guidance to writers of device driver ...
[13]
Programming Concepts Manual, Volume I — VMS Software, Inc.
It presents methods of synchronization such as event flags, asynchronous system traps (ASTs), parallel processing RTLs, and process priorities, and the effects ...
[14]
The History of Windows NT 3.1 - by Bradford Morgan White
Aug 20, 2023 · David Neil Cutler Sr was born on the ... Procedure became the Deferred Procedure Call, while some other terminology was copied verbatim.
[15]
Windows NT and VMS: The Rest of the Story - ITPro Today
Asynchronous Procedure Call (APC). Fork Procedure. Deferred Procedure Call ... A major difference between NT process management and VMS process management is that ...
[16]
[PDF] Custer_Inside_Windows_NT_19...
Early in Windows NT's development, Dave Cutler created a kernel mutex object ... NT device drivers use deferred procedure calls (DPCs), described in.
[17]
KeSetImportanceDpc function (wdm.h) - Windows drivers
Feb 24, 2022 · If the caller sets Importance to HighImportance, the DPC is placed at the beginning of the queue; otherwise, it is placed at the end.
[18]
Organization of DPC Queues - Windows drivers | Microsoft Learn
Dec 15, 2021 · Drivers can control which queue the system assigns a DPC to, the location of the DPC within the queue, and when the queue is processed.Missing: evolution levels
[19]
[PDF] Microsoft Windows Internals, Fourth Edition
... David Solomon, David Cutler, and Mark Russinovich. I first met David Solomon when I was working at Digital Equipment Corporation on the VMS operating system ...
[20]
KDPC - Geoff Chappell, Software Analyst
Jun 25, 2016 · The KDPC is the structure in which the kernel keeps the state of a Deferred Procedure Call (DPC). The latter is a routine that kernel-mode code can register ...
[21]
Windows Kernel Opaque Structures - Microsoft Learn
Nov 5, 2025 · The KDPC structure is an opaque structure that represents a DPC object. Don't set the members of this structure directly. See DPC Objects and ...EPROCESS · ETHREAD
[22]
IoInitializeDpcRequest function (wdm.h) - Windows drivers
Feb 22, 2024 · IoInitializeDpcRequest associates a driver-supplied DpcForIsr routine with a given device object. The driver's InterruptService routine (ISR) ...
[23]
IoRequestDpc function (wdm.h) - Windows drivers - Microsoft Learn
Feb 22, 2024 · The IoRequestDpc routine queues a driver-supplied DpcForIsr routine to complete interrupt-driven I/O processing at a lower IRQL.<|separator|>
[24]
Registering and Queuing a CustomDpc Routine - Windows drivers
Dec 14, 2021 · Most drivers with CustomDpc routines provide storage for their DPC objects in the device extension, but the storage can be in a controller ...
[25]
KDEFERRED_ROUTINE (wdm.h) - Windows drivers - Microsoft Learn
Feb 28, 2023 · The callback routine performs actions, after an InterruptService returns, of a threaded DPC. The CustomDpc routine finishes the servicing of an I/O operation.
[26]
KeInsertQueueDpc function (wdm.h) - Windows drivers
Feb 22, 2024 · Pointer to the KDPC structure for the DPC object. This structure must have been initialized by either KeInitializeDpc or KeInitializeThreadedDpc ...
[27]
Guidelines for Writing DPC Routines - Windows drivers
Dec 14, 2021 · If a task requires longer than 100 microseconds and must execute at IRQL equal to DISPATCH_LEVEL, the DPC routine should end after 100 ...
[28]
KeSetTargetProcessorDpc function (wdm.h) - Windows drivers
Feb 24, 2022 · The KeSetTargetProcessorDpc routine specifies which processor's queue the system should use when the driver calls KeInsertQueueDpc or ...
[29]
KeSetImportanceDpc function (ntddk.h) - Windows drivers
### Summary: Default Importance Level for DPCs with KeInsertQueueDpc
[30]
KeSetTargetProcessorDpc function (ntddk.h) - Windows drivers
Oct 21, 2021 · The KeSetTargetProcessorDpc routine specifies which processor's queue the system should use when the driver calls KeInsertQueueDpc or IoRequestDpc to queue a ...
[31]
KeSetTargetProcessorDpcEx function (wdm.h) - Windows drivers
Feb 25, 2022 · This parameter points to a KDPC structure, which is an opaque, system structure that represents the DPC object. This object must previously ...
[32]
Non-RSS Receive Processing - Windows drivers - Microsoft Learn
Dec 14, 2021 · The ISR disables the interrupts and requests NDIS to queue a deferred procedure call (DPC) to process the received data. NDIS calls the ...
[33]
Introduction to Receive Side Scaling (RSS) - Windows drivers
To process received data efficiently, a miniport driver's receive interrupt service function schedules a deferred procedure call (DPC). Without RSS, a typical ...
[34]
KeXxxTimer Routines, KTIMER Objects, and DPCs - Windows drivers
Dec 14, 2021 · KeSetTimer always sets a timer that will expire just once. KeSetTimerEx accepts an optional Period parameter, which specifies a recurring timer ...Missing: callback | Show results with:callback
[35]
HW_INTERRUPT (storport.h) - Windows drivers | Microsoft Learn
Sep 22, 2025 · The Storport driver calls the HwStorInterrupt routine after the HBA generates an interrupt request. Syntax: C++ Copy HW_INTERRUPT HwInterrupt;
[36]
HW_DPC_ROUTINE (storport.h) - Windows drivers | Microsoft Learn
Feb 22, 2024 · The HwStorDpcRoutine routine is a routine that is deferred for execution at DISPATCH IRQL by means of the deferred procedure call (DPC) ...
[37]
Using the Kernel Stack - Windows drivers | Microsoft Learn
Dec 14, 2021 · The size of the kernel-mode stack is limited to approximately three pages. Therefore, when passing data to internal routines, drivers cannot pass large amounts ...Missing: DPC | Show results with:DPC
[38]
CPU Analysis | Microsoft Learn
Mar 25, 2021 · This guide provides detailed techniques that you can use to investigate Central Processing Units (CPU)-related issues that impact assessment metrics.Missing: latency | Show results with:latency
[39]
Introduction to Threaded DPCs - Windows drivers | Microsoft Learn
Feb 21, 2025 · To initialize a KDPC structure for a threaded DPC, call the KeInitializeThreadedDpc routine, and pass it a CustomThreadedDpc routine that ...
[40]
Unreliable Guide To Hacking The Linux Kernel
Softirqs are often a pain to deal with, since the same softirq will run simultaneously on more than one CPU. For this reason, tasklets ( include/linux/interrupt ...
[41]
4.7. Softirqs and Tasklets - Understanding the Linux Kernel, 3rd ...
Softirqs and tasklets are strictly correlated, because tasklets are implemented on top of softirqs. As a matter of fact, the term “softirq,” which appears in ...
[42]
Softirq, Tasklets and Workqueues · Linux Inside - 0xax
In this part we saw three concepts: the softirq, tasklet and workqueue that are used for the deferred functions.
[43]
Lock types and their rules - The Linux Kernel documentation
In a PREEMPT_RT kernel, softirq context is preemptible, and synchronizing every bottom-half-disabled section via implicit context results in an implicit per-CPU ...
[44]
Unreliable Guide To Locking - The Linux Kernel documentation
Since a tasklet is never run on two CPUs at once, you don't need to worry about your tasklet being reentrant (running twice at once), even on SMP. Different ...
[45]
The end of tasklets - LWN.net
Feb 5, 2024 · The kernel community has developed a number of deferred-execution mechanisms designed to ensure that every task is handled at the right time.
[46]
Chapter 8. SMPng Design Document | FreeBSD Documentation Portal
8.3.4. Callouts. The timeout kernel facility permits kernel services to register functions for execution as part of the softclock software interrupt.
[47]
How to Optimize Interrupt Latency in VxWorks - VxWorks6
Aug 31, 2025 · In VxWorks, you can implement Deferred Interrupt Service Routines (DISRs) to offload time-consuming tasks from the ISR. DISRs allow you to ...
[48]
Interrupt Handling in VxWorks Device Drivers - VxWorks6
avoid blocking, printing, or heavy computation. · Use semaphores or message queues to notify a task for deferred work.
[49]
[PDF] ABCs of OS/390 System Programming Volume 1 - IBM Redbooks
3.3.3 Interrupt processing ... The access method returns control to the user program, which can then continue its processing.Missing: ISR DPC
[50]
[PDF] MVS Programming: Authorized Assembler Services Guide - Index of /
This is the MVS Programming: Authorized Assembler Services Guide, a major revision applying to OS/390 Version 2 Release 10 and later.
[51]
Interrupt Mechanism - an overview | ScienceDirect Topics
An interrupt mechanism is defined as a system that allows a microprocessor to change its flow of execution in response to asynchronous or synchronous events, ...Introduction to Interrupt... · Interrupt Handling and... · Interrupt Mechanism in...Missing: Multics | Show results with:Multics
[52]
Understanding Work Queues in Zephyr RTOS - Embedded Explorer
Oct 1, 2025 · Work queues in Zephyr RTOS are a lightweight way to defer work, move processing out of interrupt context, and keep applications responsive ...