Fact-checked by Grok 2 weeks ago

Dynamic program analysis

Dynamic program analysis is the examination of a software program's properties and behavior during its execution, typically through instrumentation or monitoring of one or more specific runs, to infer runtime characteristics such as control flow, data dependencies, and resource usage.^[1] Unlike static program analysis, which inspects source or object code without running the program to reason about all possible executions, dynamic analysis provides precise insights into observed behaviors but may miss unexecuted paths.^[1] This approach is fundamental in software engineering for tasks requiring empirical evidence of program operation, including error detection and optimization.^[2] Key applications of dynamic program analysis span debugging, performance profiling, program comprehension, testing, and security verification, where it helps identify issues like memory leaks, race conditions, or vulnerabilities that manifest only at runtime.^[2]^[3] For instance, in security contexts, techniques such as fuzzing generate inputs to explore program states and uncover flaws, while runtime verification monitors compliance with specifications during execution.^[3] In performance analysis, it enables the construction and calibration of models by measuring actual execution frequencies and bottlenecks.^[2] These uses have grown with the complexity of modern software systems, including distributed and mobile applications.^[4] As of 2025, dynamic analysis increasingly incorporates artificial intelligence and machine learning to enhance input generation, anomaly detection, and tool automation.^[5] Common techniques in dynamic program analysis include instrumentation, where additional code is inserted to collect data—often at the bytecode or binary level for efficiency—and sampling, which periodically captures program states to reduce overhead.^[6]^[7] Advanced methods combine dynamic execution with symbolic reasoning, as in dynamic symbolic execution, to extend coverage beyond concrete inputs.^[3] Frameworks like DiSL facilitate the development of custom analyses by blending aspect-oriented programming for expressiveness with low-level manipulation for performance.^[2] However, challenges persist, including the significant runtime overhead introduced by monitoring and the incompleteness of results limited to tested scenarios.^[1]

Fundamentals

Definition and Principles

Dynamic program analysis is the examination of a program's behavior through its execution on real or simulated inputs, capturing runtime states including variable values, control flow, and interactions with the environment. This approach enables the observation of actual software execution, revealing properties that may not be apparent from code inspection alone, such as the impact of dynamic features like polymorphism or threading.^[8]^[9] Key principles underlying dynamic program analysis include observability, which involves monitoring and recording execution traces to gather relevant data while managing overhead from instrumentation and logging; context-sensitivity, recognizing that program behavior is highly dependent on specific inputs, environmental factors, and runtime conditions; and the exploration of execution paths, where individual runs reveal one path through the program's possible control flows, necessitating multiple executions with varied inputs to achieve broader coverage. These principles ensure that the analysis reflects real-world usage but also highlight limitations, such as the inability to observe all paths in a single execution and the need for representative test cases to mitigate input dependency.^[9]^[10] The basic workflow of dynamic program analysis consists of generating or selecting inputs to drive the program, executing it under controlled conditions to produce runtime data, collecting observations such as traces or metrics during or after execution, and interpreting the gathered information to infer properties like coverage or errors. This process emphasizes post-execution analysis for efficiency, where raw data is processed to identify patterns or anomalies.^[9]^[10] Unlike dynamic programming algorithms, which focus on optimizing recursive problem-solving through memoization and subproblem reuse, dynamic program analysis pertains to software verification, debugging, and performance understanding by observing live executions rather than algorithmic computation. It is distinct from static analysis, which examines code without running it, though the two are complementary in software engineering practices.^[8]^[9]

Static vs. Dynamic Analysis

Static analysis examines a program's source code or binary without executing it, employing techniques such as abstract interpretation or model checking to reason about potential behaviors and detect issues like syntax errors or type mismatches prior to runtime.^[11] In contrast, dynamic analysis requires actual program execution, often with specific inputs or test cases, to observe and analyze runtime behaviors, thereby uncovering phenomena such as race conditions, memory leaks, or bugs that manifest only under particular execution paths.^[11] The trade-offs between these approaches are significant. Static analysis provides exhaustive coverage over all possible execution paths, ensuring soundness in its conclusions, but it often produces false positives due to conservative approximations needed to handle complex state spaces and path explosion.^[11] Dynamic analysis, while precise for the observed executions—offering high accuracy in detecting path-specific issues—suffers from incompleteness, as it only covers the paths actually traversed during testing, potentially missing latent bugs.^[8] In terms of bug detection metrics, dynamic analysis typically achieves higher precision (fewer false alarms) but lower recall (missing some bugs) compared to static methods, whereas static analysis excels in recall at the cost of precision.^[8] Due to these complementary strengths, static and dynamic analyses are frequently integrated in hybrid approaches to balance exhaustiveness with precision.^[11] In modern development pipelines, such as continuous integration systems, static analysis is applied early for broad vulnerability scanning, while dynamic analysis follows during testing to validate runtime issues, enhancing overall software reliability and fix rates—for instance, combining them has been shown to achieve near-100% actionable bug fixes in large-scale environments.^[12] Dynamic analysis often incorporates coverage metrics, such as the percentage of executed code paths, to quantify the thoroughness of testing efforts.^[8]

Historical Development

The roots of dynamic program analysis trace back to the 1960s, when core dumps emerged as a fundamental debugging technique on early batch-processing and standalone computer systems, allowing programmers to capture the state of a crashed program for post-execution examination without tying up expensive hardware resources. This manual approach relied on memory snapshots to identify faults, laying the groundwork for execution-based analysis. By the 1970s, with the advent of time-sharing systems like UNIX, tools such as the adb debugger—introduced in the Sixth Edition of Research UNIX in 1975—enabled interactive tracing and manual inspection of running processes, marking an early shift toward more structured runtime observation.^[13] The 1980s and 1990s saw the maturation of dynamic analysis through specialized profilers and coverage tools, driven by the growing complexity of software in engineering practices. The gprof call graph execution profiler, released in 1982 as part of BSD UNIX, introduced automated sampling to measure function call frequencies and execution times, significantly advancing performance analysis by providing actionable insights into program hotspots.^[14] Concurrently, coverage tools like tcov, developed for C and Fortran in the mid-1980s under SunOS, enabled statement-level tracking of executed code paths during testing, formalizing dynamic testing as a key software engineering discipline to ensure comprehensive validation.^[15] This era emphasized execution monitoring to complement static methods, with dynamic techniques gaining prominence for revealing runtime behaviors unattainable through code inspection alone. In the 2000s, dynamic analysis expanded with the proliferation of binary instrumentation frameworks, enabling flexible runtime modifications without source access. The PIN framework, introduced by Intel in 2005, revolutionized this area by providing a dynamic binary instrumentation system for IA-32 and x86-64 architectures, facilitating the creation of custom analysis tools for tasks like profiling and security auditing.^[16] Fuzzing also integrated deeply with dynamic methods around this time, exemplified by early coverage-guided approaches in 2007 that used runtime feedback to evolve test inputs, paving the way for tools like American Fuzzy Lop (AFL) in 2013.^[17] The 2010s and 2020s brought sophisticated advances, particularly in dynamic symbolic execution and AI integration, enhancing automation and scalability. KLEE, unveiled in 2008, pioneered bit-precise symbolic execution on LLVM bitcode, automatically generating high-coverage tests for C programs by exploring path constraints during execution.^[18] Extensions in the 2020s refined these techniques for larger systems, while machine learning emerged for smarter input generation in fuzzing, as seen in Learn&Fuzz (2017), which used neural networks to infer grammars from samples for more targeted mutations.^[19] By 2025, trends include AI-assisted runtime monitoring, leveraging machine learning for anomaly detection in production environments to preemptively address issues like vulnerabilities.^[20] Influential surveys, such as the 2014 overview of dynamic techniques, highlighted these evolutions, while post-2015 adoption integrated dynamic analysis into CI/CD pipelines for continuous testing and security scanning.^[21]^[22]

Techniques

Instrumentation

Instrumentation is a core technique in dynamic program analysis that involves the insertion of additional code, known as probes or hooks, into a program to collect runtime data on events such as function calls, variable assignments, or control flow changes, without modifying the program's core logic.^[23] This process enables the monitoring of execution behavior by logging relevant information during program runs, often at compile-time, load-time, or runtime stages.^[24] For instance, probes can record timestamps for performance metrics or trace data dependencies for debugging purposes.^[23] Instrumentation can be categorized into two primary types: source-level and binary-level. Source-level instrumentation modifies the program's source code before compilation, such as by inserting logging statements or counters directly into the text, which allows for high-level abstractions but requires access to the original code.^[25] In contrast, binary-level instrumentation alters the compiled executable after compilation, enabling analysis of proprietary or legacy software without source availability, though it may introduce challenges in handling optimized code.^[24] These types differ in granularity and applicability, with source-level approaches often being more straightforward for custom modifications.^[23] Several mechanisms facilitate the insertion of instrumentation code. Just-in-time (JIT) compilation supports dynamic insertion at runtime by recompiling portions of the code on-the-fly to embed probes, optimizing for transparency and efficiency across architectures.^[24] Aspect-oriented programming (AOP) provides a modular alternative, using language constructs like aspects to weave monitoring code into the program at specified join points, such as method entries or field accesses, without scattering instrumentation throughout the base code.^[26] Compile-time and load-time mechanisms, meanwhile, integrate probes earlier in the process, reducing runtime costs but limiting adaptability to execution paths.^[23] Managing the overhead introduced by instrumentation is essential, as added code can slow execution by factors of 2-10x or more, depending on probe complexity.^[24] Techniques for overhead reduction include selective instrumentation, which targets only critical code regions identified via prior static analysis, thereby omitting probes from irrelevant paths to minimize intrusion.^[27] Approximation methods further alleviate costs by sampling events rather than logging every occurrence or using lightweight summaries instead of full traces.^[23] A simple example illustrates source-level instrumentation for counting loop iterations. The original code might be:

for (i = 0; i < n; i++) {
    // [loop](/page/Loop) body
}
for (i = 0; i < n; i++) {
    // [loop](/page/Loop) body
}

The instrumented version inserts a counter increment:

counter = 0;
for (i = 0; i < n; i++) {
    counter++;
    // [loop](/page/Loop) body
}
counter = 0;
for (i = 0; i < n; i++) {
    counter++;
    // [loop](/page/Loop) body
}

This allows runtime tracking of iterations without altering the program's functionality.^[23] Such insertions are foundational for analyses like code coverage, where probes mark executed lines during testing.^[25]

Sampling and Profiling

Sampling and profiling techniques in dynamic program analysis approximate program behavior by collecting data at discrete intervals, such as every 10 milliseconds, rather than monitoring every instruction, to estimate metrics like CPU time usage or event frequencies. This approach enables efficient analysis of large-scale executions by capturing snapshots of the program's state, typically the program counter (PC) or stack traces, during runtime. Unlike exhaustive methods, sampling trades precision for low overhead, making it suitable for production environments where continuous monitoring would be prohibitive.^[28] Statistical sampling methods rely on random interrupts generated via operating system signals, such as timer-based SIGPROF or VTALRM, to suspend the program periodically and record its execution state. These interrupts, often controlled through facilities like setitimer() and handled via debuggers or kernel hooks, build histograms of PC locations to infer time distribution across code regions. Hardware-assisted sampling uses performance monitoring units (PMUs) in modern CPUs, which count low-level events like instruction retirements or branch mispredictions without software intervention; samples are buffered in hardware and retrieved via kernel interrupts or user-space interfaces like JNI in virtual machines. PMUs provide finer-grained data, such as cycle-accurate event counts, enhancing the accuracy of dynamic profiling in multi-threaded or heterogeneous systems.^[28]^[29] Profiling variants include call-graph profiling, which measures function invocation times and call frequencies by sampling stack traces to construct dynamic call graphs attributing time to callers and callees. This technique, pioneered in tools like gprof, uses PC sampling at clock intervals (e.g., 1/60 second) to approximate execution profiles, assuming uniform call durations for simplicity. Cache profiling tracks miss rates and types (compulsory, capacity, or conflict) by simulating cache behavior on dynamic address traces or leveraging PMU events for memory access metrics, identifying hot spots at the source-line or data-structure level to guide optimizations like loop fusion.^[30]^[31] Adaptive sampling algorithms adjust collection frequency dynamically based on observed variance in metrics, increasing resolution in high-variability regions while reducing samples elsewhere to maintain efficiency. For instance, in distributed environments, sampling rates can adapt to inter-thread sharing patterns, preserving locality during migrations. The accuracy of these estimates follows binomial statistics, with the fractional error bound approximated as \frac{1}{\sqrt{N}}, where N is the number of samples; larger N reduces aliasing but increases overhead.^[32]^[28] These methods offer low runtime overhead, typically 1-5% of execution time, compared to instrumentation-based alternatives that provide higher precision at greater cost. However, sampling introduces potential attribution errors, such as aliasing between adjacent instructions, which can skew estimates in short or irregular functions.^[28]

Tracing and Execution Monitoring

Tracing and execution monitoring in dynamic program analysis involves the passive or active capture of detailed sequences of events during a program's runtime, such as branches taken, memory accesses, and system calls, to enable subsequent analysis or deterministic replay.^[33] These traces provide a comprehensive record of the program's dynamic behavior, distinguishing them from aggregated statistics in sampling-based approaches by preserving the full execution history for replayability.^[33] Key techniques include record-replay systems, which log inputs, thread scheduling, and other sources of non-determinism to allow identical re-execution, facilitating debugging of hard-to-reproduce bugs in concurrent or parallel programs.^[34] For instance, user-space record-replay mechanisms, such as those leveraging modern CPU features and operating system schedulers, record non-deterministic events like thread interleavings without requiring kernel modifications or hardware changes.^[35] Kernel-level tracing complements this by using frameworks like eBPF in Linux, which attach lightweight, sandboxed programs to kernel hook points (e.g., tracepoints or kprobes) to monitor system-wide events such as I/O operations or memory allocations with minimal runtime overhead.^[36]^[37] Execution traces are typically represented as ordered sequences of tuples, such as (program counter address, value) pairs for instructions, memory reads/writes, or register states, often augmented with timestamps or thread identifiers to maintain ordering in multi-threaded contexts.^[33] To manage their size, compression methods like delta encoding—where differences between consecutive values are stored instead of full addresses—along with value predictors (e.g., last-value or finite context models), can reduce trace volumes significantly; for example, one algorithm achieves up to 18x compression on instruction traces while processing in a single pass.^[38] In debugging, these techniques support deterministic replay, where the recorded execution is reproduced exactly to isolate faults, and reverse execution, allowing developers to step backward through the trace to examine prior states without re-running from the start.^[34]^[35] This is particularly valuable for concurrent software, where non-deterministic interleavings can mask bugs during live testing.^[39] Major challenges include handling non-determinism from sources like thread scheduling, hardware instruction reordering, and external inputs (e.g., timestamps or I/O), which requires comprehensive logging to ensure faithful replay but can introduce synchronization overheads of 3x to 6x or more in runtime.^[34]^[39] Storage overhead poses another issue, as uncompressed traces can exceed the program's binary size by factors of 10 or more for long executions, though compression mitigates this to sub-instruction-bit levels in optimized cases.^[33]^[38]

Hybrid Methods

Hybrid methods in dynamic program analysis integrate dynamic execution techniques with static abstractions or complementary dynamic modes to address limitations such as incomplete path coverage or high computational overhead in pure dynamic approaches. These methods typically combine concrete execution paths observed during runtime with static approximations of program behavior, allowing for more efficient exploration of the program's state space. For instance, static analysis can prune infeasible paths that dynamic execution might otherwise pursue exhaustively, while dynamic runs provide concrete data to refine static models. This integration enhances the precision and scalability of analysis without sacrificing soundness in many cases.^[40] A prominent example is concolic testing, which merges concrete execution with symbolic analysis by running the program on concrete inputs while simultaneously tracking symbolic constraints along the execution path. This approach generates new test inputs by solving path constraints to explore alternative branches, effectively combining the path-guided exploration of dynamic testing with the constraint-solving power of static symbolic execution. Concolic testing was introduced in the CUTE framework for unit testing C programs, demonstrating its ability to automatically generate high-coverage test suites for complex software. Another example is feedback-directed fuzzing, where dynamic feedback from program executions—such as coverage metrics—informs static mutation strategies to generate targeted inputs, as seen in directed greybox fuzzing tools like AFLGo. In AFLGo, static distance metrics from target locations guide the dynamic fuzzing process to prioritize inputs that bring the execution closer to specific code regions, improving efficiency for tasks like patch testing.^[41]^[42] The benefits of hybrid methods include improved code coverage by leveraging static analysis to identify and prioritize feasible paths that dynamic execution alone might miss, thereby reducing false negatives in bug detection. For example, static pruning of infeasible branches mitigates path explosion in dynamic symbolic execution, allowing deeper exploration within resource constraints. Additionally, these methods can accelerate analysis by using dynamic observations to validate and refine static approximations, leading to faster convergence on relevant program behaviors. In the context of loop invariant inference, hybrid approaches like mutation-based generation combined with dynamic checking and static verification have shown effectiveness in automating the discovery of invariants for verifying loop correctness, outperforming purely dynamic or static techniques in terms of both accuracy and efficiency on benchmarks.^[40]^[43] Frameworks such as the mutation-dynamic-static inference system for loop invariants exemplify practical implementations, where postconditions are mutated to candidate invariants, dynamically tested on execution traces, and statically checked for sufficiency. This hybrid process iteratively strengthens invariants until verification succeeds, handling non-trivial loops that challenge single-method approaches. Similarly, AFLGo integrates static call-graph analysis with dynamic instrumentation to compute reachability distances, directing fuzzing efforts toward user-specified targets.^[43]^[42] Despite these advantages, hybrid methods face challenges including the complexity of synchronizing dynamic and static components, such as aligning concrete traces with abstract models to avoid inconsistencies. Ensuring seamless integration often requires careful handling of approximations, as overly aggressive static unsoundness can propagate errors into dynamic phases. A key metric for evaluating hybrid effectiveness is the combined coverage achieved, formally expressed as:

\text{combined_coverage} = \text{dynamic_paths} \cup \text{static_feasible_paths}

This union captures the expanded exploration enabled by merging the two, though practical implementations must account for overlaps and verification overheads.^[40]

Types of Analysis

Code Coverage and Testing

Code coverage in dynamic program analysis refers to the measurement of the extent to which source code is exercised during program execution under test inputs, providing insights into the thoroughness of testing efforts.^[44] This approach relies on runtime instrumentation or monitoring to track executed elements, such as statements or control flow paths, helping developers identify untested portions of the code.^[45] Structural metrics like statement coverage, which assesses the percentage of executable statements run by tests, offer a basic gauge of testing completeness, while more advanced ones evaluate control structures.^[46] Key metrics include statement coverage, branch coverage, and path coverage. Statement coverage measures the proportion of code lines executed, providing a simple indicator of breadth.^[47] Branch coverage, also known as decision coverage, evaluates whether all possible outcomes of conditional branches (e.g., true and false paths in if-statements) are tested, calculated as (number of executed branches / total branches) × 100%.^[48] Path coverage extends this by aiming to exercise all feasible execution paths through the program, though it is computationally intensive due to the potential exponential number of paths.^[49] Dynamic test generation techniques leverage execution feedback to automatically create inputs that maximize coverage. Concolic testing, a hybrid of concrete and symbolic execution, systematically explores paths by negating conditions to uncover new branches, significantly improving coverage over random testing. Mutation testing enhances this by introducing small, syntactic changes (mutants) to the code and verifying if tests detect these faults, thereby assessing and guiding test suite improvements. Coverage data integrates seamlessly into continuous integration/continuous deployment (CI/CD) pipelines, where tools collect metrics during automated builds to prioritize tests for uncovered areas or enforce thresholds before merges.^[50] For instance, in unit testing, executing a suite on a function might reveal 80% branch coverage, highlighting untested error-handling branches for targeted additions.^[51] Despite its value, code coverage has limitations; it measures execution but does not ensure logical correctness or absence of bugs, as tests may exercise code without validating outputs.^[52] In safety-critical software, modified condition/decision coverage (MC/DC) addresses this by requiring each condition in a decision to independently affect the outcome, a standard mandated for high-assurance systems like avionics.^[53] This criterion demands more rigorous testing than basic branch coverage but still falls short of proving bug-free software.^[54]

Memory Error Detection

Dynamic program analysis plays a crucial role in detecting memory errors, which are prevalent in languages like C and C++ that provide low-level memory management. Common memory errors include buffer overflows, where a program writes beyond the allocated bounds of an array or buffer, and dangling pointers, which arise from use-after-free scenarios where memory is accessed after deallocation. These errors can lead to data corruption, crashes, or security vulnerabilities. Detection often relies on shadow memory techniques, which maintain a parallel memory region to track the state of every byte in the program's address space, such as whether it is allocated, freed, or uninitialized. For instance, shadow memory maps each byte of application memory to a smaller set of bytes (typically 1:8 ratio) that encode validity information, allowing runtime checks on memory accesses.^[55]^[56] Methods for memory error detection incorporate runtime checks to enforce memory safety during execution. Bounds verification, a key runtime check, instruments pointer arithmetic and array accesses to ensure they stay within allocated regions, flagging violations immediately. Leak detectors, on the other hand, operate at program exit by scanning the heap to identify unreleased allocations; they trace from roots like global variables and the stack to mark reachable objects, then report any allocated but unmarked memory as leaks. These approaches add overhead but provide precise diagnostics without requiring source code changes in some tools.^[57]^[58] Algorithms for leak detection commonly employ a mark-and-sweep strategy adapted for non-garbage-collected environments. In the mark phase, the tool traverses references from roots to identify reachable objects; the sweep phase reclaims or reports unreachable allocated blocks. The size of detected leaks can be computed conceptually as the total bytes allocated minus the aggregate size of reachable objects, where aggregate size sums the individual sizes of marked allocations (though average size approximations may be used for efficiency in reporting). This method ensures comprehensive coverage of leaks, including those from forgotten frees or circular references.^[58] AddressSanitizer (ASan), introduced in 2012, is a widely adopted standard for C/C++ memory error detection, combining compiler instrumentation with a runtime library to enable shadow memory tracking for bounds checks, use-after-free, and other errors with low overhead (typically 2x slowdown). It integrates seamlessly into build systems like those for the Chromium browser, where it has detected numerous bugs. Valgrind's Memcheck tool complements such efforts by providing binary-level instrumentation with shadow memory for similar detections, often used in tandem for validation though they operate independently.^[55]^[59] A notable case study involves ASan's application in browser development, such as Google Chrome, where it uncovered double-free errors—where the same memory block is deallocated twice, corrupting heap metadata. During integration into Chromium's build process, ASan identified several such issues in the test suite, enabling fixes before release; for example, it flagged double-frees in rendering components by poisoning freed memory and detecting subsequent invalid frees, preventing potential exploits. This has contributed to ongoing security hardening in large-scale applications.^[55]^[60]

Fuzzing and Input Generation

Fuzzing is an automated dynamic analysis technique that systematically generates and feeds malformed, unexpected, or random inputs to a program to trigger faults, crashes, or vulnerabilities during execution.^[61] This approach monitors runtime behavior to identify issues such as memory errors or assertion failures, making it particularly effective for uncovering defects that are difficult to detect through manual testing or static methods.^[61] Fuzzing variants are classified by the level of program knowledge used. Black-box fuzzing operates without access to the program's internals, relying solely on external interfaces and random input generation to probe for failures.^[61] White-box fuzzing incorporates source code analysis to guide input creation, often integrating symbolic execution for path exploration.^[61] Grey-box fuzzing, a hybrid that emerged prominently in the 2010s, uses lightweight instrumentation to gather partial runtime feedback, such as code coverage, without requiring full source access.^[61] Core algorithms in fuzzing draw from evolutionary computation to iteratively refine inputs. Genetic fuzzing employs mutation and crossover operations on seed inputs—initial valid or semi-valid test cases—to produce variants that explore new execution paths.^[62] Coverage-guided fuzzing enhances this by prioritizing mutations based on observed code coverage; for instance, American Fuzzy Lop (AFL), introduced in 2013, uses a power scheduling mechanism to allocate more mutations to seeds that uncover rare branches, employing a 64 KB bitmap to track edge transitions during execution.^[62] This feedback loop favors inputs that increase branch diversity, improving efficiency over purely random mutation. Key metrics evaluate fuzzing effectiveness, including crash reproduction rates and code coverage gains. Fuzzers like AFL achieve high crash reproducibility by minimizing and deterministically replaying fault-inducing inputs, often verifying 90-100% of detected crashes in controlled environments.^[63] In terms of coverage, coverage-guided methods significantly outperform random testing; AFL, for example, discovers up to 10 times more unique edges (e.g., 2,597 vs. 265 in GNU patch utilities) by focusing mutations on unexplored paths.^[62] Evolutions in the 2010s shifted toward grey-box techniques, with AFL popularizing coverage feedback to boost vulnerability discovery in real-world binaries.^[61] By 2025, AI enhancements, particularly large language models (LLMs), have advanced input synthesis; LLMs generate semantically valid yet adversarial inputs for complex protocols, addressing limitations in traditional mutation for structured data formats.^[64] A representative example involves fuzzing a file parser, such as an XML or JSON handler, by mutating seed documents with random strings, bit flips, or oversized payloads to trigger buffer overflows when inputs exceed allocated memory bounds.^[65] This process can expose stack or heap overflows in parsing routines, as seen in vulnerabilities discovered in libraries like libxml2 through tools like AFL.^[66]

Dynamic Symbolic Execution

Dynamic symbolic execution (DSE) is a hybrid technique that combines concrete program execution with symbolic reasoning to systematically explore execution paths and generate test inputs. In DSE, inputs are treated as symbolic variables represented by formulas, allowing the program to execute along concrete paths while simultaneously building symbolic path constraints that capture the conditions leading to those paths. These constraints are solved using satisfiability modulo theories (SMT) solvers to produce concrete values that guide exploration toward uncovered branches.^[67]^[68] The process begins with an initial concrete execution using arbitrary input values, during which symbolic constraints are collected for each branch taken—for instance, if a conditional statement if (x > 0) is evaluated to true, the constraint x > 0 is added to the path condition. To explore alternative paths, the most recent branch constraint is negated (e.g., x \leq 0), and the modified path condition is passed to an SMT solver to find satisfying concrete inputs if feasible. This concolic (concrete + symbolic) approach iteratively generates new inputs to flip branches, enabling deeper path coverage without exhaustive enumeration. The path condition at any point is formally defined as \text{PC} = \bigwedge \text{(branch constraints)}, and the solver seeks inputs satisfying \neg \text{PC} to reach unexecuted paths.^[68]^[69] Prominent tools implementing DSE include KLEE, introduced in 2008, which applies the technique to LLVM bitcode for automatic test generation in C programs, achieving high coverage on core utilities like those in the GNU Coreutils suite. Earlier, the CUTE tool from 2005 pioneered concolic execution for unit testing C code with pointer inputs, using constraint solving to automate test case generation for functions handling complex data structures.^[69]^[70] A primary challenge in DSE is path explosion, where the number of feasible execution paths grows exponentially with branches, leading to scalability issues for large programs. To mitigate this, techniques such as state merging, abstraction of symbolic values, and search heuristics prioritize promising paths, though full coverage remains computationally intensive even with modern SMT solvers.^[68]^[71]

Dynamic Data-Flow Analysis

Dynamic data-flow analysis is a runtime technique that tracks the propagation of data values through a program by monitoring def-use chains during execution, identifying dependencies from sources such as user inputs to sinks like outputs or sensitive operations. This approach captures concrete execution traces to reveal how data influences program behavior, enabling the detection of unintended information flows or anomalies without relying on static approximations. Unlike broader execution monitoring, it specifically focuses on data lineages, marking and following values to ensure precise dependency revelation.^[72] A primary technique in dynamic data-flow analysis is taint tracking, which associates labels or "taints" with potentially sensitive data at sources and propagates these labels through explicit data dependencies (e.g., assignments, arithmetic operations) and implicit control dependencies (e.g., branches influenced by tainted conditions). Taint propagation typically employs shadow memory structures, where each data byte is mirrored with a taint tag, often using pointer-based mechanisms to handle memory accesses efficiently. For instance, in binary instrumentation, tainted pointers trigger propagation during loads and stores, ensuring that derived values inherit taints from their operands. This method allows analysts to monitor flows in real-time, flagging violations when tainted data reaches unauthorized sinks.^[73] Another key technique is dynamic slicing, which extracts a subset of the execution trace relevant to a specific variable or criterion, reducing the program's observed behavior to only those statements that contribute to the value at a point of interest. Introduced by Korel and Laski, dynamic slicing operates on a particular input trace, producing an executable subset that replicates the original program's computations for selected variables. Algorithms for dynamic slicing commonly use backward traversal from the slicing criterion through the dynamic dependence graph to identify influencing statements, or forward traversal to propagate effects from sources. These graphs represent runtime def-use relations, enabling precise isolation of execution paths.^[74] Applications of dynamic data-flow analysis include security vulnerability detection, such as identifying SQL injection attacks by tracking tainted inputs into database queries; if user-supplied data propagates unfiltered to a query sink, an alert is raised to prevent exploitation. In debugging, it localizes faults by slicing traces to reveal data dependencies leading to errors, while in testing, it assesses coverage of data flows across inputs. These uses leverage taint tracking to enforce policies, like blocking tainted data in web applications.^[75] Despite its utility, dynamic data-flow analysis incurs significant runtime overhead due to continuous tracking, with pointer-based taint propagation often slowing execution by factors of 1.5x to 40x depending on workload and instrumentation depth. Optimizations such as lazy propagation address this by deferring taint updates until necessary (e.g., only when data reaches a sink or branch), avoiding eager computation on irrelevant paths and reducing memory usage in shadow structures. Such techniques, combined with targeted control-flow propagation, make the analysis feasible for production-like environments.^[73]^[76]

Invariant Inference and Program Slicing

Invariant inference in dynamic program analysis involves automatically discovering properties that hold true across multiple program executions by mining patterns from execution traces. These invariants, such as "array index is always less than array length," are extracted by observing variable values at specific program points during runs with diverse inputs. The process typically instruments the program to log runtime data, then applies statistical or machine learning techniques to identify likely invariants that hold over the observed traces. For instance, decision trees can be trained on execution data to classify and generalize patterns, enabling the detection of linear relations or bounds without manual specification.^[77] A seminal tool for this is Daikon, introduced in 2001, which detects likely invariants by partitioning execution traces and checking candidate properties against observed values, reporting those that hold with high probability. Daikon supports languages like Java and C, and has been applied to infer loop bounds from multiple runs, such as determining that a loop variable i satisfies 0 ≤ i < n across test cases. By running the program on varied inputs, it mines invariants like "sum of array elements remains constant post-sorting," aiding in program understanding and verification. This automated discovery reduces reliance on developer-written assertions, though invariants are probabilistic and may require validation on additional traces.^[77]^[78] Program slicing complements invariant inference by reducing execution traces to relevant subsets, focusing analysis on dependencies that influence specific behaviors. Dynamic backward slicing, starting from a variable or criterion at a program point, identifies statements that actually affect its value in a particular execution, thereby isolating potential invariant violations or bugs. This technique relies on dynamic data-flow analysis to compute dependencies during runtime. For example, in debugging a faulty array access, slicing traces back to statements setting the index, excluding unrelated code paths and shrinking trace size by up to 90% in large programs.^[79]^[80] The slice is formally defined for a criterion ⟨l, v⟩, where l is a statement and v a variable, as the set of executed statements that influence the value of v at l:

\text{slice}(P, \sigma, \langle l, v \rangle) = \{ s \mid s \text{ executes in trace } \sigma \text{ and influences } v \text{ at } l \}

Here, P is the program, σ the execution trace, and influence is determined via dynamic control and data dependencies. Spectrum-based slicing enhances this by incorporating execution spectra—counts of passed/failed tests per statement—to prioritize slices likely containing faults, improving efficiency in fault localization. Benefits include automated property discovery for verification and minimized traces for debugging, with applications in inferring invariants over sliced executions to focus on critical paths.^[79]^[81]

Security and Concurrency Analysis

Dynamic program analysis plays a crucial role in detecting security vulnerabilities and concurrency bugs by monitoring program execution at runtime, enabling the identification of exploits and synchronization issues that static methods may overlook. In security contexts, techniques such as runtime monitoring enforce policies to prevent control hijacking, while in concurrency, they reveal race conditions and deadlocks through trace analysis. These approaches prioritize real-time detection to ensure software reliability in multi-threaded and networked environments.^[82] For security analysis, runtime monitoring implements control-flow integrity (CFI) checks to verify that indirect branches follow a precomputed control-flow graph, thwarting exploits like return-oriented programming attacks. The seminal CFI framework, introduced in 2005, enforces this by inserting validation code at runtime, limiting attacker control over execution paths to valid ones, with subsequent dynamic variants adapting to binary code without source access.^[83] Dynamic taint analysis complements CFI by tracking untrusted inputs—such as network data—through the program, flagging potential privilege escalations when tainted data influences sensitive operations like system calls. This method marks external inputs as tainted at entry points and propagates labels during execution, preventing escalations in systems like Android where apps collude to leak private data.^[84]^[85] Anomaly detection extends this by analyzing network inputs for deviations from expected patterns, using dynamic traces to identify intrusions like buffer overflows in real-time traffic processing.^[86] In concurrency analysis, tools like ThreadSanitizer (TSan) detect data races by instrumenting code to track memory accesses and synchronization events, reporting conflicts where concurrent threads access shared data without proper locking. TSan employs a hybrid approach combining happens-before relations—derived from program order, synchronization, and thread creation—with lockset approximations to model execution traces accurately, achieving a virtually zero false-positive rate when all code is instrumented.^[87]^[88] Happens-before analysis on traces further refines this by constructing partial orders from observed executions, predicting races in non-deterministic schedules without exhaustive testing. For deadlocks, dynamic lockset analysis monitors lock acquisition patterns during runs, predicting cycles in lock graphs that could lead to permanent blocking, as seen in generalized deadlock detectors that handle both lock-only and wait-notify scenarios.^[89]^[90] A representative case involves detecting use-after-free (UAF) vulnerabilities in multi-threaded code, where freed memory is accessed post-deallocation, often exacerbated by races. Dynamic tools like AddressSanitizer, integrated with concurrency checkers, shadow memory allocations and detect invalid accesses at runtime, identifying UAF in Linux kernel drivers by correlating frees with subsequent thread-interleaved uses. In one study, such analysis uncovered concurrency UAF bugs in production systems, confirming them through replayed traces with minimal overhead.^[91] Fuzzing can generate security inputs to trigger these issues during dynamic monitoring, enhancing coverage for exploit-prone paths. Recent advances, as of 2024, integrate machine learning into trace-based anomaly detection for faster concurrency bug localization in cloud-native applications.^[92]^[93]

Performance Profiling

Performance profiling, a key aspect of dynamic program analysis, involves instrumenting or sampling a program's execution to measure and analyze resource consumption, thereby identifying bottlenecks and guiding optimizations. By executing the program with representative inputs, profilers capture runtime behavior, revealing inefficiencies such as excessive computation in specific routines or suboptimal resource allocation. This approach contrasts with static analysis by providing concrete, workload-specific insights into actual performance characteristics.^[94] Core metrics in performance profiling encompass CPU time, which quantifies the processor cycles dedicated to computation; memory bandwidth, measuring the rate of data movement between CPU and memory; and I/O latency, assessing delays in disk or network operations. These metrics help pinpoint resource contention or underutilization during execution. Hot-spot detection targets code regions, such as functions, that account for more than 5% of total execution time, as these often yield the highest returns from optimization efforts.^[95]^[96] Prominent techniques include call-stack sampling, where the profiler periodically interrupts execution—typically every few milliseconds—to record the active call stack, yielding a statistical profile of time distribution with low overhead. This method relies on hardware performance counters or timers to approximate hotspots without altering program semantics significantly. For visualization, flame graphs represent sampled call stacks as stacked, width-proportional bars resembling flames, with wider segments indicating greater time expenditure; introduced by Brendan Gregg in 2011, they facilitate intuitive navigation of complex execution hierarchies.^[97]^[98] Key tools for performance profiling include Perf, integrated into the Linux kernel starting with version 2.6.31 in 2009, which supports event-based counting of hardware metrics like cache misses and instructions retired for precise runtime analysis. Complementing this, GNU gprof generates flat profiles that aggregate time spent per function, independent of call graphs, enabling quick identification of self-time contributors via compiler-inserted instrumentation.^[99] Dynamic feedback from profiling enhances just-in-time (JIT) compilers, as seen in the Java Virtual Machine (JVM), where runtime data on method invocation frequencies and branch probabilities informs aggressive recompilation of hot code paths to native instructions. This adaptive process, leveraging profiles collected during initial interpretation, can yield substantial speedups by tailoring optimizations to observed workloads.^[100] In parallel computing contexts, performance profiling supports adaptations of Amdahl's law, where the parallelizable fraction p is measured dynamically at runtime through execution traces or sampling. The potential speedup is given by

\text{speedup} = \frac{1}{(1 - p) + \frac{p}{s}},

with s denoting the speedup achievable on the parallel portion; this runtime estimation guides scalability assessments and parallelization strategies.^[101]

Tools and Implementations

Binary Instrumentation Frameworks

Binary instrumentation frameworks enable the modification of machine code in executables without access to the source code, typically by inserting probes or additional instructions to monitor or alter program behavior at runtime.^[102] This dynamic approach allows for platform-agnostic analysis across different programming languages and compilers, facilitating the insertion of analysis code directly into the binary's execution flow.^[103] Key frameworks include Intel's PIN, introduced in 2005, which targets x86 architectures and provides a robust platform for building custom analysis tools through its API.^[103] PIN supports just-in-time (JIT) binary rewriting, enabling developers to instrument instructions, basic blocks, or traces dynamically.^[16] Similarly, DynamoRIO, originating from a 2002 research project on dynamic optimization, offers a process virtualization system using software code caches to support instrumentation on multiple platforms, including Windows, Linux, and macOS. Its extensible API allows for hooks into system calls, memory allocations like malloc, and control flow events, making it suitable for diverse dynamic analyses.^[104] Another foundational framework is Valgrind, first released in 2000, which emphasizes heavyweight instrumentation through its core simulator and tool-specific modules, such as Memcheck for memory error detection.^[105] Valgrind's design supports shadow value tracking to detect undefined behaviors, and its modular architecture allows extension via custom tools.^[106] These frameworks share common features like APIs for defining instrumentation routines and mechanisms for handling dynamic code generation, but differ in optimization strategies—PIN and DynamoRIO focus on low-overhead JIT compilation, while Valgrind prioritizes accuracy over speed.^[16] In practice, these frameworks are widely used for malware analysis, where they enable runtime monitoring of suspicious binaries without requiring source code or recompilation.^[107] For instance, PIN has been employed to trace API calls and memory accesses in evasive malware samples, revealing hidden behaviors that static analysis might miss.^[108] However, instrumentation introduces performance overhead; benchmarks show PIN typically causes 5-20x slowdowns depending on the analysis intensity, with lighter tools achieving closer to 3x and heavier ones exceeding 10x on SPEC CPU benchmarks.^[16] DynamoRIO exhibits similar overheads, around 4-15x, while Valgrind's emulation-based approach can reach 20-50x for detailed checks like Memcheck.^[109] Recent evolutions in these frameworks address emerging architectures and environments. As of 2025, DynamoRIO provides mature support for ARM64 (AArch64), enabling efficient instrumentation on mobile and server processors with features like stolen register management for low-latency analysis.^[110] Valgrind has also extended to ARM64, with optimizations for its tools on this architecture.^[111] For WebAssembly, new frameworks like Wastrumentation, introduced in 2025, offer portable dynamic instrumentation via binary rewriting, supporting intercession across Wasm runtimes for web-based analysis.^[112] These advancements ensure binary instrumentation remains relevant for cross-platform dynamic program analysis in diverse computing ecosystems.

Language-Specific Tools

Language-specific tools for dynamic program analysis are designed to integrate directly with the runtime environments of particular programming languages, such as virtual machines or interpreters, enabling efficient instrumentation for tasks like profiling, memory monitoring, and error detection. These tools often achieve lower overhead by leveraging language-specific features, such as just-in-time compilation or built-in debugging interfaces, compared to generic approaches that operate on compiled binaries. In Java, the Java Virtual Machine Tool Interface (JVMTI) serves as a native programming interface that allows agents to inspect JVM state, control application execution, and perform dynamic instrumentation for debugging and monitoring. JVMTI supports event-based notifications and method redefinition, facilitating real-time analysis without requiring source code modifications. JProfiler, a widely used commercial profiler, builds on JVMTI to provide advanced heap analysis, including leak detection through object retention tracking and allocation telemetry, with configurable sampling modes that minimize runtime impact to under 5% CPU in low-overhead scenarios. For Python, the built-in cProfile module implements deterministic profiling to capture function call statistics, including execution time and invocation counts, as a C extension to ensure suitable performance for long-running programs. This tool helps identify computational bottlenecks by generating sortable reports of hotspots. The third-party memory_profiler package extends this capability to memory analysis, offering line-by-line tracking of resident set size (RSS) changes to pinpoint leaks and high-allocation code paths, though it introduces moderate overhead due to frequent system calls for memory queries. In C and C++, the AddressSanitizer (ASan), developed by Google and integrated into the LLVM compiler infrastructure since 2012, instruments code to detect memory errors such as buffer overflows, use-after-free, and stack overflows, achieving an average runtime slowdown of approximately 2x and a 2x memory increase across benchmarks. Complementing ASan, ThreadSanitizer (TSan) focuses on concurrency analysis by detecting data races through shadow memory tracking and happens-before ordering, with typical overheads ranging from 5x to 15x depending on thread intensity. For JavaScript in Node.js, the inspector module exposes an API to the underlying V8 engine, allowing sessions for protocol-based debugging, tracing, and performance profiling without halting execution. This enables capture of V8-specific traces, such as garbage collection events and script execution timelines, supporting remote attachment for live analysis in server environments. These tools generally exhibit reduced overhead in virtual machine-based languages like Java and Python due to native runtime support for instrumentation; for instance, JVMTI-based profiling often incurs 2x to 5x slowdowns, compared to up to 10x for equivalent concurrency checks in native C++ via TSan. In contrast, non-VM languages like C++ may rely on binary frameworks as alternatives for broader compatibility, though at potentially higher costs.

Integrated Development Environments

Integrated development environments (IDEs) embed dynamic program analysis through plugins and native features that facilitate real-time monitoring and diagnostics, allowing developers to inspect runtime behavior seamlessly within their coding workflow. In Visual Studio, the Performance Profiler integrates dynamic instrumentation to capture CPU usage, memory allocations, and other metrics during application execution, enabling on-the-fly analysis of hot paths and bottlenecks in release builds.^[113] Similarly, JetBrains Rider employs Dynamic Program Analysis (DPA) as a background service that automatically detects memory issues, such as hidden allocations from LINQ queries or async operations, with minimal overhead to support proactive optimization.^[114] Specific examples illustrate this integration's depth. Eclipse's Test and Performance Tools Platform (TPTP) provides extensible frameworks for dynamic tracing and profiling, allowing developers to monitor execution traces, log data, and performance metrics directly from the IDE across embedded and enterprise systems.^[115] In IntelliJ IDEA, built-in code coverage serves as a dynamic analysis tool that instruments tests to track executed code lines, branches, and conditions, highlighting gaps in test suites to guide refinement efforts.^[116] Dynamic analysis extends beyond individual IDE sessions into broader workflows, particularly through continuous integration and continuous delivery (CI/CD) pipelines. GitHub Actions, for instance, automates fuzzing—a form of dynamic input generation—to test code changes for vulnerabilities by running randomized inputs against applications, integrating results back into pull requests for immediate feedback.^[117] This setup ensures dynamic checks occur at scale during builds, leveraging language-specific fuzzers like Go's native tools to uncover runtime errors early in the development cycle. The benefits of such integrations include intuitive visual dashboards and automated reporting that enhance developer productivity. Visual Studio's tools feature timeline graphs and call-tree visualizations to display real-time CPU and memory trends, while snapshot diffs automate identification of leaks or inefficiencies, such as growing object counts in specific classes.^[113] IntelliJ's coverage reports generate color-coded views of executed versus untested code, producing exportable summaries that quantify test effectiveness and prioritize fixes, thereby reducing debugging time and improving overall software reliability.^[116] In Rider, DPA's automated alerts and allocation viewers provide concise reports on memory pressure, enabling quick interventions like struct conversions to cut garbage collection overhead.^[114] As of 2025, trends emphasize cloud-based dynamic analysis for greater scalability and collaboration, with tools like Dynatrace and AppDynamics integrating runtime profiling into cloud IDEs such as GitHub Codespaces, allowing distributed teams to perform resource-intensive analyses without local hardware limitations.^[118] These platforms support serverless execution of dynamic tests, aligning with multi-cloud strategies to handle large-scale applications more efficiently. However, challenges persist in applying dynamic analysis within IDEs, particularly scalability for large codebases where extensive instrumentation can impose runtime overhead, limiting its use in production-like environments.^[119] Plugin compatibility also poses issues, as evolving IDE architectures may disrupt integrations with third-party dynamic tools, requiring frequent updates to maintain seamless operation across diverse development setups.^[120]

References

[1]
The concept of dynamic analysis - ACM Digital Library
Dynamic analysis is the analysis of the properties of a running program. In this paper, we explore two new dynamic analyses based on program profiling.Missing: techniques | Show results with:techniques
[2]
Introduction to dynamic program analysis with DiSL
Dynamic program analysis (DPA) tools assist in many software engineering and development tasks, such as profiling, program comprehension, and performance ...
[3]
A Survey of Software Dynamic Analysis Methods - ACM Digital Library
323–332. Digital Library · Google Scholar. [44]. Joshi P., Park C.-S., Sen K., and Naik M. A randomized dynamic program analysis technique for detecting real ...
[4]
A Survey of Dynamic Analysis and Test Generation for JavaScript
This article surveys dynamic program analysis and test generation techniques for JavaScript targeted at improving the correctness, reliability, performance ...
[5]
Comparison of instrumentation techniques for dynamic program ...
Mar 24, 2013 · Dynamic Program Analysis (DPA) tools commonly rely on bytecode instrumentation which is done by means of low-level bytecode manipulation ...Missing: definition | Show results with:definition
[6]
A Survey of Dynamic Program Analysis Techniques and Tools
A Survey of Dynamic Program Analysis Techniques and Tools. Conference paper. pp 113–122; Cite this conference paper. Download book PDF · Proceedings of the 3rd ...
[7]
[PDF] Foundations of Software Engineering
• Define dynamic analysis, including the high- level components of such analyses, and. understand how abstraction applies. • Collect targeted information for ...
[8]
[PDF] Software Analysis Handbook:
... Dynamic Analysis: Definitions and Concepts. Dynamic analysis focuses on the execution of software such as the portions of the actual code which get executed ...
[9]
[PDF] Static and dynamic analysis: synergy and duality - Computer Science
This paper presents two sets of observations relating static and dynamic analysis. The first concerns synergies between static and dynamic analysis.Missing: seminal | Show results with:seminal
[10]
None
Summary of each segment:
[11]
Memory & Storage | Timeline of Computer History
Core memory is made up of tiny “donuts” made of magnetic material strung on wires into a grid. Each core stored a bit, magnetized one way for a “zero,” and the ...Missing: dumps | Show results with:dumps
[12]
[PDF] A Research UNIX Reader - Dartmouth Computer Science
The first debugger db and the definitive ed were Ritchie's, as was the radically new stream basis for IO in v8 and much networking software. With Steve Johnson ...
[13]
Gprof: A call graph execution profiler - ACM Digital Library
Gprof: A call graph execution profiler. SIGPLAN '82: Proceedings of the 1982 SIGPLAN symposium on Compiler construction.
[14]
The tcov Profiling Command (Fortran Programming Guide)
There are two implementations of tcov coverage analysis. The original tcov is invoked by the -a or -xa compiler options. Enhanced statement level coverage is ...Missing: 1980s | Show results with:1980s
[15]
[PDF] Pin: Building Customized Program Analysis Tools with Dynamic ...
Pin is a software system that performs run-time binary instrumentation of Linux applications. The goal of Pin is to provide an instrumentation platform for.
[16]
More about AFL - AFL (american fuzzy lop)
In short, afl-fuzz is inspired chiefly by the work done by Tavis Ormandy back in 2007. Tavis did some very persuasive experiments using gcov block coverage to ...
[17]
[PDF] KLEE: Unassisted and Automatic Generation of High-Coverage ...
We present a new symbolic execution tool, KLEE, ca- pable of automatically generating tests that achieve high coverage on a diverse set of complex and.
[18]
[PDF] Learn&Fuzz: Machine Learning for Input Fuzzing - Microsoft
In this paper, we show how to automate the generation of an input grammar suitable for input fuzzing using sam- ple inputs and neural-network-based statistical ...
[19]
2025 Trends on AI Security: How AppSec Must Evolve with the AI ...
In 2025, AI is embedded across every stage of the SDLC, from AI code assistants that generate entire modules to autonomous agents handling QA and deployment.
[20]
[PDF] A Survey of Dynamic Program Analysis Techniques and Tools
It analyses the properties of a program while it is execut- ing. Dynamic analysis has been found to be more precise than static analysis in handling run-time ...
[21]
Shift Left With DAST: Dynamic Testing in the CI/CD Pipeline
Jul 18, 2023 · Dynamic application security testing, or DAST, is a process used to evaluate an application's security while it's running rather than while ...Importance Of Dast In Early... · 3. Integrating Dast Into The... · 6. Remediate And Retest
[22]
A Survey of Dynamic Program Analysis Techniques and Tools
Aug 6, 2025 · Dynamic program analysis is a very popular technique for analysis of computer programs. It analyses the properties of a program while it is ...
[23]
Pin: building customized program analysis tools with dynamic ...
Robust and powerful software instrumentation tools are essential for program analysis tasks such as profiling, performance evaluation, and bug detection.
[24]
https://dl.acm.org/doi/10.1145/1064978.1065034
[25]
Improving Dynamic Data Analysis with Aspect-Oriented Programming
PDF | In this paper we present a new instrumentation approach to reverse engineer a given software application. Our approach is based on aspect-oriented.
[26]
[PDF] Using Static Analysis to Reduce Dynamic Analysis Overhead
Dynamic analysis (instrumenting programs with code to detect and prevent errors during program execution) can be an effective approach to debugging, as well as ...
[27]
[PDF] Simple and General Statistical Profiling with PCT - PDOS-MIT
Profiling is the art and science of understanding program performance. There are two main families of profiling techniques, automatic code instrumen- tation and ...<|separator|>
[28]
[PDF] Using Platform-Specific Performance Counters
Abstract. Hardware performance counters provide information about events in the hardware platform (e.g., cache misses, pipeline stalls), in.
[29]
(PDF) Gprof: A call graph execution profiler - ResearchGate
Aug 7, 2025 · An Execution Profiler for Modular Programs. August 1983 · Software: Practice and Experience. S.L. Graham ...
[30]
[PDF] Cache profiling and the SPEC benchmarks: a case study
We call this type of analysis a mental simulation of the cache behavior. ... CProf annotates static and dynamic data structures. Dynamically allocated.
[31]
Adaptive sampling-based profiling techniques for optimizing the ...
In this paper, we propose a lightweight sampling-based profiling technique for tracking inter-thread sharing. To preserve locality across migrations, we also ...
[32]
[PDF] Framework for Instruction-level Tracing and Analysis of Program ...
Program execution traces provide the most intimate details of a pro- gram's dynamic behavior. They can be used for program optimiza- tion, failure diagnosis, ...<|separator|>
[33]
Record-and-Replay Techniques for HPC Systems: A Survey
Apr 23, 2018 · Record-and-replay techniques provide the ability to record executions of nondeterministic applications and re-execute them identically. These ...<|separator|>
[34]
Engineering Record and Replay for Deployability - USENIX
The ability to record and replay program executions with low overhead enables many applications, such as reverse-execution debugging, debugging of hard-to ...
[35]
What is eBPF? An Introduction and Deep Dive into the eBPF ...
eBPF is a revolutionary technology with origins in the Linux kernel that can run sandboxed programs in a privileged context such as the operating system kernel.
[36]
Linux eBPF Tracing Tools - Brendan Gregg
Dec 28, 2016 · Linux eBPF tracing tools, showing static and dynamic tracing with extended BPF and the open source BCC collection of tools.One Liners · eBPF · Tools · Programming
[37]
[PDF] VPC3: A Fast and Effective Trace-Compression Algorithm
Program execution traces are widely used to study program and processor behavior. Unfortunately, traces from interesting pro- grams tend to be very large and ...
[38]
[PDF] Debugging Concurrent Software: Advances and Challenges - JCST
Another practical challenge is how to identify those events that may introduce non-determinism. If those events are missed or not instrumented, determin- istic ...<|separator|>
[39]
[PDF] Accelerating Dynamic Analysis through Predicated Static Analysis
Dynamic analysis tools, such as those that detect data-races, verify memory safety, and identify information flow, have be- come a vital part of testing and ...
[40]
Concolic testing | Proceedings of the 22nd IEEE/ACM International ...
Abstract. Concolic testing automates test input generation by combining the concrete and symbolic (concolic) execution of the code under test. Traditional test ...Missing: original | Show results with:original
[41]
Directed Greybox Fuzzing | Proceedings of the 2017 ACM SIGSAC ...
In this paper, we introduce Directed Greybox Fuzzing (DGF) which generates inputs with the objective of reaching a given set of target program locations ...Abstract · Information & Contributors · Index Terms
[42]
[PDF] Inferring Loop Invariants by Mutation, Dynamic Analysis, and Static ...
In this paper, we present an approach that automates the functional verification of partial correctness of programs with loops by inferring the required loop in ...
[43]
Branch Coverage - an overview | ScienceDirect Topics
Branch coverage is defined as a criterion that requires test cases to exercise all branches, or edges originating from decision nodes, in the control flow ...Introduction · Theoretical Foundations and... · Techniques and Tools for...Missing: formula | Show results with:formula
[44]
Dynamic Analysis - Curses at DTU Compute
We can measure coverage in many different ways. The most common are line coverage , branch coverage , and instruction coverage . Instruction coverage of a ...<|control11|><|separator|>
[45]
Four common types of code coverage | Articles - web.dev
Sep 6, 2023 · There are four common ways to collect and calculate code coverage: function, line, branch, and statement coverage.Missing: dynamic analysis
[46]
What is Code Coverage | BrowserStack
Dec 3, 2024 · Branch Coverage / Decision Coverage: This metric measures the percentage of decision points that have been tested in the code, like if and ...
[47]
What is Branch Coverage in Unit Testing? - GeeksforGeeks
May 6, 2025 · Branch coverage in unit testing is a metric that measures the percentage of branches (decision points) in the source code that have been executed during the ...Missing: dynamic | Show results with:dynamic
[48]
Test Coverage Metrics: What is, Types & Examples - PractiTest
Apr 4, 2024 · Path coverage is a comprehensive metric that measures the percentage of the possible paths in the code tested. It seeks to ensure all possible ...Missing: dynamic | Show results with:dynamic
[49]
Code coverage - GitLab Docs
Configure code coverage to track and visualize how much of your source code is covered by tests. You can: Track overall coverage metrics and trends using ...Cobertura coverage report · JaCoCo coverage report · Badges
[50]
https://docs.gitlab.com/ci/testing/code_coverage/
[51]
What is Code Coverage? | Atlassian
Code coverage is a metric that helps you understand how much of your source is tested. Learn how it is calculated & how to get started with your projects.In This Article, You'll... · How Code Coverage Is... · Code Coverage: 6 Tips To Get...Missing: dynamic | Show results with:dynamic
[52]
[PDF] a practical approach to modified condition/decision coverage
Abstract. Testing of software intended for safety-critical applications in commercial transport aircraft must achieve modified condition/decision coverage.
[53]
[PDF] An Investigation of Three Forms of the Modified Condition Decision ...
MCDC is a structural coverage criterion used to assist with the assessment of the adequacy of the requirements-based testing process.
[54]
[PDF] AddressSanitizer: A Fast Address Sanity Checker - USENIX
In this paper we presented AddressSanitizer, a fast mem- ory error detector. AddressSanitizer finds out-of-bounds. (for heap, stack, and globals) accesses ...
[55]
[PDF] How to Shadow Every Byte of Memory Used by a Program - Valgrind
When a client program is run under Memcheck, Memcheck in- struments almost every operation and issues messages about de- tected memory errors. Memcheck ...
[56]
AddressSanitizer — Clang 22.0.0git documentation - LLVM
AddressSanitizer is a fast memory error detector. It consists of a compiler instrumentation module and a run-time library.Missing: paper | Show results with:paper
[57]
4. Memcheck: a memory error detector - Valgrind
Memcheck is a memory error detector. It can detect the following problems that are common in C and C++ programs. Incorrect freeing of heap memory.
[58]
AddressSanitizer: A Fast Address Sanity Checker - Google Research
This paper presents AddressSanitizer, a new memory error detector. Our tool finds out-of-bounds accesses to heap, stack, and global objects, as well as use- ...
[59]
Chrome Use-After-Free Flaw Lets Attackers Execute Arbitrary Code
Oct 15, 2025 · The company's security team utilized advanced detection tools including AddressSanitizer, MemorySanitizer, and other fuzzing technologies to ...
[60]
Fuzzing: a survey | Cybersecurity | Full Text - SpringerOpen
Jun 5, 2018 · This paper presents a summary of the recent advances, analyzes how they improve the fuzzing process, and sheds light on future work in fuzzing.<|separator|>
[61]
None
### Summary of AFL Technical Details
[62]
[PDF] Analyzing Impact of Coverage Metrics in Greybox Fuzzing - USENIX
Coverage-guided greybox fuzzing has become one of the most common techniques for finding software bugs. Coverage metric, which decides how a fuzzer selects new ...
[63]
When Fuzzing Meets LLMs: Challenges and Opportunities
Jul 10, 2024 · Despite their potential, LLMs face specific challenges in fuzzing. In this paper, we identified five major challenges of LLM-assisted fuzzing.
[64]
Fuzz Testing: A Beginner's Guide | Better Stack Community
Apr 7, 2025 · Practical examples. Let's walk through two concrete examples to demonstrate fuzzing in action. Example 1: Fuzzing a simple file parser.Practical Examples · Example 1: Fuzzing A Simple... · Example 2: Api Fuzzing For A...<|separator|>
[65]
https://betterstack.com/community/guides/testing/fuzz-testing/
[66]
[PDF] Deconstructing Dynamic Symbolic Execution - Microsoft
Abstract. Dynamic symbolic execution (DSE) is a well-known technique for automatically generating tests to achieve higher levels of coverage in a program.
[67]
[PDF] Symbolic Execution for Software Testing: Three Decades Later
Apr 10, 2013 · Abstract. Recent years have witnessed a surge of interest in symbolic execution for software testing, due to its ability to generate.
[68]
[PDF] KLEE: Unassisted and Automatic Generation of High-Coverage ...
KLEE is a symbolic execution tool that automatically generates high-coverage tests for complex programs, achieving over 90% coverage on average.
[69]
CUTE: a concolic unit testing engine for C - ACM Digital Library
The paper addresses the problem of automating unit testing with memory graphs as inputs. The approach used builds on previous work combining symbolic and ...
[70]
[PDF] A Survey of Symbolic Execution Techniques
In this section we describe some important principles for the design of symbolic executors and crucial tradeoffs that arise in their implementation. Moving from ...
[71]
(PDF) Dynamic data flow analysis for Java programs - ResearchGate
Dynamic data flow analysis is a method of analyzing the sequence of actions (define, reference, and undefine) on data at runtime. It has mainly been used for ...
[72]
[PDF] Dynamic Taint Analysis for Automatic Detection, Analysis, and ...
Dynamic taint analysis automatically detects overwrite attacks on commodity software without needing source code or special compilation.
[73]
Dynamic program slicing - ScienceDirect.com
A dynamic program slice is an executable subset of the original program that produces the same computations on a subset of selected variables and inputs.
[74]
[PDF] Taint-Enhanced Policy Enforcement: A Practical Approach to Defeat ...
We conclude by pointing out the integral role played by taint analysis as well as security policies in detecting these attacks. 2.1 SQL and Command Injection.
[75]
[PDF] LazyTainter : Memory-Efficient Taint Tracking in Managed Runtimes ...
In this paper, we propose a hybrid approach of taint tracking which is able to combine the benefits of both eager and lazy tainting without the costs. To ...
[76]
[PDF] Dynamically discovering likely program invariants to support ...
In other words,. Daikon assumes a distribution and performs a statistical. ERNST ET AL.: DYNAMICALLY DISCOVERING LIKELY PROGRAM INVARIANTS TO SUPPORT PROGRAM ...
[77]
The Daikon dynamic invariant detector
Daikon is a dynamic detector that reports likely program invariants by observing program executions and reporting true properties.
[78]
Dynamic program slicing | ACM SIGPLAN Notices
In this paper, we investigate the concept of the dynamic slice consisting of all statements that actually affect the value of a variable occurrence for a given ...
[79]
[PDF] Dynamic Program Slicing - Columbia CS
Static program slicing isolates all possible threads computing a particular variable. Dynamic slicing, on the other hand, iso- lates the unique thread computing ...
[80]
https://www.cs.columbia.edu/~junfeng/08fa-e6998/sched/readings/slicing.pdf
[81]
[PDF] Control-Flow Integrity - UMD Computer Science
This paper describes and studies one mitigation technique, the enforcement of Control-Flow Integrity (CFI), that aims to meet these standards for ...Missing: seminal | Show results with:seminal
[82]
[PDF] Lockdown: Dynamic Control-Flow Integrity | Semantic Scholar
Lockdown is presented, an approach to dynamic CFI that protects legacy, binary-only executables and libraries and adaptively learns the control-flow graph ...Missing: seminal | Show results with:seminal
[83]
[PDF] Efficient and Extensible Security Enforcement Using Dynamic Data ...
Oct 31, 2008 · Dynamic taint analysis marks un- trusted data as tainted, tracks the flow of tainted data through the system, and checks that tainted data is ...
[84]
[PDF] On Detection of Privilege Escalation Attacks in Android Smartphones
TaintDroid exploits dynamic taint analysis in order to label privately declared data with a taint mark, audit on-track tainted data as it propagates through ...
[85]
Anomaly detection in dynamic networks: a survey - Ranshous - 2015
Mar 21, 2015 · This two-stage process can be found in many other areas of anomaly or outlier detection, such as text mining and abnormal document detection,65 ...INTRODUCTION · BACKGROUND · TYPES OF ANOMALIES · METHODS
[86]
[PDF] ThreadSanitizer: data race detection in practice - Google Research
As a result, ThreadSanitizer will find more races (as compared to a pure happens-before detector) but will not report false races.
[87]
JEP draft: Java Thread Sanitizer - OpenJDK
Jul 30, 2018 · Considerable work has been done to ensure both a low performance overhead (typically ~4x) and a virtually zero false-positive rate -- it is a ...Motivation · Description Of Tsan For The... · Use-Case Example
[88]
https://openjdk.org/jeps/8208520
[89]
[PDF] An Effective Dynamic Analysis for Detecting Generalized Deadlocks
We present an effective dynamic analysis for finding a broad class of deadlocks, including the well-studied lock-only dead- locks as well as the less-studied, ...
[90]
[PDF] Effective Static Analysis of Concurrency Use-After-Free Bugs in ...
Jul 10, 2019 · In this paper, we propose a practical static analysis ap- proach named DCUAF, to effectively detect concurrency use- after-free bugs in Linux ...
[91]
[PDF] FirmScope: Automatic Uncovering of Privilege-Escalation ... - USENIX
Aug 12, 2020 · To this end, this paper presents FIRMSCOPE, a scalable, compre- hensive, and automated static taint analysis system to identify the firmware- ...
[92]
Anomaly Detection in Dynamic Graphs: A Comprehensive Survey
This survey paper presents a comprehensive and conceptual overview of anomaly detection using dynamic graphs. We focus on existing graph-based anomaly detection ...
[93]
What Is Code Profiling and How to Choose the Right Tool? - Ranorex
Feb 7, 2024 · Code profiling is a dynamic program analysis technique to analyze software behavior while running, providing a view of runtime behavior.
[94]
Knowledge Base: Profilers: Intel Vtune - RCAC
Intel VTune Profiler is a performance profiling tool for analyzing CPU, GPU, threading, and memory usage. ... Would normally give a broad view of CPU, memory, I/O ...
[95]
Profiling in Python: How to Find Performance Bottlenecks
Software profiling is the process of collecting and analyzing various metrics of a running program to identify performance bottlenecks known as hot spots.
[96]
Understand profiler performance collection methods - Visual Studio ...
Mar 11, 2024 · Sampling. Sampling collects statistical data about the work that is performed by an application during profiling and it's a good starting place ...
[97]
Flame Graphs
Jan 23, 2025 · This is the official website for flame graphs: a visualization of hierarchical data that I created to visualize stack traces of profiled ...on-CPU · Memory Leak (and Growth... · Hot/Cold Flame Graphs · Off-CPU
[98]
Ingo Molnar: [Announce] Performance Counters for Linux, v8 - LKML
Jun 6, 2009 · We are pleased to announce version 8 of the performance counters subsystem for Linux. This new subsystem adds a new system call ...
[99]
[PDF] JVM JIT-compiler overview - OpenJDK
Modern Java VMs use dynamic compilers (JIT). – aggressive optimistic optimizations. ▫ through extensive usage of profiling data.
[100]
[PDF] Amdahl's Law in the Multicore Era - Computer Sciences Dept.
Jul 3, 2008 · Dynamic multicore chips can offer speed- ups that can be greater (and are never worse) than asymmetric chips with identical perf(r) functions.<|control11|><|separator|>
[101]
Dynamic Binary Instrumentation (DBI) is a method of ... - Uninformed
Dynamic Binary Instrumentation (DBI) is a method of analyzing the behavior of a binary application at runtime through the injection of instrumentation code.
[102]
Pin: building customized program analysis tools with dynamic ...
We have developed a new instrumentation system called Pin. Our goals are to provide easy-to-use, portable, transparent, and efficient instrumentation.
[103]
[PDF] Transparent Dynamic Instrumentation - Google Research
First presented in 2001 [9], DynamoRIO evolved from a re- search project [6, 10] for dynamic optimization to the flagship product of a security company [22] and ...
[104]
1. The Design and Implementation of Valgrind
This document contains a detailed, highly-technical description of the internals of Valgrind. This is not the user manual.
[105]
Valgrind: a framework for heavyweight dynamic binary instrumentation
In this paper we describe Valgrind, a DBI framework designed for building heavyweight DBA tools. We focus on its unique support for shadow values.Missing: original | Show results with:original
[106]
Malware Analysis with Dynamic Binary Instrumentation Frameworks
Apr 5, 2021 · Well-known DBI frameworks include Intel's Pin, DynamoRIO, and Frida. These frameworks are often used to assess proprietary programs and ...
[107]
[PDF] Hiding PIN's Artifacts - Black Hat
- Dynamic Binary Instrumentation Tools. - DBI Evasion. - Evasive Malware ... ○ Malware authors employ Anti-Instrumentation techniques to detect when ...
[108]
[PDF] ANALYzING PARALLEL PROGRAMS WITH PIN - Aamer Jaleel
Pin's aim is to provide an instrumentation platform for building a wide variety of program analysis tools, called pintools. By performing the instrumentation on ...
[109]
AArch64 Port - DynamoRIO
AArch64 is the ARM architecture's 64-bit execution state, which was introduced in version 8 of the architecture, ARMv8, announced in 2011.Ir Decisions · Encoder/decoder · Stolen RegisterMissing: support | Show results with:support
[110]
Research Papers - Valgrind
This paper describes how Valgrind works, and how it differs from other DBI frameworks such as Pin and DynamoRIO. Please cite this paper when discussing Valgrind ...Missing: original | Show results with:original
[111]
[PDF] Wastrumentation: Portable WebAssembly Dynamic Analysis with ...
This paper presents Wastrumentation, the first general-purpose framework for dynamic analysis of WebAssembly that supports intercession and is portable among ...
[112]
Overview of the profiling tools - Visual Studio - Microsoft Learn
Jun 18, 2025 · Visual Studio offers a range of profiling and diagnostics tools that can help you diagnose memory and CPU usage and other application-level issues.
[113]
Dynamic Program Analysis (DPA) - JetBrains Guide
Jul 6, 2020 · Dynamic Program Analysis (or DPA) performs automatic analysis of memory and allocations, and runs in the background of Rider.
[114]
Test and Performance Tools Platform | projects.eclipse.org
The Eclipse TPTP provides an open platform for building test and performance tools, covering the entire life cycle from early testing to production monitoring.Missing: dynamic | Show results with:dynamic
[115]
Code coverage | IntelliJ IDEA Documentation - JetBrains
Sep 3, 2025 · Code coverage in IntelliJ IDEA analyzes which lines of code were executed, helping to determine the share of code covered by tests.Run With Coverage · Read Coverage Data · Project SettingsMissing: dynamic | Show results with:dynamic
[116]
Enhancing AppSec through Fuzzing in CI/CD Pipelines | Kondukto
Aug 1, 2024 · Fuzzing is an automated testing technique that involves feeding random, unexpected, or invalid data to a program or API to uncover bugs and vulnerabilities.Missing: dynamic | Show results with:dynamic
[117]
10 Best Dynamic Code Analysis Tools | Clutch.co
Apr 17, 2025 · Dynamic code analysis assesses software behavior during runtime. Top tools include OWASP ZAP, Burp Suite, Valgrind, AppDynamics, and Dynatrace.1. Owasp Zap · 4. Appdynamics · 9. New Relic<|separator|>
[118]
Efficient Program Analyses that Scale to Large Codebases
This dissertation focuses on dataflow (DFI) and throughput (MCAD) analysis, using novel optimizations to overcome scalability issues in program analysis.
[119]
Static vs. dynamic code analysis: A comprehensive guide - vFunction
Jul 28, 2025 · Static code analysis automates source code scanning without the need for code execution. It scrutinizes the source code before execution.Missing: seminal | Show results with:seminal