Fact-checked by Grok 2 weeks ago

Daala

Daala is the codename for a royalty-free video codec project developed by the in collaboration with the and other contributors, aimed at creating an open-source alternative to patented standards like H.265 with superior efficiency and perceptual quality. Announced in June 2013, Daala sought to advance beyond traditional block-based coding by incorporating novel techniques to avoid patent encumbrances and improve performance for streaming. Key innovations included lapped transforms for reducing blocking artifacts without deblocking s, perceptual vector quantization (PVQ) for better handling of visual noise sensitivity, and overlapped block motion compensation (OBMC) to enhance motion smoothness. Additional features encompassed a multi-symbol coder for efficient , chroma-from-luma to leverage correlations between color channels, and a directional deringing to mitigate artifacts around edges. By 2016, Daala had demonstrated compression performance surpassing H.264 but trailing behind more mature codecs like HEVC, with ongoing refinements targeting real-time encoding and low-latency applications. Following the formation of the in 2015, development efforts shifted focus thereafter, with several Daala technologies—such as its deringing filter—integrated into the codec by the consortium, including Xiph.Org and . Although active work on Daala as a standalone codec concluded without a final release, its research legacy influenced royalty-free video standards, emphasizing perceptual optimization and patent avoidance in modern encoding.

Project Overview

Development and Sponsorship

Daala emerged as a collaborative effort between the and the , with development led by Timothy B. Terriberry, who initiated experimental code for the project in 2010. This partnership aimed to develop a next-generation free from patent encumbrances, building on the open-source traditions of prior Xiph projects like and . Mozilla provided primary funding and resources for the initiative, while the Xiph.Org Foundation contributed expertise in multimedia codecs and open-source development. Additional involvement came from the Internet Engineering Task Force (IETF) through its NETVC working group, which explored Daala's technologies for potential standardization, alongside contributions from broader open-source communities focused on royalty-free media formats. The project formally initiated on June 20, 2013, following a that highlighted its goals for superior efficiency. An alpha prototype was released earlier on May 30, 2013, demonstrating initial encoding and decoding capabilities in a pre-pre-alpha stage. Daala was licensed under BSD-like terms, ensuring it was , with contributors providing explicit royalty-free patent grants to prevent encumbrances similar to those in codecs. This licensing model facilitated widespread adoption and experimentation within the open-source ecosystem.

Design Goals and Objectives

Daala was developed with the primary goal of creating a , openly documented suitable for streaming and applications, deliberately avoiding patent-encumbered tools to ensure broad without licensing fees. This approach addressed the limitations of existing royalty-free formats like and , which lagged behind proprietary alternatives, by prioritizing an open development process under strong policies to foster and . Sponsored collaboratively by and the , the project aimed to deliver a format that empowers users with full control over implementation and deployment. Performance targets centered on surpassing the compression efficiency of HEVC (H.265) and , while emphasizing low , high degrees of parallelization, and designs optimized for hardware implementation. By focusing on perceptual quality rather than traditional metrics like PSNR, Daala sought to achieve superior visual results at equivalent bitrates, particularly in scenarios demanding efficient bandwidth use for web-based video. Innovation was driven by the exploration of unconventional techniques to minimize artifacts, enhance subjective image quality, and simplify adoption without financial or legal barriers. Specific objectives included the structural elimination of blocking artifacts common in block-based and the optimization of low-latency encoding to support interactive web applications and real-time communication. These priorities positioned Daala as a forward-looking alternative, emphasizing ease of integration into diverse hardware and software ecosystems.

Technical Innovations

Transform and Motion Compensation

Daala utilizes an overlapping block (DCT) as its core spatial transformation technique, applying it to blocks ranging from 4×4 to 64×64 pixels organized within 64×64 superblocks via recursive quad-tree subdivision. This approach decorrelates the signal in the while minimizing blocking artifacts inherent in non-overlapping transforms by allowing adjacent blocks to share boundary information. The variable block sizes enable adaptation to local content characteristics, such as fine details in textures requiring smaller blocks or smoother areas benefiting from larger ones. The lapped transform is realized through biorthogonal pre- and post-filters applied around the standard DCT, with the pre-filter smoothing correlations across block edges before transformation and the post-filter reconstructing the signal by merging overlapped regions. Windowing functions, based on sine-derived shapes akin to those in the modified DCT, ensure basis functions taper smoothly to zero at block peripheries, thereby eliminating sharp discontinuities and providing a more continuous frequency representation compared to traditional block DCTs. A fixed 4-point overlap is employed across all block sizes to maintain computational efficiency and simplify during encoding, avoiding the complexity of variable overlap adjustments. To accommodate directional content variations, Daala incorporates adaptive directional transforms and frequency-domain intra , where AC coefficients are predicted by copying oriented rows or columns from neighboring blocks in the transform domain. This method aligns the transform basis with local edge directions, enhancing energy compaction for textured regions and improving overall compression performance without relying on pixel-domain spatial . In the temporal domain, Daala's motion compensation employs overlapped block motion compensation (OBMC) with an adaptive grid of blocks from 8×8 to 64×64, ensuring neighboring blocks differ in size by at most a factor of two via a 4-8 mesh constraint. Predictions from multiple overlapping blocks are blended using tapering window functions to create seamless inter-frame references, reducing artifacts at motion boundaries. This frequency-compatible design integrates with the lapped transform by operating on coefficient phases through implicit shifts, bypassing explicit pixel-domain block matching and enabling efficient temporal redundancy removal.

Quantization and Entropy Coding

Daala employs Perceptual Vector Quantization (PVQ) as its primary quantization method for transform coefficients, diverging from traditional scalar quantization to enhance perceptual quality and compression efficiency. PVQ operates in the on the (AC) coefficients produced by Daala's lapped (DCT), approximating the transform outputs through a gain-shape approach without relying on per-coefficient scalar operations. This technique, adapted from the CELP mode of the , separates each vector of AC coefficients into a scalar gain representing the overall magnitude and a vector indicating the directional of energy across frequencies, thereby preserving textural details and reducing artifacts like blocking or ringing. In PVQ, the gain is perceptually adjusted using a companding factor \alpha = 1/3, computed as \gamma = g^{1-\alpha} where g is the raw gain, to allocate finer quantization resolution to lower-contrast regions while allowing coarser steps in high-contrast areas, mimicking the human visual system's (HVS) reduced sensitivity to noise in textured regions—a process known as activity masking. The shape is quantized using a normalized pyramid vector quantizer, where codewords are integer vectors of length N-1 (for N coefficients) summing to an integer K pulses, distributed uniformly on an (N-1)-dimensional spherical surface to ensure energy conservation and avoid low-pass blurring effects common in scalar methods. This spherical vector coding enables efficient approximation of the lapped transform's overlapping blocks by encoding residuals after frequency-domain prediction, such as via Householder reflections that align the predictor along one axis, leaving the residual with N-1 degrees of freedom. For practical implementation, Daala uses precomputed lookup tables to map these pyramid codes to indices without exhaustive searches, facilitating fast encoding and decoding while maintaining perceptual fidelity. Frequency-domain quantization in Daala further incorporates HVS models through adaptive weighting of bands, dividing AC coefficients into perceptual bands (e.g., one band for 4x4 blocks, up to seven for 16x16 blocks) to prioritize low-frequency components where the eye is more sensitive, while applying the companded gain per band to exploit contrast masking without additional signaling overhead. This results in bitrate savings of approximately 13-25% over scalar quantization equivalents, as demonstrated in evaluations on still images and video sequences. Entropy coding in Daala integrates seamlessly with PVQ via a multi-symbol range coder, an adaptation of optimized for video data streams. The range coder encodes symbols with up to 16 possible values per operation, reducing the total number of symbols and associated overhead compared to binary arithmetic coders like CABAC, while using piecewise integer arithmetic to avoid costly multiplications. Probability models are maintained adaptively for PVQ coefficients (via known pulse counts K for magnitude or ), motion parameters from overlapped block compensation, and prediction modes, with frequencies updated via simple SIMD instructions and a slight bias toward the most probable symbol (zero) to minimize bitrate impact—adding only about 1% overhead. This low-overhead design supports efficient handling of the variable-length codes from PVQ shapes and gains, contributing to Daala's overall compression performance.

Post-Processing and Parallelization Features

Daala incorporates post-processing techniques to enhance the quality of reconstructed frames after the transform, primarily through its directional deringing filter, which addresses introduced by quantization while preserving image details. This operates on 8×8 blocks, first estimating the dominant edge using a minimum sum of squared differences to identify one of eight possible directions, then applying a 7-tap conditional replacement along that . The replaces pixels—those differing from the center by more than an adaptive threshold proportional to the quantization step size—with the center value, effectively acting as a constrained that reduces ringing without introducing new artifacts or excessive blurring across edges. A secondary 5-tap is then applied orthogonally to smooth flat areas, resulting in an effective 35-tap that preserves directional details. In addition to deringing, Daala employs low-complexity deblocking strategies tailored for edge preservation and real-time internet applications. Traditional deblocking is largely obviated by the codec's lapped transforms, which apply fixed biorthogonal filters overlapping block boundaries to mitigate blocking artifacts during reconstruction; however, supplementary bilinear smoothing on superblock edges further reduces visible discontinuities with minimal computational overhead. Experimental adaptive loop filtering achieved up to 2.6% bitrate savings on video. These post-processing steps collectively refine the output for perceptual quality, focusing on artifact reduction in constrained bandwidth scenarios. For parallelization, Daala's design emphasizes scalability through tile-based encoding and decoding, dividing frames into independent rectangular tiles that enable multi-threaded processing on multi-core CPUs and GPU acceleration. Superblocks (up to 64×64 pixels) within tiles are processed autonomously, with lapped transform filtering ordered recursively—interiors first in parallel, followed by edges—to eliminate interdependencies and support wavefront-style parallelism. This approach avoids serial chains in and entropy decoding, allowing up to four symbols to be coded simultaneously via a multi-symbol coder, which reduces latency and facilitates SIMD instruction utilization for vectorized operations like deringing. pipelines benefit from these choices, as the absence of cross-tile dependencies permits efficient parallel execution in real-time encoding for delivery.

Development History

Origins and Key Milestones

Daala emerged as the successor to , Xiph.Org Foundation's open-source released in 2004, with the goal of advancing royalty-free video compression technology. Initial research and planning for a next-generation began in the early , focusing on perceptual coding techniques to address limitations in efficiency and quality observed in Theora and contemporary formats like VP8. By January 2012, Xiph.Org developer Timothy Terriberry presented foundational concepts on video coding at the Linux.conf.au conference in , laying the groundwork for Daala's innovative approach. A major milestone occurred on May 30, 2013, when the alpha prototype successfully encoded and decoded its first video streams, enabling the initial live streaming of Daala-encoded video over the just two hours later, demonstrated by Mozilla engineer David Richards. This pre-pre-alpha release marked the project's transition from theoretical research to practical implementation, with hosted on Xiph.Org's repositories for public access and contribution. Weekly progress meetings, held Tuesdays at 9 AM Pacific Time via Mumble on mf4.xiph.org, facilitated collaborative refinement, with agendas and minutes shared publicly to encourage community involvement. Early technical prototypes highlighted Daala's novel features, including implementations of perceptual (PVQ) for efficient frequency-domain coding and lapped transforms to minimize blocking artifacts without traditional discrete cosine transforms. Public demos of these prototypes, such as the PVQ demonstration showcasing adaptive quantization control, were released to illustrate potential quality gains. These advancements were presented at IETF NetVC meetings, including sessions in 2015 on time-domain lapped transforms. From 2013 to 2015, the project emphasized patent avoidance through unconventional designs like overlapped-block and lapped transforms, diverging from patented block-based methods in standards like H.264. Efforts also targeted complexity reduction, optimizing for lower computational demands in encoding and decoding to suit real-time applications, while maintaining high visual quality via perceptual optimizations. Sponsorship from the supported these phases, enabling focused experimentation.

Involvement in Standards and Transition to AV1

Daala played a significant role in the Internet Engineering Task Force's (IETF) and Endpoint Video Codec (NETVC) initiative, which aimed to develop an open-source standard for applications. On March 24, 2015, during the NETVC Birds-of-a-Feather (BoF) session at IETF 92 in , , Daala was presented as a , highlighting its innovative techniques such as Perceptual Vector Quantization (PVQ) for efficient coefficient encoding and lapped transforms to reduce blocking artifacts. This presentation underscored Daala's potential to advance beyond existing standards like H.264. Following positive feedback from the BoF, the IETF chartered the NETVC on May 18, 2015, with Daala's developers actively contributing proposals and participating in subsequent sessions to refine requirements and evaluation criteria. In parallel with NETVC efforts, Daala's development intersected with broader industry initiatives to unify open video codec projects. On September 1, 2015, the formation of the (AOM) was announced, integrating Daala as a key contributor alongside Google's and Cisco's Thor to develop the codec. Xiph.org and the , primary sponsors of Daala, joined AOM as promoter members, committing to collaborate on a single, high-performance, royalty-free alternative to patented codecs like HEVC. This merger was driven by the need to consolidate fragmented open-source efforts, avoiding duplication and accelerating progress toward a competitive standard that could serve streaming, conferencing, and web video without licensing fees. As development progressed under AOM, Daala's codebase and core innovations were systematically folded into the new , with Xiph.org developers porting elements like PVQ and frequency-domain deringing directly into the . By 2016, Daala's standalone development shifted focus toward contributions, and the project effectively wound down between 2016 and 2017, with the last major commits to its occurring around 2017 as resources redirected to AOMedia's unified effort. This transition marked the culmination of Daala's independent phase, ensuring its perceptual coding advancements influenced the next generation of open video standards.

Legacy and Impact

Contributions to AV1

Daala's perceptual (PVQ), a gain-shape technique originally developed for efficient coding of transform coefficients while prioritizing perceptual quality, was proposed for and experimentally integrated into prototypes. This replaced conventional scalar quantization methods in tests, enabling better allocation of bits to perceptually significant components through separate quantization of () and () parameters. Experimental into demonstrated BD-rate s of approximately 4-5% using perceptual metrics like MS-SSIM, confirming PVQ's superiority over scalar approaches in subjective quality without excessive complexity increases. Daala's innovations in lapped transforms and directional processing, designed to minimize blocking artifacts and enhance edge preservation, influenced the evolution of AV1's intra-prediction modes and frequency-domain tools. Although full lapped transforms were not adopted due to compatibility with AV1's block-based structure, Daala's frequency-domain intra prediction research directly shaped AV1's prediction, which adapts luma-derived chroma signals in the transform domain for improved color fidelity. These influences contributed to AV1's more flexible directional intra modes, supporting up to 56 angles for better adaptation to content edges compared to prior codecs. The Constrained Directional Enhancement Filter (CDEF) in directly incorporates Daala's directional deringing filter, originally derived from its Intra Paint tool for edge-directed artifact reduction. Daala's filter used conditional replacement with 1D directional taps (7-tap along edges, 5-tap across) to suppress ringing while preserving details, but refined it by merging with Cisco's Constrained (CLPF) to add low-pass smoothing and stricter edge constraints, ensuring broader hardware compatibility and reduced blurring across high-contrast boundaries. This hybrid approach provides with adaptive filtering strengths signaled per 64x64 , enhancing deringing without the full complexity of Daala's spatial-domain original. Collectively, these Daala-derived tools—CfL and CDEF—formed a core part of 's novel features, enabling the to achieve 20-30% better compression efficiency over in typical video sequences through improved perceptual coding and artifact mitigation.

Performance Comparisons and Evaluations

Daala prototypes demonstrated competitive compression efficiency in evaluations conducted between 2014 and 2016, often approaching or matching established codecs like and HEVC on standard test sequences, though results varied by content type and metric. For instance, early benchmarks indicated Daala provided similar bitrate requirements to for equivalent quality levels in sequences with complex motion, such as sports footage, while perceptual metrics highlighted advantages in artifact reduction. Objective assessments, including those in the 2016 paper by Valin et al., utilized metrics like PSNR, SSIM, and perceptual variants (PSNR-HVS-M, FastSSIM) on the ntt-short-1 test set via the Are We Compressed Yet? framework. Daala showed slightly lower performance than HEVC in average PSNR and SSIM but excelled in perceptual quality, particularly in preserving low-contrast textures and reducing , leading to subjective gains over block-based codecs in demos. These evaluations underscored Daala's focus on human visual system modeling, contributing to its influence on , which achieved 25-30% better efficiency than HEVC at comparable quality through integrated Daala innovations. Early Daala implementations faced limitations in , requiring more processing resources than or HEVC due to techniques like lapped transforms and pyramid , which increased encoding times in prototypes. These issues were mitigated in adaptations by hybridizing with elements, balancing efficiency and speed. Notably, pure Daala saw no widespread production deployment, as development shifted to the collaborative standard. As of 2025, Daala's legacy endures through AV1's broad adoption in streaming, where platforms like and leverage its perceptual enhancements for efficient delivery, reducing bandwidth by up to 30% over legacy codecs while maintaining high quality.

References

  1. [1]
    Daala video compression - Xiph.org
    Daala is the code-name for a new video compression technology. The effort is a collaboration between Mozilla Foundation, Xiph.Org Foundation and other ...
  2. [2]
    [PDF] Daala: Building A Next-Generation Video Codec From ... - arXiv
    Aug 5, 2016 · We show that Daala is approaching the performance level of more mature, state-of-the art video codecs and can contribute to improving AV1. I.
  3. [3]
    next generation video: Introducing Daala - Xiph.org
    Jun 20, 2013 · Daala tries for a larger leap forward— by first leaping sideways— to a new codec design and numerous novel coding techniques. In addition to the ...
  4. [4]
    next generation video: Introducing AV1 - Xiph.org
    Jun 26, 2018 · This is an earlier deringing filter developed during research for the Daala codec that contributed to the CDEF used in AV1. Daala: Painting ...
  5. [5]
    Revisiting Daala Technology Demos
    Jun 6, 2016 · With pieces of Daala being contributed to the Alliance for Open Media's AV1 video codec, now seems like a good time to go back over the demos ...
  6. [6]
    next generation video: Introducing AV1 - Xiph.org
    Org's Daala codec as starting point. AV1 leapfrogs the performance of VP9 and HEVC, making it a next-next-generation codec. The AV1 format is and will always be ...
  7. [7]
    Aiming for a standardized, high-quality, royalty-free video codec to ...
    Sep 1, 2015 · Last month, Cisco contributed its Thor video codec to the NETVC effort, which joins Mozilla's Daala codec as input to our work. I'm excited ...Missing: involvement | Show results with:involvement
  8. [8]
    Daala - XiphWiki - Xiph.org
    May 25, 2021 · Daala is the codename for a new video compression technology. The effort is a collaboration between the Mozilla Foundation, the Xiph.Org Foundation and any ...
  9. [9]
    [PDF] Perceptually-Driven Video Coding with the Daala Video Codec
    The Daala project is a royalty-free video codec that attempts to compete with the best patent-encumbered codecs. Part of our strategy is to replace core tools ...
  10. [10]
    [PDF] Daala: Building A Next-Generation Video Codec From ...
    Daala [1] is a video codec designed to explore a set of atypical techniques, outlined in Section II, to avoid the patent thickets built around most current ...Missing: goals | Show results with:goals
  11. [11]
    [PDF] The Daala Video Codec: Research Update - Xiph.org
    Jun 6, 2014 · What About Video Codecs? ○ Some existing royalty-free formats. – Theora is circa 1999/2000 technology. – VP8 solidly better than ...Missing: goals | Show results with:goals
  12. [12]
    Daala Progress in 2014 - Mozilla Research
    Dec 23, 2014 · Additionally, Daala's unique design makes still image coding one of the hard problems to solve. It's an obvious early goal to outperform other ...Missing: objectives | Show results with:objectives
  13. [13]
    Daala: Perceptual Vector Quantization (PVQ) - Jean-Marc Valin
    Nov 18, 2014 · Valin, Pyramid Vector Quantization for Video Coding, IETF draft. ... Jean-Marc's Daala documentation work is sponsored by the Mozilla Corporation.<|separator|>
  14. [14]
    [PDF] Perceptual Vector Quantization for Video Coding - arXiv
    Feb 16, 2016 · ABSTRACT. This paper applies energy conservation principles to the Daala video codec using gain-shape vector quantization.
  15. [15]
    [PDF] Daala: A Perceptually-Driven Next Generation Video Codec - arXiv
    [3] T. D. Tran, “Lapped transform via time-domain pre- and post-filtering ... Swamy, “New windowing techniques for variable- size block motion compensation,” IEE ...
  16. [16]
    [PDF] The Daala Directional Deringing Filter - Jean-Marc Valin
    This paper describes a deringing filter that takes into account the ... Valin, “A deringing filter for daala... and beyond,” https://people. xiph ...
  17. [17]
    DaalaMeeting20150407 - XiphWiki
    Apr 7, 2015 · - jm: You could have two packets per tiles. - d: Packets per tile depends on how complex the tile is. You might get 3 packets for residual ...
  18. [18]
  19. [19]
    Daala Weekly Meetings - Xiph wiki
    Nov 17, 2015 · You are encouraged to attend the weekly progress meetings by installing and using Mumble. The address is mf4.xiph.org and the port is 64738.Missing: development GitHub repositories
  20. [20]
  21. [21]
  22. [22]
    [PDF] Time Domain Lapped Transforms for Video Coding - IETF
    Jul 22, 2015 · [1] Malvar, H. and D. Staelin, "The LOT: Transform Coding Without Blocking Effects", IEEE. Transactions on Acoustics, Speech, ...Missing: Daala PVQ public demos
  23. [23]
    Perceptually-Driven Video Coding with the Daala Video Codec - arXiv
    Oct 8, 2016 · Abstract:The Daala project is a royalty-free video codec that attempts to compete with the best patent-encumbered codecs.Missing: origins pre- 2013 evolution Theora initial 2010
  24. [24]
    [PDF] NETVC BoF - IETF
    NETVC BoF. Dallas, TX, USA. Tuesday, March 24th, 2015. 0900 - 1130. Page 2. Note Well. • Any submission to the IETF intended by the Contributor ...
  25. [25]
    Internet Video Codec (netvc) - IETF Datatracker
    Changed milestone "Requirements and evaluation criteria to IESG, if the WG so chooses (Informational)", set due date to April 2018 from July 2016, added draft- ...
  26. [26]
    Commits · xiph/daala
    - **Most Recent Commit Date**: Insufficient relevant content to determine the exact date.
  27. [27]
    [PDF] Applying Perceptual Vector Quantization Outside Daala - Xiph.org
    The Daala video coding project includes many less-conven- tional techniques aimed at improving the perceptual quality of video [1]. Some are easily integrated ...Missing: goals | Show results with:goals<|control11|><|separator|>
  28. [28]
    AV1: next generation video - The Constrained Directional ...
    Jun 28, 2018 · Here we see a sharp edge (left) and its DCT transform coefficients (right). The energy of the original edge is spread through the DCT output ...Filtering In Av1 · Modern Cdef · Direction Search<|control11|><|separator|>
  29. [29]
    The AV1 Constrained Directional Enhancement Filter (CDEF) - arXiv
    Feb 18, 2016 · The filter works by identifying the direction of each block and then adaptively filtering with a high degree of control over the filter strength ...Missing: Daala | Show results with:Daala
  30. [30]
    (PDF) Contemporary video compression standards: H.265/HEVC ...
    Experimental results show that H. 265/HEVC on average provides 37–40% bitrate savings, VP9 provides 9– 11%, VP10 – 10–12% bitrate savings, while Daala provides ...
  31. [31]
    Contemporary video compression standards: H.265/HEVC, VP9 ...
    10–12% bitrate savings, while Daala provides 8–9% bitrate overhead on average.Missing: benchmarks | Show results with:benchmarks
  32. [32]
    AV1 vs HEVC: Which Codec is Best for You? - Gumlet
    May 3, 2023 · AV1 has better video quality, higher compression, and is royalty-free, but slower encoding. HEVC has better hardware support, but is licensed ...<|separator|>
  33. [33]
    AV1 could improve streaming, so why isn't everyone using it?
    Apr 3, 2025 · Google began testing AV1 on YouTube in 2018, while Netflix added support for AV1 in 2021. Amazon Prime Video also adopted AV1 in 2021, and ...
  34. [34]
    The State of the Video Codec Market 2025 - Streaming Media
    Mar 28, 2025 · AV1, much like VVC, has seen its share of ecosystem development, but other than AOMedia stalwarts like YouTube, Netflix, and Meta, deployments ...