Fact-checked by Grok 2 weeks ago

Dotted circle

The dotted circle (◌) is a Unicode character with the code point U+25CC, classified as an other symbol (So) within the (U+25A0–U+25FF) of the Basic Multilingual Plane, serving primarily as a typographic placeholder to visualize the positioning of combining marks such as diacritics relative to a base character. Introduced in version 1.1 in June 1993, it belongs to the undetermined script category (Zyyy) and has a bidirectional class of other neutral (ON), with no mirroring or reordering properties. In practice, the dotted circle functions as a non-significant element in rendering systems and documentation, often paired with combining characters—for instance, a accent (U+0302)—to demonstrate glyph attachment without altering the semantic meaning of the text. Its reference glyph is intentionally designed larger than standard dotted circle representations in the Unicode Standard, specifically to provide ample space for overlaying and displaying multiple or complex combining marks during testing or illustrative purposes. Encoded in as E2 97 8C, UTF-16 as 25CC, and HTML entities as ◌ or ◌, it remains a foundational tool in font design, text processing, and conformance testing.

Unicode Specification

Code Point and Category

The dotted circle is assigned the Unicode code point U+25CC and bears the official name "DOTTED CIRCLE". This assignment places it within the Geometric Shapes block, which spans code points U+25A0 through U+25FF and was established as part of version 1.1, released in 1993. The block encompasses various geometric symbols intended for use in diagrams, charts, and typographic illustrations. In the Unicode character classification system, the falls under the general category , Other (So), which includes non-letter, non-number symbols that do not belong to more specific symbol subclasses. Its bidirectional is Other Neutral (ON), meaning it does not inherently initiate or terminate runs and is treated neutrally in rendering algorithms for scripts like or Hebrew. Additionally, it has a combining of 0, signifying that it is a base character and not subject to reordering with diacritics or other combining marks during processes. The character supports standard encodings, including as the byte sequence E2 97 8C and UTF-16 as the two-byte sequence 25 CC. For representation, it is encoded via the decimal ◌ or the hexadecimal reference ◌. Unlike many Unicode characters, the dotted circle has no decomposition mapping—neither nor —and thus maintains no equivalence to sequences of other characters under normalization forms like or NFD. It also lacks case mappings, being neither uppercase, lowercase, nor titlecase variant of any other .

Glyph Design and Rendering Notes

The reference glyph for the dotted circle (U+25CC) in the Unicode Standard is intentionally designed to be larger than the typical dotted circle glyph used to visualize combining characters, enhancing its visibility in documentation and charts. In font implementations, the dotted circle is commonly rendered as a hollow circle enclosing a central filled dot, with variations in size, thickness, and style across typefaces to suit different design contexts. However, it must be designed such that it does not behave as a standard base glyph for unintended combinations, preserving its role as a neutral symbol. When rendered standalone, the dotted circle occupies horizontal space like other symbols and may influence line breaks according to Unicode's line breaking properties, classified as . In complex text layouts, it adheres to Unicode bidirectional rules as an character, embedding within surrounding text direction without initiating or altering runs. Font designers are recommended to include the dotted circle in fonts supporting combining marks, positioning diacritics relative to it as they would to base glyphs; this ensures compatibility for testing mark placement without impacting the font's core metrics or tables.

Primary Functions

Placeholder for Combining Characters

The dotted circle (U+25CC) functions primarily as a typographic placeholder to visualize the attachment and positioning of combining characters, such as diacritics, when no base glyph is present in a sequence. This allows for clear demonstration of how marks like the (U+0301) would align relative to a hypothetical base character, without relying on a specific letter or symbol. For instance, the sequence U+25CC followed by U+0301 renders as a dotted circle with the acute accent positioned above it (◌́), illustrating the mark's typical suprafix placement. In Unicode code charts, the dotted circle is conventionally paired with representative glyphs for combining characters to depict their intended appearance and anchoring points, ensuring that the placeholder itself is not part of the actual mark's design. This practice extends to font development, where font designers include a for U+25CC to test and proof the rendering of isolated combining marks, verifying alignment and spacing in tools like shaping engines. Educational materials and documentation also employ this technique to teach the behavior of combining sequences, providing visual clarity on how diacritics interact with bases in various scripts. Certain rendering systems handle ill-formed text sequences—such as orphaned combining marks without a preceding base—by automatically inserting a dotted circle to signal the error and display the mark's position. This diagnostic feature aids developers and users in identifying malformed , though it is not a universal requirement and varies by implementation.

Indication of Positioning in Complex Scripts

In complex script rendering, the dotted circle (U+25CC) serves as a placeholder to visualize the intended positioning of combining marks, such as signs or matras, during glyph reordering, splitting, or attachment processes governed by layout rules. In scripts like or other Brahmic systems, where logical sequences do not match visual order, shaping engines reorder elements—such as placing a pre-base sign before its consonant base—and use the dotted circle to indicate attachment sites for marks that would otherwise lack a visible base, ensuring accurate representation of provisional compositions. This mechanism is particularly integral to the GSUB (Glyph Substitution) and GPOS (Glyph Positioning) tables in fonts, which handle substitutions for split glyphs (e.g., matras dividing across a base consonant) and positioning adjustments for stacked or cursive forms in complex clusters. For instance, in an invalid or incomplete sequence like a standalone sign (e.g., U+093F VOWEL SIGN I), the Universal Shaping Engine inserts a dotted circle to form a cluster like ◌ि, revealing the mark's attachment position and highlighting reordering needs. Similarly, for multiple marks requiring explicit positioning, the dotted circle can be inserted in the text stream followed by the marks in canonical order, allowing GPOS lookups to apply base-to-mark and mark-to-mark adjustments. The dotted circle also aids in debugging font shaping engines by appearing in incomplete compositions, such as defective clusters starting with a combining mark, where it denotes broken syllables and facilitates validation against script-specific rules without altering the underlying text. In , this supports accurate rendering of reordered elements, like subjoined consonants or virama-mediated stacks, by providing a generic base for mark attachment in GSUB/GPOS-driven layouts.

Applications in Writing Systems

In Abugida Scripts

In scripts of , such as and , the dotted circle (U+25CC) serves as a placeholder to visualize the positioning of matras—dependent signs that attach to on the left, right, above, or below—when no base is present. This rendering aids in demonstrating how nonspacing marks like the sign i (U+093F ◌ि) or the sign aa (U+0BCA ◌ா) align relative to an implicit base glyph. In scripts like Bengali and Malayalam, the dotted circle is particularly useful for illustrating complex forms involving the virama (halant, U+094D in Devanagari-derived scripts), such as reph or yaphalaa, where a consonant cluster followed by a vowel sign requires explicit positioning. For instance, the sequence dotted circle + virama + vowel sign in Malayalam highlights the attachment of post-base matras in isolated testing or error scenarios. Southeast Asian abugidas, including Thai, Lao, and Khmer, employ the dotted circle to show the stacking or surrounding placement of tone marks and vowel traps around base letters when invalid combinations occur. In Thai, for example, invalid tone marks (e.g., U+0E31 THAI CHARACTER MAI HAN-AKAT) or below-base vowel signs (e.g., U+0E38 THAI CHARACTER SARA U) are rendered atop the dotted circle to indicate positioning errors. Similar fallback rendering applies in Lao for diacritics without a base and in Khmer for excess vowel signs or subscript stacks exceeding syllable limits. The dotted circle plays a key role in educational materials and for abugidas, helping learners and developers visualize non-linear arrangements of combining and debug rendering issues in font or text . It is recommended for inclusion in fonts supporting these scripts to ensure consistent fallback display per guidelines.

In Semitic and Other Scripts

In scripts, including , , and Hebrew, the dotted circle (U+25CC) functions as a non-spacing base glyph to visualize the positioning of combining diacritics when isolated from consonants, ensuring accurate rendering of vowel signs and dot distinctions. In and , it pairs with harakat—short vowel markers such as fatḥah (U+064E ARABIC FATHA, indicating /a/), kasrah (U+0650 ARABIC KASRA, for /i/), and ḍammah (U+064F ARABIC DAMMA, for /u/)—to demonstrate their attachment above, below, or to the side of an imaginary letter stem, particularly useful in educational materials and font testing for right-to-left cursive joining behaviors. Similarly, for i'jam (consonant-pointing dots that differentiate letters like ب bāʾ from ت tāʾ), the dotted circle highlights their lateral placement without base glyph interference, preventing misrendering in complex sequences. In Hebrew, it supports (vowel points like sheva U+05B0 HEBREW POINT SHEVA or qamats U+05B8 HEBREW POINT QAMATS) and cantillation marks, with font specifications recommending a dedicated for U+25CC to enable fallback display of isolated combining marks in . This placeholder role extends to Latin-based systems with diacritics, where the dotted circle aids proofreading and typographic verification for accent alignment in languages like French and Vietnamese. In French, it isolates acute (U+0301 COMBINING ACUTE ACCENT), grave (U+0300 COMBINING GRAVE ACCENT), and circumflex (U+0302 COMBINING CIRCUMFLEX ACCENT) marks to check their elevation and kerning relative to vowels such as é or â, ensuring legibility in editorial workflows. For Vietnamese, which features stacked diacritics including tones (e.g., U+0309 COMBINING HOOK ABOVE for hỏi tone) and horns (U+031B COMBINING HORN), the dotted circle reveals vertical stacking order and avoids collisions in precomposed forms like ơ or ư, critical for font design in this abugida-influenced orthography. In other contexts, such as legacy and constructed writing systems, the dotted circle provides a neutral anchor for diacritic experimentation and documentation. For polytonic Greek, it displays breathings (U+0314 COMBINING REVERSED BREVE for rough breathing) and iota subscript (U+0345 COMBINING GREEK YPOGEGRAMMENI) in isolation, clarifying their supralinear and sublinear positions in ancient texts. Likewise, in Church Slavonic typography, fonts must include a U+25CC glyph to properly render combining accents and superscripts over isolated marks, supporting the script's complex orthographic traditions. In constructed scripts, designers employ it analogously as a provisional base for testing novel diacritic attachments, mirroring its utility across diverse typographic ecosystems.

Development and Standards

Introduction in Unicode

The dotted circle (U+25CC) was added to the Standard in version 1.1, released in June 1993, as part of the Geometric Shapes block (U+25A0–U+25FF) to support essential typographic utilities in multilingual text encoding. This block encompasses various geometric symbols designed for compatibility with existing standards and practical rendering needs. The rationale for including the dotted circle centered on the necessity for a neutral, non-letter base to demonstrate the attachment and positioning of combining characters within the emerging framework. As noted in the standard's character annotations, it serves to illustrate combining marks, such as diacritics (for example, when paired with U+0300 COMBINING ), enabling developers and typographers to visualize rendering without script-specific biases; the reference glyph is deliberately larger than typical implementations for clarity in documentation.

Role in Documentation and Testing

The dotted circle (U+25CC) serves as a standard placeholder in Unicode code charts to illustrate the positioning and rendering of combining marks relative to a base character. In these charts, combining characters, such as diacritics, are depicted by applying them to the dotted circle, which acts as a neutral stand-in for an absent base , enabling clear visualization of mark attachment above, below, or around it. This convention has been consistent across versions, including in charts for blocks like for Symbols (U+20D0–U+20FF), where examples show marks like the combining enclosing circle (U+20DD) positioned on the dotted circle. In font development and testing, the dotted circle is integral to proofing tools for verifying metrics, attachment points, and shaping behavior in complex scripts. Tools like , an open-source text shaping engine, automatically insert a dotted circle during rendering when a combining mark lacks a suitable base, allowing developers to assess positioning in scripts such as or ; for instance, 's glyph classification functions reference U+25CC explicitly for this purpose. Similarly, , a font editor, supports editing and previewing the dotted circle to ensure proper alignment of combining marks, with its metrics view facilitating checks on spacing and features for diacritics. The dotted circle plays a key educational role in resources teaching complex script rendering and character composition. Platforms like ScriptSource utilize it to demonstrate how combining marks interact with bases in abugidas and other systems, such as indicating reordering of vowel signs or the effects of isolated diacritics without a base character. This visual aid helps learners and typographers understand glyph attachment and script-specific behaviors, as seen in documentation for Indic and Southeast Asian scripts. Under Unicode's stability policy, the properties of the dotted circle—such as its category as a geometric shape (So) and non-combining nature—remain unchanged to ensure across versions, preventing disruptions in and rendering tools. It is also referenced in for handling ill-formed sequences, where isolated combining marks may trigger display of the dotted circle as a diagnostic , aligning with guidelines that prohibit interpretation of invalid code unit sequences while allowing safe visualization.

References

  1. [1]
    [PDF] Geometric Shapes - The Unicode Standard, Version 17.0
    25CC ◌ DOTTED CIRCLE. • note that the reference glyph for this character is intentionally larger than the dotted circle glyph used to indicate combining ...
  2. [2]
    Find all Unicode Characters from Hieroglyphs to Dingbats – Unicode Compart
    ### Summary of Unicode Character U+25CC (Dotted Circle)
  3. [3]
    About the Code Charts - Unicode
    If the display of a combining mark with a dotted circle is desired, U+25CC ◌ DOTTED CIRCLE is often chosen as the base character for the mark. #Dashed Box ...<|control11|><|separator|>
  4. [4]
    None
    Below is a merged response that consolidates all the information from the provided segments into a single, comprehensive summary. To maximize detail and clarity, I’ve organized the data into tables where appropriate (e.g., for UnicodeData.txt content summaries and code point 25CC details) and included narrative text for overarching descriptions. The response retains all key information, including extracted lines, summaries, ranges, categories, examples, and useful URLs, while avoiding redundancy where possible.
  5. [5]
    Geometric Shapes - Unicode
    25CC, ◌, Dotted Circle. •, note that the reference glyph for this character is intentionally larger than the dotted circle glyph used to indicate combining ...Missing: U+ | Show results with:U+
  6. [6]
    Unicode 1.1
    ### Summary of Dotted Circle U+25CC and Related Information in Unicode 1.1
  7. [7]
    OpenType development (LEGACY INFORMATION) - Typography
    Nov 13, 2021 · For the fallback mechanism to work properly, an OTL font should contain a glyph for the dotted circle (U+25CC). In case this glyph is missing ...<|separator|>
  8. [8]
    Diacritic marks - Google Fonts
    Dotted circle. The dotted circle character (U+25CC) is inserted by shaping engines before mark glyphs which do not have an associated base, especially in the ...
  9. [9]
    Entry - Recommended characters for Non-Roman fonts - ScriptSource
    Nov 16, 2011 · Geometric Shapes, 25CC. If your OpenType font supports combining diacritics, be sure to include U+25CC DOTTED CIRCLE in your font, and ...Missing: design | Show results with:design
  10. [10]
    Creating and supporting OpenType fonts for the Universal Shaping ...
    If multiple marks are required to position on a dotted circle, the dotted circle can be explicitly inserted into the text stream followed by any marks in ...
  11. [11]
    Introduction to OpenType Programming - Simon Cozens
    GPOS contains instructions for altering the position of glyph. The canonical example of context-sensitive repositioning is kerning, which modifies the space ...
  12. [12]
    [PDF] Creating fonts for Brahmic scripts with OpenType and Apple ...
    Focus on what you do best! 19. Page 20. Generic bases. • Always include a glyph for dotted circle: ◌ ... • OpenType: GSUB/GPOS/GDEF. • AAT: morx/kerx/ankr. • ...
  13. [13]
    Chapter 12 – Unicode 17.0.0
    The Unicode Standard encodes Devanagari characters in the same relative positions as those coded in positions A0–F416 in the ISCII-1988 standard. The same ...
  14. [14]
    Developing OpenType Fonts for Devanagari Script - Typography
    In addition to the 'dotted circle' other Unicode code points that are recommended for inclusion in any Devanagari font are the ZWJ (zero width joiner; U+200D), ...
  15. [15]
    Developing OpenType Fonts for Bengali Script - Typography
    ### Summary: Use of Dotted Circle in Bengali Script for Vowel Signs, Matras, and Combining Marks
  16. [16]
    Developing OpenType Fonts for Malayalam Script - Typography
    ### Summary: Use of Dotted Circle in Malayalam Script with Virama and Vowel Signs
  17. [17]
    Developing OpenType Fonts for Thai Script - Typography
    ### Summary: Use of Dotted Circle in Thai Script
  18. [18]
    Developing OpenType Fonts for Lao Script - Typography
    Jun 11, 2022 · This document presents information that will help font developers create or support OpenType fonts for Lao script languages covered by the Unicode Standard.
  19. [19]
    Developing OpenType Fonts for Khmer Script - Typography
    ### Summary: Use of Dotted Circle in Khmer Script for Vowel Signs and Stacking
  20. [20]
    UTR #53: Unicode Arabic Mark Rendering
    Aug 2, 2023 · This technical report specifies an algorithm that can be utilized during rendering for determining correct display of Arabic combining mark ...Missing: 25CC Hebrew
  21. [21]
    Developing OpenType Fonts for Hebrew Script - Typography
    Jun 11, 2022 · In addition to the 'dotted circle,' other Unicode code points that are recommended for inclusion in any Hebrew font are: LTR (left to right mark ...Missing: specimen | Show results with:specimen
  22. [22]
    How to Use French Accent Marks | FrenchDictionary.com
    Let's start with the names of accents in French! The dotted circle in the first column below is just a placeholder for the letter that would accompany the ...
  23. [23]
    [PDF] Church Slavonic Typography in the Unicode Standard
    As defined in this document, Church Slavonic (also called “Church Slavic”) is a highly codified, liter- ary language used by the Slavs. In the earliest days of ...Missing: polytonic | Show results with:polytonic
  24. [24]
    Chapter 2 – Unicode 16.0.0
    ... dotted circle, which stands in for the base. ... For combining characters placed to the left of a base character, the combining characters start from the base ...
  25. [25]
    [PDF] Combining Marks for Symbols - The Unicode Standard, Version 17.0
    These charts are provided as the online reference to the character contents of the Unicode Standard, Version 17.0 but do not provide all the information needed ...
  26. [26]
    The Metrics View — FontForge 20230101 documentation
    The metrics view allows you to examine how glyphs look together and to alter the spacing between them. You may apply various OpenType features to those glyphs.
  27. [27]
    DOTTED CIRCLE - ScriptSource
    Oct 3, 2013 · A possible solution to this problem is to actually include the dotted circle character - U+25CC - in the HTML source. And indeed this solves the ...Missing: proofs | Show results with:proofs
  28. [28]
  29. [29]
    Chapter 3 – Unicode 17.0.0
    The representative images of combining characters are depicted with a dotted circle in the code charts. ... Combining character order: Combining characters follow ...