Fact-checked by Grok 2 weeks ago

Latin Extended Additional

Latin Extended Additional is a block of the Unicode Standard that provides 256 additional characters for the Latin script, spanning the code point range U+1E00 to U+1EFF. These characters consist primarily of precomposed combinations of Latin letters with one or more diacritical marks, supporting orthographic needs in various languages and scripts. Introduced in Version 1.1 of the Unicode Standard, the block addresses gaps in earlier Latin extensions by including characters for specific linguistic and historical applications. Notable subsets include diacritic variations for Irish Gaelic (such as letters with dots above, like ḃ and ṗ), Vietnamese quốc ngữ (precomposed vowels with tone marks and dots below, in the range U+1EA0–U+1EF9), and Indic transliteration (letters with underdots, like ḍ and ṇ). It also encompasses specialized forms for medievalist studies (such as long s with a dot above, ẛ) and German typography (the capital sharp S, ẞ at U+1E9E, used in uppercased contexts like titles and signage). Many characters in the block are canonically decomposable into a base letter and combining diacritics from the Combining Diacritical Marks block (U+0300–U+036F), facilitating compatibility with normalization processes. The block's design emphasizes practical extensions for phonetic accuracy, regional orthographies, and scholarly transcription, ensuring broad support for Latin-based writing systems without relying solely on dynamic composition. Non-decomposable characters, such as the small letter a with right half ring (U+1E9A ẚ), further cater to unique typographic requirements in historical and linguistic contexts. As part of the Multilingual Plane, Latin Extended Additional remains integral to text for European, African, and Asian languages using Latin scripts.

Overview

Purpose and Design

The Latin Extended Additional block, designated as U+1E00–U+1EFF in the Unicode Standard, encompasses 256 assigned code points dedicated to Latin letters, providing an extension to the Basic Latin and blocks for representing accented and diacritic-bearing characters in various orthographies. This block focuses on encoding uppercase and lowercase forms of letters modified by diacritical marks such as dots above or below, macrons, acutes, graves, cedillas, and circumflexes, enabling direct support for phonetic and orthographic needs in languages like , , and others without duplication of simpler forms already covered in prior blocks. A core design principle of this block is the use of precomposed characters, which integrate base Latin letters with one or more diacritical marks into single code points, thereby avoiding the need for complex sequences of combining marks that could complicate text processing, rendering, and in systems handling languages with stacked or multiple diacritics. This approach enhances efficiency in digital typography and software implementations, particularly for systems or environments where to decomposed forms might introduce errors or issues. Most characters in the block—approximately 246—are general Latin letters with such diacritics, supplemented by specialized subsets for applications like medievalist transcriptions and certain phonetic notations. The block's allocation underscores a commitment to phonetic accuracy in both native orthographies and scholarly transcriptions, allowing precise representation of sounds that cannot be adequately conveyed using only the unadorned letters from Basic Latin (U+0000–U+007F) or (U+0100–U+017F) alone. By prioritizing these precomposed forms, the design facilitates seamless integration into broader ecosystems, supporting diverse linguistic traditions while maintaining compatibility with Unicode's normalization algorithms for interconversion between composed and decomposed representations where needed.

Unicode Allocation

The occupies the range U+1E00 to U+1EFF, encompassing exactly 256 code points dedicated to extended Latin characters with diacritics and other modifications. As of 17.0, released in September 2025, all 256 positions within this block are fully assigned, with no unassigned or reserved code points remaining. This block is positioned in the Unicode repertoire after the Latin Extended-A (U+0100–U+017F) and Latin Extended-B (U+0180–U+024F) blocks, continuing the sequential expansion of Latin script support within the Basic Multilingual Plane (BMP). The placement in the BMP (plane 0, U+0000 to U+FFFF) ensures compatibility with systems limited to 16-bit encoding, distinguishing it from later Latin extensions in the Supplementary Multilingual Plane (plane 1).

Linguistic Applications

Support for European and Insular Languages

The Latin Extended Additional (U+1E00–U+1EFF) encodes a range of precomposed characters essential for representing the orthographies of various languages, with particular emphasis on Insular Celtic traditions such as . These characters facilitate accurate digital rendering of diacritic-modified letters used in historical and contexts across . The block's design prioritizes compatibility with while providing standalone forms for efficient . A key application is in Irish Gaelic, where the block supplies characters for the traditional orthography's lenition markers—dots above consonants indicating or softening. Representative examples include Ḃ (U+1E02, LATIN CAPITAL LETTER B WITH DOT ABOVE) and ḃ (U+1E03, LATIN SMALL LETTER B WITH DOT ABOVE) for lenited /b/, Ḋ (U+1E0A) and ḋ (U+1E0B) for lenited /d/, Ḟ (U+1E1E) and ḟ (U+1E1F) for lenited /f/, Ṗ (U+1E56) and ṗ (U+1E57) for lenited /p/, Ṡ (U+1E60) and ṡ (U+1E61) for lenited /s/, and Ṫ (U+1E6A) and ṫ (U+1E6B) for lenited /t/. These dot-above forms, proposed for encoding to support Irish Gaelic texts, differ from International Phonetic Alphabet symbols by serving purely orthographic roles in rather than broad phonetic transcription. Although modern Irish primarily uses an adjacent "h" for lenition (e.g., bh for /v/), the dotted forms remain vital for historical manuscripts, scholarly editions, and certain typographic styles. Beyond , the block supports other orthographies, including pre-1921 Latvian usage with characters like Ḑ (U+1E10, LATIN CAPITAL LETTER D WITH ) and ḑ (U+1E11, LATIN SMALL LETTER D WITH ) for palatalized /d/, as well as ẜ (U+1E9C, LATIN SMALL LETTER WITH DIAGONAL STROKE) in Blackletter-influenced texts. It also aids minority languages through diacritics like rings below (e.g., Ḁ U+1E00 for A) and other marks such as acutes and circumflexes. In modern , over 200 of the block's 256 characters are precomposed accented forms for capitals and lowercase letters, incorporating dots (52 characters), rings (2), and additional marks like acutes and circumflexes to ensure scalable support in digital fonts for these languages.

Support for Vietnamese and Southeast Asian Scripts

The Latin Extended Additional Unicode block plays a crucial role in supporting the Vietnamese orthography, known as quốc ngữ, by providing 90 precomposed characters that integrate Latin letters with essential to the language's phonology. These characters enable the representation of complex vowel modifications, including the diacritic on letters like ơ and ư, combined with indicators, which are vital for distinguishing the six tonal contours in . For instance, forms such as Ớ (U+1EDB, Latin capital letter O with and acute) and ự (U+1EE5, Latin small letter U with dot below) exemplify how the block accommodates stacked without relying solely on combining marks. Integration with tone marks is a key feature, offering precomposed variants for acute, , , , and dot below on modified vowels, such as ầ (U+1EAB, Latin small letter A with circumflex and ) and ẩ (U+1EAD, Latin small letter A with circumflex and ). This design supports the full range of tonal distinctions—unmarked level, rising acute, falling , rising glottalized hook, even , and falling glottalized dot below—directly within the . By prioritizing these precomposed sequences, the block ensures compatibility with legacy systems and input methods that may struggle with multiple combining diacritics. In regional usage, these characters are indispensable for digital text in , facilitating accurate rendering of quốc ngữ in documents, websites, and software without challenges that could arise from separate combining marks. This avoids inconsistencies in positioning or forms, promoting seamless adoption in computing environments across where is predominant. Compared to the block, which provides base forms like Ơ (U+01A1, Latin capital letter O with ) and Ư (U+01AF, Latin capital letter U with ) but lacks tone combinations, uniquely adds the and hook diacritics in conjunction with tones, completing the set needed for modern .

Historical, Medieval, and Scholarly Uses

The Latin Extended Additional block provides essential support for reconstructing and transcribing historical and medieval Latin-based texts, enabling scholars to preserve the original graphemic forms without alteration. These characters facilitate paleographic analysis of manuscripts from medieval , where Latin was adapted for various vernaculars and abbreviations. In particular, Unicode 5.1 (2008) introduced 10 dedicated characters for medievalist applications, proposed by experts including Michael Everson to address gaps in encoding medieval Welsh and other Insular traditions, as well as abbreviation systems common in paleography. Key medievalist characters include U+1E9C (ẜ, LATIN SMALL LETTER LONG S WITH DIAGONAL STROKE) and U+1E9D (ẝ, LATIN SMALL LETTER LONG S WITH HIGH STROKE), which represent variant forms of the long s used in medieval abbreviations for words like "sanctus" or "sit." These diacritic-modified longs allow precise reproduction of scribal conventions in 13th- to 15th-century manuscripts, aiding philologists in studying textual evolution. Similarly, U+1E9E (ẞ, LATIN CAPITAL LETTER SHARP S) encodes the uppercase sharp s, derived from medieval ligatures of long s and z, historically employed in German Blackletter printing and now standard for uppercase ß in scholarly editions of early modern texts. For paleographic support in Insular languages, characters such as U+1EFA (Ỻ, LATIN CAPITAL LETTER MIDDLE-WELSH LL) and U+1EFB (ỻ, LATIN SMALL LETTER MIDDLE-WELSH LL) denote the voiceless lateral fricative (/ɬ/) in medieval , appearing in manuscripts like the . These extend to related Insular uses, including notations for in contexts where similar sounds occur. Additional variants like U+1EFE (Ỿ, LATIN CAPITAL LETTER Y WITH LOOP) and U+1EFF (ỿ, LATIN SMALL LETTER Y WITH LOOP) mark the sound (/ə/) in , essential for accurate transcription of poetic and legal documents from the period. In scholarly applications, particularly , characters like U+1E9F (ẟ, LATIN SMALL LETTER DELTA) serve as phonetic symbols for dental or alveolar in reconstructions of ancient and medieval languages. Likewise, U+1EFC (Ṽ, LATIN CAPITAL LETTER MIDDLE-WELSH V) and U+1EFD (ṽ, LATIN SMALL LETTER MIDDLE-WELSH V) represent velar fricatives or labial variants in Insular , supporting detailed phonetic analyses of sound changes over time. These encodings, drawn from the Medieval Unicode Font Initiative (MUFI), ensure that academic transcriptions maintain fidelity to source materials, promoting interdisciplinary research in paleography and .

Development and Encoding

Historical Evolution

The Latin Extended Additional block was first introduced in Unicode 1.1 in June 1993, establishing an initial repertoire of 245 characters primarily to support precomposed forms with diacritics for various languages, including Gaelic orthography (such as dotted letters for , e.g., U+1E02 Ḃ LATIN CAPITAL LETTER B WITH DOT ABOVE) and basic tone marks (e.g., the range U+1EA0–U+1EF9 for letters like Ạ and ả). This allocation aligned closely with the inaugural edition of ISO/IEC 10646:1993, incorporating amendments from international efforts to extend the beyond earlier blocks like and . In 2.0, released in 1996, the block received a single addition: U+1E9B ẛ LATIN SMALL LETTER LONG S WITH DOT ABOVE, which provided a specialized form for historical and phonetic transcriptions in scripts, serving as a variant of the modern dotted s (U+1E61 ṡ) used in standardization. This refinement addressed needs from linguistic communities focused on Insular scripts, drawing on input from scholars standardizing representations of medieval and early modern texts. The block reached its current total of 256 characters with the final expansions in Unicode 5.1 in April 2008, incorporating 10 new characters tailored for medievalist and scholarly applications. These included variants of the (U+1E9C ẜ LATIN SMALL LETTER LONG S WITH DIAGONAL STROKE and U+1E9D ẝ LATIN SMALL LETTER LONG S WITH HIGH STROKE), the sharp s uppercase (U+1E9E ẞ LATIN CAPITAL LETTER SHARP S for ), a phonetic (U+1E9F ẟ LATIN SMALL LETTER DELTA), and digraphs (U+1EFA Ỻ LATIN CAPITAL LETTER MIDDLE-WELSH LL, U+1EFB ỻ LATIN SMALL LETTER MIDDLE-WELSH LL, U+1EFC Ỽ LATIN CAPITAL LETTER MIDDLE-WELSH V, U+1EFD ỽ LATIN SMALL LETTER MIDDLE-WELSH V) along with a looped y (U+1EFE Ỿ LATIN CAPITAL LETTER Y WITH LOOP, U+1EFF ỿ LATIN SMALL LETTER Y WITH LOOP). These additions stemmed from proposals by the Medieval Unicode Font Initiative (MUFI) and other experts, emphasizing compatibility with historical manuscripts and paleographic needs while integrating feedback from ISO/IEC 10646 Amendment 5.

Technical Implementation

The Latin Extended Additional block, spanning code points U+1E00 to U+1EFF, necessitates robust font implementations to ensure accurate rendering of its precomposed characters, many of which incorporate such as dots, strokes, and rings. Fonts supporting this block typically include features like 'ccmp' (contextual alternates) to handle and of glyphs, particularly for scenarios involving diacritic attachment or in legacy or decomposed forms. Additionally, the '' feature is essential for proper positioning and stacking of multiple diacritics when text is normalized to decomposed forms, preventing visual overlaps or misalignments in complex ligatures. Comprehensive support is evident in open-source fonts such as Noto Sans, which covers 100% of the block's 256 characters across its weights and styles, enabling seamless display in applications handling European and Southeast Asian scripts. Compatibility with existing Unicode infrastructure is facilitated through canonical decomposition mappings, where most characters in the block break down into base letters from the Basic Latin block (U+0000–U+007F) combined with diacritical marks from the block (U+0300–U+036F). For instance, under Normalization Form D (NFD), precomposed forms like U+1E02 (Ḃ) decompose to U+0042 (B) followed by U+0307 (combining dot above), allowing interoperability with systems that prefer separate components for processing. In contrast, Normalization Form C () recomposes these into the original precomposed characters, preserving the block's intended single-code-point representations and ensuring round-trip fidelity in storage and transmission. This dual compatibility supports migration from older encodings while maintaining semantic equivalence in modern -aware environments. Input methods for the block vary by language but generally leverage keyboard layouts with dead keys or mnemonic conventions to generate code points efficiently. For Irish Gaelic, standard Windows and macOS layouts incorporate dead keys—such as the dot above key—to produce characters like U+1E02 (Ḃ) by sequencing the modifier before the base letter B, facilitating direct Unicode input without requiring custom software. Similarly, dedicated Gaelic keyboards extend this mechanism to cover the full range of lenited and dotted forms used in traditional orthography. For Vietnamese, input method editors (IMEs) like those supporting Telex and VIQR schemes map alphabetic sequences to precomposed tones and diacritics in the block, such as typing "ax" in Telex to yield U+1EA3 (ả). These methods, integrated into operating systems via libraries like IBUS or Windows IME, convert user input on-the-fly to Unicode, with Telex emphasizing letter-based shortcuts (e.g., 'w' for horns) and VIQR using ASCII approximations for broader compatibility. Implementation challenges arise primarily from legacy systems, where partial overlaps with code pages like Windows-1258 limit support to a subset of characters in the block, excluding rarer historical or extended forms. This can result in or substitution errors during conversion, as Windows-1258 prioritizes single-byte mappings for common tones but omits full handling for the block's 256 entries. Modern environments, however, provide complete coverage, with libraries in languages like and offering built-in normalization and rendering to mitigate these issues, ensuring the block's characters display correctly across platforms without data loss.

Reference Materials

Full Character Chart

The Latin Extended Additional Unicode block encompasses 256 code points in the range U+1E00 to U+1EFF, of which 224 are assigned characters, designed to support precomposed Latin letters with diacritical marks for various orthographies and transliterations. All characters in this block are assigned the bidirectional class L (Left-to-Right), ensuring consistent rendering in left-to-right text flows. The characters are primarily uppercase and lowercase letters modified with diacritics such as dots, rings, macrons, acutes, graves, and cedillas, facilitating accurate representation in languages like , Latvian, Lithuanian, , and scholarly transliterations of non-Latin scripts. The following tables present the full chart, grouped into subranges for clarity, with columns for the hexadecimal code point, glyph (rendered in a monospaced font for precision), official character name, and a brief category denoting the primary linguistic or typographic function (e.g., "Accented Uppercase" for diacritic-modified capital letters used in specific orthographies). Data verified as of Unicode 17.0.

U+1E00–U+1E3F: Accented Letters (A–G Variants)

Code PointGlyphNameCategory
U+1E00LATIN CAPITAL LETTER A WITH RING BELOWAccented Uppercase
U+1E01LATIN SMALL LETTER A WITH RING BELOWAccented Lowercase
U+1E02LATIN CAPITAL LETTER B WITH DOT ABOVEAccented Uppercase
U+1E03LATIN SMALL LETTER B WITH DOT ABOVEAccented Lowercase
U+1E04LATIN CAPITAL LETTER B WITH DOT BELOWAccented Uppercase
U+1E05LATIN SMALL LETTER B WITH DOT BELOWAccented Lowercase
U+1E06LATIN CAPITAL LETTER B WITH LINE BELOWAccented Uppercase
U+1E07LATIN SMALL LETTER B WITH LINE BELOWAccented Lowercase
U+1E08LATIN CAPITAL LETTER C WITH CEDILLA AND ACUTEAccented Uppercase
U+1E09LATIN SMALL LETTER C WITH CEDILLA AND ACUTEAccented Lowercase
U+1E0ALATIN CAPITAL LETTER D WITH DOT ABOVEAccented Uppercase
U+1E0BLATIN SMALL LETTER D WITH DOT ABOVEAccented Lowercase
U+1E0CLATIN CAPITAL LETTER D WITH DOT BELOWAccented Uppercase
U+1E0DLATIN SMALL LETTER D WITH DOT BELOWAccented Lowercase
U+1E0ELATIN CAPITAL LETTER D WITH LINE BELOWAccented Uppercase
U+1E0FLATIN SMALL LETTER D WITH LINE BELOWAccented Lowercase
U+1E10LATIN CAPITAL LETTER D WITH CEDILLAAccented Uppercase
U+1E11LATIN SMALL LETTER D WITH CEDILLAAccented Lowercase
U+1E12LATIN CAPITAL LETTER D WITH CEDILLA AND ACUTEAccented Uppercase
U+1E13LATIN SMALL LETTER D WITH CEDILLA AND ACUTEAccented Lowercase
U+1E14LATIN CAPITAL LETTER E WITH MACRON AND GRAVEAccented Uppercase
U+1E15LATIN SMALL LETTER E WITH MACRON AND GRAVEAccented Lowercase
U+1E16LATIN CAPITAL LETTER E WITH MACRON AND ACUTEAccented Uppercase
U+1E17LATIN SMALL LETTER E WITH MACRON AND ACUTEAccented Lowercase
U+1E18LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND DOT BELOWAccented Uppercase
U+1E19LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT BELOWAccented Lowercase
U+1E1ALATIN CAPITAL LETTER E WITH MACRON AND DOT ABOVEAccented Uppercase
U+1E1BLATIN SMALL LETTER E WITH MACRON AND DOT ABOVEAccented Lowercase
U+1E1CLATIN CAPITAL LETTER E WITH MACRON AND DOT BELOWAccented Uppercase
U+1E1DLATIN SMALL LETTER E WITH MACRON AND DOT BELOWAccented Lowercase
U+1E1ELATIN CAPITAL LETTER F WITH DOT ABOVEAccented Uppercase
U+1E1FLATIN SMALL LETTER F WITH DOT ABOVEAccented Lowercase
U+1E20ǴLATIN CAPITAL LETTER G WITH MACRONAccented Uppercase
U+1E21ǵLATIN SMALL LETTER G WITH MACRONAccented Lowercase
U+1E22LATIN CAPITAL LETTER H WITH DOT ABOVEAccented Uppercase
U+1E23LATIN SMALL LETTER H WITH DOT ABOVEAccented Lowercase
U+1E24LATIN CAPITAL LETTER H WITH DOT BELOWAccented Uppercase
U+1E25LATIN SMALL LETTER H WITH DOT BELOWAccented Lowercase
U+1E26LATIN CAPITAL LETTER H WITH DIAERESISAccented Uppercase
U+1E27LATIN SMALL LETTER H WITH DIAERESISAccented Lowercase
U+1E28LATIN CAPITAL LETTER H WITH CEDILLAAccented Uppercase
U+1E29LATIN SMALL LETTER H WITH CEDILLAAccented Lowercase
U+1E2ALATIN CAPITAL LETTER K WITH CEDILLAAccented Uppercase
U+1E2BLATIN SMALL LETTER K WITH CEDILLAAccented Lowercase
U+1E2CLATIN CAPITAL LETTER K WITH LINE BELOWAccented Uppercase
U+1E2DLATIN SMALL LETTER K WITH LINE BELOWAccented Lowercase
U+1E2ELATIN CAPITAL LETTER L WITH DOT BELOWAccented Uppercase
U+1E2FLATIN SMALL LETTER L WITH DOT BELOWAccented Lowercase
U+1E30LATIN CAPITAL LETTER K WITH ACUTEAccented Uppercase
U+1E31LATIN SMALL LETTER K WITH ACUTEAccented Lowercase
U+1E32LATIN CAPITAL LETTER L WITH MIDDLE DOTAccented Uppercase
U+1E33LATIN SMALL LETTER L WITH MIDDLE DOTAccented Lowercase
U+1E34LATIN CAPITAL LETTER L WITH LINE BELOWAccented Uppercase
U+1E35LATIN SMALL LETTER L WITH LINE BELOWAccented Lowercase
U+1E36LATIN CAPITAL LETTER L WITH CIRCUMFLEXAccented Uppercase
U+1E37LATIN SMALL LETTER L WITH CIRCUMFLEXAccented Lowercase
U+1E38LATIN CAPITAL LETTER L WITH CEDILLAAccented Uppercase
U+1E39LATIN SMALL LETTER L WITH CEDILLAAccented Lowercase
U+1E3ALATIN CAPITAL LETTER L WITH LINE BELOWAccented Uppercase
U+1E3BLATIN SMALL LETTER L WITH LINE BELOWAccented Lowercase
U+1E3CLATIN CAPITAL LETTER L WITH LINE BELOWAccented Uppercase
U+1E3DLATIN SMALL LETTER L WITH LINE BELOWAccented Lowercase
U+1E3ELATIN CAPITAL LETTER M WITH ACUTEAccented Uppercase
U+1E3FḿLATIN SMALL LETTER M WITH ACUTEAccented Lowercase

U+1E40–U+1E7F: Accented Letters (M–Z and Additional Variants)

Code PointGlyphNameCategory
U+1E40LATIN CAPITAL LETTER M WITH DOT ABOVEAccented Uppercase
U+1E41LATIN SMALL LETTER M WITH DOT ABOVEAccented Lowercase
U+1E42LATIN CAPITAL LETTER M WITH DOT BELOWAccented Uppercase
U+1E43LATIN SMALL LETTER M WITH DOT BELOWAccented Lowercase
U+1E44LATIN CAPITAL LETTER N WITH DOT ABOVEAccented Uppercase
U+1E45LATIN SMALL LETTER N WITH DOT ABOVEAccented Lowercase
U+1E46LATIN CAPITAL LETTER N WITH DOT BELOWAccented Uppercase
U+1E47LATIN SMALL LETTER N WITH DOT BELOWAccented Lowercase
U+1E48LATIN CAPITAL LETTER N WITH LINE BELOWAccented Uppercase
U+1E49LATIN SMALL LETTER N WITH LINE BELOWAccented Lowercase
U+1E4ALATIN CAPITAL LETTER N WITH CIRCUMFLEXAccented Uppercase
U+1E4BLATIN SMALL LETTER N WITH CIRCUMFLEXAccented Lowercase
U+1E4CLATIN CAPITAL LETTER O WITH TILDE AND ACUTEAccented Uppercase
U+1E4DLATIN SMALL LETTER O WITH TILDE AND ACUTEAccented Lowercase
U+1E4ELATIN CAPITAL LETTER O WITH TILDE AND DIAERESISAccented Uppercase
U+1E4FLATIN SMALL LETTER O WITH TILDE AND DIAERESISAccented Lowercase
U+1E50LATIN CAPITAL LETTER O WITH MACRON AND GRAVEAccented Uppercase
U+1E51LATIN SMALL LETTER O WITH MACRON AND GRAVEAccented Lowercase
U+1E52LATIN CAPITAL LETTER O WITH MACRON AND ACUTEAccented Uppercase
U+1E53LATIN SMALL LETTER O WITH MACRON AND ACUTEAccented Lowercase
U+1E54LATIN CAPITAL LETTER P WITH ACUTEAccented Uppercase
U+1E55LATIN SMALL LETTER P WITH ACUTEAccented Lowercase
U+1E56LATIN CAPITAL LETTER P WITH DOT ABOVEAccented Uppercase
U+1E57LATIN SMALL LETTER P WITH DOT ABOVEAccented Lowercase
U+1E58LATIN CAPITAL LETTER R WITH DOT ABOVEAccented Uppercase
U+1E59LATIN SMALL LETTER R WITH DOT ABOVEAccented Lowercase
U+1E5ALATIN CAPITAL LETTER R WITH DOT BELOWAccented Uppercase
U+1E5BLATIN SMALL LETTER R WITH DOT BELOWAccented Lowercase
U+1E5CLATIN CAPITAL LETTER R WITH DOT BELOW AND MACRONAccented Uppercase
U+1E5DLATIN SMALL LETTER R WITH DOT BELOW AND MACRONAccented Lowercase
U+1E5ELATIN CAPITAL LETTER R WITH LINE BELOWAccented Uppercase
U+1E5FLATIN SMALL LETTER R WITH LINE BELOWAccented Lowercase
U+1E60LATIN CAPITAL LETTER S WITH DOT ABOVEAccented Uppercase
U+1E61LATIN SMALL LETTER S WITH DOT ABOVEAccented Lowercase
U+1E62LATIN CAPITAL LETTER S WITH DOT BELOWAccented Uppercase
U+1E63LATIN SMALL LETTER S WITH DOT BELOWAccented Lowercase
U+1E64LATIN CAPITAL LETTER S WITH ACUTE AND DOT ABOVEAccented Uppercase
U+1E65LATIN SMALL LETTER S WITH ACUTE AND DOT ABOVEAccented Lowercase
U+1E66LATIN CAPITAL LETTER S WITH CARON AND DOT ABOVEAccented Uppercase
U+1E67LATIN SMALL LETTER S WITH CARON AND DOT ABOVEAccented Lowercase
U+1E68LATIN CAPITAL LETTER S WITH DOT BELOW AND DOT ABOVEAccented Uppercase
U+1E69LATIN SMALL LETTER S WITH DOT BELOW AND DOT ABOVEAccented Lowercase
U+1E6ALATIN CAPITAL LETTER T WITH DOT ABOVEAccented Uppercase
U+1E6BLATIN SMALL LETTER T WITH DOT ABOVEAccented Lowercase
U+1E6CLATIN CAPITAL LETTER T WITH DOT BELOWAccented Uppercase
U+1E6DLATIN SMALL LETTER T WITH DOT BELOWAccented Lowercase
U+1E6ELATIN CAPITAL LETTER T WITH LINE BELOWAccented Uppercase
U+1E6FLATIN SMALL LETTER T WITH LINE BELOWAccented Lowercase
U+1E70LATIN CAPITAL LETTER T WITH CIRCUMFLEXAccented Uppercase
U+1E71LATIN SMALL LETTER T WITH CIRCUMFLEXAccented Lowercase
U+1E72LATIN CAPITAL LETTER U WITH DIAERESIS BELOWAccented Uppercase
U+1E73LATIN SMALL LETTER U WITH DIAERESIS BELOWAccented Lowercase
U+1E74LATIN CAPITAL LETTER U WITH TILDE BELOWAccented Uppercase
U+1E75LATIN SMALL LETTER U WITH TILDE BELOWAccented Lowercase
U+1E76LATIN CAPITAL LETTER U WITH CIRCUMFLEX AND DOT BELOWAccented Uppercase
U+1E77LATIN SMALL LETTER U WITH CIRCUMFLEX AND DOT BELOWAccented Lowercase
U+1E78LATIN CAPITAL LETTER U WITH MACRON AND DIAERESISAccented Uppercase
U+1E79LATIN SMALL LETTER U WITH MACRON AND DIAERESISAccented Lowercase
U+1E7ALATIN CAPITAL LETTER U WITH MACRON AND DIAERESIS AND ACUTEAccented Uppercase
U+1E7BLATIN SMALL LETTER U WITH MACRON AND DIAERESIS AND ACUTEAccented Lowercase
U+1E7CLATIN CAPITAL LETTER V WITH TILDEAccented Uppercase
U+1E7DLATIN SMALL LETTER V WITH TILDEAccented Lowercase
U+1E7ELATIN CAPITAL LETTER V WITH DOT BELOWAccented Uppercase
U+1E7FṿLATIN SMALL LETTER V WITH DOT BELOWAccented Lowercase

U+1E80–U+1EBF: Additional Accented Letters and Vietnamese-Specific

Code PointGlyphNameCategory
U+1E80LATIN CAPITAL LETTER W WITH GRAVEAccented Uppercase
U+1E81LATIN SMALL LETTER W WITH GRAVEAccented Lowercase
U+1E82LATIN CAPITAL LETTER W WITH ACUTEAccented Uppercase
U+1E83LATIN SMALL LETTER W WITH ACUTEAccented Lowercase
U+1E84LATIN CAPITAL LETTER W WITH DIAERESISAccented Uppercase
U+1E85LATIN SMALL LETTER W WITH DIAERESISAccented Lowercase
U+1E86LATIN CAPITAL LETTER W WITH DOT ABOVEAccented Uppercase
U+1E87LATIN SMALL LETTER W WITH DOT ABOVEAccented Lowercase
U+1E88LATIN CAPITAL LETTER W WITH DOT BELOWAccented Uppercase
U+1E89LATIN SMALL LETTER W WITH DOT BELOWAccented Lowercase
U+1E8ALATIN CAPITAL LETTER X WITH DOT ABOVEAccented Uppercase
U+1E8BLATIN SMALL LETTER X WITH DOT ABOVEAccented Lowercase
U+1E8CLATIN CAPITAL LETTER X WITH DIAERESISAccented Uppercase
U+1E8DLATIN SMALL LETTER X WITH DIAERESISAccented Lowercase
U+1E8ELATIN CAPITAL LETTER Y WITH DOT ABOVEAccented Uppercase
U+1E8FLATIN SMALL LETTER Y WITH DOT ABOVEAccented Lowercase
U+1E90LATIN CAPITAL LETTER Z WITH CIRCUMFLEXAccented Uppercase
U+1E91LATIN SMALL LETTER Z WITH CIRCUMFLEXAccented Lowercase
U+1E92LATIN CAPITAL LETTER Z WITH DOT BELOWAccented Uppercase
U+1E93LATIN SMALL LETTER Z WITH DOT BELOWAccented Lowercase
U+1E94LATIN CAPITAL LETTER Z WITH LINE BELOWAccented Uppercase
U+1E95LATIN SMALL LETTER Z WITH LINE BELOWAccented Lowercase
U+1E96LATIN SMALL LETTER H WITH LINE BELOWLegacy Variant
U+1E97LATIN SMALL LETTER T WITH DIAERESISLegacy Variant
U+1E98LATIN SMALL LETTER W WITH RING ABOVELegacy Variant
U+1E99LATIN SMALL LETTER Y WITH RING ABOVELegacy Variant
U+1E9ALATIN SMALL LETTER A WITH RIGHT HALF RINGLegacy Variant
U+1E9BLATIN SMALL LETTER LONG S WITH DOT ABOVELegacy Variant
U+1E9C<reserved>Unassigned
U+1E9D<reserved>Unassigned
U+1E9ELATIN CAPITAL LETTER SHARP SGerman Uppercase
U+1E9F<reserved>Unassigned
U+1EA0LATIN CAPITAL LETTER A WITH DOT BELOWVietnamese Uppercase
U+1EA1LATIN SMALL LETTER A WITH DOT BELOWVietnamese Lowercase
U+1EA2LATIN CAPITAL LETTER A WITH HOOK ABOVEVietnamese Uppercase
U+1EA3LATIN SMALL LETTER A WITH HOOK ABOVEVietnamese Lowercase
U+1EA4LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND ACUTEVietnamese Uppercase
U+1EA5LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACUTEVietnamese Lowercase
U+1EA6LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND GRAVEVietnamese Uppercase
U+1EA7LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRAVEVietnamese Lowercase
U+1EA8LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND HOOK ABOVEVietnamese Uppercase
U+1EA9LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOOK ABOVEVietnamese Lowercase
U+1EAALATIN CAPITAL LETTER A WITH CIRCUMFLEX AND TILDEVietnamese Uppercase
U+1EABLATIN SMALL LETTER A WITH CIRCUMFLEX AND TILDEVietnamese Lowercase
U+1EACLATIN CAPITAL LETTER A WITH CIRCUMFLEX AND DOT BELOWVietnamese Uppercase
U+1EADLATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT BELOWVietnamese Lowercase
U+1EAELATIN CAPITAL LETTER A WITH BREVE AND ACUTEVietnamese Uppercase
U+1EAFLATIN SMALL LETTER A WITH BREVE AND ACUTEVietnamese Lowercase
U+1EB0LATIN CAPITAL LETTER A WITH BREVE AND GRAVEVietnamese Uppercase
U+1EB1LATIN SMALL LETTER A WITH BREVE AND GRAVEVietnamese Lowercase
U+1EB2LATIN CAPITAL LETTER A WITH BREVE AND HOOK ABOVEVietnamese Uppercase
U+1EB3LATIN SMALL LETTER A WITH BREVE AND HOOK ABOVEVietnamese Lowercase
U+1EB4LATIN CAPITAL LETTER A WITH BREVE AND TILDEVietnamese Uppercase
U+1EB5LATIN SMALL LETTER A WITH BREVE AND TILDEVietnamese Lowercase
U+1EB6LATIN CAPITAL LETTER A WITH BREVE AND DOT BELOWVietnamese Uppercase
U+1EB7LATIN SMALL LETTER A WITH BREVE AND DOT BELOWVietnamese Lowercase
U+1EB8LATIN CAPITAL LETTER E WITH DOT BELOWVietnamese Uppercase
U+1EB9LATIN SMALL LETTER E WITH DOT BELOWVietnamese Lowercase
U+1EBALATIN CAPITAL LETTER E WITH HOOK ABOVEVietnamese Uppercase
U+1EBBLATIN SMALL LETTER E WITH HOOK ABOVEVietnamese Lowercase
U+1EBCLATIN CAPITAL LETTER E WITH TILDEVietnamese Uppercase
U+1EBDLATIN SMALL LETTER E WITH TILDEVietnamese Lowercase
U+1EBELATIN CAPITAL LETTER E WITH CIRCUMFLEX AND ACUTEVietnamese Uppercase
U+1EBFếLATIN SMALL LETTER E WITH CIRCUMFLEX AND ACUTEVietnamese Lowercase

U+1EC0–U+1EFF: Vietnamese Tones and Legacy Forms

Code PointGlyphNameCategory
U+1EC0LATIN CAPITAL LETTER E WITH DOT BELOWVietnamese Uppercase
U+1EC1LATIN SMALL LETTER E WITH DOT BELOWVietnamese Lowercase
U+1EC2LATIN CAPITAL LETTER E WITH HOOK ABOVEVietnamese Uppercase
U+1EC3LATIN SMALL LETTER E WITH HOOK ABOVEVietnamese Lowercase
U+1EC4LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND ACUTEVietnamese Uppercase
U+1EC5ếLATIN SMALL LETTER E WITH CIRCUMFLEX AND ACUTEVietnamese Lowercase
U+1EC6LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND GRAVEVietnamese Uppercase
U+1EC7LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRAVEVietnamese Lowercase
U+1EC8LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND TILDEVietnamese Uppercase
U+1EC9LATIN SMALL LETTER E WITH CIRCUMFLEX AND TILDEVietnamese Lowercase
U+1ECALATIN CAPITAL LETTER E WITH CIRCUMFLEX AND DOT BELOWVietnamese Uppercase
U+1ECBLATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT BELOWVietnamese Lowercase
U+1ECCLATIN CAPITAL LETTER I WITH HOOK ABOVEVietnamese Uppercase
U+1ECDLATIN SMALL LETTER I WITH HOOK ABOVEVietnamese Lowercase
U+1ECELATIN CAPITAL LETTER O WITH DOT BELOWVietnamese Uppercase
U+1ECFLATIN SMALL LETTER O WITH DOT BELOWVietnamese Lowercase
U+1ED0LATIN CAPITAL LETTER O WITH HOOK ABOVEVietnamese Uppercase
U+1ED1LATIN SMALL LETTER O WITH HOOK ABOVEVietnamese Lowercase
U+1ED2LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND ACUTEVietnamese Uppercase
U+1ED3LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACUTEVietnamese Lowercase
U+1ED4LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND GRAVEVietnamese Uppercase
U+1ED5LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRAVEVietnamese Lowercase
U+1ED6LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND HOOK ABOVEVietnamese Uppercase
U+1ED7LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOOK ABOVEVietnamese Lowercase
U+1ED8LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND TILDEVietnamese Uppercase
U+1ED9LATIN SMALL LETTER O WITH CIRCUMFLEX AND TILDEVietnamese Lowercase
U+1EDALATIN CAPITAL LETTER O WITH CIRCUMFLEX AND DOT BELOWVietnamese Uppercase
U+1EDBLATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT BELOWVietnamese Lowercase
U+1EDCLATIN CAPITAL LETTER O WITH HORN AND ACUTEVietnamese Uppercase
U+1EDDLATIN SMALL LETTER O WITH HORN AND ACUTEVietnamese Lowercase
U+1EDELATIN CAPITAL LETTER O WITH HORN AND GRAVEVietnamese Uppercase
U+1EDFLATIN SMALL LETTER O WITH HORN AND GRAVEVietnamese Lowercase
U+1EE0LATIN CAPITAL LETTER O WITH HORN AND HOOK ABOVEVietnamese Uppercase
U+1EE1LATIN SMALL LETTER O WITH HORN AND HOOK ABOVEVietnamese Lowercase
U+1EE2LATIN CAPITAL LETTER O WITH HORN AND TILDEVietnamese Uppercase
U+1EE3LATIN SMALL LETTER O WITH HORN AND TILDEVietnamese Lowercase
U+1EE4LATIN CAPITAL LETTER O WITH HORN AND DOT BELOWVietnamese Uppercase
U+1EE5LATIN SMALL LETTER O WITH HORN AND DOT BELOWVietnamese Lowercase
U+1EE6LATIN CAPITAL LETTER U WITH HORN AND ACUTEVietnamese Uppercase
U+1EE7LATIN SMALL LETTER U WITH HORN AND ACUTEVietnamese Lowercase
U+1EE8LATIN CAPITAL LETTER U WITH HORN AND GRAVEVietnamese Uppercase
U+1EE9LATIN SMALL LETTER U WITH HORN AND GRAVEVietnamese Lowercase
U+1EEALATIN CAPITAL LETTER U WITH HORN AND HOOK ABOVEVietnamese Uppercase
U+1EEBLATIN SMALL LETTER U WITH HORN AND HOOK ABOVEVietnamese Lowercase
U+1EECLATIN CAPITAL LETTER U WITH HORN AND TILDEVietnamese Uppercase
U+1EEDLATIN SMALL LETTER U WITH HORN AND TILDEVietnamese Lowercase
U+1EEELATIN CAPITAL LETTER U WITH HORN AND DOT BELOWVietnamese Uppercase
U+1EEFLATIN SMALL LETTER U WITH HORN AND DOT BELOWVietnamese Lowercase
U+1EF0LATIN CAPITAL LETTER Y WITH GRAVEVietnamese Uppercase
U+1EF1LATIN SMALL LETTER Y WITH GRAVEVietnamese Lowercase
U+1EF2LATIN CAPITAL LETTER Y WITH HOOK ABOVEVietnamese Uppercase
U+1EF3LATIN SMALL LETTER Y WITH HOOK ABOVEVietnamese Lowercase
U+1EF4LATIN CAPITAL LETTER Y WITH DOT BELOWVietnamese Uppercase
U+1EF5LATIN SMALL LETTER Y WITH DOT BELOWVietnamese Lowercase
U+1EF6LATIN CAPITAL LETTER Y WITH TILDEVietnamese Uppercase
U+1EF7LATIN SMALL LETTER Y WITH TILDEVietnamese Lowercase
U+1EF8LATIN CAPITAL LETTER Y WITH ACUTEVietnamese Uppercase
U+1EF9LATIN SMALL LETTER Y WITH ACUTEVietnamese Lowercase
U+1EFA<reserved>Unassigned
U+1EFB<reserved>Unassigned
U+1EFC<reserved>Unassigned
U+1EFD<reserved>Unassigned
U+1EFE<reserved>Unassigned
U+1EFF<reserved>Unassigned
Note: Unassigned code points are marked as <reserved> and have no glyph or category. All data is derived directly from the Unicode Standard, Version 17.0 (released September 2025). For subranges with multiple similar forms, refer to the official chart for visual precision.

Compact Listing

The Latin Extended Additional block (U+1E00–U+1EFF) provides 256 code points for extended Latin characters, primarily precomposed forms with diacritics for linguistic and typographic needs, many of which decompose canonically to a base letter plus a combining mark (e.g., U+1E02 Ḃ decomposes to B + COMBINING DOT ABOVE). This compact listing organizes characters alphabetically by base letter within diacritic categories for quick reference, including assigned code points only (224 total, excluding 32 unassigned), with counts per category to highlight distribution. It serves developers and typographers for font implementation, normalization, and collation, where aliases like NFKC compatibility decompositions may apply for legacy systems. Characters are grouped by primary diacritic type, then by base letter (A–Z, a–z pairs where applicable), showing , , and abbreviated name derived from official nomenclature.

Dot Above (52 characters: 26 uppercase, 26 lowercase)

These include letters like B, D, F, etc., with a single dot above, used in and other orthographies.
Code PointGlyphShort Name
1E02B DOT ABOVE
1E03b DOT ABOVE
1E0AD DOT ABOVE
1E0Bd DOT ABOVE
1E1EF DOT ABOVE
1E1Ff DOT ABOVE
......(20 more pairs)
Decompositions: e.g., Ḃ = B + U+0307 COMBINING ABOVE.

Dot Below (48 characters: 24 uppercase, 24 lowercase)

Common in and languages, applied to vowels and consonants like A, B, D.
Code PointGlyphShort Name
1E0CD BELOW
1E0Dd BELOW
1EA0A BELOW
1EA1a BELOW
1EACA BELOW
1EADa BELOW
......(18 more pairs)
Decompositions: e.g., Ḍ = D + U+0323 COMBINING BELOW.

Macron (20 characters: 10 uppercase, 10 lowercase)

For length marking in and other scripts, on letters like G, O, U.
Code PointGlyphShort Name
1E20ǴG
1E21ǵg
1E14E
1E15e
1E44N ABOVE
1E45n ABOVE
......(7 more pairs)
Decompositions: e.g., Ǵ = G + U+0304 COMBINING MACRON.

Acute Accent (16 characters: 8 uppercase, 8 lowercase, often combined)

Seen in Irish and Vietnamese, e.g., on K, P, R.
Code PointGlyphShort Name
1E30K ACUTE
1E31k ACUTE
1E54P ACUTE
1E55p ACUTE
1E58R DOT ABOVE
1E59r DOT ABOVE
......(3 more pairs)
Decompositions: e.g., Ḱ = K + U+0301 COMBINING ACUTE ACCENT.

Tilde (28 characters: 14 uppercase, 14 lowercase)

For nasalization in Portuguese and Vietnamese, on A, E, O, etc.
Code PointGlyphShort Name
1E74U TILDE BELOW
1E75u TILDE BELOW
1E4CO TILDE ACUTE
1E4Do TILDE ACUTE
1E7CV TILDE
1E7Dv TILDE
......(9 more pairs)
Decompositions: e.g., Ṽ = V + U+0303 COMBINING .

Other Diacritics (e.g., , , Ring Below; 60 characters total)

Includes diverse forms like ring below (4), stroke (8), double acute (4), and hooks/horns (32+). Examples:
Code PointGlyphShort Name
1E00A BELOW
1E01a BELOW
1E24H DOT BELOW
1E25h DOT BELOW
1EA6A
1EA7a
1ED2O ACUTE
1ED3o ACUTE
......(52 more)
Decompositions: e.g., Ḁ = A + U+0325 COMBINING RING BELOW; many forms combine multiple marks (e.g., Ầ = A + U+0302 + U+0300). Unassigned code points (e.g., 1E9C–1E9D, 1E9F, 1EFA–1EFF) reserve space for future extensions. For full visual charts and exact decompositions, refer to the official data files.

References

  1. [1]
    [PDF] Latin Extended Additional - The Unicode Standard, Version 17.0
    204. 1EFF. Latin Extended Additional. 1E00. 1E0 1E1 1E2 1E3 1E4 1E5 1E6 1E7 1E8 1E9 1EA 1EB 1EC 1ED 1EE 1EF. Ḁ ḁ. Ḃ ḃ. Ḅ ḅ. Ḇ ḇ. Ḉ ḉ. Ḋ ḋ. Ḍ ḍ. Ḏ ḏ. Ḑ ḑ. Ḓ ḓ. Ḕ.
  2. [2]
    Chapter 7 – Unicode 17.0.0
    7 Latin Extended Additional: U+1E00–U+1EFF. The characters in this block are mostly precomposed combinations of Latin letters with one or more general ...
  3. [3]
    [PDF] Block Names - Unicode
    The Unicode Standard, Version 1.1. Appendix E Block Names. Start Stop ... LATIN EXTENDED ADDITIONAL. 1E00 1EFF. 1F00 1FFF. GREEK EXTENDED. 2000 206F. 2070 ...
  4. [4]
    Unicode 17.0 Character Code Charts
    Latin Extended Additional · Latin Ligatures · Fullwidth Latin Letters · IPA Extensions · Phonetic Extensions · Phonetic Extensions Supplement · Linear A. Linear ...Help and Links · Name Index · Unihan Database Lookup
  5. [5]
    Latin Extended Additional - Unicode
    Latin Extended Additional ; Latin general use extensions ; 1E00, Ḁ, Latin Capital Letter A With Ring Below ; ≡ ; ↓ ; 1E01, ḁ, Latin Small Letter A With Ring Below.
  6. [6]
    Unicode 17.0.0
    Sep 9, 2025 · This page summarizes the important changes for the Unicode Standard, Version 17.0.0. This version supersedes all previous versions of the Unicode Standard.
  7. [7]
    Unicode Blocks - Compart
    List of Blocks ; U+0000 - U+007F. Basic Latin ; U+0080 - U+00FF. Latin-1 Supplement ; U+0100 - U+017F. Latin Extended-A ; U+0180 - U+024F. Latin Extended-B ; U+0250 ...U+024F Latin Extended-B 208 · U+017F Latin Extended-A 128 · Basic Latin
  8. [8]
    Unicode & Existing Vietnamese Character Encodings
    The Vietnamese alphabets are listed in several noncontiguous Unicode ranges: Basic Latin {U+0000..U+007F}, Latin-1 Supplement {U+0080..U+00FF}, Latin Extended-A ...Missing: support | Show results with:support
  9. [9]
    Vietnamese Writing System - CJVLang
    Modern Vietnamese is written with the Latin alphabet, known as quoc ngu (quốc ngữ) in Vietnamese. Quoc ngu consists of 29 letters.
  10. [10]
  11. [11]
    [PDF] ISO/IEC JTC1/SC2/WG2 N3027 L2/06-027 - Unicode
    Jan 30, 2006 · Accurate transcriptions of medieval texts allow scholars to quote medieval texts without distorting their graphemic content, and allow the ...
  12. [12]
  13. [13]
  14. [14]
    None
    Summary of each segment:
  15. [15]
  16. [16]
    ẛ U+1E9B LATIN SMALL LETTER LONG S ... - Unicode Explorer
    ẛ U+1E9B LATIN SMALL LETTER LONG S WITH DOT ABOVE, copy and paste, unicode character symbol info, in current use in Gaelic types (as glyph variant of 1E61)
  17. [17]
    [PDF] ISO/IEC JTC1/SC2/WG2 N2957 L2/05-183 - Unicode
    Aug 2, 2005 · Accurate transcriptions of medieval texts allow scholars to quote medieval texts without distorting their graphemic content, and allow the ...
  18. [18]
    [PDF] L2/07-157 - Unicode
    we are in favour of supporting the assignment of a codepoint for the letter “Capital Sharp-S”. (Latin Capital Letter Sharp S) to the character standard ISO ...<|control11|><|separator|>
  19. [19]
    Registered features, a-e (OpenType 1.9.1) - Typography
    Jul 6, 2024 · Tag: 'ccmp'​​ This feature permits such composition/decomposition. The feature should be processed as the first feature processed, and should be ...Tag: 'aalt' · Tag: 'apkn' · Tag: 'ccmp'
  20. [20]
    Noto Sans - Font Families - openSUSE
    ὕαλον ϕαγεῖν δύναμαι· τοῦτο οὔ με βλάπτει. ὕαλον ϕαγεῖν δύναμαι· τοῦτο οὔ με βλάπτει. ὕαλον ϕαγεῖν δύναμαι· τοῦτο οὔ με βλάπτει. ὕαλον ϕαγεῖν δύναμαι· τοῦτο ...
  21. [21]
    Font Support for Unicode Block 'Latin Extended Additional'
    This is a list of fonts that support characters in the Latin Extended Additional Unicode block. Detail. Font, Support. 100%, Arial · 100% (256 of 256).Missing: OpenType ccmp stacking Noto
  22. [22]
  23. [23]
    Gaelic Keyboards for MS-Windows
    A number of free layouts exist for keyboarding Gaelic. They are summarised in the following table, and individually described afterwards.
  24. [24]
    Vietnamese Unicode FAQs
    They typically support the three most common Vietnamese input methods: Telex, VNI, and VIQR. For them to work, the applications that they are to be used ...
  25. [25]
    Code Page Identifiers - Win32 apps - Microsoft Learn
    Jan 7, 2021 · For the most consistent results, applications should use Unicode, such as UTF-8 or UTF-16, instead of a specific code page. Expand table ...
  26. [26]
    Supported Encodings
    String classes, and classes in the java.nio.charset package can convert between Unicode and a number of other character encodings. The supported encodings vary ...
  27. [27]
    Latin Extended Additional - Unicode
    Latin Extended Additional, 1EFF. Ḁ 1E00, ḁ. 1E01, Ḃ 1E02, ḃ. 1E03, Ḅ 1E04, ḅ. 1E05, Ḇ 1E06, ḇ. 1E07, Ḉ 1E08, ḉ. 1E09, Ḋ 1E0A, ḋ. 1E0B, Ḍ 1E0C, ḍ. 1E0D, Ḏ 1E0E ...