Fact-checked by Grok 2 weeks ago

Arabic script

The Arabic script is an comprising 28 letters, primarily used to write the language and adapted for numerous others such as , , , and , characterized by its right-to-left flow and positional variations in letter forms. Originating from the Nabatean script around the , with additional influences from , it evolved through transitional "Nabateo-Arabic" phases and was standardized between the 5th and 6th centuries, as evidenced by early inscriptions like the Namāra epitaph from 328 . This script's development accelerated with the rise of in the , particularly through its role in transcribing the , transitioning from rudimentary Hijazi styles—marked by slanted, undotted letters on —to more angular forms in the and fluid, rounded Naskh scripts by the for improved readability. Key structural features include letters that connect in cursive chains, with most adopting 2 to 4 allographic shapes depending on their position (isolated, , medial, or final) in a word, and diacritical dots to differentiate similar forms, while short vowels are optionally indicated by diacritics and long vowels by specific letters. Lacking uppercase or lowercase distinctions, the script's flexibility allowed it to accommodate non-Semitic languages by adding letters or modifications, such as the four extra letters in . Historically, it served not only linguistic but also artistic and mystical purposes, with letters assigned numerical values linked to lunar stations and talismanic significance in Islamic tradition. In modern usage, the Arabic script remains the third-most widely employed globally, supporting over 400 million native speakers and additional users in regions from the to and , though contemporary texts often omit vowel diacritics for brevity. Its calligraphic styles, including the everyday Ruqʿa and ornate , continue to influence , digital fonts, and cultural expressions, while adaptations persist in languages like () and , reflecting its enduring adaptability across diverse linguistic and technological contexts.

History

Origins and early development

The Arabic script originated as a derivative of the Nabataean variant of the late Aramaic alphabet, which itself evolved from the earlier Phoenician script around the 4th century CE in the Arabian Peninsula and surrounding regions. This development occurred among Arab tribes in northern Arabia and the Levant, where the Nabataean kingdom (c. 320 BCE–106 CE) facilitated cultural and linguistic exchanges through trade routes connecting Petra, Syria, and the Hijaz. The script's emergence reflects a gradual adaptation of Aramaic's 22-letter consonantal system to accommodate Arabic phonetics, expanding to 28 letters by incorporating additional sounds unique to Arabic. Key evidence of proto-Arabic forms appears in early inscriptions, such as the from 328 , discovered near in modern-day . This funerary for the Lakhmid king Imru' al-Qays is written in the but employs language, marking it as one of the oldest dated attestations of written and demonstrating the script's transitional use for Arabic texts. Similarly, the Zabad inscription from 512 , found near , is a trilingual dedication in , , and on a church , with the Arabic portion showcasing emerging cursive tendencies and terms like "al-ilah" (the ), highlighting the script's role in pre-Islamic religious and communal contexts. These artifacts, primarily from funerary and dedicatory purposes, illustrate the script's initial application in northern Arabian and Syrian territories influenced by nomadic and settled Arab communities. The transition from angular, monumental forms to more fluid, cursive styles was driven by practical needs in trade, administration, and everyday writing on perishable materials like during the pre-Islamic era. Nabataean inscriptions, often chiseled in stone for durability, featured rigid lines suited to work, but as Arabic speakers adopted the script for broader commercial exchanges along caravan routes, ligatures and connections between letters began to appear, smoothing the forms for quicker inscription. This evolution is evident in the semi-cursive Nabataean examples from the 2nd century BCE onward, which prefigure the interconnected nature of mature Arabic writing. A representative example of letter evolution is the Arabic alif (ا), which developed from the Phoenician aleph (𐤀), an ox-head symbol simplified in Aramaic to a vertical stroke and further adapted in Nabataean to a slanted or hooked form before straightening in proto-Arabic by the 4th century CE. Such changes preserved the consonantal value while aligning with Arabic's phonetic requirements, laying the groundwork for the script's later refinements.

Spread and evolution in the Islamic era

The rise of in the 7th century facilitated the rapid dissemination of the Arabic script across vast territories through military conquests, transforming it from a regional into a vehicle for religious, administrative, and cultural expression. Following the death of Prophet Muhammad in 632 CE, Arab Muslim armies expanded into Persia by 651 CE under the and later the Umayyads, conquering the and introducing Arabic script for Quranic dissemination and governance. Similarly, conquests reached by the late 7th century, with Umayyad forces capturing in 642 CE and advancing westward to establish script use in official documents and Islamic texts, laying the groundwork for regional linguistic integrations. These expansions, extending to the and by the 8th century, prompted initial adaptations, such as the incorporation of Persian sounds into the script for administrative purposes in conquered regions. Under the (661–750 ), efforts to standardize the Arabic script intensified to support the empire's administrative needs and the accurate transmission of the . Caliph ʿAbd al-Malik (r. 685–705 ) played a pivotal role, commissioning the codification of the script between 684 and 692 , an angular, geometric style suited for monumental inscriptions like those on the in . This script, derived from earlier Hijazi forms, became the primary medium for Quranic manuscripts, emphasizing clarity and aesthetic rigidity without vowels or extensive diacritics in early versions. The (750–1258 ) further refined this standardization in the 8th and 9th centuries, with emerging as a center for script evolution; continued for religious texts while transitioning toward more styles to enhance legibility in expanding literary and bureaucratic contexts. To address ambiguities in consonant differentiation, particularly for non-Arabic speakers reciting the , diacritical marks known as i'jam were introduced around 684 CE by the grammarian Abu al-Aswad al-Du'ali (d. 688 CE), a companion of ibn Abi Talib. Commissioned by the Umayyad governor of , al-Du'ali devised a system of dots placed above, below, or beside letters to distinguish similar forms, such as ب (bāʾ), ت (tāʾ), ث (thāʾ), ن (nūn), ي (yāʾ), and ى (alif maqṣūrah), thereby preventing misreadings in sacred texts. This innovation, initially applied to Quranic codices, marked a crucial step in script legibility during the early Islamic expansions. Building on this foundation, vowel pointing or tashkil was innovated in the by the Basran scholar Khalil ibn Ahmad al-Farahidi (d. 786 CE) to indicate short vowels and phonetic nuances more precisely. Al-Farahidi replaced earlier colored dots with a refined set of symbols—such as fatḥah (a horizontal line for /a/), kasrah (a diagonal line for /i/), and ḍammah (a curl for /u/)—derived from letter shapes, enabling accurate recitation for diverse linguistic communities. His system, integrated into Quranic and grammatical works like , became standard by the 11th century, supporting the script's adaptability in and North African contexts where vowel systems differed from Arabic.

Modern standardization and reforms

The reforms in the from 1839 to 1876 marked a pivotal era for the Arabic script's adaptation to modern printing technologies, as the expansion of printing presses enabled the widespread dissemination of official edicts and educational materials in Arabic script, overcoming earlier restrictions on Muslim use of . This period also saw early proposals for orthographic simplification to address the script's complexities in representing Turkish phonetics, notably through Iranian intellectual Mirza Malkum Khan's 1860s reform plan, which advocated adding diacritics and new letters to enhance readability and literacy rates. These efforts laid groundwork for later script reforms by highlighting the need for amid growing . In post-colonial contexts, Egypt's 1920s intellectual and journalistic movements pushed for Arabic language consistency to support and print media expansion, culminating in the 1932 founding of the Academy of the Arabic Language (Majmaʿ al-Lughah al-ʿArabīyah) by King Fuad I, which focused on unifying , coining modern terms, and resolving ambiguities in usage. Similar initiatives occurred across Arab states, aiming to bridge with contemporary needs in and . Reforms in non-Arab countries diverged more radically; in 1928, under enacted Law No. 1353, mandating a switch from the Arabic script to a Latin-based to boost from around 10% to over 20% within a decade and align with Western modernization, effectively severing ties to Ottoman-Islamic traditions. In , Pahlavi's 1930s cultural policies included orthographic tweaks to the Perso-Arabic , such as standardizing short vowel markings and reducing optional ligatures through the 1935 Academy of Iran, to better suit without a full script change. Contemporary efforts have emphasized digital adaptation and cultural preservation; in the 1980s, advanced Arabic script standards for , with initiatives at institutions like developing early font encoding systems to accommodate the script's forms in word processors and databases, paving the way for global digital typography. Since the 2000s, has supported projects to safeguard endangered Arabic script variants, including of Ajami manuscripts—modified Arabic scripts for African languages like Wolof and —through partnerships like the British Library's Endangered Archives Programme, such as the pilot project EAP915, which identified and cataloged 807 endangered Arabic manuscripts in regions including . More recently, as of 2023, the EAP has digitized over 100,000 pages of Ajami materials across , including projects for and Wolof languages.

Core Features

Alphabet and basic letters

The Arabic script is fundamentally an , a that primarily represents consonants while vowels are either omitted in standard orthography or indicated through optional diacritics, setting it apart from full like the that include inherent vowel markers. This consonantal focus facilitates concise writing but requires contextual knowledge for accurate , particularly among native speakers. The standard comprises 28 letters, each with a distinct name and phonemic value, derived from the ancient Nabataean and scripts but refined over centuries. Unlike scripts with case distinctions, Arabic employs a unicase design, meaning there are no uppercase or lowercase forms; instead, letters adapt through four positional variants—initial (at the start of a word), medial (within a word), final (at the end of a word), and isolated (standalone or after a non-connecting letter)—to suit the cursive flow of connected writing. This positional flexibility is a core feature enabling the script's elegant, flowing appearance without altering the 's essential identity. The following table lists the 28 core letters in their isolated forms, along with their conventional names and approximate phonemic values, ordered from right to left as per Arabic reading direction. These phonemes represent standard pronunciations, though regional dialects may vary slightly.
Isolated FormNameIPA Phoneme
ا/ʔ/ or /aː/
بbāʾ/b/
تtāʾ/t/
ثthāʾ/θ/
جjīm/dʒ/
حḥāʾ/ħ/
خkhāʾ/x/
دdāl/d/
ذdhāl/ð/
رrāʾ/r/
زzāy/z/
سsīn/s/
شshīn/ʃ/
صṣād/sˤ/
ضḍād/dˤ/
طṭāʾ/tˤ/
ظẓāʾ/ðˤ/
عʿayn/ʕ/
غ/ɣ/
فfāʾ/f/
قqāf/q/
كkāf/k/
لlām/l/
مmīm/m/
نnūn/n/
هhāʾ/h/
وwāw/w/ or /uː/
يyāʾ/j/ or /iː/

Contextual forms and cursive nature

The Arabic script is fundamentally , with letters designed to connect to one another within words, creating a continuous flow even in printed text. This connectivity is a core feature that distinguishes it from many other writing systems, facilitating efficient writing and aesthetic harmony in . The script is written and read from right to left, with letters aligning along a that serves as the primary anchor for placement, ensuring uniform visual structure across words and lines. Letters in the Arabic script assume one of four contextual forms depending on their position within a word: isolated (when standing alone or not connecting), (at the beginning of a word, connecting only to the following letter), medial (in the middle, connecting to both preceding and following letters), and final (at the end of a word, connecting only to the preceding letter). These forms allow each of the 28 letters to adapt its dynamically, often resulting in significant visual differences from their isolated counterparts—for instance, the letter beh (ب) appears as ب in isolated form, ﺑ in , ﺒ in medial, and ﺐ in final. This positional variation is governed by standards for rendering, where font systems select the appropriate based on the letter's joining behavior and neighbors. The cursive nature is further defined by specific joining rules: 22 letters are dual-joining, capable of connecting to both the preceding (right) and following (left) letters, while 6 letters are right-joining only, connecting solely to the preceding letter and leaving a gap to the left. Examples of right-joining letters include dal (د), which maintains its isolated or final form (د or ﺩ) without linking leftward, and reh (ر), similarly non-connective to the left (ر or ﺭ). Non-joining elements, such as the hamza (ء), do not connect at all. These rules ensure readability by preventing ambiguous merges, particularly for letters with similar shapes. In , these principles combine to create fluid sequences; for example, the word كتاب (kitāb, meaning "") is rendered from right to left with the initial kāf (ك) connecting to the medial tāʾ (ﺘ), which connects to the final (ا); the beh (ب) appears isolated to the right of the alif since the right-joining alif terminates the connection without linking leftward. This example illustrates how dual-joining letters like kāf and tāʾ adapt across positions, while the right-joining alif prevents connection to the beh, maintaining the script's alignment and integrity.

Diacritics, vowels, and orthographic conventions

The Arabic script employs a system of diacritical marks known as harakat (حَرَكَاتْ) to indicate short vowels and other phonetic nuances, which are optional in most modern writing but essential for precise pronunciation. These marks are placed above or below consonants and include the fatha (ـَ), a short diagonal line above the letter representing the vowel sound /a/ as in "father"; the damma (ـُ), a small curl above the letter for /u/ as in "put"; and the kasra (ـِ), a short diagonal line below the letter for /i/ as in "bit". For example, the consonant ب (bāʾ) becomes بَ (ba), بُ (bu), and بِ (bi) respectively. Absence of a vowel is denoted by the sukun (ـْ), a small circle placed above the , indicating a consonant closure without a following short vowel, as in بْ (b, pronounced with a brief pause). The shadda (ـّ), resembling a small "w" above the letter, signifies or consonant doubling, where the is held longer and often stressed, effectively combining a sukun on the first instance and a vowel on the second; for instance, بّ (bb) in words like شَدَّة (shadda itself). Tanwin, or , marks indefinite nouns with a doubled short vowel sound at the end, using fathatan (ـً) for /an/, dammatan (ـٌ) for /un/, and kasratan (ـٍ) for /in/, as seen in كِتَابٌ (kitābun, "a "). These diacritics are encoded in as combining characters, such as U+064E for fatha and U+0651 for shadda, ensuring consistent digital representation. Orthographic conventions in Arabic script govern the placement and form of elements like the hamza (ء), a glottal stop consonant that requires specific seating rules based on its position and surrounding vowels to maintain visual and phonetic clarity. In initial position, hamza is always seated on an alif (ا), forming أ (with fatha or damma) or إ (with kasra), as in أَب (ab, "father"). Medially, it seats on the nearest compatible letter: on a dotless yā (ي) for kasra (ئ), on wāw (و) for damma (ؤ), or on alif otherwise, as in سُؤَال (suʾāl, "question"). At the end of a word, it appears on the line (ء) after a long vowel or sukun, or seated on the appropriate carrier after a short vowel, such as نَشَأَ (nashaʾa, "he arose"). These rules prevent ambiguity and align with the script's cursive flow, though pronunciation of hamza may vary by dialect. Tajweed rules such as assimilation (idgham) and clear pronunciation (izhar) apply to the recitation of tanwin or nun sakinah before certain letters—for example, idgham merges the sound before ي, ر, م, ل, و, ن, while izhar pronounces it clearly before throat letters like ء, ه, ع, ح, غ, خ—but these affect pronunciation only and do not alter the written orthography or diacritic placement. In modern usage, diacritics are fully employed in religious texts like the for accurate tajweed recitation and in educational materials for learners, where they aid comprehension and reduce ambiguity in a script that otherwise relies on for vowels. However, in everyday print , newspapers, and , they are largely omitted to save space and reflect native reader proficiency, appearing sporadically only for disambiguation in or proper names. Digital trends since the have seen increased optional use in and apps for clarity among non-native speakers, though full diacritization remains rare outside formal contexts; scholarly analyses note significant variation across genres, with religious and pedagogical texts showing near-complete marking rates compared to under 5% in general .

Styles and Variants

Calligraphic and typographic styles

The Arabic script has evolved through a rich tradition of calligraphic styles, each reflecting cultural, religious, and artistic influences across centuries. These styles originated in the early Islamic period and adapted to various media, from manuscripts to , emphasizing the script's inherent and contextual forms. Major styles like , Naskh, and emerged as foundational, balancing aesthetic elegance with readability, and later influenced typographic developments in and . Kufic, one of the earliest formal styles, developed in the late 7th century in Kufa, Iraq, characterized by its angular, geometric letterforms with thick strokes and minimal curves. It flourished from the 7th to 10th centuries, primarily used for Qur'anic manuscripts and architectural inscriptions due to its bold, monumental appearance suitable for stone, metal, and coinage. Examples include early Qur'ans on vellum and decorations on mosques like the Dome of the Rock in Jerusalem. Naskh emerged in the 10th century as a more fluid, cursive alternative to angular scripts, designed for legibility in everyday and scholarly writing. It became the predominant bookhand by the 11th century, serving as the basis for copying Qur'ans, administrative documents, and literature across the Islamic world, with regional variations in Egypt, Iraq, and Syria. Its rounded, proportional letters facilitated widespread adoption and laid the groundwork for modern printed Arabic. Nastaliq, a highly stylized and flowing script, originated in 14th-century Persia, blending elements of Naskh and Ta'liq for poetic expression. It gained dominance in the 15th century for Persian and Urdu literature, poetry, and official documents, prized for its diagonal slant, elongated horizontals, and rhythmic curves that evoke motion. Prominent in regions like Iran, Pakistan, and India, it remains a staple in South Asian manuscript traditions. The transition to typography began with the introduction of movable type for Arabic script in the early 19th century, notably at Egypt's Bulaq Press established in 1815 under Ottoman rule, where naskh and nasta'liq typefaces were cast for books like dictionaries and Qur'ans. In the Ottoman Empire proper, widespread adoption occurred in the 1860s with innovations by typefounder Ohanis Mühendisoğlu, who adapted calligraphic proportions to metal type for naskh-based printing. This evolution addressed the script's cursive joining and contextual variants, enabling mass production of texts. By the 20th century, refined typefaces like that designed by Mohamed Bek Ja‘far in 1906 for Bulaq Press set standards for clarity in printed Qur'ans. Digital typography advanced in the 2010s with open-source fonts such as Amiri, a 2011 revival by Khaled Hosny of early 20th-century Bulaq naskh, optimized for book typesetting and Qur'anic text in software like LaTeX and web browsers.
StylePeriodKey CharacteristicsPrimary RegionsNotable Uses and Examples
Kufic7th–10th centuriesAngular, geometric, bold strokes, , ArabiaQur'anic manuscripts; architectural inscriptions (e.g., )
Naskh10th century onwardCursive, rounded, legible proportions, , broader Books, documents; basis for modern print (e.g., medieval Qur'ans)
Nastaliq14th–15th centuries onwardFlowing, slanted, rhythmic curves (), ()Poetry, literature (e.g., Persian divans, Urdu manuscripts)

Regional and language-specific variants

The Arabic script exhibits notable regional variations in letter forms and orthographic conventions, primarily distinguishing between the Western (Maghribi) and Eastern (Mashriqi) traditions. The Maghribi script, prevalent in including , , and , features rounded letterforms with curved vertical strokes for letters such as , , and , along with exaggerated horizontal extensions and open final curves that descend below the . These characteristics, which evolved from early influences softened by sweeping curves, facilitate a fluid, appearance adapted to local traditions and still used in texts today. In contrast, the Eastern or Mashriqi script, dominant in the such as the and , employs sharper angles and more proportions in letters like and dhal, with less pronounced recurves and a straighter posture that aligns with styles like Naskh. This distinction, attested in medieval sources, reflects geographical and cultural divergences in scribal practices, where Mashriqi forms prioritize in angular connections. Language-specific adaptations further diversify the script, particularly in . The , used for in regions like and , modifies the standard by adding diacritics to six letters to accommodate phonemes, such as extra dots for sounds absent in . This , introduced by Muslim traders and refined over centuries, incorporates all 31 letters plus six constructed ones, enabling representation of 's vowel and inventory while maintaining right-to-left flow. Similarly, the adapts for Javanese in , employing 28 graphemes to denote 23 consonants through minimal modifications, such as added diacritics for Javanese-specific sounds like the ng phoneme, and requiring harakat for s to suit the language's syllabic structure. These tweaks, developed in Islamic scholarly contexts, preserve the script's core while aligning with local phonetics, though Pegon usage has waned with the rise of Latin-based orthographies. Historical variants in the illustrate further evolution and decline. The , a regional style from and surrounding areas, emerged in the 13th century as a blend of angular and cursive elements, featuring elongated horizontals and distinctive loops in letters like and sad. Primarily used for Qur'anic manuscripts, it persisted into the 19th century with over 137 known examples, many dated to the 15th century, but gradually declined following the promotion of , which standardized more fluid Persian-influenced forms across northern . This shift marked the end of Bihari's prominence, confining it to a niche in pre-modern Islamic textual production.

Usage and Distribution

Current geographical and linguistic use

The Arabic script serves as the official in the 22 member states of the , spanning and the , where it is used for government, education, media, and daily communication. Collectively, these regions are home to approximately 420 million speakers of dialects, making the script essential for and cultural expression among the world's fifth-most . Beyond the , the Arabic script is adapted for several major languages in non-Arab regions, supporting diverse linguistic communities. In and , the Perso-Arabic variant is used for (Farsi) and , with around 80 million speakers, primarily in official documents, literature, and signage. In and , the Nastaʿlīq style of the script writes and for approximately 230 million and 40 million speakers respectively, serving as the national language in administration, education, and print media. In and , a modified Arabic script is used for , supporting about 6-7 million speakers in official and cultural contexts. In , the Jawi variant persists in and , particularly for religious texts, Islamic education, and regional signage in areas like province, where it coexists with . As of 2025, the Arabic script's digital adoption continues to expand in , notably for in China's region, where the remains the official standard for government publications, , and signage, reflecting efforts to integrate it into modern technology platforms. This growth underscores the script's resilience in bilingual digital environments despite competing Latin and Cyrillic systems. UNESCO and World Bank data highlight the script's role in literacy across Arabic-script regions, with adult literacy rates in Arab states averaging around 75% as of recent assessments, though variations exist—such as 98% in and 79% in —tied to educational access and script-based instruction. In non-Arab contexts, literacy tied to the script, like in Persian- and Urdu-speaking areas, exceeds 80% in urban centers, supported by widespread schooling in the adapted forms.

Adaptations for non-Arabic languages

The Arabic script has been adapted for various non-Arabic languages by incorporating additional characters or diacritics to represent phonemes absent in , particularly to accommodate Indo-Iranian and linguistic features. In (Farsi), the script adds four letters—چ (che), پ (pe), ژ (zhe), and گ ()—to denote the sounds /tʃ/, /p/, /ʒ/, and /g/, which are not present in the standard . These modifications emerged during the Islamicization of Persia in the 7th–9th centuries, enabling the script to fully represent while retaining the cursive, right-to-left structure. Urdu, an Indo-Aryan language, extends the Perso-Arabic script with four additional letters—ٹ (ṭe), ڈ (ḍāl), ڑ (ṛe), and ڻ (ṇūn)—specifically for retroflex consonants /ʈ/, /ɖ/, /ɽ/, and /ɳ/, which are characteristic of its phonology influenced by Prakrit and Sanskrit substrates. These letters, marked by a nukta (dot) diacritic below the base forms, were standardized in the 19th century during the development of modern Urdu literature in British India. Swahili (also known as Kiswahili), a , historically employed the Arabic script (known as Ajami) from the onward, adapting it with extra diacritics and notations to capture its five- system and syllable-timed structure, which differ from Arabic's consonant-heavy . This adaptation facilitated religious and trade-related writing along the East African coast until the 1930s, when colonial authorities mandated a shift to the for standardization and education. The following table illustrates key phoneme-to-grapheme mappings in these adaptations:
LanguagePhonemeGraphemeDescription
/p/پModified bāʾ with three dots above for labial stop.
/tʃ/چModified jīm with three dots above for .
/ʒ/ژModified rāʾ with three dots above for voiced .
/g/گModified kāf with two dots above for voiced velar stop.
/ʈ/ٹṬe: Dental tāʾ with nukta below for retroflex stop.
/ɖ/ڈḌāl: Dental dāl with nukta below for retroflex stop.
/ɽ/ڑṚe: Dental with nukta below for retroflex flap.
/ɳ/ڻṆūn: Dental nūn with nukta below for retroflex nasal.
Swahili (Ajami)/ɪ/, /ʊ/ِ, ُ (with extensions)Short vowels marked by kasra and damma, often with additional dots or lines for contrasts.

Historical and discontinued uses

In Central Asia, the Sogdian language and its Aramaic-derived script, widely used from the 4th to 8th centuries CE, began transitioning to the Arabic script following the Islamic conquests of the region in the 8th century, as Sogdian speakers adopted Islam and incorporated Arabic linguistic elements into their writings. This shift marked the integration of Sogdian into the broader Islamic scholarly tradition, with Arabic script facilitating religious and administrative texts among Sogdian communities in areas like Transoxiana. By the early 20th century, under Soviet influence, Arabic-based scripts for Turkic languages in Central Asia, including remnants of Sogdian-influenced systems, were phased out; a Latin alphabet was introduced in the late 1920s, followed by a full replacement with Cyrillic scripts in the late 1930s to promote Russification and literacy in Russian. This discontinuation effectively ended the use of Arabic script for local languages across Soviet Central Asia by the 1940s. In , the Arabic script served as the basis for , which was written in a Perso-Arabic variant from the 14th century until the 1928 alphabet reform under , when it was replaced by a Latin-based script to modernize education and reduce Islamic cultural ties. This reform discontinued the Ottoman script's use in official and literary contexts, leading to widespread literacy campaigns that rendered Arabic-script Turkish obsolete within a decade. Additionally, in the , —a practice of writing like and Aragonese in Arabic script—emerged among Muslim communities (Mudejars and Moriscos) from the 15th to the 18th centuries, producing literature on religious, poetic, and moral themes to preserve Islamic identity under Christian rule. Following the in 1609–1614 and subsequent cultural suppression, Aljamiado ceased as a living tradition by the early . In Africa, the Ajami script, an adaptation of Arabic for local languages, was employed for Hausa in West Africa and Swahili in East Africa from the 16th century onward, enabling poetry, religious texts, and correspondence among Muslim populations. For Hausa speakers, who numbered in the tens of millions by the 19th century, Ajami facilitated widespread literacy outside formal education, with up to 80% proficiency in some communities. Similarly, Swahili Ajami supported literary epics and Islamic scholarship across coastal and inland regions. European colonial powers in the 19th and 20th centuries imposed Latin-based orthographies—such as "Boko" for Hausa under British rule in Nigeria—viewing Ajami as primitive and a barrier to administrative control, leading to its decline through school curricula, book burnings, and redefinition of literacy to exclude non-Latin systems. By the mid-20th century, Ajami had largely discontinued in favor of colonial scripts for these languages. In , the Perso-Arabic script () for , used primarily by Muslim communities, experienced a sharp decline in following the 1947 , as mass migrations shifted the Muslim-majority population to , leaving as the dominant script in Indian Punjab. Prior to , coexisted with in the region, but post-1947 demographic changes and promotion of for official use in marginalized Arabic-script and education. This discontinuation aligned with broader linguistic standardization efforts, reducing 's role in Sikh and Hindu contexts to near obsolescence.

Technical Implementation

Unicode encoding and digital representation

The Arabic script is encoded in the Unicode Standard primarily within the (BMP), with characters distributed across several dedicated blocks to accommodate core letters, diacritics, variants, and historical forms. The primary Arabic block, designated U+0600–U+06FF, encompasses 256 code points and includes the 28 basic Arabic letters, common diacritical marks such as the kasra (U+064E) and shadda (U+0651), Qur'anic annotation signs, and Arabic-Indic digits from ٠ (U+0660) to ٩ (U+0669). This block forms the foundation for standard orthography and supports right-to-left () text directionality inherent to the script. For extended letter variants used in specific languages or historical contexts, the Arabic Supplement block (U+0750–U+077F) provides 48 additional code points, introduced in Unicode 4.1 (2005). Examples include forms like Arabic Letter Beh with Three Dots Horizontally Below (U+0750, ݐ) for certain African languages and Arabic Letter Kaf with Two Dots Above (U+077F, ݿ) for historical notations. Further expansions address historical and regional adaptations; notably, Unicode 14.0 (2021) introduced the Arabic Extended-A block (U+08A0–U+08FF) with additional letters and diacritics for languages including African and Caucasian scripts, and the Arabic Extended-B block (U+0870–U+089F) with 48 code points for obsolete or variant forms, such as Arabic Letter Alef with Attached Fatha (U+0870) used in certain non-Arabic orthographies including African languages. These additions up to Unicode 17.0 (2025) ensure comprehensive coverage of historical Arabic-derived scripts without altering core encoding principles. Core Arabic letters are assigned specific code points within the primary , independent of their contextual forms; for instance, the letter is encoded as U+0627 (ا), which represents its isolated form but is rendered variably based on position in a word. Similarly, ba' is U+0628 (ب), and they join according to rules during display. Unicode's design separates logical encoding from visual presentation, relying on font systems and rendering engines to handle joining behaviors. The right-to-left nature of text requires the Bidirectional (UAX #9), which determines display order for mixed directional scripts in documents. This processes text by resolving embedding levels for RTL segments, such as words interspersed with LTR elements like numbers or Latin text, ensuring proper reordering in environments like and CSS via the direction: rtl property and unicode-bidi controls. Brief rendering challenges arise in complex layouts, but these are addressed through standardized font features rather than encoding changes. Unicode's encoding for aligns with the ISO/IEC 10646, which defines the Universal Coded Character Set (UCS) and incorporates all characters, including Arabic blocks, to facilitate global . This harmonization ensures that Arabic script data can be exchanged across systems without loss, with ISO/IEC 10646 specifying the same code points and properties for Arabic as in version 17.0.
Unicode BlockRangeKey ContentsVersion Introduced
U+0600–U+06FFBasic letters (e.g., U+0627 ا ), diacritics, digits1.0 (1991)
Arabic SupplementU+0750–U+077FLetter variants (e.g., U+0750 ݐ Beh variant)4.1 (2005)
Arabic Extended-BU+0870–U+089FHistorical forms (e.g., U+0870 Alef variant)14.0 (2021)

Challenges in digital typography and rendering

In the early days of digital document creation, particularly with PDF formats before the 2000s, Arabic script faced significant challenges in rendering cursive joining behaviors, often resulting in disjointed letters that disrupted the script's natural connectivity and aesthetic integrity. These failures stemmed from limited support in early typesetting systems, which treated Arabic characters as isolated forms rather than contextually linked glyphs, leading to poor legibility in printed and digital outputs. Solutions emerged through the adoption of OpenType font technology, specifically the Glyph Substitution (GSUB) table, which enables contextual substitutions for initial, medial, final, and isolated forms of Arabic letters to ensure proper cursive connections. By the mid-2000s, GSUB implementations in fonts like those developed for Microsoft Windows allowed for more accurate rendering across applications, marking a pivotal advancement in Arabic digital typography. Despite these improvements, font availability remains a persistent issue, especially for specialized variants like , which is widely used in and contexts but suffers from shortages in high-quality digital implementations due to its complex, slanted structure. This scarcity has historically limited and in regions where Nastaliq is preferred, often forcing designers to rely on suboptimal Naskh-based alternatives that alter visual authenticity. Projects like Google's , launched in the as part of the broader Noto font family initiative, addressed this gap by providing open-source fonts supporting multiple styles, including Nastaliq variants, to ensure consistent rendering across over 800 languages and eliminate "tofu" placeholders for unsupported characters. Noto's development, in collaboration with Monotype, emphasized comprehensive coverage for scripts, significantly boosting digital adoption in and environments by the late . Input methods for Arabic script continue to pose practical hurdles, particularly with keyboard layouts such as the standard Arabic 101-key configuration, which maps 28 letters plus diacritics to a base but often leads to inefficient typing due to frequent shifts for common characters and inconsistent support across operating systems. On devices, autocorrect systems exacerbate these issues by poorly handling diacritics (harakat), frequently misplacing or omitting vowel marks during , which is critical for precise Quranic or poetic rendering. This results in error-prone input, especially for learners or non-native users, as keyboards like those on and struggle with the script's right-to-left directionality and contextual shaping. As of 2025, emerging challenges include the need for AI-driven font generation to revive endangered Arabic styles, such as regional calligraphic variants at risk of , where models are being explored to automate design while preserving cultural nuances. for screen readers remains a key concern, with tools often failing to properly interpret Arabic's cursive joins and diacritics, leading to fragmented audio output that hinders navigation for visually impaired users in and content. Ongoing W3C efforts highlight gaps in layout requirements, prioritizing solutions like enhanced text-to-speech engines tailored for Arabic script to improve inclusivity in digital interfaces.

Extensions

Additional letters and characters

The Arabic script has been extended beyond its core 28 letters to accommodate phonetic needs in numerous non-Arabic languages, particularly through modifications like added diacritics, new letter forms, and contextual variants encoded in blocks such as Arabic (U+0600–U+06FF), Arabic Extended-A (U+08A0–U+08FF), and Arabic Extended-B (U+0750–U+077F). These extensions enable representation of sounds absent in , such as implosives, retroflexes, and specific vowels, supporting languages from to and . Common additions include letters for , , , and , while rarer forms appear in African Ajami scripts for languages like and Fulfulde. Among the most widely used extensions are those for Indo-Iranian languages. For instance, the Urdu nasalized noon (ں, U+06BA) represents a syllabic nasal /n̩/ or /̃/, essential for words like "kitābẽ" (books). In Kurdish, the open o (ۆ, U+06C6) denotes the vowel /o/, distinguishing it from the standard waw /u/ or /w/, as in Sorani Kurdish orthography. Persian adds peh (پ, U+067E, /p/), cheh (چ, U+0686, /tʃ/), zhe (ژ, U+0698, /ʒ/), and gaf (گ, U+06AF, /ɡ/), which are crucial for native phonemes not present in Arabic. Pashto employs further variants like xwe (ښ, U+069A, /ʂ/) and noong (ږ, U+0696, /ŋ/) to capture retroflex and velar nasal sounds. Rarer extensions are prominent in Ajami scripts, where the Arabic script was adapted for languages during Islamic expansion. In , additional forms like keh with three dots above (ݣ, U+0763) represent labialized /kʷ/ or palatalized /kʲ/, while ghain with three dots above (ࣃ, U+08C3) denotes /ɡʷ/ or /ɡʲ/ in emphatic contexts. For implosives in and related languages, characters such as beh with above (ࢡ, U+08A1) indicate the implosive bilabial stop /ɓ/, and yeh with two dots below and above (ࢨ, U+08A8) for the glottalized palatal approximant /ʝ/. These Ajami innovations, often using stacked diacritics or modified bases, allow expression of tonal and ejective sounds unique to West and East phonologies, as seen in Fulfulde and Wolof orthographies. Standardization of these extensions has been advanced by international bodies since the 2010s, particularly through the Unicode Consortium's encoding proposals and the W3C Arabic Layout Task Force, which addresses rendering challenges for diverse variants in digital environments. The , established in 2015, collaborates with linguists to ensure consistent support for over 50 extended characters across browsers and fonts, drawing on input from language communities in and . Efforts like the 2018 Unicode proposal for Hausa-specific letters highlight ongoing work to encode underrepresented Ajami forms without disrupting existing Arabic typography. More recently, (September 2025) added the Arabic Extended-C block (U+10EC0–U+10EFF), introducing 64 characters for additional Qur'anic annotations used in and , as well as letters for the in Indonesian languages. The following table catalogs over 50 representative additional letters and characters, selected from Unicode encodings for key languages. It includes the character glyph (isolated form where possible), Unicode code point, formal name, approximate phoneme(s), and primary language(s) of use. This is not exhaustive but illustrates the diversity of extensions.
CharacterUnicodeNamePhoneme(s)Language(s)
ٱU+0671Arabic Letter Alef Wasla/a/ (elided)Quranic Arabic
ٲU+0672Arabic Letter Alef with Wavy Hamza Above/ʔa/Baluchi, Kashmiri
ٴU+0674Arabic Letter High Hamza/ʔ/ (high)Kazakh, Jawi
ٹU+0679Arabic Letter Tteh/ʈ/Urdu, Sindhi
ٺU+067AArabic Letter Tteh with Small Tah Above/ʈʰ/Sindhi
ٻU+067BArabic Letter Beeh/ɓ/Sindhi
ټU+067CArabic Letter Teh with Ring/ʈ/Pashto
ٽU+067DArabic Letter Teh with Small Tah Above/t̪/Sindhi
پU+067EArabic Letter Peh/p/Persian, Urdu
ٿU+067FArabic Letter Peh with Small Tah Above/pʰ/Sindhi
ݐU+0750Arabic Letter Beh with Three Dots Horizontally Below/ɓ/African languages (e.g., Hausa)
ݑU+0751Arabic Letter Beh with Dot Below and Three Dots Above/bʷ/Hausa
ݒU+0752Arabic Letter Beh with Three Dots Pointing Upwards Below/ɓ/African Ajami
ݓU+0753Arabic Letter Beh with Three Dots Pointing Upwards Below and Two Dots Above/bʲ/African languages
ݔU+0754Arabic Letter Beh with Two Dots Below and Dot Above/ɗ/Saraiki
ݕU+0755Arabic Letter Beh with Inverted Small V Below/ɓ/African Ajami
ݖU+0756Arabic Letter Beh with Small V/v/Shina
ݗU+0757Arabic Letter Hah with Two Dots Above/ħ/African languages
ݘU+0758Arabic Letter Hah with Three Dots Pointing Upwards Below/ɣ/African Ajami
ݙU+0759Arabic Letter Dal with Two Dots Vertically Below and Small Tah/d̪/Saraiki
ݚU+075AArabic Letter Dal with Inverted Small V Below/ɖ/African languages
ݛU+075BArabic Letter Reh with Stroke/ɽ/African Ajami
ݜU+075CArabic Letter Seen with Four Dots Above/s/ (emphatic)Shina
ݝU+075DArabic Letter Ain with Two Dots Above/ʕ/African languages
ݞU+075EArabic Letter Ain with Three Dots Pointing Downwards Above/ʕʷ/African Ajami
ݟU+075FArabic Letter Ain with Two Dots Vertically Above/ʕʲ/African languages
ݠU+0760Arabic Letter Feh with Two Dots Below/v/African Ajami
ݡU+0761Arabic Letter Feh with Three Dots Pointing Upwards Below/ɸ/African languages
ݢU+0762Arabic Letter Keheh with Dot Above/k/Jawi
ݣU+0763Arabic Letter Keheh with Three Dots Above/kʷ/, /kʲ/Hausa, Amazigh
ݤU+0764Arabic Letter Keheh with Three Dots Pointing Upwards Below/q/African Ajami
ݥU+0765Arabic Letter Meem with Dot Above/mʲ/African languages
ݦU+0766Arabic Letter Meem with Dot Below/ɱ/Maba
ݧU+0767Arabic Letter Noon with Two Dots Below/ɲ/Arwi
ݨU+0768Arabic Letter Noon with Small Tah/ɳ/Saraiki
ݩU+0769Arabic Letter Noon with Small V/ɲ/Gojri
ݪU+076AArabic Letter Lam with Bar/ɭ/African languages
ݫU+076BArabic Letter Reh with Two Dots Vertically Above/ɽʒ/Torwali
ݬU+076CArabic Letter Reh with Hamza Above/ʑ/Ormuri
ݭU+076DArabic Letter Seen with Two Dots Vertically Above/ʃ/Kalami
ݮU+076EArabic Letter Hah with Small Arabic Letter Tah Below/χ/Khowar
ݯU+076FArabic Letter Hah with Small Arabic Letter Tah and Two Dots/ʁ/Khowar
ݰU+0770Arabic Letter Seen with Small Arabic Letter Tah and Two Dots/sˤ/Khowar
ݱU+0771Arabic Letter Reh with Small Arabic Letter Tah and Two Dots/ɹˤ/Khowar
ݲU+0772Arabic Letter Hah with Small Arabic Letter Tah Above/ħʷ/Torwali
U+08A0Arabic Letter Beh with Small V Below/bʷ/African languages
U+08A1Arabic Letter Beh with Hamza Above/ɓ/Adamawa Fulfulde
U+08A2Arabic Letter Jeem with Two Dots Above/d͡ʒ/African Ajami
U+08A3Arabic Letter Tah with Two Dots Above/tʰ/African languages
U+08A4Arabic Letter Feh with Dot Below and Three Dots Above/ɸ/African Ajami
U+08A5Arabic Letter Qaf with Dot Below/ɢ/African languages
U+08A6Arabic Letter Lam with Double Bar/ʎ/African Ajami
U+08A8Arabic Letter Yeh with Two Dots Below and Hamza Above/ʝ/Adamawa Fulfulde

Numerals and their evolution

The , also known as the Hindu-Arabic numeral system, originated from the developed in ancient , where a decimal place-value system with nine symbols and a zero was established by the BCE. These numerals reached the through and scholarly exchanges, becoming known in regions under Arab influence as early as 662 , as recorded by the scholar Severus Sebokht. In the , the Persian mathematician Muhammad ibn Musa played a pivotal role in their adoption and dissemination by authoring a treatise on Indian calculation methods, titled On the Calculation with Hindu Numerals (c. 825 CE), which explained the system's arithmetic operations and . Although the original Arabic text is lost, a 12th-century Latin translation, Algoritmi de numero Indorum, preserved its content and facilitated the system's spread to . By the , the numeral forms diverged into Eastern Arabic variants (٠١٢٣٤٥٦٧٨٩), used in the eastern including the , , and Persia, and Western Arabic variants (closer to modern 0-9), which emerged in the and through local scribal adaptations. This split arose from independent evolutions in handwriting and regional mathematical texts, with the Western forms, often called Gubar numerals after the Arabic word for "dust" (referring to dust-board calculations), gaining prominence in . Regional variants further diversified the system. In the , Western evolved distinctly by the 10th century, featuring rounded shapes like ⓪①②③④⑤⑥⑦⑧⑨ in early manuscripts, and were transmitted to via , influencing the global standard. numerals, a variant of the Eastern Arabic set, differ in digits 4 (۴), 5 (۵), and 6 (۶), reflecting calligraphic influences in and , while maintaining the same positional values. In modern computing, the are encoded in as U+0660–U+0669 (Arabic-Indic digits: ٠ through ٩), supporting right-to-left rendering in Arabic-script languages, with extended variants U+06F0–U+06F9 for and similar forms. This standardization draws from ISO/IEC 8859-6 (), ensuring compatibility in digital and international software, where both Eastern and Western forms coexist for multilingual applications.

Components

Basic graphemes and radicals

The Arabic script is built upon 18 basic graphemes known as basic shapes or letter-shapes, which form the core structures from which the 28 letters of the alphabet are derived by adding diacritical marks such as dots. These fundamental elements originated in early forms of the script and include a variety of strokes that provide the skeletal framework for letter construction. The basic shapes encompass diverse stroke types, including verticals, horizontals, diagonals, and curves, each contributing to the distinctive flow of the script. Vertical strokes, for example, appear as tall, straight lines in letters like (ا), serving as standalone elements or stems. Horizontal strokes form baseline extensions, as seen in the final form of nun (ن), where they create a flat, connecting bar. Curves add fluidity, evident in sin (س), which employs rounded arcs to form its serpentine shape. These strokes are executed with consistent pen angles in traditional to maintain proportional harmony. A key basic shape is the (nuqṭah), a small circular mark placed above or below base shapes to differentiate homographic letters. Representative examples include the single nuqṭah positioned below the bā’ (ب) to distinguish it from tā’ (ت) and thā’ (ث), and above dāl (د) to set it apart from dhāl (ذ). Other basic shapes include loops, tails, and notches, which combine to form complex shapes while adhering to alignment.
Basic Shape/Stroke TypeDescriptionExample(s)
Vertical strokeStraight downward line, often tall and isolatedAlif (ا)
Horizontal strokeRight-to-left line along the baselineFinal nun (ن)
Curve (open/closed)Arced or looped path for fluiditySin (س), mīm (م)
Diagonal/notchSlanted line or angled cutJīm (ج) initial
Dot (nuqṭah)Small circle for differentiationBā’ (ب) below; dāl (د) above
Tail (returning)Extending curve folding under baselineFinal yā’ (ى)
Bowl/loopRounded enclosure below or above baselineFinal nūn (ن), ṣād (ص)
Non-basic shape elements, such as tashkīl (diacritical vowel marks), are superimposed on these graphemes to guide and are positioned relative to the script's for . The fatḥah (َ, "a" sound) and ḍammah (ُ, "u" sound) sit above the baseline, while the kasrah (ِ, "i" sound) appears below it; the sukūn (ْ, no vowel) is a small circle above. (ّ, ) combines with these above the letter, and tanwīn (indefinite article markers) doubles the marks in their respective positions. These elements do not alter the basic shapes but enhance phonetic clarity, particularly in educational or religious texts.

Letter construction and ligatures

The Arabic script's letters are assembled from a limited set of basic components, often referred to as basic shapes or graphemes, which include fundamental strokes like vertical lines, curves, loops, and diacritical marks such as dots. These elements typically combine in groups of 2 to 5 to form the 28 distinct letters, enabling efficient representation of phonetic variations through minimal additions rather than entirely new shapes. For instance, the letter غ (ghayn) is constructed from the base shape of ع (ʿayn)—a looped curve opening to the right—augmented by three dots stacked above the loop to distinguish its emphatic velar fricative sound. This modular approach stems from the script's historical evolution, where early forms were simplified and differentiated using iʿjam (diacritical dots) to avoid ambiguity in undotted precursors. Similarly, more complex letters incorporate additional basic shapes for tails or extensions. The letter ي (yāʾ) exemplifies this, built from a primary curved stem shape akin to the base of ب (bāʾ), extended by a descending tail curve that sweeps leftward, and finalized with two dots positioned below the stem to indicate its semi-vowel quality. Such constructions ensure that letters maintain cursive compatibility while allowing visual and phonetic distinction; for example, the tail in ي not only aids joining but also evokes the script's flowing aesthetic derived from pen strokes. Overall, the 18 core basic shapes underpin all 28 letters, with dots and marks applied systematically to create the full inventory without requiring unique forms for each. Ligatures in the Arabic script involve the fusion of adjacent letters into unified glyphs, enhancing readability and stylistic harmony in writing. While the script's inherent connectivity makes most joins contextual, true ligatures—where letters transform into a single, indivisible form—are generally optional in modern printed but mandatory in specific cases or traditional styles like naskh and ruqʿah. The most prominent mandatory ligature is لا (), which merges ل (lām) and ا () into a distinct upright form to represent the common "la," preventing awkward spacing and improving flow; this is universally applied across fonts and . Beyond this, over 20 common optional ligatures exist, employed for aesthetic or space-saving purposes in particular contexts, such as religious phrases or decorative text. Examples include لله (lām-lām-hā, part of "Allāh") where the two lāms and hā blend into a compact, elevated form, and sequences like شمس (shīn-mīm-sīn) that may fuse in calligraphic styles for smoother transitions. These optional forms vary by script style—more prevalent in traditions than —and are not required for , allowing writers flexibility while preserving the script's interconnected . A key aspect of letter assembly involves stacking diacritics and modifiers, particularly for the hamza (ء), a glottal stop that requires precise positioning on carrier letters to align with phonetic rules and visual balance. The hamza is seated on one of three primary carriers: alif (أ or إ), wāw (ؤ), or yāʾ (ئ), with placement determined by the short vowel associated with the hamza—above alif for fatha (a), below wāw for ḍamma (u), and on the line of yāʾ for kasra (i)—to ensure the carrier's inherent vowel supports the hamza's pronunciation. For initial or standalone positions, hamza appears independently (ء), but in medial or final spots, it stacks above or below the carrier without altering the base letter's joinability. This rule-based stacking prevents overlap and maintains the script's baseline integrity, as seen in words like أَكْل (akl, "eat") where hamza crowns the alif.

References

  1. [1]
    Arabic Script - an overview | ScienceDirect Topics
    Arabic script is defined as a writing system consisting of 28 letters, used extensively around the world, that is written in a right-to-left cursive style.
  2. [2]
    Arabic Scripts - Basics
    Arabic script is an abjad, ie a writing system based mostly on the indication of consonants only. It developed from the cursive form of Nabatean Aramaic script.
  3. [3]
    [PDF] On the Origin of Arabic Script - Fluxus Editions
    It is most reasonable then to conclude that the traits of Arabic script have at least two sources, Nabatean and Syriac. References. AlBaladhuri, Ahmad (1969) ...<|control11|><|separator|>
  4. [4]
    The Qur'an and the development of Arabic scripts between the 7th ...
    As seen on a page from an 8th century manuscript, the script is characterized by the accentuation of the vertical strokes of the letters. The letters slant to ...Missing: sources | Show results with:sources
  5. [5]
    An Introduction to Arabic Script - Education - Asian Art Museum
    Another characteristic of Arabic letters is that they change form ever so slightly depending on their position in a word: initial, beginning, middle, or final.Missing: scholarly sources
  6. [6]
    The Arabs in the First Communication Revolution: The Development ...
    Both Latin and Arabic scripts were derived from the Phoenician, which represents a shared cultural heritage between the Middle East and the West. However, the ...
  7. [7]
    [PDF] Roots of Modern Arabic Script: From Musnad to Jazm
    The first one was found in Zabad of northern Syria, It was dated to 512 CE. It is a multilingual inscription including Greek, Syriac and Arabic. The second ...
  8. [8]
    (PDF) On the Origin of Arabic Script - Academia.edu
    Arabic script exhibits influences from both Nabatean and Syriac scripts, challenging the notion of a single ancestor. Historical examples like Ugaritic ...
  9. [9]
    Namarah Inscription: The Second Oldest Dated Pre-Islamic Arabic ...
    May 5, 2005 · 328 CE. Script. It is the one of the two Arabic inscriptions written in Nabataean alphabet. The other one is the ʿEn ʿAvdat inscription of ...
  10. [10]
    A Pre-Islamic Trilingual Inscription In Greek, Syriac & Arabic From ...
    Nov 13, 2017 · Zebed Inscription: A Pre-Islamic Trilingual Inscription In Greek, Syriac & Arabic From 512 CE. ... Note the Arabic text is in the bottom.
  11. [11]
    The Arab Conquest | Silk Roads Programme - UNESCO
    From the mid-seventh century, Muslim Arab armies from Saudi Arabia began to travel north into Central Asia and west across Africa, invading the countries ...<|control11|><|separator|>
  12. [12]
    The Spread of Islam in Ancient Africa - World History Encyclopedia
    May 10, 2019 · Islam spread from the Middle East to take hold across North Africa during the second half of the 7th century CE when the Umayyad Caliphate (661- ...Missing: 9th | Show results with:9th
  13. [13]
    Arab-Muslim Conquests (A.D. 632-700) - Middle East And North Africa
    In 10th century, a new style of high literature appeared written in Persian with Arabic script and enhanced y Arabic words. The Persian provinces produced some ...
  14. [14]
    [PDF] Arabic Scripts and External Factors Kufic and Naskhī Scripts
    Abstract: The evolution of Islamic formal script is relatively well understood in chronological and geographical terms. Scholars of paleography have ...
  15. [15]
    Islamic Calligraphy
    ### Summary of Umayyad and Abbasid Caliphates' Role in Arabic Script Transformation and Dissemination
  16. [16]
    [PDF] I'jam and the Development of Islamic Khatatti
    Consequently, Abu al-Aswad al-. Duʾali asks for a scribe and invents a system to differentiate the consonant and vowels according to mouth positions (ibid ...
  17. [17]
    Α Linguistic View of the Development - of the Arabic Writing System
    ... al'Aswad alDu'ali had a scribe from the tribe of 'Abd Qals Sit before a large codex of the. Qur'än and follow as he recited it5®. "Whenever", he Said to the ...
  18. [18]
    [PDF] writing, print, and technologies of the Arabic alphabet - UC San Diego
    century, Khalil Ibn Ahmad al-Farahidi replaced colored diacritics with a new system of diacritical tashkil. al-Farahidi's new vocalization marks were placed.Missing: 8th | Show results with:8th
  19. [19]
    Art of Arabic Calligraphy - Sakkal Design
    Abul Aswad al Du'ali (d. 688) was the legendary founder of Arabic grammar, and is credited with inventing the system of placing large colored dots to indicate ...Missing: 684 CE
  20. [20]
    [PDF] Ottoman Reforms Before and During the Tanzimat - DergiPark
    Jul 26, 2024 · Particularly during the Tanzimat period, the printing press became an effective tool for announcing and implementing reforms to the public.
  21. [21]
    An Iranian Intellectual in Istanbul: Mirza Malkum Khan and Reform ...
    Malkum Khan's reform aimed to modernize the Arabic alphabet for improved literacy in the Ottoman Empire. The Ottoman bureaucratic response to alphabet reform ...
  22. [22]
    The Formation of National Culture in Egypt in the Interwar Period
    Jan 29, 2009 · In Egypt the transition from an indigenous manuscript culture to 'print capitalism' involved a negotiation over language standardization that ...
  23. [23]
    Turkey switches from Arabic script to the Latin alphabet - The Guardian
    Oct 25, 2023 · On 1 November 1928, a new Turkish alphabet law was passed making making the use of Latin letters compulsory in all public communications and the education ...
  24. [24]
    [PDF] Persian Orthography - Modification or Changeover? (1850–2000)
    The present study provides a description of the standardization process of Persian orthography since the introduction of the Arabic alphabet, ...
  25. [25]
    Arabic and the Art of Printing: A Special Section - Saudi Aramco World
    By 1980, the Computer Revolution - or Data Processing Revolution - had begun to transform world communications almost as drastically as the invention of movable ...
  26. [26]
    Pilot project for endangered Arabic manuscripts in Ivory Coast
    The project will digitally preserve 50,000 pages of endangered Arabic and Ajami manuscripts (texts written with the modified Arabic script) produced by ...
  27. [27]
    Arabic & Persian Layout Requirements - W3C
    Oct 2, 2025 · This document describes requirements for the layout and presentation of text in languages that use the Arabic script when they are used by Web standards and ...
  28. [28]
    [PDF] Arabic - The Unicode Standard, Version 17.0
    These charts are provided as the online reference to the character contents of the Unicode Standard, Version 17.0 but do not provide all the information needed ...Missing: types | Show results with:types
  29. [29]
  30. [30]
    Writing and Pronouncing the Hamza (ء): A Guide for the Perplexed
    Hamza (or hamzaŧ, as written in our transcription system) is always written on the line (ء) when it is used in the abstract.
  31. [31]
    A Comprehensive Guide to Quran Tajweed Rules - Madinah Arabic
    Idgham occurs when a Noon Sakinah or Tanween is followed by any letter of (Noon, Meem, Yaa, Waaw, Raa, or Lam) called also Idgham letters. The sound of the Noon ...
  32. [32]
    Principles of variation in the use of diacritics (taškīl) in Arabic books
    This study is the first empirical investigation into the variation in how Arabic diacritics are used. It employs quantitative corpus linguistic methods to ...
  33. [33]
    The origin and development of Maghribī round scripts. Arabic ...
    Feb 6, 2018 · This thesis aims to study the origin and development of Maghribī round scripts, i.e. the highly distinctive writing styles employed in the ...
  34. [34]
    A By-Letter Comparison of the Andalusian and Maghrebi ...
    length towards the latter continues in the study of the Maghrebi script, the feature of pronounced. Alef​ recurves in this style leads to a noticeable shift ...
  35. [35]
    Jawi, an endangered orthography in the Malaysian linguistic ...
    Introduced by Muslim traders, it was adapted to Malay phonology using diacritics that modified six letters. It was used until the Roman script (Rumi) brought in ...
  36. [36]
    [PDF] METHODS OF LEARNING AND WRITING JAWI SCRIPTS WITHIN ...
    According to al-Attas (1990), Jawi script is a result of the renovation and addition of Arabic letters to adapt to the Malay language to facilitate the ...
  37. [37]
    Arabic-Java Writing System: How Javanese Language Adopts ...
    Apr 30, 2021 · Finally, this research found that Arabic-Java orthography has 28 graphemes which are used to represent 23 consonants. Modification letters and ...
  38. [38]
    (PDF) PEGON LETTERS: HISTORY AND FORM OF LETTERS
    However, Pegon letters did not change many forms of. Arabic letters, only a few letters were modified to adjust pronunciation in Javanese. To find out the ...Missing: scholarly | Show results with:scholarly
  39. [39]
    (PDF) Bihari calligraphy - Academia.edu
    Bihr calligraphy, originating in pre-Mughal India around the seventh/thirteenth century, is characterized by its unique blend of angular and cursive styles ...
  40. [40]
    Manuscripts in Bihari Calligraphy: Preliminary Remarks on a Little ...
    Bihari Korans use Bihari calligraphy, specific to the Indian subcontinent, with common ornamentation, a particular page layout, and a complex hermeneutical ...
  41. [41]
    Manuscripts in Bihari Calligraphy: Preliminary Remarks on a Little ...
    Aug 9, 2025 · The oldest specimens are among the earliest known manuscripts for India, even though their production extended into the nineteenth century, ...
  42. [42]
    How Many Countries Speak Arabic? (Full List of Arabic Countries)
    Jun 3, 2024 · Arabic is the main official national language in 22 countries in the Middle East and North Africa: Bahrain, Iraq, Jordan, Kuwait, Lebanon, Oman, Palestine, ...
  43. [43]
  44. [44]
    27 most spoken languages in the world in 2025
    Jun 2, 2025 · Persian, also known as Farsi, is spoken by approximately 80 million people in total, with around 70 million native speakers. It is primarily ...
  45. [45]
    Jawi script - Wikipedia
    Jawi is based on the Arabic script, consisting of all 31 original Arabic letters, six letters constructed to fit phonemes native to Malay, and one additional ...Maranao language · Maguindanao language · Ternate language
  46. [46]
    Uyghur: The Living Language of the Silk Road, Thriving in Modern ...
    Oct 2, 2025 · Current Standard: In China today, the standard written form is the Uyghur Arabic alphabet, a script adapted to perfectly capture the sounds of ...
  47. [47]
    Uyghur alphabets - Wikipedia
    UEY: the Uyghur Arabic alphabet, the only official alphabet in the Xinjiang province of China and is widely used in government, social media and in everyday ...
  48. [48]
    Literacy rate, adult total (% of people ages 15 and above) - Arab World
    Arab World ; Algeria · 75 ; Bahrain · 98 ; Comoros · 76 ; Egypt, Arab Rep. · 79 ; Iraq · 84.
  49. [49]
    Middle East: Arab world turns page on literacy | The Jerusalem Post
    May 15, 2019 · The adult literacy rate in Arab states among people 15 and older is around 75%, 11 percentage points below the world average of 86%.
  50. [50]
    Persian Online – Grammar & Resources » The Writing System
    In Arabic this writing system has 28 characters; in Persian, 32 (as there are no p, ch, zh, and g, in Arabic). With the addition of the hamze (the glottal stop, ...
  51. [51]
    Chapter 15 – Unicode 16.0.0
    The nukta diacritic also marks letters representing some sounds in Urdu or sounds not native to Hindi. No precomposed characters are encoded in those cases ...
  52. [52]
    [PDF] parh19-tug-evolution-persian-arabic-scripts.pdf - ece.ucsb.ed
    Jul 16, 2019 · Throughout this paper, my use of the term “Persian script” is a shorthand for scripts of a variety of Persian forms (Farsi/Parsi, Dari, Pashto, ...
  53. [53]
    [PDF] Urdu (in Arabic script) romanization table 2013
    For the use of ا (alif) to support ء (hamzah) and (maddah), see rules 1 and 2, respectively. For the romanization of ء by (alif), see rule 12. For other ...Missing: additional | Show results with:additional
  54. [54]
    [PDF] L2/02- - Unicode
    Jul 16, 2002 · Arabic-script letter to indicate retroflexion is derived from Urdu usage, but Urdu itself does not have a retroflex. /n/. Sindhi has this ...
  55. [55]
    [PDF] THE STORY OF SWAHILI - OHIO Open Library
    Arabic Script and Bantu Speech. The adoption of Arabic script presented a problem to Swahili writers in that the phonetic symbols, the letters, made ...
  56. [56]
    [PDF] OLD SWAHILI-ARABIC SCRIPT AND THE DEVELOPMENT OF ...
    Kiswahili. In 1930s and 1940s 'Standard Swahili' became a reality. It was used for all sorts of publishing, the greater part of which was literature ...
  57. [57]
    [PDF] CENTRAL ASIA FROM THE 9TH TO THE 13TH CENTURY
    After they embraced Islam in the 8th and 9th centuries they also ... Turkic language and adopted Arabic script for their language. Therefore, the ...
  58. [58]
  59. [59]
    [PDF] SOVIET AND POST-SOVIET LANGUAGE POLICIES IN THE ...
    Before 1917, the peoples of Central Asia used. Page 2. 466 the Arabic alphabet to write their languages. This script was a familiar system of writing as well as ...
  60. [60]
    Ottoman Turkish | Middle Eastern Studies - The University of Chicago
    Mustafa Kemal (Atatürk)'s alphabet reform in 1928 which adopted the Latin alphabet marks the end of "Ottoman Turkish." During the language reform of the 1930s a ...
  61. [61]
    Uses and written practices in Aljamiado manuscripts - Academia.edu
    In the 15th-17th centuries, the Spanish Islamic communities, mudejares (until 1502) and moriscos (from 1502) developed an interesting written production: ...
  62. [62]
    [PDF] Uses and Written Practices in Aljamiado Manuscripts
    Aljamiado manuscripts were copied within the Muslim communities in Spain: Mudejars (before the forced baptism in 1502) and Moriscos (after this date), from the ...
  63. [63]
    From Africa, in Ajami - Saudi Aramco World
    By this time, however, Ajami began running headlong into the Latin-based scripts of European languages imposed by colonial administrators who viewed Ajami as ...
  64. [64]
    The Status of Punjabi after 1947 - BAGICHA
    Apr 27, 2019 · After 1947, Punjabi gained official status in India, but in Pakistan, it remains informal. The 1966 Punjab Reorganisation Act created a new ...
  65. [65]
    How to Preserve the Punjabi Language
    Jan 31, 2022 · Punjabi is currently written using two scripts: Shahmukhi (Perso-Arabic based) is utilized by Punjabi Muslims due to Arabic's liturgical status ...
  66. [66]
    [PDF] Arabic Supplement - The Unicode Standard, Version 17.0
    Arabic Supplement. Range: 0750–077F. This file contains an excerpt from the character code tables and list of character names for. The Unicode Standard ...
  67. [67]
  68. [68]
    Digital Printing of Arabic: explaining the problem
    Aug 21, 2017 · Representing Arabic script on a computer has been problematic from the very beginning. In this series, we shall explore some of the solutions offered.
  69. [69]
    Modern Arabic Typography in Digital Design
    Jul 12, 2023 · Explore how Arabic typography has evolved in the digital era, from traditional calligraphy to contemporary font design and web typography.
  70. [70]
    GSUB — Glyph Substitution Table (OpenType 1.9.1) - Microsoft Learn
    May 29, 2024 · The Glyph Substitution (GSUB) table provides data for substitution of glyphs for appropriate rendering of scripts, such as cursively-connecting forms in Arabic ...Missing: solutions | Show results with:solutions
  71. [71]
    Developing OpenType Fonts for Arabic Script - Microsoft Learn
    Jun 9, 2022 · Arabic Typesetting contains three OpenType Layout tables: GSUB (glyph substitution), GPOS (glyph positioning), and GDEF (glyph definition, ...Missing: solutions | Show results with:solutions
  72. [72]
    Why It's So Hard to Design Arabic Typefaces - WIRED
    Oct 23, 2015 · Typeface design has a western-normativity problem: for years, most Arabic typefaces have been designed by Latin-language typographers.
  73. [73]
    The Fight to Preserve the Urdu Script in the Digital World | TIME
    Sep 27, 2023 · Urdu's commonly written form, Nastaliq, is often seen as too complex for use online. Developers and designers are trying to change that.
  74. [74]
    An open source font system for everyone - Google for Developers Blog
    Oct 6, 2016 · The Noto project started as a necessity for Google's Android and Chrome OS operating systems. When we began, we did not realize the enormity of ...
  75. [75]
    than 800 languages in a single typeface: creating Noto for Google
    Oct 5, 2016 · A typeface five years in the making, Google Noto spans more than 100 writing systems, 800 languages, and hundreds of thousands of characters.
  76. [76]
    android - Diacritic rendering incorrectly - Stack Overflow
    Jun 19, 2018 · I finally was able to find the culprit it was the text it self which was rendering incorrectly in the android app, I changed the text and it ...Arabic fonts are not displayed well in android - Stack OverflowArabic Diacritics (Harakat) Misaligned in UILabel with Justified Text ...More results from stackoverflow.com
  77. [77]
    Arabic Predictive text - Apple Communities
    Oct 5, 2020 · Thanks for using the Apple Support Communities! We understand you are having an issue using Arabic predictive text on your iPhone. We are happy ...
  78. [78]
    Preserving endangered languages with Noto fonts - The Keyword
    Nov 23, 2020 · Noto offers fonts for 146 scripts (writing systems) and over 800 languages, and is opening the door to reading and writing for minority language speakers.<|separator|>
  79. [79]
    Arabic Script Gap Analysis - W3C
    Oct 24, 2025 · This document describes and prioritises gaps for the support of the Arabic script on the Web and in eBooks. In particular, it is concerned with text layout.<|separator|>
  80. [80]
    [PDF] 18094-arabic-hausa.pdf - Unicode
    Apr 26, 2018 · Title: Proposal to encode additional Arabic script characters for Hausa to the UCS. From: Lorna Priest Evans (SIL International), Andy Warren ...
  81. [81]
    Hausa (ajami) orthography notes - r12a.io
    Two major orthographies are Warsh (a north African variant), and Hafs (much closer to the Arabic used in Egypt and the Gulf). Additionally, the Qur'an and other ...
  82. [82]
    [PDF] Arabic Extended-A - The Unicode Standard, Version 17.0
    Arabic Extended-A. Range: 08A0–08FF. This file contains an excerpt from the character code tables and list of character names for. The Unicode Standard ...
  83. [83]
  84. [84]
    Arabic Layout Task Force Home Page - W3C on GitHub
    This task force does gap analysis and documents requirements, related to the layout and presentation of text in languages that use the Arabic script.
  85. [85]
    Arabic Layout Task Force Charter - W3C
    The mission of this task force is to support the use of the Arabic script by Web standards and technologies, such as HTML, CSS, Mobile Web, Digital ...
  86. [86]
    Arabic numerals - MacTutor History of Mathematics
    It is often claimed that the first Arabic text written to explain the Indian number system was written by al-Khwarizmi. ... Around the middle of the tenth century ...
  87. [87]
    Al-Khwarizmi (790 - 850) - Biography - MacTutor
    Al-Khwarizmi also wrote a treatise on Hindu-Arabic numerals. The Arabic text is lost but a Latin translation, Algoritmi de numero Indorum in English Al- ...
  88. [88]
    Persian Numbers (Farsi Numbers): How to Count 1-10 - Iranopedia
    Persian and Eastern Arabic numerals are both derived from the Hindu-Arabic system, but the Persian set used in Iran has different symbols for digits four, five ...
  89. [89]
    [PDF] Arabic-Manuscripts-a-Vademecum-for-Readers-Handbook-of-Orient ...
    The alphabet in its original form consisted of 17 basic letter-shapes (graphemes), which included a number of identical forms (homographs). In order to ...
  90. [90]
    Script and Art - Brill Reference Works
    The 28 (or 29, if we include the lām ʾalif) letters of the Arabic alphabet, as we know them today, developed from a primitive set of 17 basic letter shapes ...
  91. [91]
    Arabic Alphabet Chart - Sakkal Design
    Dec 26, 2016 · Use consistent pen angle, strokes shape, and proportions throughout. Similar strokes in different letters should look the same. 9. Jeem and ...
  92. [92]
    Arabic Alphabet [with Interactive Chart] - Madinah Arabic
    This article deals with how to learn the Arabic alphabet. It's an important topic for reading Arabic which is one of the four language skills.
  93. [93]
    Origins and Characteristics of the Arabic Alphabet
    The Arabic alphabet consists of eighteen shapes that express twenty-eight phonetic sounds with the help of diacritical marks.
  94. [94]
  95. [95]
    [PDF] Multilingual Fonts for Arabic Script - Longdom Publishing
    Sep 10, 2015 · Only required ligatures (such as lam-alef لا and lam-lam-ha لله (lillah)) are added in Arabic fonts. Inpage does not have proper fonts for ...
  96. [96]
    (PDF) Arabic ligatures: Analysis and application in text recognition
    Feb 7, 2019 · ... Arabic ligatures are optional. I.e. a writer may decide. to either use them or to use the corresponding non-ligature. forms on a case-by-case ...Missing: mandatory scholarly