Fact-checked by Grok 2 weeks ago

Sinitic languages

The Sinitic languages, commonly referred to as the languages, constitute the primary branch of the Sino-Tibetan and are spoken natively by over 1.3 billion people, forming the world's largest cohesive . These languages are primarily concentrated in , , , and , with significant communities globally, and they encompass a continuum of varieties that range from closely related dialects to mutually unintelligible languages. Sinitic languages are typologically defined as analytic and tonal, employing fixed (typically subject-verb-object), aspectual particles, and contextual cues to express grammatical relationships without relying on inflectional . All varieties feature phonemic , where variations distinguish lexical items, with tonal systems varying from 2–4 tones in northern varieties to 6–9 or more in southern ones due to historical tonogenesis and regional innovations. Phonologically, they are characterized by monosyllabic or disyllabic morphemes, simple syllable structures (often -vowel or -vowel-nasal), and a lack of complex consonant clusters, though some southern varieties retain more archaic features like initial consonant clusters. Internally, the Sinitic languages exhibit high diversity, with estimates of 300–400 mutually unintelligible varieties classified into 7–11 major groups based on phonological, lexical, and geographical criteria. The largest group is (also known as Northern Chinese), spoken by about 70% of Sinitic speakers and serving as the basis for [Standard Chinese](/page/Standard Chinese) (Putonghua); other prominent branches include (e.g., ), (e.g., ), (e.g., [Hokkien](/page/Hokki en) and ), Xiang, Gan, Hakka, and smaller ones like , Hui, and . This classification reflects a where decreases with geographic distance, though sharp boundaries exist due to historical migrations and isolation. Historically, Sinitic languages trace their origins to Proto-Sino-Tibetan, evolving through stages like (ca. 1250–220 BCE), (ca. 600–1000 CE), and Early Modern Chinese, with innovations driven by internal sound changes and contact with non-Sinitic languages in the Sino-Tibetan family. Reconstruction of earlier forms relies on rhyme dictionaries (e.g., , 601 CE) and inscriptions, revealing a shift from non-tonal to tonal systems via tonogenesis processes shared with other Mainland Southeast Asian languages. Despite spoken divergence, a shared logographic using (hanzi) facilitates partial written comprehension across varieties, though vernacular literature and modern standardization favor -based forms. Many non-Mandarin varieties lack widespread standardized orthographies beyond characters, contributing to their endangerment amid Mandarin promotion in and media.

Overview

Definition and scope

The Sinitic languages constitute the primary branch of the , comprising all varieties descended from a common ancestor known as Proto-Sinitic. These languages, often collectively referred to as varieties, are spoken natively by over 1.3 billion worldwide, making them the largest sub-branch within the family. The scope of Sinitic encompasses major groups such as (Northern ), (including ), (including ), (including ), Xiang, , Hakka, and , among others, but explicitly excludes non-Sinitic Sino-Tibetan languages like those in the Tibeto-Burman branch, such as and Burmese. As part of the broader family, Sinitic languages share distant genetic ties with Tibeto-Burman varieties but form a distinct defined by their unique phonological and syntactic developments from Proto-Sinitic. The term "Sinitic" emerged in 20th-century Western linguistics to describe this group as a rather than treating "" as a monolithic entity, highlighting the mutual unintelligibility among its varieties and their status as separate languages. This arose amid growing recognition of linguistic diversity in , influenced by comparative studies that emphasized genetic relationships and typological distinctions over political or cultural unity. Prior to this, scholars often lumped all varieties under "," but the adoption of "Sinitic" allowed for precise within Indo-European and Sino-Tibetan frameworks. Sinitic languages are characteristically tonal, with pitch contours on syllables distinguishing lexical meanings, and analytic in , relying on and particles rather than for . They predominantly follow a subject-verb-object (SVO) , which sets them apart from the more common SOV patterns in many . These features, inherited and elaborated from Proto-Sinitic, underscore the branch's typological profile as isolating languages with rich tonal systems.

Terminology and nomenclature

The term "Sinitic" derives from the Sinae, referring to the or , combined with the -itic to form an adjectival descriptor for languages associated with this region. This etymology traces back through Sῖnai and possibly to the (Qin), reflecting early Western nomenclature for East Asian linguistic entities. In linguistic scholarship, "Sinitic languages" serves as a neutral academic designation for the branch of the Sino-Tibetan language family encompassing various forms historically linked to , distinct from the politically motivated term "" or "" promoted by the to emphasize national unity. The latter framing underscores a single overarching language with regional dialects, aligning with state policies on cultural cohesion, whereas "Sinitic" highlights the independent linguistic status of its members based on structural criteria. A central controversy surrounds the classification of these as "dialects" versus distinct "languages," largely driven by political considerations rather than purely linguistic ones, such as . For instance, and exhibit near-zero between monolingual speakers, comparable to the divide between and , supporting their recognition as separate languages. This linguistic perspective is reflected in international standards, where the code assigns separate identifiers to major varieties, such as cmn for and yue for (including ), under the macrolanguage zho for , thereby affirming their status as individual languages.

Historical development

Origins in Proto-Sinitic

Proto-Sinitic, the reconstructed ancestral language of the Sinitic branch of Sino-Tibetan, is estimated to have been spoken around 1250 BCE in the context of early developments along the . Its reconstruction relies on the , drawing primarily from the phonological data in Middle rime tables and dictionaries, such as the (601 CE), combined with evidence from modern Sinitic dialects and inscriptions. Scholars like William H. Baxter and Laurent Sagart have advanced this work by proposing a systematic that accounts for categories and initial consonants, allowing backward projection to the proto-stage before significant sound changes occurred. Key phonological features of Proto-Sinitic include the absence of lexical tones, which developed later during the transition to due to the loss of final consonants. The language consisted of monosyllabic roots, each typically carrying a single , a trait that persists in descendant Sinitic languages and distinguishes them from more polysyllabic Sino-Tibetan relatives. Syllable structure was relatively simple, generally following a consonant-vowel () or consonant-vowel-consonant (CVC) pattern, with possible prefixal elements but no complex clusters in the onset or coda beyond stops, nasals, and . The earliest direct evidence for Sinitic languages emerges from the oracle bone inscriptions, dated to circa 1200 BCE, which represent an early form of logographic writing used for on animal bones and turtle shells. These inscriptions, numbering over 150,000 fragments, include vocabulary related to rituals, astronomy, and administration, providing glimpses of a language already distinct from contemporaneous non-Sinitic tongues in the region. Proto-Sinitic likely expanded in the valley amid interactions with non-Sinitic substrates, such as pre-existing populations speaking possibly Austroasiatic or Hmong-Mien languages, which may have contributed loanwords and influenced early lexical development.

Evolution from Old to Middle Chinese

Old Chinese, spanning approximately 1250 to 200 BCE, represents the earliest well-attested stage of the Sinitic languages, primarily evidenced through the rhyming patterns in the Shijing (), a collection of ancient poems compiled around the 6th century BCE. Reconstructions of phonology, notably by Baxter and Sagart (2014), posit a system with post-glottalized initials such as *p', *t', and *k', alongside a rich inventory of initial consonant clusters and no lexical tones; instead, pitch variations were likely prosodic rather than phonemic. These features distinguished Old Chinese from later stages, with the glottalization contributing to subsequent patterns without altering core structure. The evolution to , roughly 200 to 900 CE, marked a pivotal transformation driven by the simplification of syllable codas and the emergence of tonality. Final consonants in , including stops (*-p, *-t, -k) and fricatives (-s), were progressively lost, leading to compensatory pitch contours that developed into the four tones of Middle Chinese: level (píngshēng), rising (shǎngshēng), departing (qùshēng), and entering (rùshēng). This tonogenesis process is exemplified in how Old Chinese voiceless finals yielded level tones, while voiced finals produced rising or departing tones, with the entering tone preserving the brevity of stop-final syllables. The rhyme dictionary, compiled in 601 CE under Lu Fayan, provides the primary documentation of this system, categorizing syllables into 193 rhymes and using the method to indicate pronunciations based on the standard near . Notable sound shifts underscore this transition, such as the regular correspondence of aspirated stops like *kʰ to kh, maintaining aspiration while other initials simplified; for instance, *kʰaŋ (high) evolves to kʰaŋ without further change in the initial. The four-tone split further diversified the system, with rising and departing tones arising from mergers of categories influenced by initial voicing. , translated from Indic languages starting in the 2nd century CE, significantly aided documentation by necessitating precise phonetic glosses; these translations employed early notations and introduced loanwords that highlighted tonal contrasts, informing the phonological analyses in works like the . Regional divergences in Sinitic speech emerged during the (206 BCE–220 CE), as migrations southward and westward—prompted by imperial expansion, warfare, and economic pressures—exposed northern varieties to substrate influences from non-Sinitic languages in the and basins. These population movements, involving millions of settlers, initiated subtle phonetic variations, such as differential treatment of initials and tones across emerging regional norms, setting the stage for later dialectal branching without yet fully fragmenting the literary standard.

Modern divergence and standardization

Following the fall of the in 1911, the divergence among Sinitic varieties intensified during the 19th and 20th centuries, driven by rapid and extensive population migrations triggered by wars, economic shifts, and colonial influences. These movements, including the (1850–1864) and subsequent labor migrations to urban centers and overseas, resulted in increased contact in some areas but also fostered the emergence of regionally distinct urban speech forms, as speakers adapted local varieties to new social contexts without a unifying standard. For instance, in southern cities like and , reinforced Yue and Wu features amid influxes of northern migrants, exacerbating phonological and lexical differences from northern varieties. In the Republican era (1912–1949), efforts to address this growing linguistic fragmentation culminated in the Baihua movement, launched as part of the in , which advocated replacing classical wenyan with baihua to modernize and communication. This initiative standardized the written primarily on the of , promoting it as guoyu () through school curricula, publications, and radio broadcasts, though implementation varied regionally due to political instability. The movement marked a pivotal shift toward phonological norms based on northern speech, influencing literary and administrative language across . After the establishment of the in 1949, state policies further centralized standardization to counter dialectal divergence and support national unity. The first national conference on reform in October 1955 designated Putonghua—based on Beijing phonology, northern grammar, and modern vernacular vocabulary—as the official national standard, with mandatory use in education, media, and government by 1956. Complementing this, the State Council promulgated the Scheme for Simplifying in January 1956, reducing stroke counts for over 2,000 characters to boost literacy rates among speakers of diverse varieties. These measures built on foundations by prioritizing as a , though regional varieties persisted in informal domains. Divergence metrics underscore the deep historical separation among major Sinitic branches; for example, lexical similarity between and is approximately 24%, reflecting a split estimated around 2,000 years ago during the late to early period. This ancient bifurcation, combined with 20th-century sociopolitical factors, highlights ongoing challenges in standardization efforts.

Demographics and distribution

Global speaker population

The Sinitic languages collectively have approximately 1.3 billion native speakers worldwide, as of 2025, representing the largest by native speaker population. This figure encompasses all major varieties spoken primarily in , , , and diaspora communities. Among these, varieties account for the largest share, with about 990 million native speakers. The speaker base has grown steadily due to historically high birth rates in , which have sustained a population where over 90% are native Sinitic speakers, combined with expansion in communities estimated at around 50 million individuals maintaining these languages. Breakdowns by major varieties highlight their relative scales: (including ) has roughly 85 million native speakers, as of 2025, (including ) about 83 million, and (including and ) around 75 million. These groups, alongside smaller varieties like Hakka and , contribute to the family's dominance. Additionally, non-native speakers bolster the total reach, adding an estimated 200 million individuals who use as a , largely driven by 's national education policies promoting it as the standard tongue across diverse linguistic regions.

Regional variations and migration

The Sinitic languages display distinct regional distributions across , shaped by historical settlement patterns and geographical barriers. varieties dominate in northern and , encompassing provinces such as , , , and extending into the southwest, where they serve as the primary for over 950 million speakers. In contrast, Yue varieties, including , are concentrated in the southern provinces of and , as well as in and , with around 70 million speakers maintaining these forms in daily communication. Min varieties prevail in southeastern coastal areas, particularly province and , where subgroups like support approximately 75 million speakers in local contexts. Historical migrations have extended these regional varieties into communities, profoundly influencing global linguistic landscapes. From the mid-19th century onward, labor migrations driven by economic opportunities and domestic upheavals propelled millions of overseas, primarily from . In , migrants from established vibrant Hokkien-speaking () communities, notably in , where it remains a key spoken in about 11% of ethnic households as of the 2020 . Similarly, 19th-century laborers from carried varieties to , shaping Chinatowns in cities like and with Taishanese dialects that persist in older generations. These movements also reached , though on a smaller scale during the period, fostering pockets of Sinitic language use in port cities like and through subsequent waves tied to colonial trade. Within China, rapid urbanization since the late 20th century has accelerated a shift toward Mandarin dominance in cities, diminishing the everyday role of traditional varieties. As rural populations migrate to urban centers for employment, Mandarin facilitates integration into diverse workforces and education systems, leading to intergenerational language changes. Data from the 2020 national census indicate that Mandarin is spoken by 80.72% of the population overall, with usage exceeding this rate in urban areas—where over 900 million residents live—due to its role as the standardized medium for professional and social interactions. By 2023, the urbanization rate had risen to 66.16%, with over 940 million urban residents, further reinforcing Mandarin as the primary language among city dwellers. This trend underscores how urban expansion reinforces Mandarin as the primary language among city dwellers.

Varieties and classification

Bai varieties

The Bai languages form a small group spoken primarily by the Bai ethnic group in northwest , , with an estimated 1–2 million speakers as of 2023. These languages are concentrated around the Erhai Lake region, including areas in and surrounding counties. The group encompasses several closely related varieties, with the main dialects being Jianchuan (also known as Central Bai), (Southern Bai), and Bijiang (Northern Bai, sometimes including the Lemo subdialect). Jianchuan and dialects are the most prominent, serving as prestige forms within their respective subregions, while Bijiang shows greater divergence due to geographic isolation in the Nujiang Valley. Bai varieties exhibit distinctive phonological and lexical features that set them apart from core Sinitic languages. Their tonal systems typically feature 6 to 8 tones, varying by dialect; for instance, Jianchuan Bai has seven tones, including level, rising, falling, and checked contours, often with modal and qualities distinguishing them. Lexically, Bai retains conservative elements traceable to , such as archaic pronunciations and vocabulary items like the word for "red" (preserved as *t-qʰrAk > chì in Old Chinese parallels) and terms for natural phenomena like "sky" and "wind," which align closely with northwestern forms. These retentions reflect historical layers of contact and substrate influence in the region. As of 2024, the affiliation of Bai within the Sino-Tibetan family remains highly debated among linguists. Proponents of its Sinitic classification point to extensive lexical and phonological correspondences with Old Western Chinese, suggesting it diverged early from a northwestern Sinitic branch. However, others argue it constitutes a separate Sino-Tibetan branch, heavily influenced by , particularly Qiangic and Loloish subgroups, due to substrate effects and the presence of non-Sinitic morphological traits in its core vocabulary (e.g., up to 40% non-Chinese roots in basic lists). This view is supported by analyses showing stratified Chinese borrowings overlaying a Tibeto-Burman base, complicating straightforward Sinitic assignment.

Mandarin varieties

Mandarin varieties form the largest branch of the Sinitic languages, numerically and geographically, spoken by approximately 920 million native speakers as of and covering about 70% of China's territory. These varieties are mutually intelligible to a significant degree and serve as the foundation for Putonghua, the modern standard form of . This dominance stems from historical migrations and administrative policies that promoted northern speech forms during the Ming and Qing dynasties. The classification of Mandarin varieties typically divides them into several subgroups based on phonological and lexical differences. The Beijing and Northeastern subgroup, centered in and the northeast (including , , and provinces), forms the core of standard Putonghua, with its pronunciation serving as the norm for the . The Southwestern subgroup, spoken in , , and surrounding areas, exhibits variations in tone realization and vocabulary influenced by local geography. The Jin subgroup, primarily in province, is sometimes classified separately but shares key Mandarin traits, including innovative tone patterns from ancient entering tones. Other notable subgroups include Jilu (in and ) and Jiaoliao (in and ), which feature transitional phonologies between northeastern and central forms. These subgroups, while mutually intelligible, show regional accents and lexical items that reflect historical settlement patterns. Phonologically, Mandarin varieties are distinguished by a four-tone system—high level, rising, low dipping, and falling—resulting from mergers of the eight tones of , with the ancient entering tone distributed among the other categories. A hallmark feature is the lack of word-final stop consonants (-p, -t, -k), which were lost in northern varieties between the 12th and 16th centuries, leading to open s ending in vowels or nasals. Additionally, (儿化), the retroflex suffix -r derived from the diminutive particle ér (儿), is prevalent in northern subgroups, adding r-coloring to finals for expressive or effect, as in huār (花儿) for "flower." These features contribute to the relatively uniform across , facilitating comprehension despite regional variations. Mandarin's central role in language unification was formalized in 1955 when the designated Putonghua as the standard language, based primarily on the Beijing dialect's and northern Mandarin and . This policy, aimed at promoting national cohesion, has since elevated Mandarin varieties as the medium of , , and government, reinforcing their influence over other Sinitic branches.

Wu varieties

The varieties, also known as , form a major branch of the Sinitic languages spoken primarily in the lower River region, including , southern , northern , and parts of provinces. With approximately 82 million native speakers as of 2023, Wu constitutes one of the largest Sinitic groups, concentrated in urban centers like and rural areas of the region. These varieties are characterized by their retention of archaic phonological elements from , distinguishing them from more innovative branches like . Wu is broadly divided into Northern and Southern subgroups, with Northern Wu encompassing dialects such as and spoken around the Taihu Lake basin, including and , while Southern Wu includes the Oujiang subgroup, prominently represented by the dialect in southern . Northern varieties tend to show greater among themselves, whereas Southern ones, like , exhibit significant divergence even within Wu. A hallmark of Wu phonology is its rich tonal system, typically featuring 5 to 8 tones depending on the variety, with complex rules that alter contours in . Unlike many northern Sinitic languages, Wu preserves the entering as a distinct category, often realized as syllables ending in a or short vowels, and maintains voiced initials (e.g., /b/, /d/, /ɡ/) that have devoiced in other branches. These features contribute to a phonemic inventory that is conservative yet diverse, with voiced obstruents influencing register splits into yin (higher) and (lower) categories. The dialect within Southern is particularly noted for its extreme phonological complexity and low intelligibility with other Sinitic varieties, earning it a as a "cryptologic" historically used by merchants for secretive dealings to exclude outsiders. Culturally, varieties underpin traditional arts such as Pingtan, a combining and ballad-singing in the , which preserves folklore and literary traditions. They also form the basis for literature, including vernacular collections like the Sanyan by , which reflect the social and moral themes of the Wu-speaking regions during the .

Yue varieties

The Yue varieties, also known as , constitute a major branch of the Sinitic language family, primarily spoken in the southern Chinese provinces of , , and parts of , as well as in communities worldwide. This group encompasses several subgroups, with the most prominent being (or Yuehai, centered in and ), Taishanese (or Hoisan, from the region), and Gaoyang (a transitional variety in western ). Collectively, Yue varieties are spoken by approximately 86 million native speakers as of 2023, making them one of the largest Sinitic subgroups by native speaker count. These languages are characterized by their within core subgroups but significant divergence across broader dialects, influenced by regional geography and historical . Linguistically, Yue varieties are distinguished by their rich tonal systems, typically featuring 6 to 9 tones depending on the specific dialect, which evolved from through a process of tone splitting and merger distinct from northern Sinitic branches. They retain three stop finals (-p, -t, -k) in codas, a conservative trait lost in many other modern Sinitic languages, allowing for closed s that contribute to their rhythmic complexity. Additionally, Yue employs elaborate suffixes, such as -zai, -lo, and -zi, which add nuanced expressiveness to nouns and verbs, reflecting a high degree of morphological innovation compared to more analytic Sinitic varieties. These features underscore Yue's role as a southern conservative in preserving archaic phonetic elements while developing unique lexical and syntactic patterns. Yue's global prominence stems from 19th-century emigration waves, particularly during the and British colonial expansions, which carried speakers to the , , the , and , establishing it as the most widely spoken Sinitic variety outside . Today, vibrant diaspora communities in cities like , , and maintain Yue through family and cultural networks, often blending it with local languages. Furthermore, the influence of cinema and since the mid-20th century has amplified Cantonese's reach, popularizing its idioms, songs, and in films and television that circulate internationally, fostering a sense of among global speakers.

Min varieties

The Min varieties constitute one of the most diverse and internally fragmented branches of the Sinitic languages, encompassing several major subgroups that exhibit significant phonological and lexical differences. The primary subgroups include (also known as Min Nan or /Taiwanese), Eastern Min (Min Dong), and (Min Bei), along with smaller divisions such as Central Min and Puxian Min. These varieties are spoken by approximately 76 million native speakers as of 2023, primarily in southeastern . The dialects within these subgroups are highly mutually unintelligible, with speakers of one variety often unable to comprehend others without prior exposure, reflecting the branch's deep internal fragmentation. For instance, speakers in may struggle to understand varieties from province. Phonologically, Min varieties are characterized by complex tone systems typically ranging from 5 to 7 , which arose from the early loss of stop codas in finals—a feature distinguishing them from many other Sinitic branches that retain such closures. Instead, Min languages preserve nasal codas like -m, -n, and -ŋ, contributing to their sound profiles and further complicating intelligibility across subgroups. This tonal and structure underscores the varieties' retention of pre-Middle elements, setting them apart in the broader Sinitic family. The branch represents the earliest divergence among Sinitic languages, with Proto- splitting from the rest of around 2,500 years ago, prior to the establishment of in the CE. This ancient separation is evidenced by substrate influences from pre-Han spoken in southern , including lexical borrowings related to and local that persist in Min vocabularies. As a result, Min varieties maintain conservative traits not found in northern Sinitic branches. Hokkien, the most prominent Min Nan variety, has played a significant role in overseas communities due to historical maritime trade. During the (1368–1644), merchants from established trading networks that facilitated migration to , including the , where they bartered goods with indigenous groups via southern routes as early as the 13th century, with intensified activity after the 1550s. Similar trade links extended to , contributing to enduring -speaking populations there. varieties are concentrated in province and , with migrations shaping their global distribution.

Hakka varieties

Hakka varieties, spoken by approximately 44 million native speakers worldwide as of 2023, are primarily concentrated in southern provinces of China such as , , and , as well as in . These varieties form a distinct branch of the Sinitic languages, characterized by their relative homogeneity compared to other Sinitic groups, owing to the shared migratory history of Hakka speakers. The major subgroups of Hakka include the Meixian (also known as Jiaying) dialect, which serves as the prestige or standard form and is centered in , ; and the Dabu dialect, prominent in northeastern and among migrant communities in . Other notable subgroups encompass Hailu, Sixian, and Raoping, with Meixian and Dabu together representing a significant portion of speakers, particularly in where Dabu influences local varieties. These subgroups exhibit minor phonological and lexical differences but remain mutually intelligible, facilitating communication across Hakka-speaking regions. Linguistically, Hakka varieties are distinguished by their six-tone system, a feature shared by the majority of dialects, which contrasts with the more varied tonal profiles in neighboring Sinitic languages. They retain conservative initial consonants, including the preservation of the velar nasal *ŋ- (ng-) in words like ngai "I," reflecting archaic phonology more closely than many southern varieties. The vocabulary of Hakka also bears traces of northern origins, incorporating terms and expressions from earlier migrations, such as and agricultural lexicon that differ from southern Sinitic norms. Hakka migrations, particularly those in the 19th century driven by economic opportunities and social unrest in southern , established vibrant communities in and , where Hakka speakers were often designated as "guest people" (Hakka) by local populations. These migrations contributed to intergroup tensions, exemplified by the Punti-Hakka Clan Wars (1855–1867) in , which arose from land disputes and cultural differences between Hakka newcomers and established () residents. The resilience of Hakka varieties is closely tied to the community's strong clan structures, which have historically fostered , communal living in fortified dwellings, and cultural transmission practices that prioritize language maintenance across generations. These social organizations continue to support Hakka linguistic vitality, even in settings, by reinforcing identity through festivals, , and family-based language use.

Gan varieties

Gan varieties, collectively known as , constitute a major branch of the Sinitic languages spoken primarily in Province and adjacent regions of southeastern China, including parts of , , , and . With an estimated 22 million native speakers as of 2023, Gan serves as the dominant linguistic group in , where around 29 million individuals use it as their primary language including second-language speakers. The varieties are geographically concentrated in central and northern , reflecting historical migrations and settlements that have shaped their distribution. Key subgroups within include the variety, centered in the provincial capital, and the Yichun variety, spoken in the western part of . These subgroups exhibit internal diversity, with Gan representing a more standardized form influenced by urban development, while Yichun Gan preserves more conservative rural traits. Phonologically, Gan varieties are characterized by 5 to 7 tones in most cases, though some dialects display up to 10 distinct tones due to historical tone splits from . They feature a hybrid profile, with Mandarin-like initials such as the retroflex series (/ʈ/, /ʈʂ/, /ʂ/) and a relatively full set of stops, combined with southern finals that retain Middle Chinese codas like -p, -t, and -k in certain environments. In Sinitic classification, Gan occupies a transitional position between northern and southern branches, often bridging to the north with Xiang and to the south through shared innovations and retentions. This intermediary role is evident in its moderate with , driven by substantial lexical overlap estimated at around 60-70% for core vocabulary. Culturally, Gan varieties underpin traditional opera (), a performative form originating in that integrates music, dialogue, and dance, recognized as part of China's national since 2008.

Xiang varieties

The Xiang varieties, spoken primarily in province in south-central , form one of the major subgroups of the Sinitic languages and are estimated to have around 37 million native speakers as of 2023. These varieties exhibit significant internal diversity, reflecting layers of historical development influenced by neighboring Sinitic groups, while maintaining distinct phonological characteristics. The primary areas of concentration include the central and western parts of , with extensions into adjacent regions of , , and provinces. Xiang varieties are conventionally divided into two main subgroups: New Xiang and Old Xiang. New Xiang, centered around (the provincial capital) and extending to areas like and , represents the more innovative branch, with dialects showing substantial convergence with neighboring varieties. Old Xiang, spoken in regions such as Loudi, , and Xiangxiang in southwestern , preserves more conservative features and is less affected by external influences. This subgrouping, originally proposed by linguist Yuan Jiahua, is based on differences in initial consonant voicing and other phonological traits, with New Xiang having largely lost the voiced initials typical of earlier Sinitic stages. Phonologically, Xiang varieties are characterized by 5 to 6 tones, often split into upper (yin) and lower () registers that trace back to the voicing distinction in syllable initials—voiceless initials yielding higher-pitched tones and voiced ones lower-pitched. A key conservative feature is the retention of checked (entering) tones, which appear as short, abrupt syllables typically ending in a or unreleased stop, distinguishing them from the tone mergers seen in . For example, in the Changsha dialect of New Xiang, the tone system includes six s, with the entering tone realized as a mid-rising but clipped . Old Xiang dialects, such as Xiangxiang, may merge some tones but still uphold the register split and checked syllables more robustly than their New Xiang counterparts. Compared to New Xiang, Old Xiang conserves more ancient phonological elements, including partial retention of voiced stops and nasals in initials, which contribute to its relative resistance to Mandarinization; in contrast, New Xiang displays devoicing and aspiration patterns akin to , reflecting prolonged contact in the northern Hunan basin. This layered conservatism in Xiang highlights its position as a transitional group between northern and southern Sinitic varieties, with Old Xiang embodying deeper historical strata from pre-Ming migrations. A prominent historical figure associated with Xiang is , who was a native speaker of the Changsha dialect in New Xiang.

Hui varieties

The Hui varieties, also known as , form a distinct group of Sinitic languages spoken primarily in southern Province in eastern , with some extension into adjacent areas of and provinces. These varieties are estimated to have around 5 million native speakers as of 2023, concentrated in the historical region. The classifies the Hui group as an independent branch of Sinitic languages, divided into five main subgroups: Ji–She, Tun–Xi, Yi–Jing, Dong–Qian, and Jing–De. Prominent among these are the Shexian and Tunxi subgroups, which represent key dialectal centers in . Phonologically, Hui varieties are characterized by 6 to 8 tones, typically including a glottalized that is often weakened in modern speech. They exhibit mergers of several distinctions, particularly in consonants and rhyme categories, alongside retention of Wu-like voiced initials. While frequently affiliated with varieties due to shared phonological traits such as initial voicing, Hui displays notable influences in areas like tone splits and consonant developments, positioning it as transitional between the two. with standard remains low, reflecting significant lexical and phonological divergence. The Hui varieties are closely tied to the region's Huizhou merchant culture, a historically influential network of traders from the Ming and Qing dynasties that shaped local economy, architecture, and social structures through commerce and Confucian values.

Pinghua and other minor varieties

Pinghua varieties are spoken primarily in the Zhuang Autonomous Region of southern China, where they function as trade languages in multi-ethnic areas alongside Zhuang and other local tongues. These varieties are divided into Northern Pinghua (Guibei) and Southern Pinghua (Guinan), which are not mutually intelligible and exhibit distinct phonological and lexical profiles influenced by contact with surrounding non-Sinitic languages, such as borrowing from Northern Zhuang vocabulary while retaining relatively conservative Sinitic grammatical structures. Spoken by approximately 4 million people as of , Pinghua represents a fringe branch within Sinitic classifications, sometimes grouped with varieties due to geographic proximity but often treated as a separate entity owing to its unique areal features and uncertain phylogenetic position. Jin varieties, centered in Province and extending into adjacent regions of , , and , form another major but debatably independent group within the Sinitic family, spoken by approximately 47 million native speakers as of 2023. As of 2024, linguistic consensus remains divided on whether constitutes a separate primary branch from , with some classifications including it within due to , while others treat it independently based on phonological distinctions. Unlike standard , is distinguished by phonological innovations such as the retention of entering tones (short syllables with glottal stops) and patterns of vowel raising in palatalized contexts, where high front vowels trigger front-raising while non-palatalized ones lead to back-raising. These features highlight 's transitional role between northern and southern Sinitic branches, with some dialects showing limited palatalization of velars compared to , contributing to its ongoing debate over inclusion within the continuum. Other minor Sinitic varieties include Waxiang, a conservative isolate spoken by around 320,000 people in the remote northwestern mountainous areas of Province. Waxiang maintains archaic syntactic and lexical elements, such as polyfunctional comitative markers derived from verbs like 'to follow', setting it apart as an unclassified member of the family amid heavy contact with neighboring Xiang and varieties. These fringe varieties collectively account for roughly 5–10% of Sinitic linguistic diversity, often facing pressures from assimilation into dominant regional forms like .

Internal relationships

Major classification proposals

The traditional classification of Sinitic languages recognizes seven major dialect groups, a framework established by Yuan Jiahua in his 1960 work Hanyu fangyan yinyun. These groups—Mandarin (guānhuà 官话), (Wú 吴), (Mǐn 闽), Xiang (Xiāng 湘), (Gàn 赣), Hakka (Kèjiā 客家), and (Yuè 粤)—are based primarily on phonological criteria derived from , such as the treatment of initial and final consonants, tones, and systems. This schema has served as the foundation for much of subsequent in , emphasizing regional coherence within each group while acknowledging internal diversity. Modern proposals have refined and expanded this classification, incorporating additional subgroups to account for varieties that do not fit neatly into the original seven, resulting in schemes with 10 to 14 primary branches. For instance, the (Wurm et al., 1987) delineates 10 main branches, including the original seven plus (Jìn 晋), Hui (Huì 徽), and (Pīnghuà 平话), with further subdivisions based on isoglosses for phonological and lexical features. Jerry Norman, in his 1988 monograph , organizes these into broader zones—Northern ( and ), Central (Wu, , Xiang, and Hui), and Southern (, Hakka, and )—while advocating for the recognition of and other transitional varieties as distinct due to their unique retention of archaic traits like preserved entering tones. similarly lists over a dozen coordinate subgroups under the Chinese macrolanguage, encompassing these expansions and treating varieties like Dungan and Taiwanese as separate entries to reflect global distribution and mutual unintelligibility. Despite these advancements, there remains no on the precise number of primary branches, with scholars proposing anywhere from 9 to 13 based on varying criteria such as shared innovations, borrowing patterns, and influences. This variability stems from the nature of Sinitic, where boundaries are often gradual rather than discrete. A particular point of contention involves Macro-Bai, a cluster of languages spoken in Province, whose inclusion within Sinitic is debated. Proponents of Sinitic affiliation, such as those examining cognates and syntactic parallels with , argue it represents a conservative branch influenced by local Tibeto-Burman elements. Conversely, others classify it as a distinct Tibeto-Burman offshoot with heavy Sinitic borrowing, citing phonological divergences like non-Sinitic tone splits and morphology. This debate underscores the challenges in delineating Sinitic boundaries amid historical contact.

Debates on northern vs. southern branches

The classification of Sinitic languages into northern and southern branches has been a central debate in Chinese , reflecting deep typological and historical divergences within the family. Northern varieties, primarily centered around the basin, are characterized by fewer tones—typically four or five—and innovative syntactic features, such as a stronger preference for subject-verb-object and reduced use of classifiers, which align them more closely with neighboring . Southern varieties, spoken along the River and further south, exhibit more complex tonal systems—often six or more tones—and more conservative phonological structures, preserving ancient initial consonants and final stops that have been lost in the north; these include groups like , , and . A key controversy surrounds the origins of these north-south differences: whether southern varieties reflect substrate influences from pre-existing non-Sinitic languages, such as (also known as Kra-Dai), or result from parallel internal evolution within Sinitic. Proponents of the argue that features like elaborate tone splits and head-initial tendencies in southern varieties stem from contact with indigenous languages during Han Chinese migrations southward, evidenced by shared lexical items and phonological patterns, such as the retention of certain syllable finals in that mirror Tai structures. In contrast, advocates for parallel evolution contend that these traits arose independently through divergence from a common proto-Sinitic ancestor, driven by geographic isolation and areal pressures rather than direct borrowing, with limited direct evidence of widespread lexical borrowing from substrates. This debate is complicated by transitional central varieties, which blend northern and southern traits, challenging a strict division. Lexical similarity often serves as a practical metric in this discussion, with a threshold of around 70% commonly invoked to distinguish northern from southern branches, below which diminishes significantly; for instance, comparisons between representative northern (e.g., Beijing Mandarin) and southern (e.g., ) varieties yield similarities of only 20-30%, underscoring their separation. Historical migrations of Sinitic speakers southward during periods of dynastic upheaval likely exacerbated these divides, incorporating local substrates without fully erasing proto-Sinitic foundations. Overall, while the northern-southern framework provides a useful for understanding Sinitic diversity, ongoing typological analyses continue to refine its boundaries, emphasizing over rigid genetic splits.

Quantitative and phylogenetic analyses

Quantitative analyses of Sinitic languages have employed lexicostatistical methods, such as s, to measure and estimate divergence times among varieties. For instance, comparisons using a 200-item reveal that shares approximately 31% with varieties like , indicating substantial divergence while still reflecting a shared Sinitic . These methods, though controversial due to assumptions about uniform vocabulary retention rates, suggest that major Sinitic branches, including and , began diverging around 1,500 years ago from a common ancestor, aligning with historical records of dialectal fragmentation during the . Phylogenetic studies in the have advanced these estimates through Bayesian models that incorporate lexical, phonological, and syntactic data to reconstruct family trees and divergence timelines. A Bayesian analysis of 50 , including multiple Sinitic varieties, dated the family's origin to approximately 7,200 years , with Sinitic emerging as a primary around 5,900 years ago; within Sinitic, Min varieties are positioned as the earliest diverging group, preserving features like complex systems. A subsequent 2020 study using expanded datasets estimated the initial divergence between Sinitic and Tibeto-Burman at approximately 8,000 years , confirming Sinitic's position near the root of the Sino-Tibetan tree; internal Sinitic divergences, such as those separating Min as a basal from other groups like and , are estimated at around 2,000–3,000 years ago based on linguistic and historical evidence. These models highlight reticulate due to areal contacts, challenging strict tree-based assumptions. Recent interdisciplinary correlations between DNA and further support a northern for Sinitic varieties, linking them to ancient millet farmers in the basin. Genomic analyses of remains show that populations associated with millet agriculture (ca. 5,000–3,000 BCE) contributed significantly to the ancestry of Chinese speakers, whose languages exhibit genetic affinities with these early farmers; this admixture pattern correlates with the spread of Sinitic linguistic features southward. Post-2020 studies, including 2023–2024 research, have refined these findings with evidence of multiple agriculture-driven migrations from northern , integrating linguistic phylogenies, , and to explain Sino-Tibetan dispersal, including Sinitic branches. Computational studies leveraging AI-driven methods like knowledge graphs and embedding models have further refined phonological reconstructions—particularly rhyme correspondences across dialects—yielding divergence estimates for major branches consistent with 2,000–3,000 years ago for internal splits, while accounting for influences from non-Sinitic languages.

Linguistic features

Phonological systems

Sinitic languages share a core phonological profile characterized by monosyllabic morphemes and a relatively simple syllable structure inherited from , typically following the template (C)V(N), where the optional initial consonant (C) is followed by a nucleus (V) and an optional (N) limited to nasals or stops in certain varieties. This structure reflects an analytic trait, with stress and intonation playing minimal roles compared to lexical tone for lexical distinction. All varieties derive their tonal systems from the four tones—level (píng), rising (shǎng), departing (qù), and entering (rù)—plus mergers and splits that yield 4 to 9 tones today, with the entering tone often preserved as a short, checked in southern varieties. A defining phonological evolution across Sinitic languages is the widespread loss of codas, including liquids and fricatives, which simplified syllable endings and contributed to development through compensatory mechanisms. However, southern varieties like , , and Hakka retain the stop codas -p, -t, and -k from the entering , distinguishing them from northern , which reduced codas to only nasals -n and -ŋ. also show variation: northern varieties devoiced voiced obstruents, resulting in aspirated or unaspirated voiceless stops, while southern groups such as preserve voiced initials (e.g., /b/, /d/, /g/), enhancing inventory diversity. Tone inventories differ markedly by branch: Mandarin features a canonical four-tone system (high level, high rising, low dipping, high falling), whereas Min varieties often have seven tones, incorporating splits from the departing tone and preserved entering distinctions. Wu dialects typically exhibit seven or eight tones with complex sandhi rules, and Yue maintains six tones plus entering stops, as in Cantonese where syllables like /sat/ (ten) end in -t. These variations underscore regional conservatism in the south versus simplification in the north, with tone sandhi—contextual tone changes in connected speech—being ubiquitous but varying in scope, from Mandarin's third-tone reduction to more extensive right-dominant patterns in Min. Recent acoustic research in the 2020s has illuminated dynamics in understudied varieties. In (southern Wu), a measured (F0) trajectories, revealing that applies progressively across trisyllabic sequences, with rising tones triggering mid-level realizations in preceding syllables, confirming phonological rules through precise durational and height analyses. Similarly, a 2022 acoustic investigation of demonstrated right-dominant , where the final spreads leftward, altering F0 contours in up to 80% of disyllables, with checked tones resisting full due to cues. These findings highlight how acoustic properties reinforce lexical contrasts amid variability.

Grammatical structures

Sinitic languages are predominantly analytic in their grammatical structure, lacking inflectional for tense, number, case, or gender, and relying instead on , particles, and context to convey . They typically follow a subject-verb-object (SVO) in basic clauses, which distinguishes them from the more common SOV patterns in related Sino-Tibetan branches. A hallmark of their syntax is the topic-comment structure, where the topic—often a —is fronted to set the frame, followed by a comment providing new information about it, as seen in constructions like "Zhè běn shū, wǒ kàn guò" (This book, I have read) in . This organization prioritizes pragmatic prominence over strict subject-predicate alignment, allowing flexible across varieties. Numeral classifiers are mandatory in Sinitic languages when nouns are quantified or modified by demonstratives, serving to categorize and individuate referents based on shape, function, or other semantic properties. In Mandarin, the general classifier is used for humans or abstract items (e.g., yī gè rén, one person), while běn specifies long, thin objects like books (yī běn shū, one book); these classifiers often follow numerals or demonstratives in the noun phrase. Variations exist across varieties, such as in Yue (Cantonese), where go3 functions similarly to Mandarin but with distinct phonological and syntactic behaviors, including optional use in possessive constructions like ngo5 go3 syu1 (my book). Classifiers also play a role in definiteness marking in some contexts, evolving from individuation functions. Aspect in Sinitic languages is marked through postverbal particles rather than verbal inflection, with shared markers across varieties indicating completion or ongoing states. In Mandarin, le signals perfective aspect for completed actions (e.g., tā chī-le fàn, he ate the meal), while zhe denotes continuous or durative aspect (e.g., tā zuò-zhe, he is sitting). Serialization, or chaining multiple verbs in a single clause without conjunctions, is common for expressing complex events, as in Mandarin qù mǎi shū (go buy book), where verbs share a subject and aspect. This construction is areal, extending to southern varieties and facilitating compact expression of manner, direction, or purpose. Indirect objects are typically introduced by prepositions, such as gěi for benefactive or dative roles (e.g., wǒ gěi tā shū, I give him a ). In some southern varieties, forms like bei appear in dative contexts or passive constructions, reflecting regional divergence. distinguish proximal (zhè in , this) from distal (, that), often requiring classifiers for specificity (e.g., zhè běn shū, this ). Southern varieties tend to employ more postpositions for locative and relational functions, such as hai6 (in/at) in post-nominal position, contributing to mixed word-order patterns compared to the preposition-dominant north.

Writing systems and orthographies

The Sinitic languages share the writing system, a logographic script that represents morphemes rather than sounds, enabling in written form despite spoken differences. Comprehensive dictionaries like the Zhonghua Zihai catalog over 85,000 distinct characters, though literacy typically requires mastery of only 3,000 to 5,000 for reading newspapers and modern texts. Approximately 81% of frequently used characters are semantic-phonetic compounds, featuring a that conveys semantic information (e.g., indicating the category of meaning) paired with a phonetic component that hints at pronunciation, a structure that originated in ancient inscriptions and evolved through millennia. This compound design facilitates the script's adaptability across Sinitic varieties, as the characters maintain consistent visual forms while allowing diverse readings. Non-Mandarin Sinitic varieties, while relying on the same Hanzi script, often incorporate systems to capture their unique phonologies for pedagogical, digital, or literary purposes. For (), —a standardized scheme developed by the Linguistic Society of in 1993—uses Latin letters with diacritics to denote tones and initials, such as representing the six tones distinct from Mandarin's four. Similarly, (POJ), a 19th-century pioneered by Protestant missionaries for (), employs modified to transcribe the language's nasalized vowels and tonal contours, historically used in religious texts and Taiwanese . These adaptations highlight the script's flexibility but also underscore the need for supplementary tools, as Hanzi alone does not encode dialect-specific sounds. A notable variation within the Hanzi system arises from the 1956 introduction of simplified characters by the , which reduced stroke counts in over 2,000 characters to boost rates, contrasting with the traditional forms retained in , , and overseas communities. For instance, the character for "person," 人, is read as rén (second tone) in but jan4 in , illustrating how the shared belies profound phonological divergence. This in readings preserves written unity but poses challenges for spoken-to-written transcription. The logographic nature of Hanzi mitigates some ambiguities from spoken s—common in tonal Sinitic varieties, where a single sound may correspond to dozens of characters—but digital input methods for dialects amplify these issues. Early pinyin-based systems for required manual selection from homophone lists, and analogous tools for dialects like keyboards face similar disambiguation hurdles, often relying on context prediction. Recent advances, such as corpus-based adaptive algorithms, improve accuracy by analyzing surrounding text to resolve homophones in real-time, facilitating broader digital expression of non- varieties.

Cultural and sociolinguistic aspects

Language policy and endangerment

In the People's Republic of China (PRC), language policies have prioritized Putonghua, the standard form of Mandarin, over other Sinitic varieties since the 1980s through bilingual education initiatives that emphasize Mandarin proficiency in schools while providing limited support for regional languages. The 1982 Constitution explicitly promotes the nationwide use of Putonghua to foster national unity and communication. This approach was reinforced by the 2021 Law on the National Common Language and Writing System, which mandates Putonghua as the primary medium of instruction in educational institutions and requires its use in government, media, and public services to standardize communication across diverse linguistic regions. These policies contribute to the endangerment of many Sinitic varieties, with smaller ones facing significant decline due to , , and the dominance of Putonghua in formal domains. Many documented Sinitic varieties are considered moribund, spoken primarily by older generations with few or no younger speakers; notable examples include certain peripheral Min dialects in northern Province, where transmission has nearly ceased. The UNESCO Atlas of the World's Languages in Danger classifies several Sinitic varieties as vulnerable, including some Jin and Min forms, highlighting risks from assimilation into dominant Mandarin norms. Preservation efforts in the complement domestic challenges, with community schools in the United States and offering classes in heritage Sinitic varieties like , , and Hakka to maintain cultural ties among immigrant families. In the 2020s, digital archiving projects have advanced documentation, such as the Shanghai Dialect Conversational Speech Corpus for varieties, which provides transcribed audio resources for linguistic analysis, and Taiwan's Hakka Cultural Assets Digital Archives, launched in 2022 to digitize oral traditions, , and historical materials. Socially, these dynamics have driven a generational shift, as younger speakers in increasingly favor Putonghua for educational and , leading to reduced fluency in ancestral varieties among urban youth. This trend exacerbates , particularly in migrant-heavy areas where internal relocation disrupts local language use.

Influence on other languages

Sinitic languages have profoundly influenced the lexical systems of neighboring through extensive borrowing of vocabulary, particularly via the adoption of and their associated readings. In , the script incorporates thousands of Sino-Japanese terms derived from , forming the basis for much of the formal and technical lexicon, including numbers (e.g., ichi for "one" from ) and terms (e.g., bo for "" from ). Similarly, Korean borrowings from contribute significantly to the , which comprises about 60% of the lexicon in historical texts, with examples like numbers (il for "one" from ) and family relations (mo for "" from ). In , chữ Hán loans form a substantial layer of Sino-Vietnamese words, estimated at around 60-70% of the vocabulary in classical literature, including numerals (một alongside Sino-Vietnamese nhất from ) and designations (e.g., mẫu for "" from ). These borrowings, known collectively as Sino-Xenic vocabularies, reflect systematic adaptations of Chinese morphemes across phonological systems while preserving semantic content. Beyond lexicon, Sinitic languages have exerted structural influence on syntax in contact situations, notably promoting topic-prominent word order in Japanese. Japanese exhibits topic-comment structures marked by particles like wa, a feature shared with Sinitic languages such as Mandarin, where topics are fronted for discourse focus (e.g., "The book, I read it" paralleling Mandarin shū, wǒ kàn-le). This areal convergence likely arose from prolonged literary and cultural contact during the adoption of kanji, enhancing Japanese's predisposition toward topic prominence over strict subject-predicate alignment. In Southeast Asia, southern Sinitic varieties have contributed to the development of numeral classifiers in languages like Thai and Burmese through historical migration and trade. Thai classifiers (e.g., lǝəw for round objects) parallel those in southern Chinese dialects like Cantonese (go3), suggesting diffusion via contact in the Mekong region; similarly, Burmese employs classifiers (e.g., ta for humans) that align semantically with Sinitic systems, originating potentially from a single proto-classifier innovation in early Sinitic that spread to Tai-Kadai and Tibeto-Burman languages. Sinitic elements also appear in pidgin and creole languages worldwide, blending with European tongues in colonial trade contexts. In (Singlish), —a variety of Sinitic—provides substratal influence, contributing particles like lah for emphasis and lexical items such as kiasu ("fear of losing"), which integrate into the creole's grammar and vocabulary. Likewise, , a pidgin, incorporated Sinitic loanwords from Chinese laborers during 19th-century railroad construction, including terms like chop-chop (from jōp-jōp, meaning "quickly") and basic numerals adapted for trade. These pidgins illustrate Sinitic's role in facilitating across diverse substrates. Recent linguistic analyses highlight substantial Sinitic effects in -Mien languages, with studies estimating that 20% or more of derives from loans accumulated over millennia of coexistence in southern . For instance, dialects borrow terms for and (e.g., White neeg for "person" from rén), reflecting layers of borrowing from onward. This lexical integration underscores the asymmetric influence of dominant Sinitic varieties on minority languages in the region. On a global scale, Mandarin terms have permeated international lexicons, particularly in and , through modern exchanges. Words like taichi (from Mandarin tài jí quán, referring to the martial art) and dimsum (from Cantonese-influenced Mandarin diǎn xīn, denoting small dishes) entered English via 20th-century migration and trade, now standard in and wellness contexts. In diplomatic spheres, Mandarin phrases such as ping shēng ("peaceful rise") appear in international discourse on Chinese , symbolizing projection. These adoptions exemplify Sinitic's ongoing expansion beyond ..pdf)

References

  1. [1]
    The Sino-Tibetan Language Family | STEDT
    ### Summary of Sinitic Languages within Sino-Tibetan Family
  2. [2]
    Chinese Speaking Countries 2025 - World Population Review
    A total of about 1.4 billion people worldwide speak Chinese as their mother tongue, including more than 1.3 billion on the Chinese mainland.
  3. [3]
    Chinese 'dialect' - Sino-Platonic Papers
    Mandarin is not a dialect but a full-fledged language or, ultimately perhaps more accurately, branch of languages within the Sinitic group. Modern Standard ...
  4. [4]
    [PDF] Towards a typology of aspect in Sinitic languages - HAL
    Feb 6, 2023 · could all be described as analytic tonal languages using means other than inflectional morphology to signal grammatical relations. All ...
  5. [5]
  6. [6]
    How Many Languages Are There in China?
    Nov 17, 2022 · Sinitic includes all the languages of China that fall under the designation Hànyǔ 漢語, not just Mandarin, and especially not just Modern ...
  7. [7]
    Language Contact and Language Change in the History of the ...
    This paper looks at the history of the development of the Sinitic branch of the Sino-Tibetan language family from the point of view of population movements and ...
  8. [8]
    Linguistic diversity in Greater Tibet - Language Log
    May 3, 2014 · ... speakers amount to roughly 1.3 billion, yet politically minded partisans insist that there is only a single Sinitic language and that all of ...
  9. [9]
    Dated language phylogenies shed light on the ancestry of Sino ...
    May 6, 2019 · The Sino-Tibetan language family is one of the world's largest and most prominent families, spoken by nearly 1.4 billion people. Despite the ...
  10. [10]
    [PDF] mandarin and other sinitic languages | hal
    Oct 17, 2022 · A second major feature of Sinitic language is their use of in situ interrogatives. This means that the interrogative pronouns for who, what, ...
  11. [11]
    Sinitic languages: A historical and typological overview
    This chapter provides a historical and typological overview of Sinitic languages. It first discusses the terminology related to Chinese, clarifying the meaning ...
  12. [12]
    [PDF] The Classification of Sinitic Languages : What Is “ Chinese ”
    This pathbreaking dictionary places primary emphasis on the sounds and meanings of Sinitic roots and places Old Chinese squarely within the Sino-Tibetan ...
  13. [13]
    Linguistic areas in China for differential object marking, passive, and ...
    Broadly speaking, these languages are tonal and analytic in make-up. While SVO is a common word order in Sinitic, (S)OV is equally common in the Wu and Min ...
  14. [14]
  15. [15]
    (PDF) MANDARIN AND OTHER SINITIC LANGUAGES
    Sinitic languages are tonal, and syllable structure typically follows (C1)V T (C2) format with tones affecting meaning. The classification of dialects includes ...
  16. [16]
  17. [17]
    Sinitic - Wiktionary, the free dictionary
    Etymology. From Latin Sīna (“China”) +‎ -itic, as in Semitic. Pronunciation. IPA: /saɪˈnɪtɪk/, /sɪˈnɪtɪk/. Adjective. Sinitic (comparative more Sinitic ...
  18. [18]
    Sino- - Etymology & Meaning of the Prefix
    Originating from Late Latin Sinæ and Arabic Sin, "Sin-" is a word-forming element meaning "Chinese," derived from the Chinese Qin dynasty name.
  19. [19]
    Sinitic and Tibetic - Language Log
    Mar 30, 2010 · Whether or not Sinitic and Tibetic are genetically related, how can it be that there is only a single Sinitic "language" with 1.2 billion ...<|control11|><|separator|>
  20. [20]
    Mutual Intelligibility of Sinitic Languages
    Mar 6, 2009 · A monolingual speaker of Cantonese cannot understand a monolingual speaker of Mandarin and vice versa. There is zero intelligibility between the two languages.
  21. [21]
    Languagehood of Cantonese: A Renewed Front in an Old Debate
    So far, mutual intelligibility is the strongest “linguistic” evidence to prove the language status of Cantonese. It is clearly an abstand language (in the ...
  22. [22]
    [PDF] PCC Guidelines for the Use of ISO 639-3 Language Codes in MARC ...
    Jan 12, 2023 · The macrolanguage for Chinese (“zho”) is related to 16 individual languages, including Mandarin Chinese (“cmn”) and Yue Chinese/Cantonese (“yue”) ...
  23. [23]
    Chinese Language (ZHO) - Ethnologue
    Chinese is classified as a “macrolanguage” in the ISO 639 standard and is assigned to [zho] as its three-letter code.
  24. [24]
    RADIOCARBON DATING OF ORACLE BONES OF LATE SHANG ...
    Oct 27, 2020 · Oracle bone script developed into a sophisticated writing system in Shang Dynasty of China more than 3000 years ago. The systematic ...
  25. [25]
    Middle Chinese: The Qièyùn 切韻 (Chapter 3)
    Middle Chinese phonology is mainly based on the information from the phonological system of the Qièyùn 切韻, a rhyme dictionary compiled in 601.
  26. [26]
    [PDF] 1. INTRODUCTION - HAL-SHS
    ... Chinese script, while having the look of a natural language, and evolving into Early Middle Chinese according to natural sound changes. Additional bodies of ...
  27. [27]
    (PDF) Buddhism and Chinese Linguistics - ResearchGate
    Feb 16, 2017 · In this paper, different aspects of the relation between Buddhism and the Buddhist Chinese literature and the linguistic study of Chinese will be at issue.
  28. [28]
    (PDF) Language Contact and Language Change in the History of ...
    This paper looks at the history of the development of the Sinitic branch of the Sino-Tibetan language family from the point of view of population movements and ...
  29. [29]
    [PDF] Vernacular Language Movement - Chinese Studies - Jeffrey Weng
    Oct 28, 2020 · A masterful study of the national language movement, primarily in the republican era. The author makes such skillful use of primary sources ...
  30. [30]
    [PDF] language-planning-in-china.pdf - Center for Applied Linguistics
    At the beginning of. 1956, Mandarin Chinese was officially adopted as the common language, Putonghua, for the PRC (Guo, 2004). On September 14, 1998, the ...
  31. [31]
    Reforms in Language and Script in the 1950s - Chinaknowledge
    To come back to the language issue, the official standard language was called Putonghua 普通话. It became compulsory for teaching in schools, state services, ...
  32. [32]
    China's Long Struggle for Linguistic Unification - Global Asia
    The Ministry of Education also decreed in November 1955 that Putonghua would become the medium of instruction for Chinese language and literature courses ...
  33. [33]
    [PDF] Mutual intelligibility of Chinese dialects An experimental approach
    example, French has lexical similarity of about 75% to several other Romance languages. In comparison, Mandarin has 31% lexical similarity with Wu.<|control11|><|separator|>
  34. [34]
    (PDF) Mutual Intelligibility and Similarity of Chinese Dialects
    Aug 10, 2025 · When scholars discuss mutual intelligibility among Sinitic languages, they typically refer to the spoken form (Tang and van Heuven, 2007; ...
  35. [35]
    Ethnologue | Languages of the world
    Find, read about, and research all 7159 living languages. Ethnologue is the ultimate source of information on the world's languages.Browse the Countries of the... · Browse By Language Name · Credits · English
  36. [36]
    Chinese, Mandarin Language (CMN) - Ethnologue
    Mandarin Chinese is an international language whose original homeland was in China. It belongs to the Sino-Tibetan language family and is part of the Chinese ...
  37. [37]
    Chinese Guide: Welcome - LibGuides at Mount Holyoke College
    Jul 29, 2025 · Welcome to the Chinese Language Guide. File:Map of sinitic languages ... The Chinese language and its variations have approximately 1.3 billion ...
  38. [38]
    25 Most Spoken Languages in the World in 2025 | Berlitz
    Mar 27, 2025 · Looking at total speakers, Mandarin is the second most widely spoken language in the world. However, it's the most spoken language in the world ...
  39. [39]
    Chinese, Yue Language (YUE) - Ethnologue
    Yue Chinese is a language of wider communication that originated in China, Hong Kong, and Macao. It belongs to the Sino-Tibetan language family.
  40. [40]
    Chinese, Wu Language (WUU) - Ethnologue
    Wu Chinese is a stable indigenous language of China. It belongs to the Sino-Tibetan language family and is part of the Chinese macrolanguage.
  41. [41]
    Chinese, Min Nan Language (NAN) - Ethnologue
    Min Nan Chinese is a stable indigenous language of China and Taiwan. It belongs to the Sino-Tibetan language family and is part of the Chinese macrolanguage ...
  42. [42]
    What is the most spoken language? | Ethnologue Free
    Mandarin Chinese is the largest language in the world, if you count only native speakers. If you count both native and non-native speakers, English is the ...
  43. [43]
    None
    ### Regional Distributions of Major Sinitic Varieties
  44. [44]
    [PDF] The Chinese Diaspora: Historical Legacies and Contemporary Trends
    This report examines key demographic elements of the global Chinese diaspora. Although the focus here is on recent levels and trends, the history of Chinese ...
  45. [45]
    Hokkien as a Heritage Language of Citizenry in Singapore
    Aug 7, 2025 · The Singapore Census of Population statistics indicate that 11% of ethnic local Chinese households speak Hokkien. Singaporean television ...
  46. [46]
    Taishan and Chinatown - Language Log
    Feb 5, 2022 · In the late 19th century and early 20th century, a significant amount of Chinese emigration to North America originated from Siyi (Seiyap) ...
  47. [47]
  48. [48]
    China sees rising number of Mandarin speakers
    Jun 2, 2021 · About 80.72 percent of China's population spoke Putonghua, or Mandarin Chinese, in 2020, up 27.66 percentage points from 2000.Missing: urban | Show results with:urban
  49. [49]
    Communiqué of the Seventh National Population Census (No. 7)
    May 11, 2021 · Among the national population [3], there were 901991162 persons living in urban areas, accounting for 63.89 percent (in 2020, the urbanization ...<|separator|>
  50. [50]
    Jianchuan Bai | Journal of the International Phonetic Association
    Apr 27, 2020 · The Bai language ( ) is spoken by approximately 1.6 million people in northwest Yunnan Province, China. Of the 25 minority languages spoken ...
  51. [51]
    Bai language, alphabet and pronunciation - Omniglot
    Feb 10, 2024 · In 2003 there were 1.3 million Bai speakers. There are three main dialects of Bai: Jianchuan (Central), Dali (Southern), and Bijiang (Northern).Missing: population | Show results with:population
  52. [52]
    [PDF] Bai dialect survey - SIL.org
    Abstract. The People's Republic of China is a nation of fifty-six ethnic groups. The Bai are the fourteenth largest minority group with a total population ...
  53. [53]
    A Musical Language with Typological Ablative Cases - Scirp.org.
    Dec 3, 2015 · Bai is spoken by ethnic people with a population of around two million, who live in the areas around the Erhai Lake in Dali prefecture of ...<|control11|><|separator|>
  54. [54]
    (PDF) Bai and Old Western Chinese - ResearchGate
    Hence, this paper puts the claim that Bai is akin to Old Western Chinese on a stronger footing.Missing: ergative alignment
  55. [55]
    Bai and Old Western Chinese | Bulletin of SOAS | Cambridge Core
    Sep 11, 2024 · The classification of Bai is disputed; it has been claimed to be a Sinitic language (Greenberg Reference Greenberg and Kroeber1953; Benedict ...
  56. [56]
    Ancient Connections of Sinitic - MDPI
    Jul 24, 2023 · The generally-agreed location for the origins of Sinitic is the upper Yellow River valley. In the early Neolithic period corresponding to Proto- ...
  57. [57]
    [PDF] Bai - STEDT
    His general conclusion was that "Bai is a TB language (prob. Loloish) having borrowed heavily from Chinese". By his reckoning, of the 100-word Swadesh list, ...
  58. [58]
    On the Genetic Position of the Bai Language - ResearchGate
    Aug 7, 2025 · In light of some new data from Bai dialects and new hypotheses on the genetic classification of Sino-Tibetan languages, this paper calls ...
  59. [59]
    [PDF] NO LIMITS TO BORROWING: THE CASE OF BAI AND CHINESE
    In this paper we show that the Chinese vocabulary in Bai is stratified, forming successive layers of borrowings.
  60. [60]
    Chinese Language Day and the Diversity of Chinese - Asian Absolute
    Apr 14, 2025 · Explore the linguistic diversity of China with a detailed look at its many spoken languages and dialects, including Mandarin and Cantonese.
  61. [61]
    What are the top 200 most spoken languages? | Ethnologue Free
    The top 3 most spoken languages are English (1.5B), Mandarin Chinese (1.2B), and Hindi (609.1M). The list accounts for over 88% of the world's population.
  62. [62]
    The 10 Most Spoken Languages In The World In 2025 - Babbel
    1.3 Billion Native Speakers. Map of China. Numbers vary widely — Ethnologue puts the number of native speakers at 1.3 billion ...
  63. [63]
    Mandarin Phonological Structure - jstor
    INTRODUCTION. This paper attempts to present the Mandarin phonological system after the generative fashion.1 We find it convenient to treat this part of.Missing: scholarly | Show results with:scholarly
  64. [64]
    What R Mandarin Chinese /ɹ/s? – acoustic and articulatory features ...
    This paper extends the investigation of rhotic sounds by examining the articulatory and acoustic features of Mandarin Chinese rhotics.
  65. [65]
    What Is Mandarin? The Social Project of Language Standardization ...
    In fact, at the start of the republican period, the pronun- ciation of the national language underwent considerable debate, with many factions opposing the use ...
  66. [66]
    Dialect Groups of the Chinese Language - Oxford Bibliographies
    Jan 30, 2014 · The classification widely adopted divides Chinese into seven regional groups: Mandarin (northern half of China and the southwest), Wu (Jiangsu and Zhejiang)
  67. [67]
  68. [68]
    Lili Wu Chinese | Journal of the International Phonetic Association
    Sep 29, 2020 · In summary, the lexical tonal system of Lili Wu Chinese includes two level tones (high–level T1 and low–level T4), one falling tone (high– ...Missing: varieties | Show results with:varieties
  69. [69]
    Research on Wu Dialect Recognition and Regional Variations ...
    Wu dialect possesses unique phonological characteristics: its initials exhibit a clear voiced ... features such as voicing contrasts and entering tone ...Missing: subgroups | Show results with:subgroups
  70. [70]
    Lóngyóu tones and tone sandhi | Journal of East Asian Linguistics
    Apr 27, 2024 · More generally, sandhi rules and constraints in Sinitic languages are typically expressed with tonal categories that abstract away from syllable ...
  71. [71]
    The sounds of Chinese - Oxford Academic - Oxford University Press
    voiceless contrast, in other (especially, Northern Wu) varieties Middle Chinese voiced obstruents correspond to murmured (rather than voiced) initials ...
  72. [72]
    [PDF] PHONETIC EXPLANATION FOR INITIAL AND TONAL EVOLUTION ...
    Aug 21, 2011 · This study tries to explore phonetic mechanism of the evolution both for initials and tones taken place in Wu Dialects of Chinese, so that to ...Missing: subgroups northern southern speakers entering
  73. [73]
    Devilishly difficult "dialect" - Language Log
    Aug 20, 2015 · One major obstacle to the use of Wenzhounese for espionage is that it is not a written language. As they did for many topolects during the late ...Missing: cryptologic business
  74. [74]
    Suzhou Pingtan - Culture and Art - Wujiang
    Suzhou Pingtan is a term denoting storytelling and ballad singing in the Suzhou dialect. It also goes by “Shuoshu” or “Nanci”.
  75. [75]
  76. [76]
    [PDF] Southern Min - HAL
    Feb 7, 2023 · The Min group is a very diverse group of dialects whose subdivisions are generally not mutually intelligible. For example, speakers of the ...
  77. [77]
    Old Chinese Medials and Their Sino-Tibetan Origins - Academia.edu
    The aim of this study is to carry out a new comparison between OC and TB, taking into account these advances, in order to refine our understanding of OC, and to ...
  78. [78]
    Phonetic evidence for the nasal coda shift in mandarin - ResearchGate
    Aug 6, 2025 · This paper presents phonetic evidence to resolve the transcription disagreement concerning the syllable-final nasal shift in the variety of ...Missing: stop | Show results with:stop
  79. [79]
    Old Chinese and Min上古漢語和閩語 - ResearchGate
    Aug 4, 2025 · The Min dialects are known to have split off of from mainstream Chinese before the Middle Chinese period.
  80. [80]
    [PDF] THE AUSTROASIATICS IN ANCIENT SOUTH CHINA
    THE AUSTROASIATICS IN ANCIENT SOUTH CHINA: SOME LEXICAL EVIDENCE. Author(s): Jerry Norman and Tsu-lin Mei. Source: Monumenta Serica , 1976, Vol. 32 (1976), pp.
  81. [81]
    Hokkien Merchants in Early Maritime Asia - ResearchGate
    Aug 7, 2025 · The early Hokkien traders mostly conducted private trade and were part of a regional business network that spanned Korea, Kyushu, Taiwan, and ...
  82. [82]
    Most spoken language | Guinness World Records
    According to Ethnologue.com, the most spoken language in the world in 2019 is Chinese, with 1,311,000,000 first-language speakers, which is listed as an ...
  83. [83]
    How Many Dialects Are There in Chinese? The Ultimate Breakdown
    Nov 12, 2020 · Formed during the Qing Dynasty from 2-140AD as a result of Han Chinese migrating to Jiangxi. Around 43.5 million speakers.Missing: divergences | Show results with:divergences
  84. [84]
    THE HAKKA LANGUAGE - Learning SA Chinese - WordPress.com
    Mar 5, 2013 · The Hakka language has as many regional dialects as there are counties with Hakka speakers in the majority. Some of these Hakka dialects are ...Missing: subgroups | Show results with:subgroups
  85. [85]
    Hakka's relation to Cantonese and Mandarin
    Mar 8, 1998 · ng- initials appear in Hakka, but less so in Cantonese, so we can say that Hakka is the more conservative of the three.
  86. [86]
    [PDF] Gan, Hakka and the formation of Chinese dialects1 - HAL-SHS
    The idea that Hakka is a northern dialect transplanted into the south has its roots in Lo Hsiang-lin's 1933 book Kejia Yanjiu Daolun. Lo's intention was to ...
  87. [87]
    [PDF] Guest People: Hakka Identity in China and Abroad - OAPEN Home
    Guest people: Hakka identity in China and abroad / edited by Nicole Constable. p. cm.-(Studies on ethnic groups in China). ISBN 0-295-<)7469-<) (alk. paper).
  88. [88]
    [PDF] The cultural assimilation of the Hakka Communities in Southeast Asia
    During the middle of the 19th century, the Punti-Hakka Clan War was ignited ... In Guest People: Hakka. Identity in China and Abroad (pp. 3 - 35). University ...
  89. [89]
    a case study of Hakka traditional architecture in southeastern China
    Mar 17, 2025 · The Hakka people, renowned for their resilience and bravery, have successfully preserved their unique language, customs, dietary traditions, and ...
  90. [90]
    Guest People - Project MUSE
    The term Hakka, which literally means “guest people” or “strangers,” is the ... Cohen demonstrates how nineteenth-century conflicts and collective action aligned ...
  91. [91]
  92. [92]
    [PDF] VARIATION IN NANCHANG GAN by Jie Cui - D-Scholarship@Pitt
    Mar 23, 2010 · ... Sinitic languages (both. Western and Chinese) have used 'dialect' as the translational equivalent of Chinese Fangyan. In. English language ...Missing: 20th | Show results with:20th
  93. [93]
    [PDF] TONE AS A PREDICTOR OF MUTUAL INTELLIGIBILITY OF ...
    Aug 21, 2011 · All Chinese dialects have a lexical tone system but the complexity of the tones differs. Mandarin dialects typically have four lexical tones ...
  94. [94]
    The Sound Quality Characteristics of the Gan Opera Ancestral ...
    Gan opera, listed in the third batch of China's National Intangible Cultural Heritage, is characterized by various vocal techniques and performance forms that ...
  95. [95]
    The 10 most spoken dialects of the Chinese language | Sprachcaffe
    Oct 31, 2022 · 8. Gan 贛 語 With 48 million speakers, the Gan dialect is a Sinitic language spoken by many residents of Jiangxi province in China, as well as ...
  96. [96]
    The Xiangxiang dialect of Chinese | Journal of the International ...
    Aug 8, 2019 · Xiang has two main subgroups, New Xiang and Old Xiang (Zhou & You Reference Zhou and Rujie1985, Yuan 2001), and Xiangxiang is often cited as a ...
  97. [97]
    Changsha Xiang Chinese | Journal of the International Phonetic ...
    Jul 11, 2023 · Tone6 is a rising tone and is transcribed as 24. Note that Tone6 is conventionally named Rusheng (also known as 'entering tone' or 'checked tone ...
  98. [98]
    Plastic Mandarin - Language Log
    Dec 15, 2022 · The hometowns of several founding leaders of the Chinese Communist Party, including Chairman Mao Zedong ... Xiang vernacular speakers and learnt ...
  99. [99]
    Polyfunctionality of 'Give' in Hui Varieties of Chinese: A Typological ...
    Sep 15, 2023 · 1987) that Hui Chinese was categorized as an independent group of Sinitic with five subgroups. In the second edition of the Language Atlas of ...
  100. [100]
    Huizhou Merchants | Academy of Chinese Studies
    Oct 16, 2019 · The phrase Huizhou merchants refers to the merchant associations that developed in six Huizhou counties (Shexian, Xiuning, Wuyuan, Jixi, ...
  101. [101]
    (PDF) Diversity in Sinitic Languages - Academia.edu
    While borrowing freely from Northern Zhuang vocabulary, Pinghua shows much less influence from Zhuang in terms of grammatical features than do the two varieties ...
  102. [102]
    Pinghua, Northern Language (CNP) - Ethnologue
    Northern Pinghua is a stable indigenous language of China. It belongs to the Sino-Tibetan language family and is part of the Chinese macrolanguage.Dashboard · Want To Know More? · Ethnologue Subscriptions
  103. [103]
    Pinghua, Southern Language (CSP) - Ethnologue
    Southern Pinghua is a stable indigenous language of China. It belongs to the Sino-Tibetan language family and is part of the Chinese macrolanguage.Dashboard · Want To Know More? · Ethnologue Subscriptions
  104. [104]
    [PDF] The Classifications of Vowel-raising in Jin Dialects
    The paper, based on the analysis of vowel-raising of Jin Dialects belonging to the isolating mode, holds that two types of vowel-raising exist in Jin ...
  105. [105]
  106. [106]
    (PDF) A comitative source for object markers in Sinitic languages: 跟 ...
    Waxiang is an as yet unclassified Sinitic language spoken in a remote mountainous area of western Hunan in China. It has developed in an unusual contact ...
  107. [107]
    Han Chinese, Waxiang in China people group profile - Joshua Project
    Approximately 320,000 speakers of the Waxiang language live scattered throughout an area of more than 6,000 square kilometers (2,340 sq. mi.) in western ...
  108. [108]
    [PDF] Language Policy, Dialect Writing and Linguistic Diversity
    The total number of users in the Min Nan and Min Dong virtual linguistic community ranks after Cantonese and Wu, but it is quite close to Wu. The Gan and Hakka ...
  109. [109]
  110. [110]
    [PDF] Mandarin and other Sinitic languages - HAL
    1.2 Classification of Sinitic languages. On the basis of their sound ... Norman, Jerry (1988) Chinese. Cambridge: Cambridge University Press. Norman ...
  111. [111]
    Chinese - Jerry Norman - Google Books
    Jan 21, 1988 · This general introduction to the study of Chinese traces the language's history from its beginnings in the second millennium BC to the present.Missing: Sinitic | Show results with:Sinitic
  112. [112]
    [PDF] Identifying the Tai Substratum in Cantonese
    This paper attempts to identify the origins of 29 Cantonese words lacking etymological relationship with standard Chinese characters, focusing on characterless ...
  113. [113]
    (PDF) Sinitic as a typological sandwich: revisiting the notions of ...
    Feb 24, 2021 · This paper focuses on typological variation across Sinitic varieties. Through comparing the typological profiles of various Sinitic languages ...
  114. [114]
    Sinitic is a group of languages, not a single language
    Oct 12, 2017 · Lexical similarity: Tang et. al (2008) estimated Beijing-Cantonese at 24%. They also measured Wu-Mandarin at ~30%, Min-Mandarin at ~20%, ...
  115. [115]
    Typology of Chinese Languages: An Introduction to the Special Issue
    Sinitic can be conceptualized as a “typological sandwich” (Szeto & Yurayong, 2021), situated between the Altaic languages to its north and the Mainland ...Missing: definition | Show results with:definition
  116. [116]
    [PDF] Swadesh's glottochronology
    Refutation: the rate of change may differ from Swadesh's value of 0,14 (Bergsland and Vogt 1962, O'Neil 1964, Fodor 1961) and depends on divergence time. ( ...Missing: Mandarin | Show results with:Mandarin
  117. [117]
    Dated language phylogenies shed light on the ancestry of Sino ...
    May 6, 2019 · Based on a dataset of 50 Sino-Tibetan languages, we infer phylogenies that date the origin of the language family to around 7200 BP.
  118. [118]
    Dated phylogeny suggests early Neolithic origin of Sino-Tibetan ...
    Nov 27, 2020 · An accurate reconstruction of Sino-Tibetan language evolution would greatly advance our understanding of East Asian population history.
  119. [119]
    Ancient genomes from northern China suggest links between ...
    Jun 1, 2020 · Our results suggest a link between changes in subsistence strategy and human migration, and fuel the debate about archaeolinguistic signatures of past human ...
  120. [120]
    [PDF] arXiv:2307.01209v1 [cs.CL] 30 Jun 2023
    Jun 30, 2023 · This paper uses machine learning to create multi-dialectal representations of Sinitic syllables using a knowledge graph and BoxE technique.
  121. [121]
    Toward Modern Mandarin (Part VI) - A Phonological History of ...
    Jun 9, 2020 · This is the ideal system established by the Ming government (e.g. all finals with nasal coda -m, -n, or -ŋ have corresponding finals with a stop ...
  122. [122]
    Middle Chinese (Part III) - A Phonological History of Chinese
    Jun 9, 2020 · Middle Chinese phonology is mainly based on the information from the phonological system of the Qièyùn 切韻, a rhyme dictionary compiled in 601 ...
  123. [123]
    [PDF] AN ACOUSTIC STUDY OF TONE SANDHI IN LISHUI WU
    ABSTRACT. This study investigated the tone sandhi in the Lishui. Wu dialect, an understudied variety of Southern Wu dialects in Zhejiang Province, China.Missing: Min 2020s
  124. [124]
    [PDF] Right-Dominant Tones in Zhangzhou - ACL Anthology
    Nov 21, 2022 · This study conducts a systematic acoustic exploration into the phonetic nature of rightmost tones in a right-dominant tone sandhi system ...
  125. [125]
    (PDF) The Origins of Sinitic - Academia.edu
    Sinitic emerged from a complex interplay of Sino-Tibetan and Southeast Asian linguistic features. The Shang language likely served as a creole lingua franca ...
  126. [126]
    Aspect of Chaozhou Grammar - jstor
    ... topic-comment languages where the topic is what the sentence is about and the comment is the speaker's comment on the topic which presents the subject matter.
  127. [127]
    Classifiers in Sinitic languages: From individuation to definiteness ...
    The present study will focus on the bare classifier construction [Cl+N] and the role of the classifier for (in)definiteness-marking within that construction in ...
  128. [128]
    How Do You Use the Possessive Particle ge3? - CantoneseClass101
    May 17, 2019 · 嘅 (ge3) is a possessive particle, and, when combined with a pronoun, is similar to " 's " in English, as well as the possessive pronouns “my,” ...
  129. [129]
    (PDF) TOWARDS A TYPOLOGY OF ASPECT IN SINITIC LANGUAGES
    ... analytic tonal languages using means other than inflectional morphology to signal grammatical relations. ... Sinitic languages are immigrant languages (Li ...
  130. [130]
    Serial Verbs - Brill Reference Works
    The phenomenon of serial verbs, verb serialization or serial verb constructions (SVCs) is not restricted to Sinitic languages, it is an areal phenomenon of ...
  131. [131]
    [PDF] Postpositions vs. prepositions in Mandarin Chinese
    Sinitic languages contain well- known examples of both types. Mixed orders are exemplified by prepositions, postpositions and circumpositions occurring in ...Missing: southern | Show results with:southern
  132. [132]
    STANDARD SPOKEN AND WRITTEN CHINESE LANGUAGE LAW
    Article 10 Putonghua and the standardized Chinese characters shall be used as the basic language in education and teaching in schools and other institutions ...
  133. [133]
  134. [134]
    Teachers' language use in United Kingdom Chinese community ...
    Aug 11, 2022 · This study deals with teachers' language use as it is manifested in community-based heritage-language classes.
  135. [135]
    ASR-CShhiDiaCSC: A Chinese Shanghai Dialect Conversational ...
    This open-source dataset consists of 4.19 hours of transcribed Shanghai dialect conversational speech on certain topics, where ten conversations between ten ...Missing: project | Show results with:project
  136. [136]
    The online digital archives plan of Hakka cultural assets is officially ...
    Feb 16, 2022 · The online digital archives plan of Hakka cultural assets is officially launched! To allow more people to learn more about the content of ...Missing: 2020s | Show results with:2020s
  137. [137]
    Language Modernization in the Chinese Character Cultural Sphere
    Oct 14, 2022 · This chapter delineates lexical modernization in four states (Japan, China, Korea, Vietnam) where the writing system allowed for modernization and ...
  138. [138]
    Chinese Loanwords in Vietnamese - Brill Reference Works
    All of the words in Table 3 are seen in not only Chinese and Vietnamese but also Japanese and Korean, shown with Romanized pronunciation. Mandarin readings ...
  139. [139]
    [PDF] Chinese Loanwords in Vietnamese Pronouns and Terms of Address ...
    Chinese loanwords have played a significant role in the Vietnamese system of pronouns and terms of address and reference. Semantic and pragmatic features of.
  140. [140]
    (PDF) Topic Prominence - ResearchGate
    Nov 2, 2020 · East Asian languages grouped together as topic‐prominent turn out to display important differences. In Japanese and Korean, the A‐domain extends higher than in ...
  141. [141]
    Chapter 5. A single origin of numeral classifiers in Asia and the Pacific
    One existing hypothesis suggests that numeral classifiers in Asia emerged first in the Sinitic language group and spread to nearby languages, e.g., Tai-Kadai, ...
  142. [142]
    [PDF] Classifiers in Southeast Asian languages - HAL
    Dec 8, 2020 · Abstract. Classifiers are one of the types of nominal classifications systems that help speakers to identify dis- course referents.
  143. [143]
    Sociolinguistic variation in Colloquial Singapore English sia
    Jul 21, 2024 · Thus, some scholars also characterise CSE as an English-lexifier creole with substratal influences from Sinitic varieties such as Hokkien, ...<|separator|>
  144. [144]
    [PDF] Chinuk Wawa (Chinook Jargon) etymologies Henry Zenk, Tony ...
    Chinuk Wawa (Chinook Jargon) is a language with lexical items from various sources, including Chinookan, Salishan, and English. 'Wawa' means 'to talk, say, ...
  145. [145]
    (PDF) Sinitic loanwords in two Hmong dialects of Southeast Asia
    Jun 10, 2016 · It is estimated that more than 20% of the Hmong lexicon may be of Sinitic origin, borrowed over a period of time that may have started around ...Missing: 2020s | Show results with:2020s
  146. [146]
  147. [147]
    Taiwanese Identity and Culinary Diplomacy: Moving from Dim Sum ...
    Nov 29, 2023 · Gastrodiplomacy has the potential to encourage foreign tourism, teach foreign audiences about the complexities of Taiwanese identity, and gain global support ...Missing: Mandarin taichi
  148. [148]
    Decoding the Vocabulary of Chinese Foreign Policy
    Apr 11, 2023 · The dictionary's second edition, which includes 26 terms. It is an invaluable resource for understanding Chinese foreign and domestic policies.