Manner of articulation is a fundamental concept in articulatory phonetics that describes how the airstream from the lungs is obstructed, constricted, or otherwise modified by the articulators in the vocal tract to produce consonant sounds.[1] It focuses on the degree of closure or narrowing between the active articulator (such as the tongue or lips) and the passive articulator (such as the teeth, alveolar ridge, or velum), which determines the acoustic and auditory qualities of the sound.[1]Consonants are classified into several main manners based on this airflow modification, including stops (also called plosives), where the airstream is completely blocked in the oral cavity before being released, as in , , and ; fricatives, where the articulators are close enough to cause turbulent friction, as in , , and [ʃ]; and affricates, which combine a stop closure followed by fricative release, as in [tʃ] and [dʒ].[1] Other key manners encompass nasals, in which the velum is lowered to allow airflow through the nasal cavity while the mouth is closed, such as , , and [ŋ]; approximants (including glides and liquids), featuring minimal obstruction without significant friction, as in , , , and [ɹ]; and flaps or trills, involving brief or vibrating interruptions of airflow, like the alveolar flap [ɾ] in American English.[1]This classification, often combined with place of articulation (the location of constriction) and voicing (vibration of the vocal folds), provides a systematic framework for describing and comparing consonant inventories across languages.[1] For instance, English has a relatively limited set of manners compared to languages like !Xóõ, which includes clicks as an additional manner involving ingressive oral airflow.[2] Understanding manner of articulation is essential for phonetic transcription, language teaching, and phonological analysis.
Core Concepts
Definition and Scope
Manner of articulation refers to the specific way in which the airflow from the lungs—or, in some cases, other airstream mechanisms—is obstructed or modified by the articulators in the vocal tract to produce speech sounds, particularly consonants.[3] This parameter focuses on the degree and type of constriction or closure formed by the active and passive articulators, such as the tongue, lips, and palate, which shapes the acoustic properties of the sound without regard to the precise location of the constriction.[1] For instance, complete closure of the vocal tract temporarily blocks airflow to create a stop, while a narrow constriction allows turbulent airflow to produce a fricative.[3]The concept of manner of articulation emerged in phonetic theory during the 19th century, building on the pioneering work of linguists such as Henry Sweet, who advanced systematic classifications of speech sounds in works like his Handbook of Phonetics (1877).[4] Sweet's contributions laid foundational principles for describing how articulatory configurations affect sound production, influencing subsequent scholars including Daniel Jones, who further refined phonetic notation in the early 20th century.[4] Modern standardization of manner descriptions occurred through the International Phonetic Alphabet (IPA), established by the International Phonetic Association in 1886, with Sweet as an early member; the IPA provides a universal framework for transcribing manners consistently across languages.[4]Manner of articulation plays a crucial role in phonological contrasts, enabling languages to distinguish meaning through variations in airflow modification, which directly impacts speech intelligibility and the composition of sound inventories. For example, in English, the voiceless bilabial stop /p/ (with full closure) contrasts with the voiceless labiodental fricative /f/ (with narrowing), as in "pat" versus "fat," where differences in manner create minimal pairs that alter word meaning.[5] This feature allows languages to build diverse consonant systems, with some prioritizing certain manners to encode phonological oppositions essential for communication clarity.
Relation to Place of Articulation and Airstream
Manner of articulation specifies the configuration of the vocal tract articulators that modifies the airstream to produce a speech sound, while place of articulation identifies the precise location of that modification along the vocal tract. Together, these parameters define the essential characteristics of consonants, with manner addressing the "how" of airflow obstruction and place the "where." For example, in the sound /p/, the manner is plosive, involving complete closure and sudden release of airflow, combined with a bilabial place where the lips meet to form the obstruction.[6][7]Airstream mechanisms provide the foundational airflow that manner and place act upon, determining the direction and source of air movement through the vocal tract. The predominant mechanism in most languages is pulmonic egressive, in which air is expelled outward from the lungs via contraction of the respiratory muscles, serving as the basis for standard manners like stops and fricatives. Non-pulmonic mechanisms, such as those used in ejectives or clicks, alter the airflow direction and intensity, thereby influencing compatible manners.[8] Conceptual models of airstream distinguish pulmonic flow as steady and outward, contrasting with the localized pressure bursts in other types, which require coordinated articulator movements to avoid disrupting the primary manner.[9]The coordination of manner, place, and airstream is central to phonetic classification systems, such as the International Phonetic Alphabet (IPA) consonant chart, where places of articulation are arrayed horizontally (from bilabial to glottal) and manners vertically (from plosives to approximants), with pulmonic egressive as the default airstream modified by symbols for non-pulmonic variants. This integration allows for systematic representation; the alveolar fricative /s/, for example, results from a narrow groove formed by the tongue tip against the alveolar ridge (place: alveolar), creating frictional turbulence in the pulmonic egressive airstream (manner: fricative).[10] Such combinations highlight how airstream provides the dynamic energy, place sets the anatomical site of interaction, and manner dictates the degree and type of stricture, enabling diverse sound inventories across languages.Understanding these relations presupposes knowledge of vocal tract anatomy, where articulators like the tongue (for coronal and dorsal places), lips (for labial places), and velum (for routing airstream to the nasal cavity in nasal manners) enable the modifications. The tongue's flexibility, for instance, allows precise adjustments at multiple places to achieve varying manners, while the velum's elevation or lowering directs pulmonic airflow orally or nasally, foundational to manner distinctions without altering the primary airstream.[7] This anatomical interplay ensures that manner and place operate in concert with airstream to produce perceptually distinct sounds, forming the prerequisites for analyzing consonant systems in phonetics.[6]
Classification Frameworks
Obstruents and Sonorants
Obstruents are speech sounds characterized by significant obstruction of the airflow in the vocal tract, resulting in turbulence or complete blockage that produces noisy or silent acoustic effects.[3] This class includes sounds subdivided by the degree of stricture, such as stops (with full closure) and fricatives (with partial narrowing causing friction).[11] Articulatorily, obstruents involve a sufficiently narrow vocal tract configuration that generates turbulence even with modal voiced airflow, classifying them as [-sonorant] in phonological feature systems.[11]In contrast, sonorants are produced with minimal obstruction of the airstream, permitting free resonance in the oral or nasal cavities and yielding periodic, vowel-like acoustic qualities.[3] Examples encompass nasals, approximants, liquids, glides, and vowels, all marked as [+sonorant] due to their open vocal tract allowing unimpeded airflow without turbulence.[11] Acoustically and articulatorily, obstruents exhibit low sonority from pressure buildup and aperiodic noise, whereas sonorants display high sonority through sustained periodic voicing and resonant formants.[12] The basic sonority hierarchy ranks obstruents as the least sonorous, ascending through nasals and liquids to glides and vowels at the peak.[12]Phonologically, obstruents typically occupy syllable margins, such as onsets, where they create perceptual contrast through their abrupt, non-resonant profiles; for instance, the English voiceless stop /t/ in "top" functions as an obstruent onset.[12] Sonorants, by virtue of their higher sonority, often form syllable nuclei or act as glides, supporting the core resonance of syllables; the English nasal /m/ in "man" exemplifies a sonorant in a marginal position but with resonant properties akin to a nucleus.[12] This binary distinction underlies cross-linguistic patterns in syllable structure and phonotactics, with obstruents favoring positions for demarcation and sonorants enabling sustained sonority peaks.[12]
Degrees of Stricture
The degree of stricture refers to the extent to which the airflow through the vocal tract is obstructed by the articulators during the production of speech sounds, forming a gradient scale that serves as the foundational framework for classifying manners of articulation from maximal obstruction to minimal interference.[13] This scale ranges from complete closure, where airflow is fully blocked, to open approximation, where the tract remains relatively unobstructed, allowing smooth passage of air.[14] The stricture hierarchy organizes manners accordingly: complete closure characterizes plosives, involving total occlusion that builds up intraoral pressure until release; near-closure with turbulence defines fricatives, where a narrow channel generates frictional noise; momentary closure followed by fricative release distinguishes affricates; nasal diversion occurs with complete oral closure but lowered velum, as in nasals; and open passage typifies approximants, with sufficient space to avoid turbulence.[13] This hierarchy underpins the obstruent-sonorant binary, where obstruents exhibit greater stricture (complete or near-complete) than sonorants (nasal or open).[15]Articulatorily, stricture arises from the proximity of active and passive articulators, creating pressure differentials that determine acoustic outcomes: full closure produces silence due to blocked airflow and pressure buildup behind the occlusion, while partial strictures accelerate air velocity through constrictions, leading to turbulence in fricatives.[14] In frication, the narrowing invokes Bernoulli's principle conceptually, where increased airflow speed reduces static pressure, destabilizing the jet and promoting turbulent eddies that generate noise without requiring equations for description.[16] For nasals and approximants, the mechanics involve either a secondary nasal pathway or wider oral spacing, maintaining resonant airflow without significant pressure imbalance.[13]Phonetic measurement of stricture relies on qualitative scales, such as those in the International Phonetic Alphabet (IPA), which categorize manners based on articulatory constriction degrees without numerical precision, emphasizing perceptual and production distinctions. Acoustically, correlates distinguish stricture levels: fricatives exhibit broadband noise spectra from turbulent energy, often in high-frequency ranges, whereas approximants show clear formant structures resembling vowels due to periodic voicing and minimal obstruction.[17] Plosives, meanwhile, feature silence during closure followed by burst transients, and nasals display lowered formant intensities with nasal murmurs.[14]Variations in stricture include central versus lateral configurations: central stricture obstructs the vocal tract midline, forcing air through peripheral channels, while lateral stricture permits airflow along the sides despite central approximation, as seen in certain approximants or fricatives.[18] In modern articulatory phonology, stricture is further nuanced by gestural overlap, where overlapping articulatory gestures for adjacent sounds can modulate effective constriction degrees, influencing manner transitions through spatiotemporal coordination rather than isolated closures.[19]
Primary Manners of Articulation
Plosives
Plosives, also known as stops, are consonants characterized by a complete closure of the vocal tract that blocks oral airflow, followed by the buildup and sudden release of intraoral pressure. This manner of articulation involves three primary stages: the closurephase, in which the articulators form a tight seal at the place of articulation to prevent air escape; the hold phase, during which subglottal air pressure increases behind the closure, creating a pressure differential; and the release phase, marked by an abrupt opening that results in a brief burst of sound as the compressed air is expelled. Plosives represent the stricture degree of complete closure, distinguishing them from manners with partial obstruction.[20]Common types of plosives include voiceless variants such as /p/, /t/, and /k/, produced without vocal fold vibration during the hold and release, and voiced counterparts like /b/, /d/, and /g/, where the vocal folds vibrate throughout or primarily during the hold phase. Additional variants occur cross-linguistically, including aspirated plosives (e.g., /pʰ/, /tʰ/, /kʰ/), in which a puff of glottal airflow follows the release, as in English word-initial positions or languages like Hindi; and pre-nasalized forms (e.g., /ᵐb/, ⁿd/, ᵑɡ/), featuring a preceding homorganic nasal element with lowered velum during the initial closure, common in Bantu languages such as Swahili. These variants often function phonologically as unitary segments despite their complex articulation.[21][22]Acoustically, plosives exhibit a period of silence or low-amplitude noise during the closure phase due to the absence of airflow, followed by a transient burst—a short, noise-like release spectrum that varies by place of articulation—and subsequent formant transitions into adjacent vowels that provide cues for identification. For instance, bilabial bursts are diffuse with low-frequency energy, while alveolar bursts show mid-frequency concentration. Plosives are universally present in the world's languages, with voiceless /p/, /t/, and /k/ appearing in nearly all, and labial plosives like /p/ occurring in approximately 93% of sampled languages, underscoring their fundamental role in human speech inventories.[23][20][24][25]In phonology, plosives frequently participate in processes like gemination, where they are lengthened or doubled to signal morphological or lexical contrasts, as in Italian (e.g., /p/ in appa vs. apa) or Finnish, enhancing duration in the hold phase for perceptual distinction. Prenasalized plosives similarly serve contrastive functions, behaving as single phonemes that trigger nasal harmony or tone depression in languages like Zulu, while also stabilizing voicing in obstruent systems. These roles highlight plosives' versatility in structuring syllables and words across diverse linguistic families.[22]
Fricatives
Fricatives are consonant sounds produced by directing airflow through a narrow constriction in the vocal tract, generating turbulent airflow and characteristic frication noise. This partial stricture divides the vocal tract into anterior and posterior cavities, with the length of the anterior cavity influencing the resonant frequencies of the noise spectrum.[26] Unlike complete closures, this sustained turbulence allows continuous sound production without interruption.[27]Fricatives are classified as sibilant or non-sibilant based on the intensity and frequency of their noise. Sibilants, such as /s/ and /ʃ/, exhibit high-amplitude, high-frequency hissing due to articulatory features like a grooved tongueblade directing airflow against the alveolar ridge or palate, enhancing turbulence in the 4-8 kHz range. Non-sibilant fricatives, like /f/ and /θ/, produce lower-intensitynoise with broader spectral energy, often lacking the sharp sibilance because of less focused airflow, as in labiodental or dental articulations.[26][27]Voicing in fricatives introduces periodic vocal fold vibration superimposed on the aperiodic frication noise. Voiceless fricatives, exemplified by /s/, feature aspiration-like turbulent noise without vocal cord involvement, resulting in higher amplitude and longer duration compared to their voiced counterparts. Voiced fricatives, such as /z/, combine the frication with low-frequency energy from voicing, but aerodynamic constraints often lead to partial devoicing or shorter durations to maintain subglottal pressure for vibration.[26][27]Acoustically, fricatives are analyzed through spectral properties, including peaks and moments like center of gravity (spectral mean), which decrease posteriorly (e.g., /s/ at 5-6 kHz versus /ʃ/ at 4-4.5 kHz). Sibilants show concentrated high-frequency energy, while non-sibilants have more diffuse spectra; formant transitions from adjacent vowels further aid place distinctions, such as between /f/ and /θ/. Articulatory adjustments, like tongue grooving for sibilance, sharpen these spectral peaks, as confirmed by methods like multitaper spectral estimation for robust measurement.[26][27]Cross-linguistically, fricatives occur in about 91% of the world's languages, but they are rarer in some isolates, where they often appear primarily in loanwords rather than core vocabulary. In tone languages like Mandarin, the broadband noise of fricatives can interact with lexical toneperception, influencing manner identification through masking of pitch cues or altered spectral contrasts.[26]
Affricates
Affricates are consonants characterized by a manner of articulation that involves a complete oral closure, akin to a plosive, followed by a slow release producing fricative-like turbulence at the same articulatory location. This sequential process creates a complex sound unit where the airstream is first obstructed entirely before transitioning to partial obstruction, generating noise through friction. For instance, the English voiceless postalveolar affricate /tʃ/, as in "church," begins with an alveolar ridge closure released into post-alveolar frication. Affricates occur in approximately 60% of the world's languages, less universally than plosives or fricatives.[18][1][28]A key distinction exists between true affricates and pseudo-affricates. True affricates function as single phonemes with a monophthongal, unitary transition from closure to frication, maintaining consistent voicing throughout—such as the voiced /dʒ/ in English "judge," where the stop and fricative phases share voicing. In contrast, pseudo-affricates are sequences of distinct segments (a stop followed by a fricative), often allowing independent voicing or prosodic behavior, as seen in Polish minimal pairs like "czy" (true affricate /tɕ/) versus "trzy" (pseudo-affricate /tʂ/). This differentiation can be phonemically contrastive in languages like Polish.[18]Phonetically, affricates exhibit variable duration ratios between the stop closure and fricative release phases, influenced by language, position, and speaking rate. The fricative phase often dominates the overall duration, contributing to the perceptual unity of the segment and aiding distinction from fricative-only sounds. Affricates combine plosive and fricative strictures but are treated as a hybrid manner.Historically, affricates frequently evolve from palatalized stops through sound changes that introduce frication during release, as documented in languages like Laomian where bilabial or velar plosives develop affrication post-palatalization. In African languages, affricate clusters are notable, such as lateral affricates in Khoisan varieties, where they form part of expanded consonant inventories including series of fricatives and affricates.[29][30]
Sonorant Manners
Nasals
Nasal consonants are produced by creating a complete closure in the oral cavity at some point of articulation while simultaneously lowering the velum to allow airflow through the nasal cavity.[31] This results in the sound being channeled exclusively through the nose, with the oral tract serving as a side branch that contributes to the acoustic properties.[32] Common places of articulation for nasals include bilabial (/m/), alveolar (/n/), and velar (/ŋ/), aligning with typical oral stop positions in many languages.[33]Nasals are typically voiced due to the steady airflow required for their production, which sustains vocal fold vibration, although voiceless nasals occur in some languages (e.g., Burmese, certain Tibeto-Burman languages).[1][34] Their resonance is characterized by low-frequency formants arising primarily from the pharyngeal-nasal airway, creating a distinct nasal timbre.[35] Additionally, the closed oral cavity introduces anti-resonances, or zeros in the spectrum, that further shape the sound by attenuating certain frequencies.[35]Phonologically, nasals participate in processes such as prenasalization, where a nasal precedes a stop in clusters like /ᵐb/ or /ⁿd/, often treated as single units in languages with such sequences.[36] Post-nasal contexts frequently trigger voicing assimilation, where following obstruents become voiced, as seen in patterns across many languages.[37] While nasal consonants are nearly universal, their inventories vary, typically limited to places matching oral stops, with bilabial, alveolar, and velar nasals being the most common and labiodental nasals rare.[38]Acoustically, nasals feature a low-amplitude nasal murmur during the closure, distinguished by broad spectral peaks and formant transitions that reflect the place of articulation.[39] These transitions provide cues for perception, with coarticulatory effects extending nasalization to adjacent vowels. Recent research highlights how children acquire nasal coarticulation gradually, achieving adult-like coordination of velum lowering and oral gestures by around age five, aiding phonological development.[40]
Approximants
Approximants are consonants characterized by a degree of stricture in which the articulators approach each other but remain sufficiently distant to allow smooth, laminar airflow through the vocal tract without generating turbulent friction.[18] This open stricture, narrower than that of vowels yet wider than fricatives, results in central approximation where airflow passes primarily through the midline of the oral cavity.[18] Representative examples include the labiovelar approximant /w/, formed by rounding the lips and raising the back of the tongue toward the velum, and the palatal approximant /j/, produced by raising the front of the tongue toward the hard palate.[41]A key subtype of approximants consists of glides, also termed semivowels, which are vowel-like in articulation but function as consonants due to their non-syllabic role and shorter duration.[42] For instance, /j/ in English "yes" acts as a glide preceding a vowel, exhibiting a rapid articulatory transition from a high front vowel position, while in diphthongs like that of "boy," it serves as a semivowel offglide following the nucleus.[42] Their realization is highly context-dependent, with duration typically brief (often under 100 ms) and influenced by surrounding vowels, leading to allophonic variation where they may approach fricative-like quality in rapid speech.[42]Acoustically, approximants display formant patterns closely resembling those of adjacent vowels, characterized by steady-state formants and gradual transitions rather than abrupt changes or noise.[43] The palatal /j/ features a high second formant (F2 around 1,850–2,100 Hz), mirroring the high front vowel /i/, while the labiovelar /w/ shows a lowered F2 (600–850 Hz) due to back tongue constriction and liprounding, which also reduces overall intensity.[43] These properties enable approximants to contribute to diphthong formation, providing smooth offglide transitions that enhance vowel quality contrasts without introducing spectral turbulence.[43]Cross-linguistically, approximants frequently occur as offglides in diphthongs, with glides like /j/ and /w/ present in the inventories of over 90% of sampled languages according to phonological database analyses. Their distribution shows variation, as they often hold marginal phonological status in languages emphasizing obstruent contrasts, and in click languages of southern Africa, approximants typically manifest as secondary articulations accompanying ingressive clicks rather than as standalone primary manners.[44]
Laterals and Rhotics
Lateral approximants are consonantal sounds produced with a partial central blockage in the vocal tract, where the airflow is directed around the sides of the tongue.[45] This manner involves the tongue making contact at a central place of articulation, such as the alveolar ridge, while the sides of the tongue are lowered to allow unobstructed lateral airflow without turbulence.[45] The alveolar lateral approximant /l/, found in languages like English and Spanish, exemplifies this, with the tongue tip raised to the alveolar ridge and air escaping bilaterally or unilaterally.[45] Retroflex lateral approximants like /ɭ/, occurring in Dravidian languages such as Tamil, feature the tongue tip curled backward toward the hard palate, maintaining the lateral airflow pathway.A key variation in alveolar laterals is the distinction between clear and dark realizations, determined by secondary articulation. Clear /l/ maintains a neutral tongue body position, promoting a fronted vowel-like quality, as in syllable-initial positions in English (e.g., "leaf").[46] In contrast, dark /l/ involves velarization, where the tongue body raises toward the velum, creating a back, rounded co-articulation similar to a velar approximant, typically in syllable-coda positions (e.g., "full").[46]Rhotics constitute a diverse class of sonorants characterized by vibratory or bunched tongue configurations that produce a distinctive "r"-like quality, often involving trill, flap, or approximant manners.[47] Common types include the alveolar trill /r/, produced by rapid vibration of the tongue tip against the alveolar ridge as in Italian or Spanish; the alveolar flap /ɾ/, a brief single-contact tap as in American English intervocalic "butter"; and the postalveolar approximant /ɹ/, a smooth non-vibratory approximation in English.[47] These realizations vary widely across languages, with over 75% of the world's languages featuring at least one rhotic, often contrasting multiple types (e.g., flap vs. trill in Spanish).[47] As sonorants, rhotics exhibit minimal stricture, allowing periodic voicing with low airflow obstruction.[45]Articulatory diversity extends to obstruent variants of laterals and specific rhotic configurations. Lateral fricatives, such as the voiceless alveolar /ɬ/ in Welsh or Tlingit, function as obstruents by narrowing the lateral channels to generate turbulent airflow and frication, contrasting with approximant laterals in manner while sharing lateral directionality.[48] In American English, rhotics frequently employ a bunched gesture, where the tongue body and sides elevate centrally without retroflexion, creating a medial constriction near the palate.[49]Rhotics present notable phonetic challenges, particularly their instability during acquisition, often resulting in late mastery and frequent misarticulations across languages due to the need for coordinated dual tongue gestures (e.g., front raising and root retraction).[50]Ultrasound studies post-2020 have illuminated these complexities, revealing consistent pharyngeal involvement in rhotic production; for instance, Australian English rhotics show variable bunching and retroflexion patterns influenced by vowel context, with tongue root advancement contributing to articulatory stability.[51] Similarly, investigations of uvular rhotics in languages like Upper Sorbian confirm tongue root gestures lowering the second formant, underscoring the multi-gestural nature of rhotics.[52]
Non-Pulmonic and Secondary Manners
Implosives and Ejectives
Implosives and ejectives are non-pulmonic consonants produced using a glottalic airstream mechanism, where airflow is initiated by movements of the larynx rather than the lungs. These sounds modify the basic plosive manner of articulation by incorporating glottal closure alongside oral closure, but with distinct directional airflow patterns.[53]Implosives are produced with an ingressive glottalic airstream, involving a closed glottis and a downward movement of the larynx that creates rarefaction, or negative pressure, in the supraglottal cavity, drawing air inward upon release of the oral closure. The velum is raised to seal the nasal cavity, preventing air escape, and the vocal folds typically vibrate during this process, resulting in voiced implosives such as the bilabial /ɓ/ or alveolar /ɗ/. This inward pull contrasts with pulmonic plosives by generating a sucking effect rather than explosive outflow. In languages like Sindhi, implosives form a phonemic contrast with voiced plosives, appearing in words like /ɓəɾi/ ('child') and contributing to the language's four implosive series at bilabial, dental, palatal, and velar places of articulation.[53][54][55][56][54]Ejectives, in contrast, employ an egressive glottalic airstream, where the closed glottis is raised upward, compressing air in the sealed supraglottal cavity to build positive pressure for an outward burst upon oral release. This mechanism ensures voicelessness, as the vocal folds remain approximated without vibration, producing sounds like the bilabial /pʔ/ or alveolar /tʔ/ (often transcribed as /p'/, /t'/). The supraglottal pressure buildup creates a sharper, more abrupt release than in pulmonic voiceless plosives. Ejectives are prevalent in Caucasian languages such as Georgian, where they contrast with plain stops in series like /p, p', pʰ/, and in Native American languages like Navajo, featuring ejective stops and affricates across multiple places of articulation.[53][56][57][58][59]The core production difference lies in larynx displacement: lowering for implosives to induce rarefaction and inward airflow, versus raising for ejectives to generate supraglottal pressure and outward ejection, with implosives typically voiced and ejectives voiceless due to phonatory constraints. Both lack the pulmonic airflow that enables voicing in standard stops, often integrating phonologically as obstruents despite variable behavior in processes like nasalization or coda positioning. In typological surveys, glottalized consonants like these occur in about 27% of sampled languages, with ejectives in 16% (primarily in the Americas and Caucasus) and implosives in 13% (concentrated in Africa and Southeast Asia), though recent documentation reveals their presence in select Papuan languages of New Guinea.[56][55][60][59]
Clicks and Other Lingual Egressive Sounds
Clicks are speech sounds produced using a lingual ingressive airstream mechanism, in which air is drawn into the mouth by lowering the tongue body after creating a sealed pocket of air between two oral closures.[61] The posterior closure is formed by raising the back of the tongue against the velum, while the anterior closure occurs at a forward point such as the teeth, alveolar ridge, or lips; upon release of the anterior closure, the rarefied air produces a characteristic suction pop.[62] This velaric mechanism contrasts with pulmonic airstreams by relying solely on tonguemovement for airflow initiation, without involvement of the lungs.[61]The primary types of clicks are distinguished by the location of the anterior closure and are represented in the International Phonetic Alphabet (IPA) with symbols such as dental /ǀ/ (as in the "tsk" sound), alveolar /ǃ/, palatal or palato-alveolar /ǂ/, lateral /ǁ/ (side-released from the alveolar region), and bilabial /ʘ/ (using lip closure).[61] These occur prominently in Khoisan languages of southern Africa, such as Nama (with dental, alveolar, palatal, and lateral clicks) and !Xóõ (with up to 20 click distinctions), where they function as phonemic consonants in word-initial positions.[63][64]Clicks are typically accompanied by a secondary pulmonic egressive airstream at the posterior closure, which determines their manner: tenuis clicks are voiceless and unaspirated (e.g., /kǀ/ in Nama, with velar release); aspirated clicks involve breathy release (/kǀʰ/); nasal clicks feature velum lowering for nasal airflow (/ŋǀ/ voiced or /ŋ̊ǀ/ voiceless); and glottalized clicks include a glottal stop (/ʔǀ/).[63] Voiced nasal accompaniments, common in Khoisan languages, combine the ingressive click influx with simultaneous voicing and nasal resonance from the lungs.[64] Phonologically, clicks are not standalone consonants but ingressive influxes modified by these accompaniments, functioning as complex segments in the syllable onset.[64]Other lingual sounds include rare egressive variants, where the velaric mechanism expels air outward through reversed tongue movement, though such sounds are unattested in natural languages and limited to theoretical or imitative contexts; suction-based ingressives dominate, as in percussive or interactive clicks.[65] Clicks have spread historically from Khoisan substrates into Bantu languages through contact, with borrowing evident in Nguni languages like Xhosa and Zulu, where dental, alveolar, and lateral clicks were incorporated as early as Proto-Nguni around the 15th century.[66]Phonetically, the velum's role in sealing the oral cavity is crucial for maintaining the pressure differential during click formation, preventing air escape until release.[62] Acoustically, clicks exhibit a sharp transient pop at release (with higher intensity in percussive types, around 60-80 dB), followed by low-frequency noise and formant structure influenced by cavity size; for instance, dental clicks show a lower center of gravity (spectral centroid) than lateral ones due to front cavity resonance.[65] Documentation has also included Australian languages, notably the Damin ritual register of the Lardil language (now extinct), which features five click types (dental, lateral, etc.) outside Africa, highlighting convergent evolution in isolated linguistic traditions.[67]