Fact-checked by Grok 2 weeks ago

Articulatory phonetics

Articulatory phonetics is the branch of that investigates the physiological mechanisms involved in producing , focusing on how the articulators—such as the , , , and vocal folds—shape from the lungs to create audible phones. This subfield emphasizes the coordination of the vocal tract to generate consonants and vowels, distinguishing it from , which examines sound waves, and auditory phonetics, which studies . typically begins with pulmonic egressive , where air is expelled from the lungs through the and supralaryngeal vocal tract, modified by voicing (vibration of the vocal folds for sounds like or ) or (no vibration, as in or ). The vocal tract, extending from the glottis to the lips and including nasal passages, comprises key structures like the , oral cavity, alveolar ridge, hard and soft , teeth, and , all of which act as movable to constrict or open the airway. For instance, the tongue's position relative to the determines places of articulation, while the velum controls nasal versus oral airflow. These configurations allow for precise control, enabling speakers to produce a wide range of sounds across languages, though individual variations in anatomy influence realization. Consonants are classified primarily by place of articulation (e.g., bilabial for [p, b] involving both lips, alveolar for [t, d] at the ridge behind the teeth) and manner of articulation (e.g., stops with complete closure like , fricatives with turbulent airflow like , nasals with velum lowering like ). Voicing further differentiates pairs, such as voiced versus voiceless . Vowels, in contrast, involve open vocal tract configurations defined by tongue height (high like , low like ), frontness/backness (front , back ), and lip rounding (rounded , unrounded [ɪ]), forming the resonant quality of speech. Articulatory phonetics underpins phonological analysis by revealing how abstract sound units (phonemes) are physically realized as allophones, informing fields like speech therapy, , and . Advances in imaging techniques, such as MRI and , have enhanced empirical study of these processes, providing dynamic views of in .

Vocal Tract Anatomy

Articulators

The articulators are the movable and fixed structures within the vocal tract that interact to modify airflow and produce . These include both active components, which can move to create constrictions or openings, and passive ones that serve as points of . The primary articulators extend from the to the , shaping the acoustic properties of sounds by altering the vocal tract's configuration. The jaw (mandible) is a primary active articulator, consisting of the movable lower jawbone that hinges at the temporomandibular joint. It elevates and depresses to control the vertical dimension of the oral cavity, facilitating mouth opening for vowels and closures for stops, while also influencing tongue positioning through its support of the tongue base. Lips are the outermost active articulators, consisting of soft, muscular tissue capable of protrusion, , and spreading. They modify by closing completely to create bilabial closures or approximating to produce turbulence, and their shape influences resonance through rounding or spreading. The tongue is the most versatile active articulator, a muscular hydrostat divided into parts: the tip () for precise contact at forward points; the blade (underside near the tip) for broader approximations; the front for raising toward the ; the back for contacting rear structures; and the root anchoring near the . The tongue raises, lowers, advances, or retracts to adjust cavity sizes, enabling high vowels via elevation or constrictions via approximation to fixed structures. Teeth, particularly the upper incisors, act as passive articulators, providing a fixed edge for the or lower to contact, generating sounds through narrow channels or stops via complete blockage. The alveolar ridge, a bony prominence just behind the upper teeth, serves as a passive target for the or , facilitating alveolar constrictions that shape sounds by narrowing the front oral cavity. The , the bony roof of the mouth extending rearward from the alveolar ridge, functions as a passive articulator for the tongue front or blade, creating palatal contacts that modify airflow for higher-frequency resonances. The soft palate (velum) is a movable muscular flap at the rear of the hard palate, actively raising to seal the for oral sounds or lowering to allow nasal airflow, thus directing resonance paths and producing nasal versus oral distinctions. The uvula, a small projection from the velum's trailing edge, can vibrate or contact the tongue back in some languages, adding pharyngeal modulation. The pharyngeal wall, the flexible rear wall of the throat above the , acts passively as a target for the tongue root, constricting the to lower frequencies in certain vowels or consonants. The epiglottis, a leaf-shaped at the laryngopharynx entrance, contributes minimally to sound shaping but helps protect the airway during , indirectly supporting . The glottis, the space between the vocal folds within the , controls vibration for voicing; it narrows for voiced sounds via fold adduction or widens for voiceless ones, fundamentally altering . Anatomical variations in articulators, such as differences in length, palate concavity, or pharyngeal curvature, occur across individuals and populations, influencing precision and acoustic output; for instance, reduced palatal concavity can shift formants, requiring compensatory adjustments in positioning. These variations, explaining up to 46% of palatal shape differences and 78.5% of pharyngeal curvature variance, arise from genetic and developmental factors and affect sound production across languages. A sagittal view of the vocal tract illustrates these structures in a midline cross-section, showing the at the front, tongue body curving upward, alveolar ridge and forming the oral roof, velum and at the rear, pharyngeal wall descending to the , and at the base above the trachea; this perspective highlights how movements reshape the tract from to for sound filtering.

Larynx and Pharynx

The , often referred to as the voice box, is a cartilaginous structure located in the anterior at the level of the to vertebrae, serving as the primary for airflow initiation and modification in . It consists of several key cartilages that provide structural support: the , which forms the anterior and lateral walls and is prominent as the laryngeal prominence (); the , a ring-shaped structure inferior to the thyroid that anchors the larynx and connects to the trachea; and the paired arytenoid cartilages, which sit atop the cricoid and facilitate movement of the vocal folds through rotation and rocking motions. The vocal folds, central to the larynx, are paired structures comprising the vocal ligaments (elastic bands of fibrous tissue) covered by and supported by the vocalis muscle, allowing for tension adjustments during sound production. Intrinsic laryngeal muscles, such as the , tilt the forward relative to the cricoid to stretch and tense the vocal folds, thereby influencing pitch control in speech. The , a muscular tube extending from the to the level of the , lies immediately superior to the and forms the lower portion of the vocal tract, contributing to shaping and . Its walls are composed of a fibromuscular layer, including the posterior pharyngeal wall lined with mucosa, which can constrict to modify the acoustic properties of sound. The pharyngeal constrictor muscles—superior, middle, and inferior—encircle the and enable narrowing of its lumen; the superior constrictor originates from the pterygoid plate and , inserting into the pharyngeal , while the middle and inferior constrictors arise from the hyoid and laryngeal cartilages, respectively, allowing sequential contraction to adjust pharyngeal width for articulatory purposes. In addition to supporting airflow, the larynx performs critical protective functions during speech and swallowing, with the —a leaf-shaped —folding over the laryngeal inlet to close the airway and prevent of food or liquid. The and together provide a foundational chamber for , where the acts as an acoustic that amplifies and filters sounds generated below the oral cavity. Cross-linguistically, the plays a prominent role in producing pharyngeal sounds, as seen in , where pharyngeal fricatives like /ħ/ (voiceless) and /ʕ/ (voiced) involve retraction of the and toward the posterior pharyngeal wall, narrowing the lower pharynx to create turbulent airflow. Emphatic consonants in , such as /tˤ/ and /sˤ/, feature secondary pharyngeal constriction via body retraction and larynx raising, enhancing and distinguishing them from non-emphatic counterparts. These articulations highlight the pharynx's capacity for precise narrowing, which is less utilized in languages without such phonemes.

Airstream Mechanisms

Pulmonic Initiation

Pulmonic initiation refers to the in which for is generated by the lungs through the coordinated action of respiratory muscles. In the predominant pulmonic egressive variant, air is expelled outward from the lungs, driven by the contraction of the and the expansion of the , which decreases the thoracic volume and builds positive pressure below the . This process relies on the of the lungs and muscular effort to initiate and sustain through the vocal tract, forming the basis for the production of consonants and vowels in . While pulmonic egressive is the default mechanism, pulmonic ingressive airstream—where air is drawn inward by expanding the and relaxing the —is far rarer and typically serves auxiliary roles rather than primary . Examples include paralinguistic affirmatives like "yes" or "no" in Scandinavian languages such as and , or occasional accompaniments in click consonants within certain African languages, where it supplements the main velaric ingressive flow. These ingressive instances are not contrastive in most languages and are constrained by physiological limits on sustained inward . The efficiency of pulmonic initiation depends on subglottal pressure differentials, with typical values ranging from 5 to 10 cm H₂O during modal in conversational speech. This , created between the lungs and the atmosphere, directly influences : higher subglottal pressures increase airflow velocity and amplitude, resulting in louder , while lower pressures produce softer sounds. Pulmonic mechanisms power approximately 99% of consonants and vowels across the world's languages, underscoring their universal role in human .

Non-Pulmonic Initiation

Non-pulmonic initiation refers to the production of using mechanisms that do not rely on the lungs, instead employing enclosed cavities within the oral or laryngeal regions to generate through expansion or compression. These mechanisms are less common than pulmonic ones and are typically associated with in a minority of the world's languages, enabling unique phonetic contrasts. Glottalic mechanisms involve the movement of the to create pressure differences, with the closed to form a seal. Ejectives are produced via glottalic egressive , where the is raised to compress air behind an oral , followed by simultaneous release of the oral and glottal stops, resulting in a voiceless with no pulmonic involvement; for example, the velar ejective [kʼ] occurs in . Implosives, in contrast, use glottalic ingressive , where the is lowered to rarefy air behind the oral , producing a voiced upon release; an example is the bilabial implosive [ɓ̤] in . Both ejectives and implosives are represented in the with diacritics indicating , and they often feature acoustic correlates such as abrupt pressure releases in ejectives (manifesting as high-intensity bursts) and lowered in implosives due to laryngeal lowering. Velaric mechanisms, also known as lingual ingressive airstream, utilize the tongue to enclose and manipulate air in the oral cavity, independent of the larynx or lungs. Clicks are the primary sounds produced this way, involving two lingual closures: an anterior one (e.g., at the lips, teeth, or alveolar ridge) and a posterior one (typically velar), with the tongue body lowered to create suction before the anterior release produces an ingressive "pop." In Khoisan languages like !Xóõ, the alveolar click is transcribed as [ǃ] in the IPA, often accompanied by efflux notations for accompanying airstreams (e.g., tenuis [ǃ], voiced [ɡǃ], or nasal [ŋǃ]). Acoustically, clicks exhibit rarefaction bursts characterized by a sharp negative pressure spike followed by a transient noise, distinguishing them from pulmonic stops. Rarely attested epiglottal or pharyngeal initiations involve constriction or movement in the lower or region to generate localized , often in conjunction with glottalic features. plosives, such as [ʡʔ], occur in languages like Agul (a Lezgic language), where the forms a posterior with glottal sealing for egressive or ingressive effects. These sounds produce acoustic profiles with intense frication and lowering due to pharyngeal narrowing, but they remain marginal in global inventories.

Consonant Articulation

Places of Articulation

Places of articulation refer to the locations in the vocal tract where a consonant's primary constriction or closure occurs, typically along the midline from the lips to the . This classification is central to the , which organizes consonants horizontally by place in its chart, distinguishing the point of articulatory contact or narrowing that shapes the sound's acoustic properties. The active articulator is the movable organ, such as the tongue tip or lower lip, that approaches or contacts the passive articulator, the stationary structure like the upper teeth or alveolar ridge. Sagittal diagrams, which provide midsagittal cross-sections of the vocal tract, illustrate these positions and degrees of constriction, from complete closure in stops to narrow channels in fricatives. The major places of articulation, from front to back, are as follows:
  • Bilabial: The lips come together, with the lower lip (active) contacting the upper lip (passive); examples include /p/ and /b/ as in English "" and "."
  • Labiodental: The lower lip (active) approaches the upper teeth (passive); examples are /f/ and /v/ as in "" and "."
  • Dental: The tip or (active) contacts or nears the upper teeth (passive); examples include /θ/ and /ð/ as in "think" and "this."
  • Alveolar: The tip or (active) touches the behind the upper teeth (passive); common in English with /t/, /d/, /n/, /s/, /z/, /l/, and /ɹ/ as in "top," "dog," "no," "see," "zoo," "let," and "red."
  • Postalveolar: The (active) contacts the area just behind the (passive); examples are /ʃ/ and /ʒ/ as in "ship" and "measure."
  • Retroflex: The tip (active) curls back toward the postalveolar or palatal region (passive); examples include /ʈ/ and /ɖ/ found in languages like .
  • Palatal: The front or body (active) raises to the (passive); examples are /c/, /ɟ/, and /j/ as in some or English "yes" (/j/).
  • Velar: The back (active) contacts the or velum (passive); examples include /k/, /g/, and /ŋ/ as in "cat," "go," and "sing."
  • Uvular: The root or back (active) approaches the (passive); examples are /q/ and /ɢ/ in languages like or /ʁ/.
  • Pharyngeal: The root (active) constricts against the pharyngeal wall (passive); examples include /ħ/ and /ʕ/ in .
  • Glottal: Constriction occurs at the between the vocal folds (both active and passive); examples are /ʔ/ () as in English "uh-oh" and /h/ as in "."
Language inventories vary by place; for instance, English primarily utilizes bilabial, labiodental, dental, alveolar, postalveolar, palatal (limited to /j/), velar, and glottal places, but lacks uvular, pharyngeal, and retroflex in its standard . These places interact with manners of articulation to produce distinct sounds, while presupposing pulmonic mechanisms for most languages.

Manners of Articulation

Manners of articulation describe the ways in which the airstream is obstructed or modified by the articulators during the production of , focusing on the degree and type of in the vocal tract. These manners are distinguished by the extent of impedance, ranging from complete to minimal approximation, which influences the acoustic properties and perceptual distinctiveness of sounds. The primary classification divides into obstruents, characterized by significant obstruction leading to or pressure buildup, and sonorants, which allow relatively free with resonant qualities. Degrees of stricture refer to the varying levels of , from full (as in stops) to close approximation (as in fricatives) and open approximation (as in ), determining whether is blocked, turbulent, or smooth. Full halts the entirely, creating pressure differences that release as a burst; partial narrowing generates through ; and wider openings permit continuous, non-turbulent flow. This gradation underlies the articulatory and aerodynamic mechanisms that differentiate types across languages. Obstruents are consonants produced with a substantial obstruction to pulmonary airflow, resulting in lower sonority and often turbulent or explosive sounds. Stops, or plosives, involve complete of the vocal tract, building up oral pressure before a sudden release that produces a burst of sound, as in the bilabial /p/, alveolar /t/, and velar /k/. Fricatives feature a narrow that forces air through a small , creating audible , exemplified by the alveolar /s/, interdental /θ/, and labiodental /f/. Affricates combine a stop's with a fricative's prolonged release, forming a single gesture where the stop transitions into , such as in the postalveolar /tʃ/ and /dʒ/. Some may exhibit mild if the approximation is close enough, though they generally avoid . Sonorants, in contrast, are produced with a relatively open vocal tract configuration that allows continuous, resonant airflow, yielding higher sonority suitable for syllabic roles. Nasals achieve this by lowering the velum to divert airflow through the nasal cavity while maintaining oral closure, as in the bilabial /m/, alveolar /n/, and velar /ŋ/. Laterals permit airflow along the sides of the tongue past a central obstruction, producing non-turbulent sounds like the alveolar /l/. Taps or flaps involve a brief, single contact of the tongue against the alveolar ridge, creating momentary interruption without full closure, as in the alveolar /ɾ/. Trills feature rapid vibration of an articulator against another surface due to Bernoulli forces, such as the alveolar /r/ or uvular /ʀ/. Certain co-occurrence patterns in manners arise from articulatory and aerodynamic constraints; for instance, voiced obstruents require precise control of subglottal pressure and glottal adduction to sustain voicing during , a feature prominent in languages with robust voicing distinctions.

Vowel Articulation

Tongue and Jaw Positions

In articulatory phonetics, the primary dimensions of vowel quality are determined by the and frontness/backness of the , along with the degree of opening. refers to the vertical position of the relative to the of the , categorized as high, mid, or low. High vowels, such as as in "beat," involve the raised close to the with minimal opening, while low vowels like [ɑ] in "cot" feature the lowered toward the floor of the and a widely opened . Mid vowels, exemplified by in "bait," occupy an intermediate position with moderate depression. The horizontal positioning of the tongue—front, central, or back—further shapes vowel articulation. Front vowels, such as and [æ], are produced with the front of the advanced toward the , whereas back vowels like in "boot" involve the body retracted toward the . , including [ʌ] as in "up," position the in a neutral, midway location. These configurations form the basis of the vowel triangle (or trapezoid), a schematic representation in the (IPA) that maps vowels by tongue height and backness, with at the high-front corner and [ɑ] at the low-back. Tense and lax distinctions refine these positions, particularly in languages like English. Tense vowels, such as , are articulated with greater muscular tension and tongue bunching relative to the jaw, resulting in a more peripheral position in the vowel space and often longer duration. In contrast, lax vowels like [ɪ] in "bit" exhibit reduced tension, a less raised or advanced tongue, and a more centralized quality without extreme bunching. This difference highlights how subtle variations in tongue configuration influence vowel perception and production. The IPA's cardinal vowels provide standardized reference points for these articulations, defined by extreme tongue positions without influence from specific languages. For instance, cardinal vowel 1 is a high front unrounded vowel with the tongue maximally raised and fronted, while cardinal vowel 5 [ɑ] is low back unrounded with the tongue low and retracted. These eight primary cardinals (plus central approximations) anchor the vowel chart, allowing linguists to describe other s relative to them using diacritics for fine adjustments. Acoustically, these positions correlate with formant frequencies: the first formant (F1) inversely relates to tongue height (higher F1 for lower vowels), and the second formant () inversely relates to tongue backness (higher for fronter vowels). Cross-linguistically, tongue positions underpin patterns like , as in Turkish, where vowels within a word must agree in frontness/backness to maintain consistent tongue advancement. Front vowels (, , [ø], ) require the tongue positioned anteriorly, while back vowels (, [ɯ], , ) demand posterior placement, ensuring harmonic sequences without shifts in tongue locus. This system exemplifies how articulatory constraints on tongue movement promote phonological uniformity.

Lip and Velum Configurations

In vowel articulation, lip configurations play a crucial role in modifying the acoustic quality and of sounds through secondary adjustments. Lip rounding, or , involves the protrusion and pursing of the , which narrows the oral cavity's front opening and lowers frequencies, particularly the second (F2). This feature distinguishes rounded vowels from their unrounded counterparts; for instance, the high front unrounded vowel /i/ (as in English "see") features , where the corners of the mouth are drawn back without protrusion, maintaining a wider oral . In contrast, the high back rounded vowel /u/ (as in English "boot") exhibits protruded , forming a more constricted front channel that enhances resonance. Neutral lip positions, without significant spreading or rounding, occur in mid-central vowels like /ə/, providing a for minimal labial influence. Labialization effects vary by vowel height and backness, often coarticulating with position to create perceptual contrasts. For front s, shifts the spectral envelope toward qualities, as seen in languages with rounded front vowels; in , the high front rounded /yː/ (as in "") involves protrusion combined with a front position, distinguishing it from the unrounded /iː/ (as in "ihre") and contributing to a lower around 1800-2200 Hz compared to 2000-2500 Hz for /iː/. This configuration exemplifies how on front s, absent in English, alters vowel inventories by compressing the front-back dimension acoustically. Back vowels typically pair with , but excessive protrusion can influence adjacent segments through anticipatory coarticulation. The velum, or , further modulates articulation by controlling airflow partitioning between oral and nasal cavities. In oral s, the velum remains raised, sealing the nasal to direct airflow exclusively through the mouth, preserving pure oral resonance. For nasalized s, the velum lowers partially or fully, allowing nasal airflow and introducing anti-formants that reduce oral formant amplitudes, particularly affecting lower frequencies. While velar lowering is prototypical for nasal consonants, it extends to phonemic nasal s, where the velum's position creates a coupled oral-nasal ; in , nasal s like /ã/ (as in "mão") involve velum lowering during the vowel nucleus, distinct from oral /a/. European Portuguese nasal s similarly show velar gestures timed to vowel onset, enhancing nasal without full consonant-like closure. Jaw position synergizes with lip configurations to facilitate smooth transitions in dynamic vowel sequences, such as diphthongs and rhotics. In diphthongs like English /aɪ/ ("buy"), the jaw lowers progressively while lips adjust from neutral to slightly spread, coordinating with tongue gliding to maintain articulatory fluidity and prevent abrupt formant shifts. For rhotics, such as /ɹ/, lip rounding often co-occurs with jaw retraction, combining with lowered jaw height to bunch the tongue root, producing a retroflexed quality with lowered around 1500 Hz. This lip-jaw coupling ensures gestural overlap, minimizing perceptual discontinuities in .

Phonation and Resonance

Voicing Mechanisms

Voicing in articulatory phonetics refers to the of the vocal folds during the production of sounds, primarily supported by pulmonic mechanisms that provide subglottal pressure. The myoelastic aerodynamic explains this as an interaction between the elastic properties of the laryngeal muscles and s and the aerodynamic forces generated by from the lungs. According to this , begins with the adduction of the vocal folds, mediated by muscles such as the lateral cricoarytenoid and thyroarytenoid, which approximate the folds to close the partially or fully. As subglottal pressure builds and forces the folds apart, through the narrowed accelerates, creating a of via the Bernoulli effect that draws the folds back together, initiating cyclic . This is sustained by the propagation of a mucosal wave along the superior surface of the vocal folds, where the body-cover structure allows the inferior portions to close first, followed by a traveling wave that facilitates efficient energy transfer from to . In modal phonation, the typical voiced mode used in everyday speech, the vocal folds vibrate regularly at frequencies of approximately 100-200 Hz, varying by speaker sex, age, and other factors, producing a clear, periodic sound source. This mode features balanced adduction and tension, with the mucosal wave propagating smoothly to achieve efficient closure and minimal air escape. Breathy phonation, in contrast, involves looser vocal fold approximation, resulting in incomplete closure and significant airflow leakage alongside vibration, which adds turbulent noise to the sound. For example, the Hindi glottal fricative /ɦ/ exemplifies breathy phonation, where the vocal folds vibrate irregularly with a breathy quality due to this lax closure. Creaky phonation features stiffer vocal fold vibration with tighter closure and irregular, low-frequency pulses, often described as a "vocal fry" quality due to the thickened and tensed folds vibrating aperiodically. A notable example is the Danish stød, a prosodic feature realized as creaky phonation in specific syllables, involving reduced airflow, prolonged closure, and decreased fundamental frequency compared to modal voicing. Voiceless sounds are produced when the vocal folds are abducted by the posterior cricoarytenoid muscles, widening the to allow unimpeded without , often resulting in turbulent noise if the passage is sufficiently constricted elsewhere in the vocal tract. This contrasts with voiced production by eliminating the periodic pulsation, yielding a continuous stream. Intrinsic pitch control during voicing is primarily achieved through the , which tilts the forward relative to the cricoid, elongating and tensing the vocal folds to raise the . This mechanism allows fine adjustments in pitch independent of overall or , essential for intonation in speech.

Airflow and Resonance Properties

In articulatory phonetics, the glottal serves as the primary sound source that interacts with the vocal tract to produce . For voiced segments, the vocal folds vibrate to create a quasi-periodic pulse train, characterized by regular pulses at the (typically 100-200 Hz for adult males), which generates harmonics that are subsequently filtered by the vocal tract. In contrast, voiceless sounds arise from turbulent without vocal fold vibration, producing an aperiodic noise source with a broad, flat that lacks distinct periodicity. This distinction in source characteristics fundamentally shapes the acoustic output before resonance effects are applied. The vocal tract acts as an acoustic filter that modifies the glottal source signal through resonance, amplifying specific frequencies known as while attenuating others. represent the resonant peaks of the vocal tract's , determined by its shape and length (approximately 17 cm in adults), and they define the perceptual quality of vowels and . For instance, the first formant (F1) typically ranges from 500-800 Hz in open vowels like /a/, correlating with greater vocal tract that lowers the resonant frequency. Higher formants (, ) further distinguish vowel identities based on position, with the tract's filtering emphasizing harmonics near these resonances to create audible speech . Supraglottal coupling refers to the acoustic interactions between the laryngeal source and the pharyngeal/oral cavities above the , where adjustments in tract configuration influence sound quality and intensity. Pharyngeal widening or oral cavity shaping can couple additional resonances, enhancing or shifting positions to alter , such as in emphatic consonants where expanded pharyngeal space lowers frequencies for a darker . These couplings ensure efficient energy transfer from the glottal source to the radiated sound, with nonlinear interactions occasionally introducing subharmonics or spectral tilts that contribute to expressive variations in speech. Nasal resonance differs markedly from oral resonance due to the involvement of the , accessed via velum lowering, which introduces additional but also anti-formants—spectral zeros that attenuate energy at specific frequencies. In nasal sounds like /m/ or nasalized vowels, the oral tract's side branch creates anti-resonances around 200-500 Hz and 1000-2000 Hz, resulting in a muffled with reduced oral prominence compared to purely oral sounds. This contrast highlights how airflow routing through nasal versus oral paths shapes distinct acoustic profiles, with nasals exhibiting broader, lower-frequency resonances due to the longer nasal tract length.

Dynamic Articulation

Coarticulation Effects

Coarticulation refers to the influence of one speech sound on the of adjacent sounds, resulting in overlapping gestures that modify the production of individual segments. This dynamic interaction occurs because speech is produced as a continuous stream rather than isolated units, leading to articulatory adjustments that anticipate or carry over features from neighboring sounds. Anticipatory coarticulation, also known as right-to-left coarticulation, involves a sound being influenced by an upcoming , where the articulators prepare in advance for sound. A classic example is the production of the velar stop /k/ in the English word "," where lip rounding—typically a feature of the following high /uː/—begins during the , advancing and labializing the /k/ gesture. Perseveratory coarticulation, or left-to-right coarticulation, occurs when the features of a preceding sound persist into the following , such as the carryover of from a into a subsequent . These effects demonstrate how articulatory extends beyond segmental boundaries, creating fluid transitions in . One acoustic manifestation of coarticulation is captured through locus equations, which quantify formant transitions between consonants and vowels in consonant-vowel () contexts. Specifically, locus equations model the relationship between the second formant frequency () at the onset of a vowel and its steady-state value, with steeper slopes indicating greater anticipatory coarticulation for places of articulation that overlap more with vowels, such as alveolar stops. These equations provide a for the degree of articulatory , where higher coarticulation correlates with more centralized transitions. The extent of coarticulation varies with speech rate and language-specific phonologies; it intensifies in faster speech due to increased gestural overlap, reducing the temporal separation between segments. Cross-linguistically, English exhibits more extensive anticipatory coarticulation compared to , where vowel-to-consonant effects are weaker, reflecting differences in phonological structure and prosodic timing. For instance, Japanese speakers show reduced C-to-V coarticulatory influence on formants relative to English speakers. These coarticulatory effects contribute to phoneme variability by generating context-dependent realizations that can lead to allophonic variation, where the same phoneme appears in multiple forms without altering meaning. In English, for example, the velar /k/ varies from fronted in "" to backed and rounded in "" due to vowel anticipation, illustrating how coarticulation underlies allophonic diversity and challenges strict segmental isolation in phonetic descriptions. Such variability enhances speech efficiency but also informs models of phonological contrast maintenance.

Timing and Gestures

In articulatory phonetics, the temporal organization of speech production is captured through gestural models that emphasize the dynamic coordination of articulatory movements. The task-dynamic model, developed at Haskins Laboratories, treats speech as a system of coupled oscillators where gestures represent functional units of action aimed at achieving specific vocal tract configurations. These gestures include constrictor gestures, which narrow the vocal tract for consonants (e.g., tongue body constriction for /k/), and opener gestures, which widen it for vowels (e.g., jaw lowering coupled with tongue body retraction for /ɑ/). In this framework, each gesture is governed by dynamical equations that specify activation intervals, stiffness parameters, and coupling relations, allowing for flexible yet stable movement patterns across contexts. Recent computational implementations, such as the DYNARTmo model developed in 2025, extend task-dynamic approaches by generating continuous articulator movements from gesture scores, bridging phonological planning to biomechanical realization. Gestural overlap and phasing are central to understanding asynchrony in speech timing, visualized through gestural scores that depict s as overlapping temporal intervals rather than sequential segments. For instance, in nasal stops like /m/, the velum-lowering (an opener for nasal ) initiates before the lip-closing constrictor , ensuring nasal precedes full oral and avoids unwanted oral . This phasing, often in-phase (onsets aligned) for consonants within onsets or anti-phase (staggered by 180 degrees) for consonant-vowel sequences, arises from nonlinear in the task-dynamic system, promoting robustness in rapid articulation. Such overlaps exemplify how s are not rigidly timed but dynamically adjusted to maintain recoverability of phonetic contrasts. Speech rate variations significantly influence gestural timing, with faster rates leading to temporal through increased overlap and reduced gesture durations without proportional of all intervals. In rapid speech, constrictor gestures exhibit shortened activation intervals and heightened overlap between adjacent units, such as greater intrusion of openers into flanking consonants, which preserves perceptual cues while minimizing articulatory effort. This is modeled as adjustments to gestural and oscillator frequencies in task-dynamic simulations, where higher rates amplify phasing asynchronies but rarely disrupt core coordination modes. Prosodic structure modulates gestural timing, particularly through lengthening effects in stressed syllables that extend articulatory gestures beyond segmental defaults. Stressed syllables feature prolonged constrictor and opener gestures, with increased movement amplitude and velocity for tongue and jaw, enhancing acoustic prominence while maintaining gestural phasing. In task-dynamic terms, this is achieved via prosodic gestures—higher-level modulatory activations that slow local clock speeds, thereby stretching activation intervals for affiliated phonetic gestures without altering their intrinsic dynamics.

Experimental Methods

Contact and Pressure Techniques

Contact and pressure techniques in articulatory phonetics involve methods that directly measure physical interactions between articulators, such as -palate contacts and air pressure variations within the vocal tract, to understand mechanisms. These approaches originated in the with the invention of palatography, a technique that visualizes patterns on the through the application of powders or tracings, allowing researchers to map articulation points for consonants like alveolars and palatals. Early variants included static palatography, where speakers articulated sounds after coating the with a dark substance, revealing imprints upon examination, and indirect methods using external tracings to infer movements. By the early , these evolved into dynamic forms to capture timing, culminating in digital electropalatography (EPG) systems from the onward, which provided and marked a shift from qualitative tracings to . Electropalatography represents a key advancement in contact detection, employing custom-fitted artificial embedded with s—typically 62 in the influential Reading EPG system developed in the —to the precise location and duration of -hard palate interactions during speech. When the tongue touches an , it completes an electrical , generating spatio-temporal maps of contact patterns that reveal articulatory gestures, such as the central groove in or lateral contacts in laterals. This technique has been instrumental in studying coarticulatory effects and atypical articulations, offering higher resolution than traditional palatography by capturing multiple cycles of speech in without relying on post-articulation imprints. Aerodynamic measures complement contact techniques by quantifying air and dynamics essential for production, particularly stops and fricatives. Intraoral , recorded via transducers inserted into the , rises sharply during closures for plosives due to subglottal air buildup, reaching plateaus of several cmH₂O before release, whereas implosives exhibit negative pressures from velum lowering and pulmonic , distinguishing ingressive from egressive . is simultaneously assessed using a pneumotachograph, a or tube connected to a that measures oral or nasal rates in liters per second, enabling calculations of and distinguishing nasal from oral during velar movements. These metrics provide insights into the physiological demands of , such as the differentials driving voicing contrasts. Nasal endoscopy, or nasoendoscopy, offers direct observation of velum elevation and with the pharyngeal walls to control nasal during oral . A flexible inserted through the transmits video images of the velopharyngeal port, revealing closure patterns where the rises to meet the posterior pharyngeal wall, preventing nasal emission in non-nasals. This technique captures dynamic velar gestures in real time, such as the rapid lowering for nasals like /m/ or /n/, and has been used to differentiate adequate from insufficient velopharyngeal function in clinical . By focusing on pressure-mediated sealing at anatomical targets like the velum, it bridges with outcomes without invasive sensors.

Imaging and Modeling Approaches

Real-time (RT-MRI) has emerged as a pivotal non-invasive technique for visualizing the dynamic movements of the vocal tract during , enabling the capture of midsagittal views at frame rates typically ranging from 20 to 80 frames per second for dynamic sequences. This method provides comprehensive anatomical details, including , , and velum positions, without requiring physical contact, and has been instrumental in studying coarticulatory effects in running speech. Early applications in the focused on static imaging, but advancements in spiral sampling have facilitated real-time acquisition suitable for phonetic analysis. Ultrasound tongue imaging offers an affordable, portable alternative for tracking tongue surface contours, particularly in coronal and sagittal planes, using a submental probe positioned beneath the chin to reflect off the . This technique excels in capturing rapid movements for lingual consonants and vowels, with resolutions sufficient to distinguish subtle shape variations, and has been widely adopted in clinical for its non-ionizing nature and ease of synchronization with audio recordings. Limitations include partial obscuration by the shadow, but it remains a standard for longitudinal studies of articulatory patterns in diverse populations. Electromagnetic articulography () measures three-dimensional trajectories of articulators by embedding small receiver coils on the , , and , which track positions relative to a fixed transmitter array at high sampling rates, often up to 400 Hz. This allows precise quantification of spatiotemporal coordination in gestures, such as for alveolar stops, and supports validation of biomechanical models. systems, refined since the 1990s, provide millimeter accuracy in Cartesian coordinates, making them ideal for investigating inter-articulator dependencies in phonetic research. Articulatory synthesis models simulate vocal tract configurations from gestural inputs to generate speech acoustics, bridging empirical data with theoretical predictions. The GEST framework, based on gestural , models overlapping articulatory gestures as constrictors with temporal dynamics, enabling synthesis of naturalistic transitions between sounds. Complementarily, the ASY model from Haskins Laboratories represents the midsagittal vocal tract as a area function driven by positions, facilitating controlled experiments on trajectories. These tools have advanced understanding of inverse mapping from acoustics to gestures, though they require calibration against imaging data for realism. Since the 2000s, AI-enhanced analysis has transformed articulatory phonetics by automating feature extraction from imaging modalities, such as networks for segmenting contours in videos with over 90% accuracy. In RT-MRI, diffusion models guided by speech audio reconstruct high-fidelity articulator motions, improving beyond traditional limits. integration with data has also enabled predictive modeling of unseen trajectories, enhancing cross-linguistic comparisons and clinical applications.

References

  1. [1]
    [PDF] Phonetics - Stanford University
    25.2 Articulatory Phonetics. Articulatory phonetics is the study of how these phones are produced as the various articulatory phonetics organs in the mouth ...
  2. [2]
    [PDF] Week 1: Articulatory Phonetics (Ladefoged 2001) - BMCC OpenLab
    Aug 16, 2016 · Three main subfields of phonetics: 1 articulatory phonetics: how speech sounds are made . (eg., articulator placement, vocal fold configuration ...
  3. [3]
    [PDF] Phonetics: The Sounds of Language
    – Auditory phonetics: focuses on how listeners perceive the sounds of language. – Articulatory phonetics: focuses on how the vocal tract produces the sounds ...Missing: sources | Show results with:sources
  4. [4]
    [PDF] 1 Articulatory System - UCI Open
    All of the body parts that we use to produce speech sounds are called the articulatory system. Teachers need to understand how the articulatory system works so ...
  5. [5]
    2.3: Speech articulators
    ### Sagittal View of Vocal Tract and Articulators
  6. [6]
    [PDF] Morphological Variation in the Adult Vocal Tract: A Modeling Study ...
    In order to fully understand inter-speaker variability in the acoustical and articulatory domains, morphological variability must be considered, as well.
  7. [7]
    Larynx (Voice Box): Anatomy & Function - Cleveland Clinic
    Your larynx is a hollow tube in the middle of your neck, just above your trachea (windpipe) and esophagus. It makes it possible for you to make sounds.Vocal cords · Glottis · Trachea
  8. [8]
    Anatomy, Head and Neck, Pharynx - StatPearls - NCBI Bookshelf
    The pharynx is a conductive structure located in the midline of the neck. It is the main structure, in addition to the oral cavity, shared by two organ systems.
  9. [9]
    [PDF] The Phonetics and Phonology of Semitic Pharyngeals1
    To provide a secure basis for the subsequent discussion, I will begin by reviewing the abundant phonetic evidence for the properties of gutturals in Arabic, a ...
  10. [10]
    [PDF] English Phonetics and Phonology - Glossary - Peter Roach
    The most common way of moving air is by compression of the lungs so that the air is expelled through the vocal tract. This is called a pulmonic airstream ( ...
  11. [11]
    (PDF) AIRSTREAM MECHANISMS - Academia.edu
    The most common - and most effective – way is that in which lung air is pushed through the vocal tract in the respiratory mechanism, or breathing process.
  12. [12]
    Linguistic aspects of respiratory phenomena - ResearchGate
    Aug 6, 2025 · The dynamics of the pulmonic initiatory movement is determined by elastic recoil force and respiratory muscular effort (Ladefoged, 1968) . The ...
  13. [13]
    [PDF] Pulmonic ingressive speech: a neglected universal? - Robert Eklund's
    Jun 1, 2007 · While most ingressive speech occurs on single, paralinguistic feedback words, entire phrases or sentences are reported in Finnish, Faroese and.<|control11|><|separator|>
  14. [14]
    Pulmonic ingressive phonation: Diachronic and synchronic ...
    Nearly all of the world's languages are made with the pulmonic airstream mechanism, in which lung air is pushed through the vocal tract by the action of the ...
  15. [15]
    Evaluation of Visual Feedback for fo and SPL in Subglottal Pressure ...
    Sep 4, 2025 · According to this theoretical model, the PTP ranges between 0.1 and 1.0 kPa (approximately 1-10 cmH2O). The reference values for healthy voices ...
  16. [16]
    Loudness, Sound Pressure, and Subglottal Pressure in Speech
    It follows that (rate of work done on the air) ∝ (subglottal pressure)2 ∝ (effective sound pressure)1.2. The exponent 1.2 compares well with the exponent 1.1 ...
  17. [17]
    Consonants (Chapter 3) - The Cambridge Handbook of Phonetics
    This chapter provides an overview of the phonetic properties of consonants in the world's languages and key issues in the phonetic description of consonants.<|control11|><|separator|>
  18. [18]
    88 Journal ottheInternational Phonetic Association J.C.CATFORD ...
    One relates to initiation, one to phonation and one to coarticulation. Regarding the account ofthe ingressive glottalic airstream mechanism, Katrina Hayward.
  19. [19]
  20. [20]
    Clicks, concurrency and Khoisan* | Phonology | Cambridge Core
    May 20, 2014 · Traditionally, clicks are assigned a velaric airstream mechanism, and placed in a separate section of the International Phonetic Alphabet chart ...
  21. [21]
    [PDF] THE INTERNATIONAL PHONETIC ALPHABET (revised to 2015)
    Shaded areas denote articulations judged impossible. CONSONANTS (NON-PULMONIC). Clicks. Voiced implosives. Ejectives. Bilabial. Bilabial. Examples: Dental.
  22. [22]
    [PDF] Articulatory Phonetics and the International Phonetic Alphabet
    “Place of articulation” refers to the location where the oral tract is constricted in order to make the phone. ▷ In English, there are 7 places of articulation:.Missing: articulators anatomy<|control11|><|separator|>
  23. [23]
    IPA Chart
    Areas shaded grey indicate articulations judged impossible. Bilabial, Labiodental, Dental, Alveolar, Post alveolar, Retroflex, Palatal, Velar, Uvular ...
  24. [24]
    3.4 Describing consonants: Manner – ENG 200
    Oral stops, fricatives, and affricates together form the class of obstruents, which are defined by having an overall significant obstruction to free airflow in ...
  25. [25]
    4.4 Natural Classes – Essential of Linguistics
    Natural classes group sounds sharing features, like obstruents, sonorants, glides, and vowels. These classes behave similarly in language grammar.
  26. [26]
    [PDF] 4 Phonetics and Phonology
    An articulatory description of a vowel must include all of these features. Tongue height. Pronounce the words eat and at. Now pronounce just the vowels of these.Missing: scholarly | Show results with:scholarly
  27. [27]
    [PDF] Articulatory Phonetics and the International Phonetic Alphabet
    describing the articulatory phonetics of a vowel: Tongue Height. (how close to roof of the mouth). Tongue Backness. (how far back in mouth). Lip Rounding.
  28. [28]
    [PDF] Vowels - Columbia CS
    • Cardinal vowels are positioned at extreme edges of the vowel ... Tongue position for some back vowels. From Ladefoged, 2001 who'd [u] hood ...
  29. [29]
    Vowel Theories
    Other parameters along which vowels can contrast: high vs. low jaw position; tense vs. lax tongue shape (bunching of tongue with respect to jaw); round vs ...
  30. [30]
    Cardinal Vowels
    These consist of four vowels produced at each extremity of the vowel producing area: [i], [a], [ɑ] and [u], plus four in intermediate positions which sound ...
  31. [31]
    A comparison of acoustic and articulatory methods for analyzing ...
    Jul 31, 2017 · One common assumption is that the first formant (F1) of a vowel is inversely correlated with tongue height; another is that the second formant ( ...
  32. [32]
    [PDF] Vowel Harmony in Turkish Language and Palestinian Arabic - mecsj
    May 12, 2015 · Open and close vowels describe the position of the tongue when it is in the aperture of the jaw. Vowel backness also refers to the position of ...
  33. [33]
    2.7 Classifying Vowels – Essential of Linguistics
    We also classify vowels according to whether the lips are rounded or unrounded. In Canadian English, there are only four vowels that have lip rounding, and ...
  34. [34]
    Physiology of the lips and modelingof lip gestures - ScienceDirect.com
    The lips are one of the articulators for producing speech sounds, and also contribute to the generation of phonetic information to be visually perceived by ...
  35. [35]
    [PDF] Articulatory Analysis of the German Vowel System
    The abscissa arranges the vowels in three groups (from left to right: front unrounded, front rounded, back). Within each group, tense-lax pairs are adjacent to ...
  36. [36]
    [PDF] The phonetics of German vowels
    A front round vowel has the same constriction(s) as the corresponding front unrounded vowel, plus an additional constriction at the lips. • German has two pairs ...
  37. [37]
    Oral configurations during vowel nasalization in English - PMC
    Feb 28, 2021 · We conclude that speakers of American English do indeed use different oral configurations for vowels that are in nasal and oral contexts.
  38. [38]
    Managing the distinctiveness of phonemic nasal vowels: Articulatory ...
    Jan 13, 2012 · This paper uses simultaneously collected articulatory, acoustic, and nasal airflow data during the production of all oral and phonemically nasal vowels in ...
  39. [39]
    [PDF] European Portuguese Nasal Vowels: An EMMA Study - ISCA Archive
    The use of velum and other articulators variation in time contributes to an improved naturalness of the articulatory synthesizer nasal vowels [2].
  40. [40]
    Lip-jaw and tongue-jaw coordination during rate-controlled syllable ...
    Aug 6, 2025 · The present study investigated the relationship between functionally relevant compound gestures and single-articulator component movements ...
  41. [41]
    Articulatory–acoustic kinematics: The production of American ...
    This study documents in detail how jaw, tongue front, tongue back, lips, and the first spectral moment covary during the production of ∕‐∕, to establish how ...
  42. [42]
    Mechanics of human voice production and control - PMC
    According to his myoelastic-aerodynamic theory of voice production, phonation starts from complete adduction of the vocal folds to close the glottis, which ...
  43. [43]
  44. [44]
    Some observations on aspiration and glottal fricatives in languages ...
    Builds on a typology of languages with voiceless vs. voiceless and breathy voiced aspiration. •. Introduces the processes of transposition of h/ɦ, transposition ...
  45. [45]
    Phonetic analysis of the stød in standard Danish - PubMed
    This paper presents an acoustic and physiological analysis of the Danish 'stød' with an introduction on its phonological role and distribution.
  46. [46]
    Phonation - an overview | ScienceDirect Topics
    Breath, producing voiceless sounds: the glottis is open (vocal folds abducted), so that the airstream passes through with minimal obstruction. This is the ...
  47. [47]
    Anatomy, Head and Neck: Laryngeal Muscles - StatPearls - NCBI
    This muscle displaces the thyroid cartilage downward and tenses the vocal cords when contracted. The cricothyroid is the only intrinsic laryngeal muscle that ...
  48. [48]
    [PDF] Source-Filter Model of Speech Production - MIT OpenCourseWare
    The amplitude in the output wave of each frequency component of the source depends on the extent to which the air in the vocal tract resonates at that frequency ...
  49. [49]
    [PDF] Fundamental Frequency Characteristics of Modal and Vocal Fry ...
    Dec 12, 2016 · Voiceless sounds are created by an aperiodic noise source generated by airflow through a contraction in the vocal tract, without vocal fold ...<|separator|>
  50. [50]
    Vowel Sounds - HyperPhysics
    The vocal resonances are altered by the articulators to form distinguishable vowel sounds. The peaks in the vowel spectra are called vocal formants.
  51. [51]
    [PDF] Application of nonlinear dynamics theory to understanding normal ...
    Apr 3, 2025 · The formants of the supraglottal tract not only play a major role in determining the acoustic quality of the produced speech sound, ...
  52. [52]
    [PDF] Source-Vocal Tract Interaction in Female Operatic Singing and ...
    The supraglottal formants are located where the iner- tance suddenly collapses to zero and the subglottal for- mant frequencies are where the compliance begins ...
  53. [53]
    Resonance Effects and the Vocalization of Speech - PMC
    Acoustical reflections from the oral and nasal cavities can interact destructively within the vocal tract and create anti-resonances (spectral zeroes) at ...
  54. [54]
    [PDF] HCS 7367 Speech Perception
    antiformants and appear as additional peaks and valleys in the spectrum of a nasal vowel. Oral vs. nasal vowels. Oral vowel /ɑ/. Nasal vowel /ɑ/˜.
  55. [55]
    [PDF] Coarticulation and Phonology - UC Berkeley Linguistics
    Another example of a phonological process commonly attributed to coarticulatory factors is the case of heterorganic consonant clusters in which there is ...
  56. [56]
    (PDF) Coarticulation - ResearchGate
    Aug 27, 2017 · In this chapter we provide a general introduction to coarticulation, to the techniques used to measure it, and to the major theoretical and experimental ...
  57. [57]
    Coarticulation (Chapter 4) - The Cambridge Handbook of Phonetics
    Nov 11, 2021 · This process of intermingling of information about several speech sounds in the articulatory and acoustic signals is called coarticulation.
  58. [58]
    Locus equations are an acoustic expression of articulator synergy
    One hypothesis is that locus equations indicate degree of coarticulation between a consonant and a following vowel (Krull, 1987, 1989; Fowler, 1994).
  59. [59]
    [PDF] COMPARISON OF CROSS-LANGUAGE COARTICULATION
    Three talker groups were considered: native talkers of Australian. English (AE), L2 English talkers (L1 Japanese, hence, JE) and native Japanese talkers (J) ...
  60. [60]
    [PDF] the task dynamic model in speech production - Haskins Laboratories
    The task-dynamic model of gestural control provides for such specificity and flexibility of articulatory behavior. This work has been the result of.
  61. [61]
    [PDF] An introduction to task dynamics - SARAH HAWKINS
    The task-dynamic model governs how the action of the articula- tors is coordinated within a single gesture, making use of the articulator weightings.Missing: constrictor opener
  62. [62]
    [PDF] Tutorial on task dynamics and articulatory phonology Colleagues
    Oct 2, 2010 · □ A given gesture's constriction-dynamics influence vocal tract activity for a discrete interval of time. – Activation interval.Missing: constrictor opener
  63. [63]
    [PDF] Gestural Syllable Position Effects in American English
    That is, for the nasal [m], the velum-lowering gesture has a wider constriction than the labial closing gesture, and this wider gesture precedes the ...
  64. [64]
    Articulatory gestures are individually selected in production
    ▻ Selection of articulatory gestures is independent of selection of phonetic segments. ... The velum opening gesture begins before the onset of the oral gesture ...Missing: opener | Show results with:opener
  65. [65]
    [PDF] Speaking Rate Effects on Normal Aspects of Articulation
    Jul 1, 2011 · The purpose of this paper is to review the literature pertaining to the articulatory effects of speaking rate change and present a discussion of ...
  66. [66]
    Gestural overlap in consonant clusters: effects on the fluent speech ...
    These results suggest that the production of two consonants with the same place of articulation across a syllable boundary puts higher demands on motor planning ...
  67. [67]
    Effects of syllable stress on articulatory planning observed in a stop ...
    Articulatory gestures in stressed syllables, compared to those in unstressed ones, generally exhibit greater movement range, increased duration, and greater ...
  68. [68]
    [PDF] Exploring the effect of stress on gestural coordination
    These findings suggest that stress itself can be analyzed as a prosodic gesture–since a prosodic gesture slows down the internal clock used for articulatory ...
  69. [69]
    On the history of palatography in Hungarian phonetics
    Nov 3, 2021 · This paper outlines the evolution of this revolutionary experimental technique, discusses its two types, direct (static vs. natural) and indirect palatography.
  70. [70]
    [PDF] Why do palatographic data have to be taken seriously? - ISCA Archive
    May 17, 2024 · Techniques of palatography​​ Since the invention of palatography in the 19th century, several variants of the technique have been used for ...
  71. [71]
    A history of EPG - Articulate Instruments
    EPG evolved from dynamic palatography, with early static palatometry. Electropalatography began in 1938, with modern systems developed in the 1960s and 70s.Missing: articulatory device
  72. [72]
    The Use of Electropalatography in Phonetic Research | Phonetica
    Nov 12, 2009 · The instrumental technique of electropalatography (EPG) provides real-time spatio-temporal information on tongue-palate contacts during ...Missing: device | Show results with:device
  73. [73]
    Full article: EPG research and therapy: further developments
    Previous EPG systems included the Dynamic Palatograph (DP; Rion Ltd., Tokyo, Japan) that was used mainly in Japan in the 1970–80s, the WinEPGTM (Articulate ...
  74. [74]
    The technology of tongue and hard palate contact detection: a review
    Feb 6, 2021 · However, a technique was introduced in the year 1970 to identify tongue and hard palate location known as electropalatography (EPG). EPG ...
  75. [75]
    [PDF] Intraoral air-pressure in speech
    Intraoral air pressure curves show how articulations are made. For /p/, a sharp rise indicates lip closure, and a plateau relates to subglottal pressure. ...
  76. [76]
    Air Pressure Responses to Sudden Vocal Tract Pressure Bleeds ...
    The pressure transducer and pneumotachograph were calibrated with a manometer and rotometer at 6 cm H2O and 250 ml/s, respectively. The pressure transducer was ...
  77. [77]
    Stop consonant voicing and intraoral pressure contours in women ...
    This study explored the relationship between stop consonant voicing and intraoral pressure contours in women, 5 year olds, and 10 year olds.
  78. [78]
    Comparison of videonasoendoscopy and auditory-perceptual ... - NIH
    Videonasoendoscopy is one of the most widely used methods for studying velopharyngeal function, allowing visualization of the nasal, pharyngeal, and laryngeal ...Missing: articulatory | Show results with:articulatory
  79. [79]
    Velopharyngeal Function - an overview | ScienceDirect Topics
    Nasendoscopy provides a two-dimensional dynamic view of the velopharyngeal port during speech by passing a flexible endoscope through the nasal cavity, with the ...
  80. [80]
    The Palatal Closure Efficiency (PaCE) Index: A New Speaker ... - NIH
    The nasal airflow tube is coupled to a heated pneumotachograph and flow transducer. A microphone is used to record the speech signal.
  81. [81]
    Advances in real-time magnetic resonance imaging of the vocal tract ...
    Real-time magnetic resonance imaging (rtMRI) of the moving vocal tract during running speech production is an important emerging tool for speech production ...
  82. [82]
    Analysis of speech production real-time MRI - ScienceDirect.com
    We review techniques used in analysis of articulatory data acquired using RT-MRI, and assess the utility of different approaches for different types of data ...
  83. [83]
    Ultrasound Images of the Tongue: A Tutorial for Assessment ... - NIH
    Jan 3, 2017 · Ultrasound imaging of speech has greatly expanded our understanding of how sounds articulated with the tongue (lingual sounds) are produced.
  84. [84]
    Ultrasound Tongue Imaging in Research and Practice with People ...
    Abstract. Ultrasound tongue imaging is becoming popular as a tool for both phonetic research and biofeedback for treating speech sound disorders.Missing: key | Show results with:key
  85. [85]
    A review of data collection practices using electromagnetic ...
    Mar 1, 2021 · Electromagnetic articulography (EMA) is a point tracking method, whereby sensors placed on target articulators (including tongue, lips, and jaw) ...
  86. [86]
    Electromagnetic Articulography (EMA) | Michael T. Johnson, Ph.D.
    Electro-Magnetic Articulography (EMA) has been a rapidly growing technology for accurate measurement of articulatory kinematics. This technology is based on ...
  87. [87]
    A modular architecture for articulatory synthesis from gestural ... - NIH
    This paper proposes a modular architecture for articulatory synthesis from a gestural specification comprising relatively simple models for the vocal tract, ...Missing: GEST | Show results with:GEST
  88. [88]
    Legacy: ASY Vocal Tract Model - Haskins Laboratories
    The Articulatory Synthesis program (ASY) is a software speech synthesis system. At the heart of this system is a model of the vocal tract in the midsaggital ...
  89. [89]
    [PDF] ISCA Archive THE HISTORY OF ARTICULATORY SYNTHESIS AT ...
    The Haskins articulatory synthesis model (ASY) was designed to allow for simple control of vocal tract shape by direct manipulation of articulators.
  90. [90]
    Deep learning in ultrasound tongue imaging: a systematic review ...
    Sep 23, 2025 · We identified and synthesised eight key studies that collectively illustrate the current advancements, some developed direct classifiers for ...
  91. [91]
    Speech-Guided Diffusion Model for Real-time MRI Video of ... - arXiv
    Sep 23, 2024 · We introduce a data-driven method to visually represent articulator motion in Magnetic Resonance Imaging (MRI) videos of the human vocal tract during speech.Missing: seminal | Show results with:seminal
  92. [92]
    Computer-Implemented Articulatory Models for Speech Production
    Mar 7, 2022 · This paper lists many computer-implemented articulatory models and provides criteria for dividing articulatory models in different categories.Missing: GEST | Show results with:GEST