Interval ratio

In music theory, an interval ratio is the ratio of the frequencies of two pitches that defines the musical interval between them.^[1] For example, the octave corresponds to a 2:1 ratio, where the higher pitch has twice the frequency of the lower pitch, while the perfect fifth has a 3:2 ratio.^[2] These ratios form the basis for understanding consonance and dissonance, as simpler ratios (with small integers) tend to sound more harmonious due to overlapping harmonics in the overtone series.^[3] Interval ratios are central to tuning systems, such as just intonation, where intervals approximate these simple ratios, and equal temperament, where they are adjusted for practicality across the octave.^[2]

Fundamentals

Definition

In music theory, an interval ratio is defined as the ratio of the frequencies of two pitches that form a musical interval, typically simplified to its lowest terms using small integers.^[1] For instance, a perfect fifth, such as from C to G, has a frequency ratio of 3:2.^[4] These ratios emerge from the harmonic overtones inherent in the physical production of sound, where a pitched note consists of a fundamental frequency and its integer multiples (harmonics), creating natural proportional relationships between simultaneous or successive tones.^[5] When two notes share harmonics that align closely, their frequency ratio approximates simple integers, contributing to the perception of consonance.^[1] Interval ratios differ from measurements in semitones or scale steps, which are additive approximations of logarithmic frequency differences used in tempered systems; ratios instead represent precise, multiplicative proportions that preserve the exact acoustic relationships, as emphasized in just intonation.^[4] A foundational example is the octave, with a ratio of 2:1, where the higher pitch's frequency is exactly double that of the lower, serving as the basis for scaling all other intervals.^[5]

Frequency Basis

Musical intervals arise acoustically from the harmonic series produced by vibrating sources, such as strings or air columns in musical instruments. When a fundamental frequency f_1 is sounded, it generates a series of overtones at integer multiples of f_1, namely $2f_1, $3f_1, $4f_1, and so on, forming simple frequency ratios between partials.^[6] These ratios, such as 2:1 between the fundamental and second harmonic, provide the acoustic foundation for intervals, as pairs of notes whose fundamentals relate through these overtones exhibit aligned partials.^[6] Mathematically, an interval is represented by the ratio r = f_2 / f_1, where f_1 and f_2 (f_2 > f_1) are the fundamental frequencies of two tones.^[7] Human perception of pitch intervals scales logarithmically, so the size of an interval in octaves is given by \log_2(r) = \log_2(f_2 / f_1).^[7] For example, the octave corresponds to r = 2, yielding \log_2(2) = 1 octave. This logarithmic measure derives from the physics of wave interference and beat frequencies. When two tones with close frequencies interfere, the resulting amplitude modulates at the beat frequency |f_2 - f_1|, creating audible pulsations; simple ratios minimize such beats across corresponding partials, while the logarithmic scale reflects how frequency doublings (octaves) are perceptually equivalent regardless of absolute pitch.^[8] In instruments like strings, the frequency is inversely proportional to the vibrating length L, so f \propto 1/L; thus, an interval ratio f_2 / f_1 corresponds to the inverse length ratio L_1 / L_2.^[9] For pipes in wind instruments, similar inverse relationships hold between effective length and frequency for the fundamental mode.^[6]

Historical Context

Ancient Origins

The foundations of interval ratios in music theory trace back to ancient Greece around the 6th century BCE, where Pythagoras and his school conducted experiments with the monochord—a single-string instrument—to demonstrate that consonant intervals correspond to simple numerical ratios of string lengths. For instance, halving the string length produces an octave (ratio 2:1), while a two-thirds length yields a perfect fifth (ratio 3:2). These discoveries, often attributed to empirical observations such as the sounds of blacksmith hammers or vibrating strings, established a mathematical basis for musical harmony, emphasizing the harmony between numbers and sound.^[10]^[11]^[12] Subsequent Greek theorists expanded this framework through the tetrachord system, a four-note structure spanning a perfect fourth that served as the building block of scales. Aristoxenus, in the 4th century BCE, classified intervals within tetrachords geometrically, focusing on perceptual distances rather than strict ratios, dividing the fourth into combinations like two whole tones and a semitone for the diatonic genus. Ptolemy, in the 2nd century CE, refined this by integrating mathematical ratios derived from monochord measurements, assigning 4:3 to the fourth and critiquing earlier approaches for inconsistencies, such as the sum of six tones exceeding the octave. These classifications influenced harmonic science by balancing auditory experience with quantitative precision.^[13]^[14] Parallel developments occurred in ancient India during the Vedic period (c. 1500–500 BCE), where musical theory emerged in texts like the Samaveda, emphasizing chanted melodies with foundational intervals. The shruti system, conceptualizing microtonal divisions of the octave into 22 parts, drew from Vedic swaras (notes) and incorporated ratios such as 4:3 for the fourth between Shadja and Madhyama, reflecting early intuitions of just intonation in ritual music. This approach paralleled Greek ratio-based tuning by prioritizing sonic purity in sacred contexts.^[15] A key limitation of Pythagorean tuning arose from stacking twelve perfect fifths (each 3:2) to form a circle of fifths, which, upon completing the cycle, yields a slight discrepancy of 531441/524288 relative to seven octaves (2:1), known as the Pythagorean comma. This interval, first quantified in ancient Greek sources like Euclid's Sectio Canonis (c. 300 BCE), highlighted the tension between pure fifths and octave closure, influencing later theoretical debates on scale construction.^[16]

Evolution in Western Music

In the medieval period, the development of interval ratios in Western music was advanced through practical pedagogical innovations and early tuning adjustments. Guido d'Arezzo, in his treatise Micrologus (c. 1026), influenced ratio-based solmization by codifying the hexachord system, which organized pitches into six-note segments defined by specific interval patterns such as whole tones (9/8 ratio) and semitones (16/15 or 256/243 ratios), facilitating the teaching of chant through interval recognition rather than rote memorization.^[17] This system built on Boethian principles of consonance derived from simple numerical ratios, emphasizing intervals like the diatessaron (4:3) and diapente (3:2) as foundational to modal structures.^[18] By the late medieval and early Renaissance transition, mean-tone tuning emerged as an approximation to pure ratios, particularly targeting the major third at 5:4 for sweeter harmonic thirds in polyphony; Pietro Aaron described this in Toscanello de la musica (1523), where fifths were flattened by a quarter syntonic comma to achieve equal major thirds across the scale.^[19] During the Renaissance, theorists explicitly championed just intonation based on small integer ratios to enhance consonance in choral and instrumental music. Gioseffo Zarlino, in Le Istitutioni harmoniche (1558), advocated for 5-limit just intonation, deriving the diatonic scale from the senario (integers 1 through 6) to produce pure intervals such as the major third (5:4) and emphasizing the minor third (6:5) as essential for the "natural" harmonic series reflected in vocal polyphony.^[20] Zarlino argued that these ratios, rooted in acoustic proportions, formed the basis of sensory pleasure in music, influencing composers like Palestrina to prioritize smooth voice leading within just intervals over Pythagorean approximations.^[21] From the Baroque to the Classical era, the pursuit of versatile tuning systems began to challenge the rigidity of pure ratios in favor of broader key modulation. Andreas Werckmeister, in works like Musicalische Temperatur (1687) and Erfundene und verfertigte musicalische Temperatur (1691), proposed well-tempered tunings that tempered select fifths away from the pure 3:2 ratio—such as widening some by a comma while leaving others intact—to enable usable harmonies in remote keys, critiquing the limitations of meantone's pure thirds for lacking "universal" applicability in organ and keyboard music.^[22] This shift implied a pragmatic awareness of ratio deviations, as seen in Johann Sebastian Bach's The Well-Tempered Clavier (1722), where fugues exploit contrapuntal entries that align with tempered intervals approximating just ratios (e.g., near-5:4 thirds in close-voiced subjects), demonstrating compositional sensitivity to ratio-based consonance even in unequal temperaments.^[23] In the 19th and 20th centuries, interval ratios experienced a revival through microtonal explorations and systematic composition theories, extending beyond equal temperament's dominance. Harry Partch, in Genesis of a Music (1949), delved into microtonal just intonation with his 43-tone scale, prominently featuring the 81/80 syntonic comma as a melodic interval to resolve discrepancies between ratios like 9/8 and 10/9, enabling expressive nuances in works like Revelation in the Courthouse Park (1960) that equal temperament could not capture.^[24] Similarly, Paul Hindemith, in The Craft of Musical Composition (1937–1953), developed a ratio-based theory grounding harmony in the overtone series, assigning degrees of tension to intervals by their simplest numerical ratios (e.g., octave 2:1 as zero tension, major third 5:4 as low), which informed his compositions like Symphonic Metamorphosis (1943) by prioritizing acoustically "natural" progressions over chromatic equality.^[25]

Key Examples

Consonant Intervals

Consonant intervals are musical intervals perceived as stable and harmonious due to their simple frequency ratios, typically involving small integers that align closely with the natural harmonic series. In just intonation, these ratios are derived from low-prime-limit tunings, primarily using the primes 2, 3, and 5, which minimize dissonance by maximizing overtone coincidence between the two tones.^[26] Such intervals form the foundation of many tonal harmonies, with perfect intervals being the most stable, followed by major and minor variants.^[27] The perfect intervals include the unison at a ratio of 1:1, the octave at 2:1, the perfect fifth at 3:2, and the perfect fourth at 4:3, where the fourth is the inversion of the fifth (its complementary interval within the octave).^[27] These ratios produce the purest consonance because their overtones overlap extensively; for instance, the second overtone of the lower note in a perfect fifth coincides with the fundamental of the higher note.^[26] Major intervals encompass the major third at 5:4 and the major sixth at 5:3, which introduce a brighter, more expansive quality while remaining relatively stable due to their derivation from the fifth and sixth harmonics.^[27] Minor intervals consist of the minor third at 6:5 and the minor sixth at 8:5, offering a slightly narrower, more introspective consonance that complements major counterparts in triadic structures.^[28] The following table summarizes these consonant interval ratios along with illustrative frequency examples based on a reference pitch of C at 264 Hz:

Interval	Ratio	Example Frequencies (Lower to Higher, Hz)
Unison	1:1	264 : 264
Octave	2:1	264 : 528
Perfect Fifth	3:2	264 : 396
Perfect Fourth	4:3	330 : 440
Major Third	5:4	264 : 330
Minor Third	6:5	264 : 317
Major Sixth	5:3	264 : 440
Minor Sixth	8:5	264 : 422

These examples approximate just intonation using standard pitch references, where the higher frequency is the product of the lower frequency and the ratio.^[29] These consonant ratios are predominantly superparticular—taking the form (n+1):n, such as 3:2 or 4:3—or otherwise limited to low primes, which ensures harmonic simplicity and reduces beating between partials for a smoother auditory experience.^[30] This structure arises from the initial harmonics in the series generated by vibrating strings or air columns.^[27]

Dissonant Intervals

Dissonant intervals in just intonation are characterized by frequency ratios involving larger prime factors or higher denominators, leading to greater perceptual tension through the misalignment of harmonic partials. The minor second, with a ratio of 16:15, exemplifies this dissonance, as its complex numerator and denominator result in rapid beating between overtones when sounded simultaneously.^[31] Similarly, the major second at 9:8, while simpler than the minor second, still produces noticeable roughness due to the introduction of the factor 3 in the numerator, distinguishing it from more stable perfect intervals like the fifth.^[31] The tritone, often realized as 45:32 in ascending form or 64:45 in descending, amplifies dissonance through its even more intricate ratio, incorporating primes up to 5 and creating a stark instability historically dubbed the diabolus in musica or "devil in music" for its avoidance in early polyphony.^[32]^[33] Augmented and diminished intervals further embody this tension; for instance, the augmented fourth aligns with the 45:32 tritone, evoking a sense of unresolved conflict in harmonic progressions.^[32] Higher-limit ratios extend dissonance into septimal territory, where primes like 7 introduce additional complexity. The harmonic seventh, at 7:4, serves as a prime example, lying approximately 27 cents flat relative to the just minor seventh (16:9) and generating pronounced beating among its partials due to the odd prime 7's misalignment with lower harmonics.^[34] These septimal intervals enhance harmonic color but heighten perceptual roughness, contributing to tension in compositions that resolve to simpler ratios.

Dissonant Interval	Just Ratio	Beat Frequency Implication
Minor Second	16:15	High beat rate from near-overlapping partials, perceived as clashing roughness.^[35]
Major Second	9:8	Moderate beating in mid-range partials, creating subtle tension without full resolution.^[35]
Tritone	45:32	Intense, irregular beats from widely spaced harmonics, amplifying instability.^[35]
Harmonic Seventh	7:4	Rapid, dissonant pulsations from septimal partials, evoking unresolved strain.^[35]

Applications

In Just Intonation

Just intonation is a tuning system in which musical intervals are defined by exact simple integer ratios between frequencies, typically limited to prime factors of 2, 3, 5, or 7, known respectively as 3-limit, 5-limit, or 7-limit tunings.^[36]^[37] These ratios derive from the natural harmonic series, producing intervals that align harmonically without approximation.^[38] Scales in just intonation are constructed by stacking pure perfect fifths with a ratio of [3:2](/page/3-2) and major thirds with a ratio of [5:4](/page/5-4), often within a 5-limit framework. For example, the major scale in just intonation yields the following frequency ratios relative to the tonic: 1:1 (tonic), 9:8 (major second), 5:4 (major third), 4:3 (perfect fourth), 3:2 (perfect fifth), 5:3 (major sixth), 15:8 (major seventh), and 2:1 (octave).^[37] This method generates a diatonic scale where each step reflects simple rational proportions, emphasizing harmonic purity over equal division.^[39] The primary advantages of just intonation lie in its production of pure, beat-free intervals, where overtones from different notes align precisely, resulting in smooth, consonant harmonies without the interference beats caused by slight frequency mismatches.^[38] This clarity enhances the resonance and blend in contexts such as a cappella choral music and string ensembles, where performers can dynamically adjust tuning to achieve these ideal ratios.^[40]^[41] However, just intonation faces limitations when constructing closed temperaments or modulating keys extensively, as discrepancies like the syntonic comma—a small interval of $81:80 (approximately 21.5 cents)—arise from the mismatch between stacked fifths and thirds.^[42] This comma leads to wolf intervals, such as a dissonant fifth or third in certain positions within a 12-note scale, producing audible roughness that disrupts harmonic continuity.^[37]^[43]

Relation to Tempered Systems

In equal temperament, the octave is divided into twelve equal semitones, each with a frequency ratio of $2^{1/12} \approx 1.05946. For an interval spanning n semitones, the tempered ratio is thus given by $2^{n/12}. A representative example is the perfect fifth, which spans 7 semitones and yields a tempered ratio of $2^{7/12} \approx 1.49831, corresponding to 700 cents; this approximates the pure 3:2 ratio of approximately 701.96 cents, resulting in a deviation of about -2 cents (flat). The deviation in cents for any interval can be calculated as $1200 \times \log_2 \left( \frac{\text{ratio}_\text{pure}}{\text{ratio}_\text{tempered}} \right), quantifying the slight impurity introduced to enable consistent tuning across all keys.^[44]^[45]^[46] Other tempered systems adjust interval ratios differently to prioritize certain consonances. In meantone temperament, such as quarter-comma meantone, the fifth is flattened more significantly—to around 696.58 cents—to achieve purer major thirds closer to the just 5:4 ratio of 386.31 cents, rather than the tempered 400 cents. This compromises the fifth's purity relative to 3:2 but enhances harmonic sweetness in common chords, though it introduces a "wolf" interval (a dissonant fifth or diminished sixth) when closing the octave circle. Well-tempered systems, by contrast, employ unequal tempering of intervals, distributing deviations variably across keys to provide a range of interval purities; for instance, some fifths may be closer to 701.96 cents while others are tempered differently, allowing greater variety in tonal colors without the uniformity of equal temperament.^[45]^[47] These systems reflect practical trade-offs in approximating pure interval ratios. Equal temperament's even distribution eliminates wolf intervals and facilitates unrestricted modulation between keys, a key advantage for complex Western music, but at the cost of all intervals being slightly impure. Meantone prioritizes third purity for Renaissance and early Baroque styles, while well-tempered variants balance usability across keys with expressive interval variations, influencing composers like Bach.^[48]^[49]

Perception and Measurement

Consonance Principles

The theory of consonance in musical intervals attributes perceived stability to simple frequency ratios composed of low integers, which allow the overtones of the constituent tones to align closely with one another, thereby minimizing auditory beats and interference.^[50] Beats occur when partials (overtones) from different tones have frequencies that are close but not identical, causing periodic amplitude fluctuations; in simple ratios like 3:2 for the perfect fifth, such misalignments are limited, resulting in fewer beats due to the sparse interactions among higher harmonics.^[51] This overtone coincidence enhances harmonic fusion, producing a sense of resolution and smoothness in perception.^[52] Hermann von Helmholtz, in his seminal 1863 treatise On the Sensations of Tone, formalized this by linking dissonance to "roughness," a sensory irritation arising from amplitude modulations in the overlapping spectra of complex tones with nearby partials.^[53] He argued that consonance emerges when these modulations are absent or slow enough to avoid perceptual harshness, as in intervals where partials either coincide or are sufficiently separated, drawing on physiological responses in the cochlea to explain the auditory discomfort of dissonant combinations.^[50] Contemporary psychoacoustics refines Helmholtz's model by incorporating the critical bandwidth—a frequency range of about 1/6 to 1/4 of the center frequency within which partials interact strongly—and the mechanism of virtual pitch, where the brain extracts a global pitch from harmonic patterns.^[54] For ratios like 5:4 (major third), consonance arises when the virtual pitches of the tones match closely and their partials fall outside each other's critical bands, reducing roughness while promoting perceptual coherence; this dual process underscores how consonance integrates both sensory and cognitive elements.^[55] Perceptions of consonance also exhibit cultural variations, with Western traditions emphasizing 3-limit ratios (limited to primes 2 and 3) for maximal purity, whereas Javanese gamelan music accommodates 7-limit intervals, tolerating greater harmonic complexity due to the inharmonic spectra of its metallophones and culturally shaped aesthetic preferences.^[56]

Logarithmic Scales

Interval ratios in music are often quantified using logarithmic scales to facilitate precise measurement and comparison of pitches, as human perception of pitch is logarithmic rather than linear with respect to frequency. The cent is the most widely adopted unit in this context, defined as one twelve-hundredth of an octave, providing a standardized way to express deviations from equal temperament or just intonation ratios.^[57] This unit allows intervals to be treated additively, mirroring the multiplicative nature of frequency ratios through the logarithm.^[7] The cent is calculated using the formula:

\text{cents} = 1200 \times \log_2 \left( \frac{f_2}{f_1} \right)

where f_2 and f_1 are the frequencies of the higher and lower pitches, respectively.^[58] This derivation stems from the equal-tempered scale, where the octave (frequency ratio 2:1) is divided into 12 equal semitones, each spanning 100 cents, for a total of 1200 cents per octave. The logarithmic base-2 ensures that interval sizes add directly: for instance, stacking two equal-tempered perfect fifths, each 700 cents, yields 1400 cents, equivalent to one octave (1200 cents) plus 200 cents, with the just perfect fifth (ratio 3:2) measuring approximately 701.96 cents—slightly larger than the equal-tempered approximation by about 2 cents.^[59] This additivity simplifies analysis of tuning systems, such as calculating cumulative deviations in meantone or Pythagorean scales. A practical example is the just perfect fifth with a frequency ratio of 3:2, which measures approximately 701.96 cents—slightly larger than the 700 cents of equal temperament, highlighting the tool's utility in assessing tuning purity.^[60] While other logarithmic units exist, such as the savart (approximately 3.986 cents, as there are about 301.03 savarts in an octave, given by $1000 \times \log_{10}(2)), cents have become the standard for measuring interval ratio deviations due to their alignment with the 12-tone equal-tempered framework prevalent in Western music.^[61] The savart offers finer granularity but sees limited use outside historical or theoretical acoustics.^[61]