Fact-checked by Grok 2 weeks ago

Point-biserial correlation coefficient

The point-biserial correlation coefficient (r_{pb}) is a statistical measure that assesses the strength and direction of the linear association between a dichotomous (binary) variable and a continuous variable, serving as a special case of the Pearson product-moment correlation coefficient when one variable takes only two distinct values, such as 0 and 1.^[1]^[2] It is commonly applied in fields like psychology, education, and social sciences to evaluate relationships, for example, between gender (binary) and test performance (continuous), assuming the binary variable represents true categories rather than an artificial split of continuous data.^[3]^[4] The coefficient is calculated using the formula
r_{pb} = \frac{\bar{Y}_1 - \bar{Y}_0}{s_Y} \sqrt{p(1 - p)},
where \bar{Y}_1 is the mean of the continuous variable for the group coded as 1, \bar{Y}_0 is the mean for the group coded as 0, s_Y is the standard deviation of the continuous variable across all observations, and p is the proportion of observations in the group coded as 1 (with $1 - p for the other group).^[3]^[2] This formula yields values ranging from -1 to 1, where positive values indicate that higher values of the continuous variable are associated with the category coded as 1, negative values suggest the opposite, and a value near 0 implies no linear relationship.^[1] Interpretation mirrors that of Pearson's r, with magnitudes indicating weak (< 0.3), moderate (0.3–0.5), or strong (> 0.5) associations, though context-specific benchmarks apply.^[5]^[3] Key assumptions include the continuous variable being approximately normally distributed within each binary group, equal variances across groups (homoscedasticity), and linearity in the relationship; violations, such as non-normal data or artificial dichotomization, can lead to biased estimates, in which case alternatives like logistic regression may be preferable.^[2]^[4] The point-biserial correlation is robust for large samples but requires random sampling and independence of observations for valid inference, with significance testing often performed via t-tests analogous to those for Pearson's r.^[2] In practice, software like SPSS or R computes it directly by treating the binary variable as numeric (0/1), ensuring it reflects true categorical distinctions to avoid misleading results.^[4]

Definition and Background

Definition

The point-biserial correlation coefficient is a statistical measure that assesses the strength and direction of the association between a binary (dichotomous) variable and a continuous variable, serving as a special case of the Pearson product-moment correlation coefficient.^[6]^[7] It is particularly useful in scenarios where one variable represents two mutually exclusive categories, such as success/failure or presence/absence, and the other is measured on a numerical scale without discrete categories.^[8] In this context, the binary variable is conventionally coded as 0 for one category and 1 for the other, ensuring it functions as a numerical indicator in the correlation calculation, while the continuous variable is typically interval or ratio scaled, such as income levels or exam scores.^[7]^[9] The coefficient captures how variations in the continuous variable differ across the two groups defined by the binary variable, with positive values indicating that the continuous variable tends to be higher in the group coded as 1, and negative values showing the reverse pattern.^[10] The term "point-biserial" reflects the discrete, two-point structure of the binary variable on a measurement scale, distinguishing it from other correlation types like the biserial correlation, which assumes the binary variable arises from an underlying continuous distribution.^[11] This naming emphasizes the point-like nature of the dichotomy in contrast to fully continuous associations.^[12]

Historical Development

The point-biserial correlation coefficient originated as an extension of Karl Pearson's pioneering work on correlation measures in the early 1900s. Pearson, who introduced the product-moment correlation coefficient in 1895, extended his methods to handle variables of mixed types, including dichotomous ones, in biometric and statistical analyses. The specific term "point-biserial" was introduced by M. W. Richardson and J. M. Stalnaker in 1933, in their paper distinguishing it from the biserial correlation as a measure for truly dichotomous variables without assuming an underlying continuous distribution.^[13] A key publication contributing to its foundations is Pearson's 1909 paper in Biometrika, where he derived the biserial correlation for estimating relationships assuming an underlying continuous distribution for the dichotomous variable; the point-biserial can be viewed as a simplified variant without that assumption.^[14] The coefficient's development advanced through further examinations of its properties, such as Joseph Lev's 1949 note in the Annals of Mathematical Statistics, which explored its sampling distribution.^[14] It gained traction in psychometrics and social sciences in the mid-20th century, with statisticians like Chester W. Harris emphasizing its practical utility in educational measurement and test construction, including for analyzing item discrimination in binary-scored assessments.^[15] During the mid-1900s, refinements focused on robust estimation for dichotomous variables in empirical studies. By the 1980s, it was routinely incorporated into major statistical software packages, such as SAS and SPSS, enabling efficient computation in large-scale analyses. In modern statistical practice, it continues to inform developments in item response theory, where it serves as a benchmark for validating latent trait models against observed binary responses.^[2]

Mathematical Formulation

Formula Derivation

The point-biserial correlation coefficient arises as a special case of the Pearson product-moment correlation coefficient when one variable is dichotomous (coded as 0 or 1) and the other is continuous.^[2] To derive its formula, begin with the general Pearson correlation:

r = \frac{\mathrm{Cov}(X, Y)}{s_X s_Y},

where X is the dichotomous variable, Y is the continuous variable, \mathrm{Cov}(X, Y) is the covariance between X and Y, and s_X and s_Y are the standard deviations of X and Y, respectively.^[16] Assume X takes the value 1 with proportion P (and 0 with proportion $1 - P). The mean of X is \mu_X = P, and its variance is \sigma_X^2 = P(1 - P), so the standard deviation is s_X = \sqrt{P(1 - P)}. The covariance \mathrm{Cov}(X, Y) can be expressed as P \cdot M_1 - \mu_Y \cdot P, where M_1 is the mean of Y conditional on X = 1 and \mu_Y = P \cdot M_1 + (1 - P) \cdot M_0 is the overall mean of Y, with M_0 the mean of Y conditional on X = 0. Substituting yields \mathrm{Cov}(X, Y) = P(1 - P)(M_1 - M_0).^[16]^[2] Plugging these into the Pearson formula gives:

r_{pb} = \frac{P(1 - P)(M_1 - M_0)}{\sqrt{P(1 - P)} \cdot s_Y} = \frac{(M_1 - M_0) \sqrt{P(1 - P)}}{s_Y},

where s_Y is the standard deviation of Y. This simplification shows that the point-biserial correlation r_{pb} directly measures the standardized difference in group means scaled by the binary variance factor.^[16]^[2] In the derived formula, M_1 - M_0 captures the mean difference between the two groups defined by the dichotomous variable, reflecting the strength of the association. The term \sqrt{P(1 - P)} normalizes for the variance inherent in the binary distribution, which reaches its maximum at P = 0.5 and approaches zero as P nears 0 or 1, ensuring the coefficient accounts for the proportion of each category. Finally, division by s_Y standardizes the difference relative to the variability in the continuous variable.^[2] The derivation implicitly assumes linearity in the relationship between X and Y, as inherited from the Pearson correlation.

Relation to Pearson Correlation

The point-biserial correlation coefficient r_{pb} is mathematically equivalent to the Pearson product-moment correlation coefficient r when the dichotomous variable is coded numerically as 0 and 1.^[17] This equivalence arises because the point-biserial formula is a direct substitution of the Pearson formula for the case where one variable is binary, specifically incorporating the variance of the dichotomous variable, which is p(1-p) where p is the proportion of cases in the "1" category.^[17] To demonstrate this, recall the Pearson correlation formula:

r = \frac{\sum_{i=1}^N (X_i - \bar{X})(Y_i - \bar{Y})}{\sqrt{\sum_{i=1}^N (X_i - \bar{X})^2} \cdot \sqrt{\sum_{i=1}^N (Y_i - \bar{Y})^2}},

where X is the continuous variable, Y is the dichotomous variable coded as 0 or 1, \bar{X} and \bar{Y} are their means, and N is the sample size. With Y binary, \bar{Y} = p and the denominator's second square root simplifies to \sqrt{N p (1-p)}. The numerator, representing the covariance, becomes N p (1-p) ( \bar{X}_1 - \bar{X}_0 ), where \bar{X}_1 and \bar{X}_0 are the means of X for Y=1 and Y=0, respectively. Substituting these terms yields:

r = \frac{ ( \bar{X}_1 - \bar{X}_0 ) \sqrt{p (1-p)} }{ s_X },

which is identical to the standard point-biserial formula, confirming the direct substitution and equivalence.^[17] Both the point-biserial and Pearson coefficients range from -1 to +1, with the bounds interpreted similarly: r_{pb} = 1 occurs when the binary grouping perfectly predicts the continuous variable (e.g., all higher X values align with one category and lower with the other), r_{pb} = -1 for the opposite perfect separation, and r_{pb} = 0 for no linear association.^[17] Although computationally identical under the 0/1 coding, the point-biserial coefficient is preferred over the general Pearson correlation for interpretability when one variable is explicitly dichotomous, as its formula highlights the role of group proportions and mean differences in a manner tailored to binary-continuous associations.^[17]

Computation and Estimation

Step-by-Step Calculation

The point-biserial correlation coefficient is computed through a series of straightforward steps using the provided formula, which relates the difference in group means to the variability in the continuous variable, adjusted by the binary variable's distribution.

Code the binary variable (X) as 0 for one category and 1 for the other, ensuring consistent assignment across the dataset.^[3]
Calculate the proportion P of observations where X = 1, given by P = (number of 1s) / n, where n is the total number of observations.^[3]
Compute the mean M_1 of the continuous variable (Y) for all observations where X = 1, and the mean M_0 for all observations where X = 0.^[3]
Determine the sample standard deviation S_y of the continuous variable Y across all n observations, using the formula $S_y = \sqrt{\frac{\sum_{i=1}^n (Y_i - \bar{Y})^2}{n-1}},$ where \bar{Y} is the overall mean of Y.^[3]
Substitute the values into the point-biserial correlation formula: $r_{pb} = \frac{M_1 - M_0}{S_y} \sqrt{P(1 - P)}.$

^[3] To demonstrate these steps, consider a hypothetical dataset with n = 20 observations from students' exam outcomes. The binary variable X codes failure as 0 and passing as 1, while the continuous variable Y represents test scores out of 100. There are 10 failures (P = 10/20 = 0.5) and 10 passes. The test scores for the failure group (X = 0) are 35, 40, 45, 50, 55, 60, 65, 70, 75, 80 (M_0 = 57.5). The scores for the passing group (X = 1) are 65, 70, 75, 80, 85, 90, 95, 100, 105, 110 (M_1 = 87.5). The overall mean \bar{Y} = 72.5, and the computed S_y ≈ 33.33. Substituting into the formula yields

r_{pb} = \frac{87.5 - 57.5}{33.33} \sqrt{0.5(1 - 0.5)} \approx 0.90 \times 0.50 = 0.45.

This value indicates a moderate positive association.^[3] The sign of r_{pb} reflects the direction of the relationship: a positive coefficient occurs when higher values of the continuous variable tend to align with X = 1, while a negative coefficient indicates the opposite association.^[3] If P = 0 or P = 1, the point-biserial correlation is undefined, as the binary variable lacks variation (√[P(1 - P)] = 0), and one group mean cannot be computed due to zero observations in that category.^[3]

Software Implementation

The point-biserial correlation coefficient can be computed in R using the base function cor.test() with the method="pearson" argument after coding the binary variable as numeric values 0 and 1, as the point-biserial is equivalent to the Pearson correlation under this coding.^[18] A sample code snippet is as follows:

r
# Example data: binary_var (0/1) and continuous_var
binary_var <- c(0, 1, 0, 1, 1)  # Coded as numeric
continuous_var <- c(2.1, 3.5, 1.8, 4.2, 3.9)

# Using base R
result_base <- cor.test(binary_var, continuous_var, method = "pearson")
print(result_base$estimate)  # Point-biserial coefficient
# Example data: binary_var (0/1) and continuous_var
binary_var <- c(0, 1, 0, 1, 1)  # Coded as numeric
continuous_var <- c(2.1, 3.5, 1.8, 4.2, 3.9)

# Using base R
result_base <- cor.test(binary_var, continuous_var, method = "pearson")
print(result_base$estimate)  # Point-biserial coefficient

In Python, the scipy.stats module includes the pointbiserialr() function to directly compute the point-biserial correlation between a binary variable (coded as 0 and 1) and a continuous variable, returning the coefficient and p-value.^[19] The pingouin library offers pointbiserial_corr() for a similar computation, often with additional statistical outputs like confidence intervals. An executable code example is:

python
import numpy as np
from scipy.stats import pointbiserialr
import pingouin as pg
import pandas as pd

# Example data
binary_var = np.array([0, 1, 0, 1, 1])  # Coded as 0 and 1
continuous_var = np.array([2.1, 3.5, 1.8, 4.2, 3.9])

# Using scipy
corr_scipy, p_value = pointbiserialr(binary_var, continuous_var)
print(f"Point-biserial coefficient: {corr_scipy}")

# Using pingouin
df = pd.DataFrame({'binary': binary_var, 'continuous': continuous_var})
result_pg = pg.pointbiserial_corr(df, x='binary', y='continuous')
print(result_pg['r'].iloc[0])  # Point-biserial coefficient
import numpy as np
from scipy.stats import pointbiserialr
import pingouin as pg
import pandas as pd

# Example data
binary_var = np.array([0, 1, 0, 1, 1])  # Coded as 0 and 1
continuous_var = np.array([2.1, 3.5, 1.8, 4.2, 3.9])

# Using scipy
corr_scipy, p_value = pointbiserialr(binary_var, continuous_var)
print(f"Point-biserial coefficient: {corr_scipy}")

# Using pingouin
df = pd.DataFrame({'binary': binary_var, 'continuous': continuous_var})
result_pg = pg.pointbiserial_corr(df, x='binary', y='continuous')
print(result_pg['r'].iloc[0])  # Point-biserial coefficient

For SPSS, the point-biserial correlation is obtained via the Bivariate Correlations procedure (Analyze > Correlate > Bivariate), selecting the Pearson option; the binary variable must be coded as 0 and 1.^[4] The syntax example is:

CORRELATIONS
  /VARIABLES=binary_var continuous_var
  /MISSING=PAIRWISE.
CORRELATIONS
  /VARIABLES=binary_var continuous_var
  /MISSING=PAIRWISE.

In SAS, PROC CORR computes the point-biserial correlation as a Pearson correlation by specifying the binary variable (coded as 0 and 1) and the continuous variable in the VAR statement, with options for handling missing data such as NOMISS for listwise deletion. A syntax example is:

PROC CORR DATA=dataset;
  VAR binary_var continuous_var;
  RUN;
PROC CORR DATA=dataset;
  VAR binary_var continuous_var;
  RUN;

In Microsoft Excel, the point-biserial correlation can be calculated manually using the =CORREL() function on ranges containing the continuous variable and the binary variable coded as 0 and 1, or via the Analysis ToolPak add-in by selecting the Correlation tool under Data Analysis to generate a correlation matrix including the point-biserial as a Pearson value. For example, if binary data is in A2:A6 and continuous in B2:B6, enter =CORREL(A2:A6, B2:B6). When implementing the point-biserial correlation in software, ensure the binary variable is properly coded as numeric 0 and 1 to treat it as dichotomous, as incorrect coding (e.g., as factors or text) may lead to errors or inappropriate computations.^[18] Most tools, including cor.test() in R, pointbiserialr() in SciPy, and PROC CORR in SAS, handle missing data via pairwise deletion by default, excluding only pairs with missing values for each correlation; specify listwise deletion if needed to use complete cases only, though this may reduce sample size.^[19]

Interpretation and Applications

Interpreting the Coefficient

The point-biserial correlation coefficient, denoted as r_{pb}, quantifies the strength and direction of the linear relationship between a dichotomous variable and a continuous variable, serving as a special case of the Pearson product-moment correlation. The coefficient ranges from -1 to +1, where a value of -1 indicates a perfect negative association (higher values of the continuous variable correspond to the reference category of the dichotomous variable), +1 indicates a perfect positive association (higher values correspond to the other category), and 0 suggests no linear relationship between the variables. The sign of r_{pb} depends on which category of the dichotomous variable is coded as 1, making the direction interpretable only in the context of the coding scheme used.^[2] To evaluate the magnitude of r_{pb}, researchers often apply Cohen's (1988) conventions for Pearson correlations, classifying |r_pb| = 0.10 as small, 0.30 as medium, and 0.50 as large. These thresholds provide a conventional framework for assessing practical significance, though the actual importance depends on the research context and sample size.^[16] Statistical significance of r_{pb} is typically assessed using a t-test, where the test statistic is calculated as

t = r_{pb} \sqrt{\frac{n-2}{1 - r_{pb}^2}},

with degrees of freedom df = n - 2 and n as the sample size; under the null hypothesis of no correlation, this follows a t-distribution.^[2] The resulting p-value indicates the probability of observing such a coefficient (or more extreme) assuming no true association; if p < α (e.g., 0.05), the correlation is deemed statistically significant, suggesting evidence against the null hypothesis.^[2] Confidence intervals for r_{pb} are commonly constructed using Fisher's z-transformation to normalize the sampling distribution of the correlation coefficient. The transformation is given by

z' = \frac{1}{2} \ln \left( \frac{1 + r_{pb}}{1 - r_{pb}} \right) = \artanh(r_{pb}),

with standard error SE_{z'} = 1 / \sqrt{n - 3}; a 95% interval for z' is z' \pm 1.96 \times SE_{z'}, which is then back-transformed using r = \tanh(z') to obtain the interval for r_{pb}.^[16] This interval provides a range of plausible values for the population correlation, with narrower intervals indicating greater precision; if the interval excludes 0, it supports a significant association.^[16] As an effect size measure, r_{pb} reflects the degree to which the dichotomous variable discriminates between groups on the continuous variable, approximately equal to the standardized mean difference (Cohen's d) in a two-group design multiplied by the square root of the product of the group proportions \sqrt{p(1-p)}. Higher absolute values indicate better discrimination, aiding in the evaluation of substantive importance beyond mere statistical significance.

Practical Examples

In educational research, the point-biserial correlation coefficient is commonly applied to assess the relationship between a binary outcome, such as whether students graduated (yes/no), and a continuous measure like grade point average (GPA). This finding implies that educational interventions targeting GPA improvement could enhance graduation likelihood, with the positive direction suggesting that higher academic achievement aligns with successful program completion. In psychological studies, the point-biserial correlation evaluates associations between binary diagnostic categories, such as whether a patient is diagnosed with a mental health condition (yes/no), and continuous symptom severity scores derived from standardized scales. For example, research on psychopathology has examined point-biserial correlations between specific diagnoses (e.g., major depressive disorder) and symptom severity measures, reflecting moderate strengths where diagnosed individuals exhibit higher severity levels, aiding in clinical decision-making for targeted therapies. The positive direction here underscores the diagnostic tool's utility in identifying at-risk cases, while the moderate magnitude highlights the need for complementary assessments to account for variability in symptom expression. Biological experiments often employ the point-biserial correlation to examine the impact of binary treatment assignments, such as whether plants received a specific intervention (yes/no), on continuous outcomes like growth height or biomass. In agronomic trials, for example, a treatment versus control design has demonstrated a moderate positive effect where treated plants showed greater growth, informing agricultural practices on intervention efficacy. This association's direction and strength suggest practical implications for scaling treatments in field applications, emphasizing the coefficient's role in quantifying biological responses without assuming equal group variances.

Assumptions and Limitations

Key Assumptions

The point-biserial correlation coefficient, as a special case of the Pearson product-moment correlation where one variable is dichotomous (coded as 0 and 1) and the other is continuous, inherits several core statistical assumptions to ensure unbiased estimation and valid inference.^[20] These assumptions underpin the coefficient's reliability in measuring the strength and direction of the linear association between the variables. A fundamental assumption is the independence of observations, meaning that the values of the continuous variable for each case are not influenced by those of other cases, with no clustering, repeated measures, or other dependencies in the data. This ensures that the correlation captures genuine pairwise relationships rather than spurious effects from correlated errors. The relationship between the binary and continuous variables must be linear, such that when the data are plotted (e.g., as boxplots or means of the continuous variable by binary group), the association appears as a straight-line trend rather than curvilinear or nonlinear.^[21] With only two binary categories, this linearity is often inherently satisfied, as the group means can always be connected by a line, but deviations (e.g., if the continuous variable shows nonlinear patterns within groups) could undermine the interpretation.^[2] Homoscedasticity is another key requirement, stipulating that the variance of the continuous variable is equal across the two binary groups.^[2] This equal spread of scores prevents bias in the correlation estimate and supports the validity of associated statistical tests, such as those for significance. The continuous variable should also be approximately normally distributed within each binary group.^[2] Although the point-biserial coefficient is somewhat robust to mild violations of this assumption, substantial non-normality can distort p-values and confidence intervals derived from the correlation. For optimal interpretation, the binary variable should represent a true natural dichotomy (e.g., gender or pass/fail) rather than an artificial cutoff from an underlying continuous trait; in the latter case, the related biserial correlation coefficient may provide a more accurate estimate, though the point-biserial remains computable without this condition.^[3]

Potential Limitations

One major limitation of the point-biserial correlation coefficient arises from its sensitivity to artificial dichotomization of continuous variables, such as median splits, which can lead to biased estimates and loss of statistical power by discarding information from the original scale.^[22] For instance, applying the point-biserial to a dichotomized continuous predictor underestimates the true underlying correlation by 20-44%, depending on the split proportion and true correlation strength, with the bias persisting even in large samples.^[22] As a special case of the Pearson product-moment correlation, the point-biserial coefficient assumes a linear relationship between the variables and fails to detect non-linear associations, potentially underestimating or misrepresenting the strength of the relationship in such cases.^[2] When non-linearity is suspected, alternatives like logistic regression are recommended to model the binary outcome more appropriately without assuming linearity. The coefficient's reliability diminishes with small sample sizes, where estimates become unstable and the sampling distribution deviates from normality, requiring large n for valid inference.^[2] Additionally, imbalanced group proportions (P near 0 or 1) attenuate the correlation toward zero and inflate the standard error, reducing precision and power due to the smaller effective sample in the minority group.^[23] Like all correlation measures, the point-biserial quantifies association but does not imply causation, leaving room for confounding variables to explain the observed relationship.^[24] When key assumptions such as normality within groups or equal variances fail, or if both variables are truly dichotomous, the phi coefficient provides a more suitable measure than the point-biserial; for non-normal continuous variables, Spearman's rank correlation (rho) is preferable to avoid violation of parametric assumptions.^[2]

References

[1]
Point-Biserial Correlation - Statistics By Jim
The point-biserial correlation coefficient measures the strength and direction of the relationship between a continuous variable and a dichotomous variable.<|control11|><|separator|>
[2]
[PDF] Point-Biserial and Biserial Correlations - NCSS
The point-biserial correlation is a special case of the product-moment correlation in which one variable is continuous and the other variable is binary ...
[3]
Point-Biserial Correlation & Biserial Correlation: Definition, Examples
Formula. The formula for the point biserial correlation coefficient is: M1 = mean (for the entire test) of the group that received the positive binary variable ...
[4]
Point-Biserial Correlation using SPSS Statistics
A point-biserial correlation is used to measure the strength and direction of the association that exists between one continuous variable and one dichotomous ...
[5]
Point Biserial Correlation | Semantic Scholar
Oct 15, 2005 · The point biserial correlation is the value of Pearson's product moment correlation when one of the variables is dichotomous and the other ...
[6]
[PDF] Point Biserial Correlation Tests - NCSS
The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable (Y) and a ...
[7]
[PDF] Preliminary Item Statistics Using Point-Biserial Correlation and P ...
This computation results in the correlation of the item score and the total score minus that item score. For example, the Item 1 correlation is ...Missing: journal | Show results with:journal
[8]
Nonoverlap proportion and the representation of point-biserial ... - NIH
The Pearson product moment correlation coefficient (r) calculated from these numeric data is known as the point-biserial correlation coefficient (rpb) [3].
[9]
(PDF) Point‐biserial correlation: Interval estimation, hypothesis ...
Oct 29, 2025 · Point‐biserial correlations are defined for designs with either fixed or random group sample sizes and can accommodate unequal variances.Missing: source | Show results with:source
[10]
Point Biserial Correlation Coefficient and its Generalization
Jan 1, 2025 · Point Biserial Correlation Coefficient and its Generalization. Published online by Cambridge University Press: 01 January 2025. S. Das Gupta.
[11]
More Correlation Coefficients - Andrews University
The formula is very similar to the point-biserial but yet different: rb = (Y1 - Y0) • (pq/Y) / [sigma] Y,. where Y0 and Y1 are the Y score means for data ...
[12]
The Point Biserial Coefficient of Correlation - Project Euclid
The Point Biserial Coefficient of Correlation. Joseph Lev. DOWNLOAD PDF + SAVE TO MY LIBRARY. Ann. Math. Statist. 20(1): 125-126 (March, 1949).Missing: invented | Show results with:invented
[13]
[PDF] Tests. - ERIC
Apr 4, 1972 · Harris, Chester W. TITLE. An Index of Efficiency for Fixed-Length ... principle that the point biserial cannot take values of t 1. The ...Missing: origin | Show results with:origin
[14]
[PDF] Interpretation of the point-biserial correlation coefficient in the ...
In this article, we presented the point-biserial correlation, which is to be used instead of the Pearson correlation when one variable is naturally ...
[15]
Correlation Between a Discrete and a Continuous Variable. Point ...
September, 1954 Correlation Between a Discrete and a Continuous Variable. Point-Biserial Correlation. Robert F. Tate · DOWNLOAD PDF + SAVE TO MY LIBRARY.Missing: original | Show results with:original
[16]
How to Calculate Point-Biserial Correlation in R
If you're using R, for example, you might need to use a package like `psych` or `ltm` to calculate point-biserial correlation coefficients and conduct related ...
[17]
[PDF] CRAN - Package 'psych'
Jun 23, 2025 · Functions for analyzing data at multi- ple levels include within and between group statistics, including correlations and factor analy- sis.
[18]
pointbiserialr — SciPy v1.16.2 Manual
The point biserial correlation is used to measure the relationship between a binary variable, x, and a continuous variable, y. Like other correlation ...Missing: pingouin | Show results with:pingouin
[19]
Correlation in Relationship to t-test - Real Statistics Using Excel
The point-biserial correlation coefficient is simply Pearson's product-moment correlation coefficient where one or both of the variables are dichotomous.
[20]
Reliability and validity of severity dimensions of psychopathology ...
Concurrent validity for lifetime and current psychopathology was assessed using correlations between each diagnosis or symptom severity scale (point‐biserial ...
[21]
The Other Half of the Story: Effect Size Analysis in Quantitative ... - NIH
... examples comparing the effects of a treatment on plant growth or student performance. ... The point-biserial correlation coefficient is similar in nature ...
[22]
Point Biserial - Statistics Resources - LibGuides at National University
Oct 27, 2025 · The Point-Biserial Correlation is a special case of the Pearson Correlation and is used when you want to measure the relationship between a continuous variable ...Missing: journal | Show results with:journal
[23]
Point Biserial Correlation - Kornbrot - Wiley Online Library
Apr 22, 2014 · The point biserial correlation is a useful measure of effect size, that is, statistical magnitude, of the difference in means between two groups.
[24]
[PDF] Scatterplots and Correlation
Pearson r: Assumptions Assumptions: ▪ Correlation requires that both variables be quantitative. Correlation describes linear relationships. Correlation does ...
[25]
https://doi.org/10.1002/jrsm.1218
[26]
Correcting the Point-Biserial Correlation for Attenuation Owing - jstor
The same reasoning underlies the suggestion by meta-analysts that the point-biserial, which has a metric different from that of the "regular" product-moment ...
[27]
Correlation and contingency
Correlation = Knowing A helps you guess B · Correlation does not imply causation. · A is correlated with B implies B is correlated with A. · Contingency = Changing ...