Fact-checked by Grok 2 weeks ago

G*Power

G*Power is a free statistical software package designed for performing power analyses, sample size calculations, and estimations across a variety of common statistical tests, including t-tests, F-tests, χ²-tests, z-tests, and some exact tests. Developed primarily for researchers in the , behavioral, and , it provides an intuitive that enables users to visualize power curves, conduct sensitivity analyses, and explore different analysis options without requiring advanced programming knowledge. The software is available for both Windows and macOS operating systems, with the latest versions being 3.1.9.7 for Windows and 3.1.9.6 for macOS 10.7 through 15, and it has been widely adopted due to its flexibility and ease of use in experimental design and testing. Originally introduced in 1996 as a standalone power analysis tool by Edgar Erdfelder, Franz Faul, and Axel Buchner, evolved significantly with the release of version 3 in 2007, led by Franz Faul, Edgar Erdfelder, Albert-Georg Lang, and Axel Buchner at in . This version expanded its capabilities to handle more complex analyses, such as those involving and , while incorporating improvements for accuracy and user-friendliness based on user feedback and methodological advancements. A subsequent update in 2009 further refined the program for specific test families, including and analyses, ensuring it remains a robust tool for addressing underpowered studies—a common issue in psychological and biomedical research. Hosted and maintained by the Department of at , G*Power is distributed under terms that prohibit commercial resale but encourage free academic and personal use. Key features of include built-in calculators from various conventions (e.g., Cohen's d, f, or r), support for both a priori and post-hoc power calculations, and the generation of detailed plots to illustrate how power varies with factors like sample size and effect magnitude. It also integrates tutorial resources and a comprehensive to guide users through its functions, making it accessible to students, faculty, and professionals alike. The software's enduring popularity stems from its role in promoting rigorous statistical planning, as evidenced by its frequent citation in methodological guidelines and its integration into curricula for courses.

Development and History

Origins and Creators

was initially developed as a standalone statistical program by German psychologists Edgar Erdfelder, Franz Faul, and Axel Buchner in the early 1990s, with its foundational version originating in 1992 as a tool for Macintosh computers focused on a priori, , and compromise power analyses. The program's creation stemmed from the need to offer researchers in the social and behavioral sciences an accessible, interactive alternative to traditional power tables and charts, which were cumbersome and limited in flexibility for computing sample sizes, power values, and ratios across common statistical tests. At the time, existing software options lacked comprehensive support for compromise analyses and precise handling of noncentral distributions, prompting the developers to design as a menu-driven application capable of high-precision calculations for t tests, F tests, and χ² tests on IBM-compatible PCs and Apple Macintosh systems. The 1996 publication of marked a significant milestone, formalizing the software as a general-purpose tool released as to promote better statistical in behavioral . Erdfelder was affiliated with the , Faul with the University of Kiel, and Buchner with the during this period, reflecting a collaborative effort across academic institutions to address the underutilization of due to the absence of user-friendly tools. This initial iteration emphasized graphical outputs to visualize relationships between power, sample size, and effect sizes, making complex computations more intuitive for non-experts while maintaining algorithmic accuracy through approximations of noncentral distributions. Over time, the project evolved under ongoing contributions from the original creators, with the software later hosted by the Department of Psychology at , where Axel Buchner held a position. This institutional base facilitated subsequent enhancements, but the core origins trace back to the 1990s collaborations aimed at democratizing in empirical sciences.

Evolution of Versions

GPower originated as a DOS-based and Macintosh-compatible program in the mid-1990s, with its initial major release documented in 1996 by developers Edgar Erdfelder, Franz Faul, and Axel Buchner, marking a shift toward a general stand-alone tool for common statistical tests in behavioral research. This early iteration, referred to as GPower 2.0 in some contexts, supported basic platforms including and OS 7-9, and introduced three types of power analyses: a priori, , and compromise, while providing limited graphical output capabilities. A significant evolution occurred with the release of 3.0 in 2007, representing a major overhaul that transitioned the software from its DOS-based roots to a fully () architecture, enhancing accessibility and usability. This version improved algorithmic precision for power calculations, expanded cross-platform compatibility to include / and Mac OS X 10.4, and added support for five types of analyses, including new sensitivity and criterion options, as detailed in its seminal publication. The overhaul addressed limitations in earlier versions by incorporating more robust numerical methods and graphical features for visualizing power curves. In 2009, 3.1 extended the software's capabilities with dedicated modules for correlation and regression analyses, including multiple linear regression, logistic regression, and Poisson regression, building on the foundational improvements of version 3.0 to support a broader range of advanced statistical procedures. Subsequent updates focused on refinement rather than wholesale redesign; for instance, version 3.1.9.2 in 2014 fixed a bug in the χ² goodness-of-fit test where the power was incorrectly set to 1-β instead of β. Further enhancements came in version 3.1.9.7, released on March 17, 2020, which improved plotting functionalities to handle ranges of input values and noncentrality parameters more dynamically. As of November 2025, no major version releases have occurred since 2020, though the software receives ongoing minor patches for compatibility with evolving operating systems, maintaining its availability and functionality through the website. This steady maintenance underscores G*Power's enduring role as a reliable, free tool, with the GUI-driven architecture established in version 3.0 continuing to define its modern operation across Windows and macOS platforms.

Core Functionality

Supported Statistical Tests

G*Power supports a wide array of statistical tests for power analysis, primarily focusing on common and non-parametric procedures used in behavioral, social, and biomedical research. The software categorizes these into t-tests, F-tests, χ²-tests, z-tests, and tests, with each category offering variants tailored to specific research designs. All tests accommodate a priori, post-hoc, and compromise power analyses, allowing users to estimate sample sizes, detect power levels, or balance Type I and Type II error rates based on specified effect sizes, alpha levels, and other parameters. For t-tests, G*Power provides comprehensive support for comparing means, including one-sample tests to assess differences from a constant (H₀: μ = μ₀), independent samples tests for two groups (H₀: μ₁ = μ₂) with options to assume equal or unequal variances through effect size adjustments like Cohen's d, and paired samples tests for dependent means (H₀: μ_d = 0) using effect size d_z. Additional t-test variants include point-biserial correlations (testing ρ = 0 between a dichotomous and continuous variable) and linear regression slope tests (one group or two groups for slope equality). These implementations draw from Cohen's frameworks for effect size standardization, enabling precise power calculations via noncentral t-distributions. The F-tests category encompasses analyses of variance and , supporting fixed-effects one-way ANOVA for k independent groups (H₀: all μ_i equal), factorial ANOVA for main effects and interactions in multi-factor designs, and repeated measures ANOVA variants for within-subjects, between-subjects, or mixed effects (including between-within interactions). ANCOVA is handled through specialized F-tests adjusting for covariates, while multiple options include tests for R² deviation from zero (fixed model) and incremental tests for R² increases with added predictors. Multivariate extensions cover MANOVA global effects, , and Hotelling's T² for mean vector comparisons (one or two groups), with effect sizes like f or f². Variance equality tests between two groups are also included. These tests utilize noncentral F-distributions for computation. χ²-tests in G*Power facilitate goodness-of-fit assessments for single or multiple proportions (e.g., contingency tables) and independence tests in categorical data, with support for as a variant for paired nominal data (testing marginal homogeneity). Effect sizes are based on standardized differences, and power is derived from noncentral χ²-distributions. Z-tests cover proportion comparisons, such as differences between two independent proportions (with pooled or unpooled variance options and continuity corrections), and tests including tetrachoric models for dichotomous variables, and differences between two Pearson r's (independent or dependent, with or without common variables). Logistic and z-tests (Wald or likelihood ratio) assess single or multiple predictors' effects on binary or count outcomes, using ratios or rate ratios as effect sizes. These rely on normal approximations for large samples. Exact tests address small-sample scenarios without asymptotic approximations, including the for single proportions (difference from constant), (special binomial case for p=0.5), for two independent proportions (unconditional), and McNemar's exact test for dependent proportions. Power for these is computed using exact distributions to avoid inflation in low-n situations. Later versions, such as 3.1, expanded support to include advanced correlation models like tetrachoric correlations and generalized linear models (), enhancing applicability to non-normal data without introducing equivalence tests or capabilities.

Power Analysis Methods

Statistical power in G*Power is defined as the probability (1 - β) of correctly rejecting the when it is false, representing the likelihood of detecting a true effect of a specified size. This calculation relies on noncentral distributions under the (H1), such as the noncentral t, F, or distributions, which shift from the central distributions assumed under the (H0) based on a noncentrality that quantifies the effect's magnitude. Key parameters in these power analyses include the effect size (e.g., Cohen's d for t-tests or f for F-tests), the significance level α (typically 0.05), the sample size n, and specific to the test. Effect sizes follow Cohen's conventions, such as small (d = 0.2, f = 0.10), medium (d = 0.5, f = 0.25), and large (d = 0.8, f = 0.40), providing standardized measures of practical significance. G*Power supports several analysis types for power computation: a priori analyses determine required sample size given a target power, , and α; post-hoc analyses estimate achieved power given an observed , α, and sample size; compromise analyses balance Type I (α) and Type II (β) error probabilities; criterion analyses determine the significance level α given power, , and sample size; and sensitivity analyses identify the minimum detectable given power, α, and sample size. The software employs algorithms based on for calculations, utilizing libraries like DCDFLIB to evaluate cumulative functions of noncentral distributions and generate curves. For more complex tests, such as repeated measures ANOVA, methods are applied, including large-sample approximations or corrections where solutions are computationally intensive. Power is formally expressed as: \text{Power} = 1 - \beta = P(T \geq t_{\alpha} \mid H_1) where T is the test statistic under H1, and t_{\alpha} is the critical value from the central distribution at level α; β denotes the Type II error rate. The noncentrality parameter drives this computation; for F-tests, it is given by \lambda = f^2 N, where f is the effect size and N is the total sample size in balanced designs, reflecting the squared effect scaled by sample size to capture deviation from H0 (derived from the expected value of the noncentral F distribution as E(F) = \frac{df_2}{df_2 - 2} \left(1 + \frac{\lambda}{df_1}\right) for df_2 > 2, linking directly to power via the distribution's tail probability). For handling multiple comparisons, G*Power incorporates planned contrasts in ANOVA and MANOVA via contrast matrices, allowing users to specify adjusted α levels or error probabilities to control family-wise error rates without inflating Type II errors. Cluster randomization is addressed indirectly by adjusting the effective sample size for (ICC); users compute the design effect as $1 + (m-1)\rho (where m is average cluster size and ρ is ICC) and divide the nominal sample size by this factor before inputting into standard tests, ensuring power accounts for clustering-induced variance inflation.

User Interface and Operation

Graphical Interface Features

G*Power's graphical user interface centers on a main window that streamlines setup through intuitive selection mechanisms. Users begin by accessing a "Test family" dropdown menu to choose from categories such as F tests, t tests, or χ² tests, which then populates a secondary "Statistical test" menu with relevant options like "Means: Difference between two independent means (two groups)." A central "Type of power analysis" menu or set of radio buttons allows selection among five modes: a priori, , , post-hoc, and , dynamically adjusting the input and output fields accordingly. In the lower left section, dedicated input fields capture essential parameters, including (e.g., Cohen's d or f²), α error probability, desired (1-β), sample size, and test-specific details like . Output visualization is integrated directly into the interface for immediate feedback, with the upper main window displaying overlaid probability distributions of the test statistic under the null hypothesis (H0) and alternative hypothesis (H1), including the decision criterion line and shaded regions for Type I and Type II error probabilities. For exploratory analysis, a dedicated "X-Y plot for a range of values" button opens a separate window generating real-time line graphs, such as power as a function of sample size or effect size across specified ranges, complete with right-click options to copy the image, save it as a file, print, or export underlying data to a table. The menu structure supports advanced customization and workflow efficiency, with an "Options" button revealing dialogs for test-specific criteria, such as alpha balancing in two-sided tests or accrual intervals in sequential designs. A "Protocol of power analyses" tab maintains a log of all performed calculations, enabling users to clear, save to file, print, or copy the for documentation. Batch processing is accommodated via the built-in calculator's scripting support for evaluating multiple expressions sequentially. Cross-platform consistency ensures a uniform experience across operating systems, as the Windows and macOS versions of share an identical layout and toolbar for quick-access functions like calculate and plot generation, with no reported variances. Accessibility features promote for non-experts, including tooltips that appear on hover over input fields to explain parameters and display standard conventions, such as Cohen's guidelines for effect sizes (e.g., small d = 0.2). Built-in checking validates entries in real time, issuing alerts for invalid inputs like negative sample sizes or implausible correlations (e.g., |ρ - ρ0| ≤ ε), and the allows aborting lengthy computations.

Input and Output Options

G*Power allows users to specify inputs through two primary modes: distribution-based and design-based. In the distribution-based mode, users select a test family (such as F, t, χ², z, or exact tests) and enter parameters directly, including (e.g., Cohen's d or f), level (alpha, typically 0.05), and desired (e.g., 0.80). The design-based mode enables specification of study characteristics, such as the number of groups or type of means comparison, facilitating more tailored analyses. Manual entry is supported for all parameters, with options to fix certain values (e.g., alpha and ) while varying others (e.g., sample size) for sensitivity analyses. For advanced inputs, accommodates multilevel and complex designs, including factorial ANOVA with multiple groups (e.g., 2×3×4 designs specifying up to 24 groups) and repeated measures setups that account for the number of measurements per subject. Nonsphericity corrections, such as Greenhouse-Geisser or Huynh-Feldt adjustments, are available for repeated measures ANOVA to handle violations of assumptions. These features allow precise modeling of hierarchical or within-subjects data structures. Outputs in are presented in numerical tables displaying key results, such as achieved power values, required sample sizes (n), and critical values, alongside graphical plots (e.g., power curves versus sample size or ). Plots and tables can be exported in formats including , PDF, and enhanced metafiles for integration into reports. Additionally, text-based protocols detail all input parameters, assumptions, and computation steps. The feature automatically records the full analysis setup upon each calculation, enabling by saving inputs, options, and outputs to a ; for exact tests, it includes the to ensure identical results upon recalculation. This documentation supports auditing and sharing of analyses in research workflows. Error handling in includes warnings for scenarios such as low achieved power (e.g., below typical thresholds like 0.80) or infeasible solutions (e.g., negative sample sizes or invalid parameter combinations like non-positive definite correlation matrices), prompting users to adjust inputs for viable designs. These alerts help prevent underpowered studies during planning.

Applications and Impact

Use in Research Fields

GPower is primarily utilized for a priori sample size planning in psychological experiments, enabling researchers to determine the necessary number of participants to detect effects of interest with adequate statistical . For instance, in designing a two-group independent t-test to detect a medium (Cohen's d = 0.5) at an alpha level of 0.05 and 80% , GPower calculates a required sample size of 128 participants (64 per group), ensuring the study is sufficiently powered to identify meaningful differences without excessive Type II errors. This application is central to , where it supports planning for t-tests, ANOVA, and correlation analyses commonly employed in cognitive and behavioral studies. Beyond , finds extensive use in clinical trials for calculations involving tests to evaluate treatment outcomes, such as comparing categorical response rates between and groups. In the sciences, it facilitates ANOVA-based analyses for survey , helping researchers assess group differences in attitudes or behaviors across multiple conditions. Similarly, in education research, aids in regression analyses to identify predictors of academic performance, allowing for based on expected effect sizes from multiple variables. The software's integration into research workflows is evident in its frequent citation within methods sections of published papers, reflecting its in justifying study designs. The seminal GPower 3 paper by Faul et al. (2007) is highly cited, underscoring its widespread adoption across disciplines. Additionally, with millions of downloads worldwide, GPower serves an educational purpose, commonly taught in statistics courses for the behavioral sciences to instruct students on principles and practical application.

Limitations and Alternatives

While GPower provides robust support for classical frequentist power analyses, it has notable limitations in handling advanced statistical paradigms. The software does not support Bayesian power analysis, which is increasingly used in fields requiring probabilistic under , necessitating separate tools for such computations. Similarly, it lacks integration with models for power calculations in contexts. GPower primarily assumes in its tests without built-in options for robust or non-parametric alternatives, potentially leading to inaccurate estimates in skewed data distributions. Furthermore, it offers no native capabilities for complex experimental designs, such as those involving interactions beyond basic structures, requiring users to resort to programming environments for simulations. Platform-specific constraints also affect . The version (3.1.9.6) trails the Windows version (3.1.9.7) in updates and has historically encountered bugs, such as crashes and glitches, though many have been addressed in patches. As of 2025, no or web-based versions exist, limiting accessibility for users without environments. has seen no major feature updates since around 2020, with minor bug fixes continuing as of August 2025, leaving G*Power without extensions for emerging tests like advanced multilevel modeling, which accounts for hierarchical data structures common in social sciences and education research. This omission can hinder power planning for clustered or longitudinal designs. Alternatives address many of these gaps, often at the cost of GPower's hallmark simplicity. The commercial software supports over 1,200 scenarios, including more advanced tests and analyses, making it suitable for comprehensive planning. For scriptable, open-source options, the pwr enables basic power calculations for t-tests, correlations, and ANOVA via flexible functions, ideal for integration into reproducible workflows. The Superpower specializes in simulation-based for designs, allowing empirical evaluation of complex interactions and non-sphericity corrections that GPower cannot handle natively. While G*Power remains preferred for quick, user-friendly computations in standard scenarios, these alternatives provide greater flexibility for sophisticated or customizable analyses.

References

  1. [1]
    G*Power - Düsseldorf - Psychologie - HHU
    G*Power is a tool to compute statistical power analyses for many different t tests, F tests, χ2 tests, z tests and some exact tests.
  2. [2]
    [PDF] G*Power 3: A flexible statistical power analysis program for the ...
    G*Power (Erdfelder, Faul, & Buchner, Behavior Research Methods, Instruments, & Computers,. 1996) was designed as a general stand-alone power analysis ...Missing: origins creators
  3. [3]
    GPOWER: General Power Analysis Program
    Buchner, A., Faul, F., &Erdfelder, E. (1992).GPOWER: A priori-, post hoc-, and compromise power analyses for the Macintosh [Computer program]. Bonn, Germany ...Missing: original | Show results with:original
  4. [4]
    [PDF] G*Power 3.1 manual - Psychologie
    G*Power covers statistical power analyses for many tests, including F, t, and chi-square tests, and provides effect size calculators and graphics.<|control11|><|separator|>
  5. [5]
    Tools of the Trade: Intra-cluster correlations - World Bank Blogs
    Dec 2, 2012 · As a result, when doing power calculations for cluster randomized ... G*Power is also an nice source for power calculations, as are the ...
  6. [6]
    G*Power 3: A flexible statistical power analysis program for the ...
    G*Power (Erdfelder, Faul, & Buchner, 1996) was designed as a general stand-alone power analysis program for statistical tests commonly used in soci.
  7. [7]
    Sample size determination and power analysis using the G*Power ...
    Jul 30, 2021 · The G*Power software is easy to use for calculating sample size and power for various statistical methods (F, t, χ2, Z, and exact tests), and ...
  8. [8]
    One-way ANOVA Power Analysis | G*Power Data Analysis Examples
    In this unit we will try to illustrate the power analysis process using a simple four group design.
  9. [9]
    Multiple Regression Power Analysis - OARC Stats - UCLA
    In this unit we will try to illustrate how to do a power analysis for multiple regression model that has two control variables, one continuous research ...
  10. [10]
    PSYCH 252: Statistical Methods
    This course offers an introduction to advanced topics in statistics with the focus of understanding data in the behavioral and social sciences. ... G*Power: Power ...