Fact-checked by Grok 2 weeks ago

Operational taxonomic unit

An operational taxonomic unit (OTU) is a cluster of biological sequences or organisms grouped together based on a predefined level of similarity, serving as a practical proxy for taxonomic categories such as in and microbial community analyses. This approach allows researchers to handle large datasets from high-throughput sequencing without relying on complete taxonomic classifications, which can be challenging for microbes lacking clear morphological traits. The concept of OTUs originated in the field of during the 1960s, introduced by Robert R. Sokal and Peter H. A. Sneath as a way to define taxonomic units operationally through quantitative similarity measures rather than evolutionary or morphological criteria. In contemporary , OTUs are most commonly applied to (rRNA) gene sequences for bacterial identification, where sequences sharing at least 97% similarity (or 3% dissimilarity) are clustered into the same unit, approximating species-level diversity. This threshold, while widely used, is an arbitrary and does not always align perfectly with true phylogenetic boundaries, as some species exhibit greater intraspecific variation and others show less interspecific divergence. OTU clustering is a core step in metagenomic pipelines such as QIIME and Mothur, involving dereplication of sequences, error correction, and algorithmic grouping using methods like average neighbor clustering or approaches to reduce from millions of reads to thousands of units. These units enable downstream analyses of microbial composition, alpha and , and ecological patterns in environments like the human gut or microbiomes. However, traditional OTU methods are susceptible to artifacts from amplification biases and sequencing errors, which can inflate diversity estimates or merge distinct taxa. To address these limitations, recent advancements favor amplicon sequence variants (ASVs) over OTUs, as ASVs infer exact sequence variants while accounting for errors, improving , , and taxonomic accuracy without fixed similarity thresholds. Despite this shift, OTUs remain influential in legacy datasets and studies where computational resources limit ASV implementation, underscoring their role in standardizing microbial amid ongoing methodological evolution.

Definition and History

Definition

An operational taxonomic unit (OTU) serves as a practical for taxonomic units in biological , grouping based on measurable similarities in phenotypic or genotypic traits. In contemporary applications, particularly within microbial , OTUs are primarily defined by clustering sequences of marker such as the 16S rRNA for prokaryotes or the 18S rRNA for eukaryotes, where sequences sharing a specified level of are considered representative of the same unit. The primary purpose of OTUs is to approximate or higher taxonomic levels in assessments, especially in environments where traditional cultivation-based methods are infeasible, such as studies of unculturable microorganisms comprising the vast majority of microbial . By enabling the analysis of high-throughput sequencing data from environmental samples, OTUs facilitate the quantification and comparison of community structures without requiring full taxonomic identification. At their core, OTUs represent clusters rather than genuine biological taxa, delineated by arbitrary similarity thresholds that prioritize analytical consistency over strict phylogenetic or evolutionary boundaries; for instance, a 97% sequence identity cutoff is commonly employed as a for species-level in prokaryotic 16S rRNA analyses. This approach underscores that OTUs are operational constructs, not equivalent to formally described taxa. In contrast to traditional taxonomy, which integrates morphological, physiological, and ecological data for , OTUs emphasize reproducible, data-driven grouping to handle the scale and complexity of molecular datasets, often at the expense of finer phylogenetic accuracy.

Historical Development

The concept of the operational taxonomic unit (OTU) originated in , a phenetic approach to classification based on overall similarity rather than evolutionary relationships, as introduced by Robert R. Sokal and Peter H. A. Sneath in their 1963 book Principles of Numerical Taxonomy. In this framework, OTUs were defined as clusters of organisms derived from similarity matrices computed from multiple phenotypic characters, enabling objective grouping without reliance on subjective morphological judgments. This method aimed to promote reproducibility in taxonomy by treating taxa as operational entities based on quantifiable resemblances, initially applied to macroscopic organisms but laying the groundwork for broader applications. The adaptation of OTUs to molecular data marked a significant , particularly in microbial during the and , driven by the advent of (PCR) for amplifying conserved marker genes such as 16S rRNA from environmental samples. This shift enabled the delineation of molecular OTUs (mOTUs) by clustering sequences with high similarity, often using a 97% identity threshold proposed by Stackebrandt and Goebel in to approximate species-level distinctions based on 16S rRNA sequence divergence correlating with DNA-DNA hybridization values. These mOTUs facilitated the study of unculturable microbes, transforming microbial diversity assessments from culture-dependent methods to sequence-based analyses. A key milestone in extending OTUs beyond prokaryotes occurred in 2005, when Blaxter et al. proposed their use for estimating metazoan diversity through of the 18S rRNA gene, emphasizing OTUs as practical proxies for in complex environmental samples like meiofauna. This approach highlighted the versatility of OTUs in handling barcode data without strict alignment to traditional , promoting their application in eukaryotic metabarcoding. With the rise of high-throughput sequencing technologies after 2010, OTUs became integral to metagenomic pipelines, where they aggregated vast numbers of short reads into metrics for . This era saw standardized tools like QIIME and Mothur incorporating OTU clustering to process amplicon data efficiently, though challenges in threshold selection persisted. Schmidt et al. (2014) analyzed global datasets and found that OTUs exhibit high ecological consistency, with habitat preferences akin to true taxa, though this varies with clustering methods and thresholds.

OTU Delineation Methods

Clustering Approaches

Clustering approaches for operational taxonomic units (OTUs) in microbial primarily involve strategies that group sequences based on similarity, either independently or with reference to established databases. These methods balance the discovery of novel taxa with computational efficiency and comparability across studies. The three principal paradigms are clustering, closed-reference clustering, and open-reference clustering, each suited to different research contexts in microbial ecology and . De novo clustering groups sequences solely based on their internal similarity thresholds, without relying on any external reference database. This approach is particularly suitable for exploring novel environments where microbial diversity may include taxa absent from current databases, allowing for the identification of previously undescribed OTUs. By treating the input sequences as the complete for clustering, de novo methods maximize the potential for discovery but can be sensitive to sequencing errors and require careful parameter selection to avoid over- or under-clustering. In contrast, closed-reference clustering assigns input sequences to pre-existing OTUs defined in a reference database, such as Greengenes or , by matching them at a specified similarity ; any sequences that fail to match are discarded from the analysis. This method ensures consistency with prior studies using the same reference, facilitating direct comparisons of microbial communities across datasets, and provides reliable taxonomic assignments for known taxa. However, it inherently limits analysis to documented diversity, potentially underrepresenting novel or rare microbes in underrepresented environments. Open-reference clustering represents a hybrid strategy that first applies closed-reference assignment to match sequences against a database and then performs clustering on the remaining unmatched sequences to form additional OTUs. This approach combines the strengths of both paradigms, retaining all input sequences while leveraging reference-based consistency for the majority of taxa, and is especially useful for datasets expected to contain both known and novel elements. It mitigates some limitations of pure closed-reference methods by incorporating novel OTUs, though it may still inherit biases from the reference database. Comparisons among these approaches highlight trade-offs in discovery, accuracy, and efficiency. clustering excels in capturing in understudied systems but carries higher risks of erroneous groupings due to in sequencing data and demands substantial computational resources. Closed-reference clustering promotes and taxonomic precision across studies but sacrifices novel discoveries, while open-reference offers a balanced alternative with comprehensive coverage. Notably, closed-reference methods can reduce computational load by up to 80% compared to approaches in large-scale datasets, such as those involving billions of sequences, due to their parallelizable nature and exclusion of unmatched reads.

Algorithms and Thresholds

Common algorithms for OTU delineation include UPARSE, which employs a ing approach to simultaneously filter chimeras and sequences, reducing computational demands while maintaining accuracy. UCLUST, a centroid-based method integrated into the USEARCH toolkit, rapidly sequences by aligning them to existing centroids and assigning new only when similarity falls below the threshold, enabling efficient handling of large datasets. CD-HIT performs length-dependent ing, prioritizing longer representative sequences and using a shortest-word-first strategy to accelerate pairwise comparisons, particularly suited for sequences in metagenomic analysis. Similarity thresholds in OTU clustering are typically set at 97% sequence for -level delineation, derived from observations that approximately 3% divergence in 16S rRNA genes correlates with distinct bacterial based on DNA-DNA hybridization . For strain-level , a 100% is used to identify single variants without grouping, capturing intraspecies diversity more precisely. These thresholds stem from early studies on 16S rRNA variability, where intra-species differences rarely exceed 3%, providing a practical proxy for taxonomic boundaries in uncultured microbes. The standard for OTU clustering begins with denoising to correct sequencing errors, followed by detection to remove artifactual , and concludes with clustering using hierarchical or distance-based methods such as average to group remaining . similarity is calculated as the percentage of matching bases over the total length, often via pairwise or multiple alignments: \text{Sequence Identity} = \left( \frac{\text{Number of Matching Bases}}{\text{Total Alignment Length}} \right) \times 100 This metric ensures consistent grouping by quantifying evolutionary distance. The UPARSE algorithm specifically mitigates OTU inflation—where erroneous sequences spawn spurious clusters—by merging near-identical reads early in the process, reporting OTU sequences with ≤1% incorrect bases compared to >3% with other methods in artificial community tests, resulting in fewer OTUs.

Applications

In Microbial Ecology

In microbial ecology, operational taxonomic units (OTUs) derived from 16S rRNA gene sequencing serve as fundamental units for quantifying community structure through , which measures within-sample richness and evenness, and , which assesses compositional turnover between samples, often using OTU abundance tables as input for metrics like Shannon's index or Bray-Curtis dissimilarity. These OTU-based approaches enable researchers to compare microbial diversity across environmental gradients, such as varying or host conditions, revealing patterns like higher in soils compared to . For instance, OTU tables facilitate the calculation of to track community shifts in response to disturbances, providing insights into . OTUs have been widely applied to study diverse microbial ecosystems, including microbiomes where they help delineate bacterial consortia influencing nutrient cycling, and gut ecosystems where they illuminate host-microbe interactions in and . In systems, OTU analyses have tracked microbial during organic matter , showing phased shifts from copiotrophic to oligotrophic taxa over time. Similarly, in gut microbiomes, OTUs have been used to monitor community recovery following antibiotic perturbations, documenting the reassembly of Firmicutes and Bacteroidetes-dominated assemblages post-treatment. A notable example of OTU-based involves identifying taxa in communities, where network approaches using OTU abundances have highlighted genera like as central hubs sustaining plant growth promotion through nutrient solubilization and suppression. Such OTUs, often comprising less than 1% of total abundance, disproportionately influence community stability by mediating resource flows in the plant-soil interface. OTUs integrate into ecological models by serving as nodes in co-occurrence networks, allowing inference of potential symbiosis or competition among microbes; for example, positive correlations between OTUs may indicate mutualistic interactions in carbon degradation pathways, while negative links suggest antagonism in nitrogen-limited soils. These networks, constructed from OTU co-abundance data, reveal modular structures that reflect ecological guilds, enhancing predictions of community responses to environmental changes. Additionally, OTUs enable rarefaction techniques to standardize sampling effort across datasets, ensuring unbiased estimates of diversity indices like Shannon entropy, which accounts for both richness and evenness.

In Metagenomics

In , operational taxonomic units (OTUs) are integral to workflows for analyzing high-throughput sequencing , particularly in marker-gene surveys targeting the 16S rRNA . The typical begins with quality filtering of raw reads to remove low-quality sequences, adapters, and chimeric artifacts, ensuring reliable downstream clustering. This is followed by OTU picking, where sequences are clustered based on similarity thresholds, often using software like QIIME or Mothur; for instance, QIIME employs or reference-based clustering to group sequences into OTUs at 97% identity, reducing while approximating species-level resolution. Taxonomic assignment then occurs by aligning representative OTU sequences against curated databases, such as the Ribosomal Database Project (RDP) using Bayesian classification or for homology-based identification, enabling the profiling of microbial composition in complex samples. OTUs are particularly valuable in marker-gene surveys, where millions of 16S rRNA reads from environmental samples—such as , , or host-associated microbiomes—are processed to characterize uncultured bacterial diversity. These surveys amplify hypervariable regions of the 16S gene, generating amplicon sequence data that OTUs cluster into manageable units, facilitating the detection of rare taxa and community structure without the need for full genome assembly. This approach has been widely applied to uncover microbial profiles in diverse ecosystems, revealing previously unknown lineages that dominate uncultured environments. A prominent example is the Human Microbiome Project (2007–2013), which utilized 97% identity OTU clustering on 16S rRNA sequences from approximately 5,000 samples across up to 18 body sites from 242 healthy adults to catalog the human gut flora and other microbial communities. This effort identified core OTUs shared across individuals, such as those from and Firmicutes, providing a foundational reference for variability and health associations. The scalability of OTUs supports efficient downstream analyses, including principal coordinate analysis (PCoA) for ordination of , where OTU abundance tables are transformed into distance matrices (e.g., Bray-Curtis) to visualize sample similarities in low-dimensional space, aiding in the interpretation of large-scale datasets. In shotgun metagenomics, which sequences total DNA without targeting specific genes, OTUs can be applied to cluster assembled contigs at similarity thresholds to delineate putative microbial genomes or functional elements, enhancing taxonomic binning and annotation of metabolic pathways. For example, contigs from assembly are grouped into OTU-like units to infer community functions, such as production, by mapping to databases like .

Limitations and Alternatives

Challenges of OTU-Based Analysis

One major challenge in OTU-based analysis stems from sequencing and artifacts, which introduce errors that inflate the number of spurious OTUs and lead to overestimated microbial diversity. PCR chimeras, formed during amplification, and read errors from platforms like Illumina can create artificial sequence variants that cluster into distinct OTUs, particularly affecting low-abundance taxa. For instance, spurious sequences resulting from these errors can significantly increase estimates if not adequately filtered, as they mimic rare species without corresponding biological reality. The reliance on arbitrary similarity thresholds, such as the commonly used 97% sequence identity cutoff, further complicates OTU delineation, as this value lacks universal applicability across microbial taxa and often fails to align with ecological boundaries. This threshold, originally derived from early 16S rRNA studies, does not consistently partition communities into ecologically meaningful groups, leading to inconsistent preferences among OTUs. A global-scale demonstrated that OTUs at 97% similarity exhibit variable ecological consistency, with many failing to show predictable environmental associations comparable to true taxa. Database dependency introduces additional biases depending on the OTU picking strategy employed. Closed-reference methods, which align sequences to a predefined database, discard unmatched reads and thus miss novel lineages not represented in the reference, potentially underestimating diversity in underexplored environments. In contrast, clustering avoids database limitations but tends to over-split low-abundance sequences due to error-induced variability, resulting in fragmented OTUs that dilute signal from rare but real taxa. Comparative evaluations show approaches outperforming closed-reference in capturing sequence distances but at the cost of higher computational demands and instability for sparse data. Reproducibility remains a critical issue, as variations in analysis pipelines—such as those implemented in QIIME versus USEARCH—can produce divergent OTU tables from identical input data, undermining comparative studies. Differences arise from algorithmic choices in clustering (e.g., hierarchical versus methods) and parameter sensitivity, leading to inconsistent OTU assignments. Studies report up to 20% discordance in OTU membership between algorithms on the same dataset, with some methods yielding substantially more OTUs due to poorer handling of sequence order or error tolerance. These challenges have prompted the development of alternatives like amplicon sequence variants (ASVs), which aim to mitigate error-driven variability through denoising rather than clustering.

Amplicon Sequence Variants (ASVs)

Amplicon Sequence Variants (ASVs) represent a contemporary approach in microbial community analysis, identifying exact amplicon sequences at single-nucleotide resolution through error-aware denoising methods that distinguish true biological variants from sequencing artifacts. Unlike traditional clustering, ASVs are generated without relying on arbitrary similarity thresholds, enabling precise resolution of closely related taxa. Prominent algorithms for ASV inference include DADA2, which employs a parametric model to estimate and correct sequencing errors based on observed transitions between , and UNOISE3, which uses an entropy-based approach to denoise sequences and construct zero-radius operational taxonomic units (ZOTUs) equivalent to ASVs. These methods reduce the generation of spurious variants caused by (PCR) or sequencing errors, thereby enhancing the accuracy of microbial profiles. Key advantages of ASVs over OTUs include improved across independent studies due to standardized sequence outputs and superior resolution of intra-species diversity, allowing detection of subtle ecological differences. Recent studies from 2024 and 2025 highlight ASVs' superior performance in estimation; for instance, a analysis of metabarcoding datasets demonstrated that ASVs produced higher estimates of , and compared to OTUs, which systematically underestimated due to clustering artifacts. Similarly, a 2025 study in and Evolution examining 5S rRNA intergenic spacers found that DADA2-derived ASVs effectively pruned over-split clusters observed in 100% identity OTUs, yielding more biologically coherent phylogenetic groupings with greater computational efficiency. ASVs also better discriminate ecological patterns in microbial communities, as evidenced by seminal work showing their enhanced relative to OTU methods. A 2025 review of 16S rRNA amplicon sequencing noted DADA2's consistent outputs in metabarcoding applications, contrasting with OTU-based underestimation of , though it acknowledged occasional ASV over-splitting in samples.

References

  1. [1]
    Assessing and Improving Methods Used in Operational Taxonomic ...
    Assessing and Improving Methods Used in Operational Taxonomic Unit-Based Approaches for 16S rRNA Gene Sequence Analysis · Abstract · INTRODUCTION · MATERIALS AND ...
  2. [2]
    Operational Taxonomic Units – Molecular Ecology & Evolution
    An OTU is a cluster of organisms grouped together based on a set level of similarity, often using DNA sequences. OTUs serve as proxies for species or higher- ...
  3. [3]
    Operational Taxonomic Unit - an overview | ScienceDirect Topics
    ... Sokal and Peter Sneath. In Sokal and Sneath's original 1963 manifestation of the Principles oj Numerical Taxonomy, any evolutionary approach is avoided in ...
  4. [4]
    A perspective on 16S rRNA operational taxonomic unit clustering ...
    Apr 20, 2016 · A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity · Introduction · Results and discussion · Materials and ...
  5. [5]
    Microbiome Analysis via OTU and ASV-Based Pipelines—A ... - NIH
    The resulting consensus sequences are referred to as operational taxonomic units (OTUs). Said threshold is typically set at 97% identity, with the OTU ...3. Results · 3.2. Sequencing Pipeline... · 4. Discussion
  6. [6]
    Exact sequence variants should replace operational taxonomic units ...
    Jul 21, 2017 · We argue that the improvements in reusability, reproducibility and comprehensiveness are sufficiently great that ASVs should replace OTUs as the standard unit ...Abstract · Introduction · Discussion
  7. [7]
    Ecological Consistency of SSU rRNA-Based Operational Taxonomic ...
    Operational Taxonomic Units (OTUs), usually defined as clusters of similar 16S/18S rRNA sequences, are the most widely used basic diversity units in large- ...Missing: scholarly | Show results with:scholarly
  8. [8]
    Reconciliation between operational taxonomic units and species ...
    Assessing and improving methods used in operational taxonomic unit-based approaches for 16S rRNA gene sequence analysis. Appl Environ Microb 2011;77:3219–26 ...
  9. [9]
    Defining DNA-Based Operational Taxonomic Units for Microbial ...
    We describe a computer-based tool that assigns DNA sequences from environmental databases to operational taxonomic units at approximately species-level ...
  10. [10]
    A Place for DNA-DNA Reassociation and 16S rRNA Sequence ...
    Jan 10, 1994 · A place for DNA-DNA reassociation and 16S rRNA sequence analysis in the present species definition in bacteriology.Missing: OTU | Show results with:OTU
  11. [11]
    Updating the 97% identity threshold for 16S ribosomal RNA OTUs
    The 97% threshold was proposed in 1994 (Stackebrandt and Goebel, 1994) when few 16S rRNA sequences were available, raising the question of whether this ...
  12. [12]
    Principles of Numerical Taxonomy - Google Books
    Sokal, Peter Henry Andrews Sneath. Publisher, W. H. Freeman, 1963. Original from, Rutgers University. Digitized, Aug 26, 2022. Length, 359 pages. Export ...<|separator|>
  13. [13]
    Use of 16S rRNA and rpoB Genes as Molecular Markers for ...
    Molecular microbial ecology commenced in 1990 with the direct amplification and sequencing of 16S rRNA genes from the environment (17). This procedure ...
  14. [14]
    Defining operational taxonomic units using DNA barcode data
    (d) Molecular operational taxonomic unit definition. The sequencing was ... phenotypic variation in deep-sea limpets, Biology Letters, 15:10, Online ...<|control11|><|separator|>
  15. [15]
    High-Throughput Metagenomic Technologies for Complex Microbial ...
    Jan 27, 2015 · The shotgun metagenomic sequencing approach provides community-level information in complex environments with thousands to millions of different ...
  16. [16]
    Assessment of Common and Emerging Bioinformatics Pipelines for ...
    Metagenomics based on high-throughput sequencing (HTS) helps biologists unveil a large part of the constitutive microorganisms of a microbiota. This culture ...Missing: post- | Show results with:post-
  17. [17]
    Subsampled open-reference clustering creates consistent ... - NIH
    We also compare these methods to de novo and closed-reference OTU picking, and explore the effect of dataset and algorithm parameters on runtime and analysis ...Missing: seminal | Show results with:seminal
  18. [18]
  19. [19]
    LotuS: an efficient and user-friendly OTU processing pipeline
    Sep 30, 2014 · A fast and user-friendly open-source tool to calculate denoised, chimera-checked, operational taxonomic units (OTUs).
  20. [20]
    On the use of sequence-quality information in OTU clustering - PMC
    Aug 16, 2021 · In this paper, we present quality-aware clustering methods inspired by quality-weighted alignments and model-based denoising, and explore their applicability ...
  21. [21]
    Key features and guidelines for the application of microbial alpha ...
    Jan 3, 2025 · Alpha diversity is a general term for metrics that describe the species richness, evenness, or diversity within a sample. In contrast, beta- ...
  22. [22]
    Alpha-, beta-, and gamma-diversity of bacteria varies across habitats
    Sep 23, 2020 · We found that soils contained the highest bacterial richness within a single sample (alpha-diversity), but sediment assemblages displayed the highest gamma- ...
  23. [23]
    Measuring the diversity of the human microbiota with targeted next ...
    Dec 26, 2016 · (B) Alpha and beta diversity can be qualitatively described considering the species accounted by each sample or the species shared by pairs of ...
  24. [24]
    Forest succession improves the complexity of soil microbial ...
    Collectively, our findings suggest that forest succession improves the complexity of soil microbial interactions and the ecological stochasticity of community ...
  25. [25]
    Functional and Structural Succession of Soil Microbial Communities ...
    Our results revealed the pattern of functional and compositional succession in soil microbial communities during decomposition of human-derived organic matter.
  26. [26]
    Trait-based community assembly and succession of the infant gut ...
    Feb 1, 2019 · Functional redundancy among gut microbiome taxa may benefit the host by improving community resilience in response to disturbance53.
  27. [27]
    Specialized metabolic functions of keystone taxa sustain soil ...
    Jan 31, 2021 · This study provides new insights into our understanding of the relationships between soil microbiome biodiversity and ecosystem stability
  28. [28]
    Changes in the structure and function of rhizosphere soil microbial ...
    Mar 27, 2023 · The top 5 node OTUs were considered as keystone taxa. ... Our findings highlighted the importance of keystone taxa in multiple communities ...
  29. [29]
    Using network analysis to explore co-occurrence patterns in soil ...
    Overall, the soil microbial network was comprised of highly connected OTUs (∼5 edges per node) structured among densely connected groups of nodes (that is, ...
  30. [30]
    Structure and co-occurrence patterns in microbial communities ...
    Apr 12, 2018 · Using network analysis, we examined changes in putative ecological interactions among microbial Operational Taxonomic Units (OTUs) or 'nodes', ...Results · Methods · Microbial Diversity And...<|separator|>
  31. [31]
    Estimating and comparing microbial diversity in the presence ... - NIH
    Feb 1, 2016 · Traditional sample-size-based rarefaction for species richness has been widely applied in ecology as a standardization method and also suggested ...
  32. [32]
    Using QIIME to analyze 16S rRNA gene sequences from Microbial ...
    By default, QIIME uses the Ribosomal Database Project (RDP) classifier to assign taxonomic data to each representative sequence from stage (b). In the directory ...
  33. [33]
    A framework for assessing 16S rRNA marker-gene survey data ... - NIH
    Mar 13, 2020 · Background. Studies often use 16S rRNA marker-gene surveys (targeted sequencing of the 16S rRNA gene) to characterize microbial communities.
  34. [34]
    Improved OTU-picking using long-read 16S rRNA gene amplicon ...
    Oct 5, 2015 · Table 3 summarizes the number of OTUs formed at similarity thresholds 1–6 % (1 % increments). Overall, MiSeq generated 1.86- to 3.06-fold more ...Missing: rationale seminal
  35. [35]
    A Core Human Microbiome as Viewed through 16S rRNA Sequence ...
    Jun 13, 2012 · Using the OTU approach, we analyze the HMP 16S tag-sequencing data to look for organisms that occur in most or all healthy subjects. The depth ...
  36. [36]
    Distribution-Based Clustering: Using Ecology To Refine the ...
    De novo clustering has fewer correct OTUs because some sequences are merged into the same OTU. Closed-reference clustering retains fewer correct OTUs because ...Missing: seminal | Show results with:seminal
  37. [37]
    The effect of low-abundance OTU filtering methods on the reliability ...
    Jun 11, 2023 · PCR amplicon sequencing may lead to detection of spurious operational taxonomic units (OTUs), inflating estimates of gut microbial diversity.<|control11|><|separator|>
  38. [38]
    De novo clustering methods outperform reference-based ... - NIH
    This study demonstrates that de novo methods are the optimal method of assigning sequences into OTUs and that the quality of these assignments needs to be ...Missing: seminal | Show results with:seminal
  39. [39]
    Stability of operational taxonomic units: an important but neglected ...
    May 20, 2015 · The operational taxonomic unit (OTU) is widely used in microbial ecology. Reproducibility in microbial ecology research depends on the ...
  40. [40]
    A Comparison of Methods for Clustering 16S rRNA Sequences ... - NIH
    Aug 13, 2013 · ... OTU clustering and the appropriate abundance thresholds depend on individual clustering algorithms. Our results may provide general ...
  41. [41]
    To denoise or to cluster, that is not the question - BMC Bioinformatics
    Apr 5, 2021 · The results of the former are called Amplicon Sequence Variants (ASVs) and those of the latter ZOTUs (zero-radius OTUs).
  42. [42]
    Qualitative comparison of DADA2, Deblur, and UNOISE3
    Subsequently, DADA2 (Callahan et al., 2016) was applied to correct sequencing errors, merge pair-ended reads, and eliminate chimeric sequences and sequences ...
  43. [43]
    Denoising Amplicon Sequence Variants Using DADA2, DeBlur, and ...
    Nov 30, 2020 · UNOISE3 can be applied across sequencing runs and is very fast (mapping reads back to zOTUs often longest step). However, 64-bit version ...
  44. [44]
  45. [45]
    ASV vs OTUs clustering: Effects on alpha, beta, and gamma ...
    Oct 3, 2024 · The results showed that the OTUs clustering proportionally led to a marked underestimation of the ecological indicators values for species diversity.Missing: flora | Show results with:flora
  46. [46]
    Comparing OTUs and ASVs in High‐Throughput Sequencing of 5S ...
    Oct 7, 2025 · In this work, we compared operational taxonomic units (OTUs, 100% identity) and amplicon sequence variants (ASVs) from 5S intergenic spacer (5S- ...Missing: paper | Show results with:paper
  47. [47]
    The unresolved struggle of 16S rRNA amplicon sequencing - NIH
    May 13, 2025 · ASV algorithms—led by DADA2— resulted in having a consistent output, yet suffered from over-splitting, while OTU algorithms—led by UPARSE— ...