medRxiv is a free online preprint server dedicated to the rapid dissemination of unpublished manuscripts in the health sciences, including clinical, public health, and medical research.[1] Launched in June 2019 by Cold Spring Harbor Laboratory in collaboration with Yale University and BMJ, it provides a platform for researchers to share complete but unpeer-reviewed studies, explicitly cautioning that such preprints should not inform clinical decisions or policy due to the absence of formal validation.[2][3]The server has facilitated accelerated knowledge sharing, particularly during the COVID-19 pandemic, where it hosted over 20,000 submissions in its first year alone, enabling early visibility of emerging data on epidemiology, treatments, and vaccines.[4] However, this speed has drawn criticism for amplifying unverified claims, with some preprints later retracted or contradicted upon peer review, contributing to public confusion and misguided applications in media reporting and decision-making.[5] medRxiv employs moderation to screen out submissions posing risks of harm, such as unsubstantiated alarms about carcinogens, yet instances of removed content critical of public health measures have raised questions about selective oversight in a field prone to institutional biases favoring consensus narratives.[6][7] As of 2025, management has transitioned to the independent non-profit openRxiv, reflecting an evolution toward community-driven governance while maintaining its core mission of open access to preliminary research.[8]
History
Founding and Early Development
medRxiv was founded in June 2019 through a collaboration between Cold Spring Harbor Laboratory (CSHL), Yale University, and BMJ to establish a dedicated preprint server for health sciences research.[2][9] The initiative addressed longstanding barriers in medical publishing, where traditional peer review often delayed dissemination of findings by months or years, limiting timely access for researchers and clinicians.[2] The server was announced on June 5, 2019, with submissions opening the following day, June 6, emphasizing a publisher-neutral platform to foster openness and collaboration in clinical and public health studies.[2]Key figures in the founding included John Inglis and Richard Sever from CSHL, which had successfully operated the biology-focused bioRxiv since 2013; Joseph S. Ross and Harlan Krumholz from Yale School of Medicine, who advocated for greater transparency in clinical research; and Theodora Bloom and Claire Rawlinson from BMJ, bringing expertise in medical publishing standards.[2][9] This partnership leveraged CSHL's technical infrastructure for preprint management, Yale's clinical research insights, and BMJ's global reach in health knowledge dissemination to create a non-profit service independent of commercial interests.[2]In its early phase, medRxiv implemented features such as digital object identifiers (DOIs) for each preprint to ensure citability and discoverability, alongside formation of an international advisory board to guide policies on content suitability for health sciences.[2] The platform prioritized rapid posting while acknowledging sensitivities in medical research, such as potential risks from unvetted clinical data, though initial operations focused on building submission workflows modeled after bioRxiv's proven system.[2] By mid-2019, it had begun attracting submissions in areas like epidemiology and therapeutics, setting the stage for broader adoption amid emerging global health challenges.[9]
Launch and Initial Safeguards
MedRxiv began accepting manuscript submissions on June 6, 2019, following an announcement on June 5, 2019, by its founding partners: Cold Spring Harbor Laboratory, Yale University School of Medicine, and BMJ Publishing Group.[2][10] The platform was established to provide a dedicated preprint server for health sciences research, addressing the absence of such a resource in fields like clinical trials and public health, where rapid dissemination was hindered by traditional peer review delays.[11]Unlike bioRxiv, which operates with minimal screening to prioritize speed in basic biology, medRxiv implemented initial safeguards tailored to the higher stakes of medical research, including potential risks to patient safety and public health from unvetted clinical data.[11] Submissions underwent a multi-step screening process: initial in-house review by staff with scientific and editorial expertise checked for completeness, plagiarism, and basic scientific formatting, followed by evaluation by external clinical experts for ethical compliance, such as appropriate oversight of human subjects research and clinical trial registration.[12][13] Content posing health risks, lacking ethical standards, or containing offensive or non-scientific material was rejected, though scientific validity or methodological rigor was not assessed, preserving the preprint model's emphasis on timeliness over endorsement.[1]These safeguards were developed in response to skepticism in the medical community about preprints' reliability, particularly after incidents of flawed COVID-19-related preprints on other platforms highlighting risks of misinformation in policy-influencing research.[11] The process aimed to balance accessibility with responsibility, screening roughly 4-5 days per submission without impeding the core goal of free, open distribution of unpublished manuscripts.[14] Early adoption was cautious, with endorsements from organizations like the World Health Organization underscoring the platform's commitment to mitigating harms while enabling breakthroughs in fields wary of bioRxiv's lighter-touch approach.[15]
Expansion During COVID-19
During the COVID-19 pandemic, medRxiv experienced a marked surge in submissions, driven largely by the global urgency to share preliminary research on SARS-CoV-2, clinical interventions, and public health responses. From its launch in June 2019 through September 2020, the server received 27,413 submissions, of which 21,862 (79.8%) were posted following screening; while monthly submissions grew steadily from 100-200 pre-pandemic, COVID-19-related papers accelerated this to peaks exceeding 3,000 per month by mid-2020.[4] This represented a roughly 10-fold increase in overall submissions between January and May 2020 alone, with COVID-19 topics comprising up to 80% of posted preprints at the height of the crisis.[16]The influx necessitated adaptations in operational capacity, including enhanced screening protocols to handle the volume while upholding safeguards against unsubstantiated claims, though the server prioritized rapid posting for non-controversial COVID-19 submissions to facilitate timely scientific discourse. By December 2020, annual postings reached approximately 14,290, with 8,858 (62%) focused on COVID-19, reflecting the platform's role in disseminating over 16,000 such preprints cumulatively through 2021 (59.5% of total output).[17] Usage metrics mirrored this expansion, with abstract views escalating from tens of thousands monthly pre-2020 to millions during peak pandemic periods, underscoring medRxiv's emergence as a key venue for health sciences preprints amid journal backlogs.[4]Post-acute phase, COVID-19 preprints declined to about 7% of submissions by 2023, yet the pandemic-induced growth persisted in non-COVID domains, solidifying medRxiv's infrastructure for sustained higher volumes and broader adoption in clinical research dissemination.[18] This expansion highlighted both the platform's utility in crisis response and challenges in maintaining quality amid rapid scaling, with only 0.17% of preprints withdrawn overall, including some COVID-related retractions due to errors or ethical concerns.[19]
Transition to Independence
On March 11, 2025, medRxiv transitioned from operation under Cold Spring Harbor Laboratory (CSHL) to independent governance as part of the newly established non-profit organization openRxiv, which also oversees bioRxiv.[8][20] This shift marked the maturation of the preprint servers, originally launched as experiments at CSHL, into a self-sustaining entity focused on long-term viability in open science.[21] CSHL co-founders John Inglis and Richard Sever described the change as a "natural evolution," emphasizing that while the laboratory supported the platforms' mission, their growth necessitated separation to ensure continued rapid dissemination of research without institutional dependencies.[8]The openRxiv structure establishes a researcher-led framework with a board of directors drawn from the scientific community, aimed at enhancing operational independence and broadening preprint sharing in life and health sciences.[22][23] This transition preserves core features like non-peer-reviewed posting with screening for policy compliance, while enabling expanded funding models, such as grants and partnerships, to support scalability amid rising submission volumes post-COVID-19.[24] CSHL retained no ownership or control, viewing the independence as aligned with the platforms' original goal of advancing scientific discovery through accessible preprints.[8]
Purpose and Scope
Core Objectives
medRxiv's core objectives center on accelerating the dissemination of research in the health sciences by providing a platform for sharing complete but unpublished manuscripts, known as preprints, prior to formal peer review.[1] This enables researchers to rapidly communicate findings on human health studies, including clinical trials and epidemiological analyses, where delays in traditional publishing can hinder timely application to public health challenges.[25] By prioritizing speed, medRxiv addresses the limitations of journal-based systems, which often impose months-long review periods, thereby fostering earlier feedback and iteration in sensitive fields like medicine.[26]A key aim is to enhance the openness and accessibility of scientific findings, allowing global researchers to access preliminary data without paywalls or subscription barriers.[1] This promotes broader collaboration, as authors can solicit input from peers worldwide, potentially refining studies before submission to journals.[26] medRxiv also seeks to document the provenance of ideas, establishing timestamps for priority and reducing disputes over originality in fast-moving research areas.[1]Unlike peer-reviewed outlets, medRxiv does not endorse content through validation but focuses on enabling discourse while implementing safeguards against misinformation, such as mandatory screening for ethical compliance, plagiarism, and basic scientific plausibility.[25] These objectives position medRxiv as a complementary tool to formal publishing, emphasizing provisional sharing to advance knowledge without supplanting rigorous evaluation.[26] The server specifically targets health-related research excluded from general biology preprint platforms, ensuring domain-appropriate handling of clinical and policy-sensitive topics.[1]
Differences from bioRxiv and Other Servers
medRxiv specializes in preprints describing human health research, including clinical trials, epidemiological studies, and public health analyses, whereas bioRxiv focuses on broader biological sciences such as molecular biology, genetics, and ecology.[1] Clinical trials and studies with direct implications for patient care or public health policy are directed to medRxiv, with bioRxiv explicitly recommending submission of such papers to its medical counterpart since June 2019.[27] Manuscripts cannot be posted to both servers, and dual submissions result in withdrawal to maintain distinct scopes.[25]Unlike bioRxiv, which employs basic screening for plagiarism, ethical compliance, and scientific content validity, medRxiv implements more rigorous pre-screening protocols to mitigate risks associated with unvetted clinical research, such as potential misinterpretation leading to harm in medical practice.[11][28] This includes mandatory disclaimers on all medRxiv preprints warning readers that findings are preliminary and not peer-reviewed, with heightened scrutiny for studies involving human subjects or therapeutic claims.[29]bioRxiv's lighter touch reflects its origins in basic biological research, where immediate clinical risks are lower.[27]Compared to general preprint servers like arXiv (physics and mathematics) or SSRN (social sciences), medRxiv's health-specific focus incorporates domain-expert review during screening to ensure alignment with medical research standards, a feature absent in non-specialized platforms that prioritize rapid dissemination without field-specific safeguards.[30] Other health-oriented servers, such as Research Square, may allow broader interdisciplinary uploads but lack medRxiv's integration with biomedical indexing like PubMed Central or its emphasis on excluding non-human health topics.[1][30]
Operational Features
Submission and Screening Procedures
Authors submit preprints to medRxiv through the online Manuscript Processing System at submit.medrxiv.org, where they create an account or log in via ORCID if available.[31] The preferred method involves uploading a single PDF file containing the full manuscript, including title, author list with affiliations and corresponding author contact, abstract, keywords, main text, references, and any figures or tables.[31] Manuscripts must report original research or clinical protocols in the health sciences, excluding reviews, commentaries, or case reports, and authors are required to declare ethical approvals, funding sources, conflicts of interest, and—for clinical trials—registration details from a WHO-compliant registry such as ClinicalTrials.gov.[13] No submission fees apply, and authors retain copyright, licensing content under a Creative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND) license by default, though alternatives like CC0 or CC BY are permitted.[25]Upon submission, preprints undergo a multi-step screening process rather than peer review, designed to exclude offensive, non-scientific, or potentially harmful content without evaluating scientific validity.[1] The first step, conducted by in-house staff with scientific and editorial expertise, verifies that submission metadata matches the manuscript, confirms scope alignment with health sciences research, checks for plagiarism via automated tools, ensures absence of patient identifiers, and confirms declarations of ethical oversight and clinical trial registration where required.[13] If discrepancies or violations are identified—such as unregistered trials, missing ethics statements, or out-of-scope material—the submission may be rejected or returned for correction; escalated cases involving potential public health risks are discussed by the content team, founders, or external advisors.[13]The second screening step involves medRxiv affiliates reviewing for broader compliance, including whether the content pertains to health-related research and poses no risk of public harm, such as unsubstantiated claims that could influence clinical practice adversely.[13] Screening typically completes in 2–4 days but may extend over weekends, holidays, or for complex reviews; successful preprints are posted online without editing, typesetting, or formatting changes, assigned a DOI, and made publicly accessible immediately unless authors request a brief hold.[13] Authors can submit revisions to existing preprints via the author dashboard, updating the version while preserving the original and prior iterations.[25] This process prioritizes rapid dissemination while mitigating risks, as emphasized in medRxiv's disclaimers that preprints are preliminary and not suitable for guiding clinical decisions.[1]
Content Policies and Restrictions
MedRxiv maintains content policies centered on screening submissions for relevance to health sciences, ethical compliance, and absence of harm, without formal peer review. Manuscripts must pertain to clinical research in health or medicine, excluding narrative reviews, commentaries, case reports, or non-research materials.[13] Submissions require complete author details, ethical oversight declarations (such as institutional review board approval or exemption), and clinical trial identifiers where applicable; failures in these areas lead to rejection.[13]The screening process, conducted by in-house staff and medRxiv affiliates, occurs in stages and typically spans 2–4 days. Initial checks verify submission completeness, appropriate article type, and scope alignment, while scanning for patient identifiers or plagiarism via automated tools. Further evaluation assesses potential public harm, such as content that could discourage vaccination or public health measures, or promote unsafe practices; offensive, non-scientific, or previously published material is also flagged for rejection.[31][13] Escalated cases involve consultation with a content team, founders, or advisors, but no embargoes or scheduled posting times are permitted.[13]Once posted, preprints receive a DOI and cannot be removed, preserving the archival record; authors may withdraw manuscripts if they retract findings, marking them as withdrawn while retaining public access to versions.[25] Revised versions can be submitted at any time before journal acceptance, superseding prior iterations without altering the original posting.[1] These policies aim to balance rapid dissemination with safeguards against misinformation or ethical lapses, though screening does not certify accuracy or novelty.[25][13]
Technical Infrastructure
medRxiv operates as a web-based platform managed by Cold Spring Harbor Laboratory, with submissions processed through an online portal that accepts PDF manuscripts along with associated metadata such as titles, authors, abstracts, and keywords.[31] Manuscripts are required to be in English and formatted according to specific guidelines, including a maximum file size and exclusion of proprietary formats to ensure accessibility.[31]Following submission, preprints undergo a non-peer-reviewed screening process by an editorial team to check for compliance with content policies, after which approved manuscripts are posted in the order of approval, assigned a unique DOI prefix of 10.1101, and made freely accessible via the web interface without subscription fees.[25][31] The platform supports versioning, allowing authors to upload revised manuscripts that supersede prior versions while preserving access to historical iterations, and facilitates withdrawals or corrections as needed.[25]Storage and dissemination rely on a centralized server infrastructure that hosts preprints primarily as PDF files, with full-text HTML versions automatically generated and added 48-72 hours post-initial PDF posting to enhance readability and machine readability since October 19, 2020.[32] Search functionality is provided through integrated indexing of metadata and content, enabling retrieval by keywords, authors, or subjects, while programmatic access is supported via RSS/Atom feeds in XML format for recent posts and third-party APIs for querying archives, as evidenced by open-source tools like the medrxivr R package.[33][34]The shared operational framework with bioRxiv, both under Cold Spring Harbor Laboratory's oversight until their transition to the independent nonprofit OpenRxiv in March 2025, implies a unified backend for handling high-volume traffic, with annual operational costs around $3 million supporting scalability for over 100,000 combined preprints across the servers.[18] No public details disclose the specific software stack or cloud versus on-premise hosting, prioritizing reliability for global access over open-source transparency.[8]
Usage Statistics and Growth
Preprint Volumes and Trends
medRxiv posted 913 preprints in 2019 following its launch in June of that year.[19] Volumes surged in 2020 to 14,070 preprints, driven primarily by the COVID-19 pandemic, which accounted for a substantial portion of submissions during this period.[19] By the end of 2021, cumulative posted preprints totaled 27,674, with approximately 12,691 added that year.[19]Post-2021, annual volumes stabilized at elevated levels compared to pre-pandemic baselines but below the 2020 peak. medRxiv's official reporting data indicate roughly 10,575 new preprints in 2022 (bringing the cumulative total to about 38,249 by year-end), followed by 11,046 in 2023 (cumulative 49,295).[35] In 2024, postings approached 12,863, reflecting sustained growth in non-COVID topics amid a decline in pandemic-related research.[18][35]The following table summarizes annual posted preprints:
Year
Posted Preprints
2019
913
2020
14,070
2021
12,691
2022
~10,575
2023
~11,046
2024
12,863
Early in the pandemic, COVID-19 preprints comprised up to 80% of medRxiv's output, but this proportion fell to 7% by 2024 as focus shifted to broader health sciences research.[18] Monthly postings fluctuated between 700 and 1,400 during peak periods, with higher volumes in late 2020 (e.g., 1,399 in September).[35] Overall, the platform's growth underscores preprints' role in rapid dissemination, though volumes reflect screening rates of about 83% of submissions.[19]
Author and Download Metrics
As of early 2025, medRxiv had posted nearly 64,000 preprints from more than 380,000 authors.[20] This reflects substantial growth since its launch in June 2019, with early contributions by June 2020 already encompassing 57,096 unique authors across 124 countries.[4] Author participation surged during the COVID-19 pandemic, driven by urgent dissemination needs in health sciences, before stabilizing; in 2024, the server posted 12,863 preprints, approaching its 2020 peak volume.[18]Download and view metrics underscore medRxiv's reach, with cumulative PDF downloads totaling 51,943,342 and abstract views reaching 132,900,392 as of December 2021.[19] These figures capture exponential early uptake: in 2019, downloads numbered 107,772 alongside 241,528 abstract views; 2020 saw 27,963,915 downloads and 61,613,928 views, reflecting pandemic-related acceleration.[19] Per-preprint metrics, including article views, PDF downloads, and attention scores (aggregating altmetric data), are provided to authors via the platform, though these are noted to underestimate full usage due to limited tracking of article-to-article shares.[25] Post-2021 aggregates remain less publicly detailed, but sustained posting rates indicate ongoing engagement in clinical and public health research dissemination.[18]
Impact on Scientific Communication
Acceleration of Research Dissemination
MedRxiv facilitates the acceleration of research dissemination in the health sciences by allowing authors to post preprints shortly after submission, following a screening process that typically takes 2-4 days.[13] This contrasts sharply with traditional journal peer review, where the interval from submission to publication often exceeds six months, as evidenced by a median time of 6.0 months (interquartile range, 3.0-8.0 months) observed between medRxiv preprint posting and subsequent peer-reviewed publication for clinical studies.[36] By bypassing these delays, medRxiv enables immediate access to unpublished findings, promoting rapid feedback, collaboration, and iteration among researchers.[1]The platform's emphasis on speed proved critical during public health emergencies, such as the COVID-19 pandemic starting in early 2020, when medRxiv became a leading source for sharing breaking research on epidemiology, treatments, and vaccines.[10] For example, randomized clinical trials addressing COVID-19 interventions were posted expeditiously, allowing global scientific communities to engage with results in real time and refine ongoing studies ahead of formal validation.[37] This timely availability supported faster knowledge integration into clinical decision-making and policy formulation, with preprints enabling clinicians to access a broader range of trial outcomes promptly.[38]Overall, medRxiv's model prioritizes the swift establishment of scientific priority and openness, documenting idea provenance while enhancing accessibility for diverse stakeholders in health research.[26] Launched on June 5, 2019, by Cold Spring Harbor Laboratory in partnership with Yale University and BMJ, it addresses the unique needs of clinical and public health fields where delays in dissemination can hinder timely responses to evolving challenges.[2] Such acceleration has been recognized for driving quicker breakthroughs by allowing critical engagement with preliminary data before peer-reviewed certification.[39]
Influence on Policy and Practice
MedRxiv preprints gained significant traction in shaping public health responses during the COVID-19 pandemic, where rapid evidence needs outpaced traditional peer-review timelines. The RECOVERY collaborative group's preprint, posted on June 22, 2020, reported that dexamethasone reduced 28-day mortality by one-third among hospitalized patients receiving invasive mechanical ventilation (from 41.4% to 29.3%) and by one-fifth among those receiving oxygen without ventilation.[40] This finding prompted immediate policy shifts, including endorsements by the UK's National Health Service and the World Health Organization's updated guidelines in July 2020 recommending corticosteroids for severe cases, contributing to dexamethasone's global adoption and an estimated prevention of over 3 million deaths by mid-2021 according to modeling analyses.[41][42]Vaccine development similarly benefited from medRxiv's speed. Pfizer and BioNTech's phase 3 efficacy preprint, released November 9, 2020, detailed 95% efficacy against symptomatic COVID-19 after two doses in over 43,000 participants, with adverse events comparable to prior trials. This data informed emergency use authorizations by the FDA on December 11, 2020, and equivalent bodies worldwide, accelerating rollout amid surging cases and enabling policies for prioritized vaccination of high-risk groups. Such instances illustrate how preprints facilitated causal evidence for interventions, bypassing delays that could exacerbate outbreaks, though final peer-reviewed confirmations largely aligned with preliminary results in these high-profile cases.[43]Despite these successes, medRxiv's influence on practice has been tempered by inherent limitations, as the platform explicitly advises against relying on unverified reports for clinical decisions.[3] In non-emergency contexts, adoption remains cautious; for instance, while 29.3% of COVID-19 articles in major journals like The Lancet and NEJM in 2020 cited medRxiv preprints, subsequent analyses revealed discrepancies in 13.6% of matched publications, including changes in results or interpretations that could mislead if acted upon prematurely.[44] Policy-makers and practitioners have thus increasingly integrated preprints with triangulation from multiple sources, mitigating risks of flawed data propagation observed in retracted works.[17] Overall, medRxiv's role underscores a shift toward provisional evidence in crises, enhancing causal responsiveness while demanding rigorous post hoc validation.
Reception and Criticisms
Positive Assessments
MedRxiv has been praised by proponents in the scientific community for facilitating the rapid dissemination of health sciences research, particularly during the COVID-19 pandemic, where it emerged as a key platform for sharing preliminary findings ahead of formal peer review.[18][10] The Chan Zuckerberg Initiative recognized medRxiv's role by awarding it $2 million in June 2020, citing its status as a top source for breaking COVID-19 research that improved openness, accessibility, and collaboration among researchers.[10] This funding underscored its value in documenting the provenance of ideas and enabling early feedback, which supporters argue enhances scientific progress without supplanting peer review.[1]Empirical analyses have bolstered positive views by demonstrating substantial concordance between medRxiv preprints and their subsequently peer-reviewed publications. A December 2022 cross-sectional study in JAMA Network Open examined 1,006 medRxiv clinical studies from 2019–2021 that later appeared in journals, finding that 82.5% had identical or similar study characteristics, 85.9% matching results, and 87.4% aligned interpretations, suggesting preprints often provide reliable previews of validated work.[43] Such findings counter concerns over prematurity, with researchers like those involved in medRxiv's development, including Harlan Krumholz, advocating for its embrace as a tool to accelerate knowledge sharing and teach discernment in evaluating emerging science.[45]Institutional endorsements highlight medRxiv's growing importance in biomedical communication. A March 2025 Nature article noted that the platform has "become so important" for preprint distribution that it warranted independent nonprofit infrastructure to support expansion, reflecting researcher demand for swift, open access to findings.[20] Similarly, reviews from library evaluators, such as Doody's in July 2021, described it as a "valuable resource" for open science and rapid research avenues, emphasizing its clarity in distinguishing preprints from peer-reviewed content.[46] These assessments align with broader arguments that medRxiv promotes collaboration, establishes priority, and fills gaps in traditional publishing timelines, particularly in urgent fields like public health.[47][48]
Concerns Over Quality and Reliability
MedRxiv preprints undergo no formal peer review prior to posting, which can result in the dissemination of manuscripts containing errors, incomplete analyses, or unsubstantiated claims.[25] This absence of pre-publication scrutiny has prompted concerns that such content may mislead researchers, policymakers, and the public, particularly in high-stakes fields like clinical medicine where premature conclusions could influence decisions.[49]Empirical analyses have identified discrepancies between medRxiv preprints and their eventual peer-reviewed publications. For instance, a review of COVID-19-related studies found differences in titles, data, or conclusions in nearly half of the preprint-published version pairs that were cited in leading medical journals.[44] Similarly, among 67 COVID-19 studies from medRxiv that reached journal publication, only 34% showed no discrepancies in results reporting, while 22% exhibited at least one major discrepancy, such as changes in statistical significance or effect sizes.[50] These alterations often stem from methodological refinements or error corrections during peer review, underscoring the provisional nature of preprint findings.Retractions of medRxiv preprints highlight additional reliability issues, with faster retraction timelines compared to peer-reviewed articles—median of 29 days versus 139 days—but frequently tied to ethical or procedural misconduct in 70% of downstream cases.[51][52] Moreover, retracted preprints are commonly cited without critical acknowledgment; in one analysis of COVID-19 papers, 86% of citations to retracted works treated them as valid, potentially perpetuating flawed evidence in subsequent research.[53]Critics argue that medRxiv's rapid posting model, while accelerating dissemination, amplifies risks in areas prone to hype or bias, such as pandemic response research, where unvetted preprints garnered outsized media and policy attention before validation.[54] Although many preprints align with final publications—e.g., concordant results in most clinical study pairs—the potential for influential errors persists, as not all submissions proceed to peer review, leaving low-quality work in circulation.[43] MedRxiv mitigates some risks through author attestations and post-publication withdrawal options, but these measures do not substitute for rigorous vetting.[25]
Specific Controversies in Citation and Interpretation
A 2022 analysis of 205 COVID-19 research articles published in 2020 by The Lancet, NEJM, JAMA, and BMJ found that 60 (29.3%) cited at least one medRxiv preprint, but 25.6% of these citations were deemed inappropriate due to mismatches between the cited preprint and its eventual peer-reviewed version, such as alterations in titles, dataanalyses, or conclusions.[17] Nearly half of the cited preprints exhibited substantive differences upon publication, underscoring the hazards of relying on unverified preliminary findings.[55] Moreover, medRxiv's server did not indicate links to published versions for 42.7% of these preprints at the time of analysis, complicating verification efforts by citing authors.[44]Critics have highlighted systemic flaws in preprint citation practices, including the frequent omission of disclaimers noting the absence of peer review, which can foster undue authority for provisional results.[56] In COVID-19 discourse, this contributed to interpretive disputes, as media and policymakers sometimes amplified preprint claims without caveats, leading to public confusion when revisions emerged.[57] For instance, abstracts of COVID-19 randomized trials on medRxiv often displayed "spin"—exaggerated interpretations of results—more frequently than their published counterparts, potentially biasing early citations toward overstated efficacy or safety signals.[58]A prominent case involved hydroxychloroquine (HCQ) for COVID-19 treatment. An April 2020 medRxiv preprint analyzing U.S. Department of Veterans Affairs data (n=1,868 patients) reported no reduction in mortality or ventilation risk with HCQ, alone or combined with azithromycin, prompting the World Health Organization to suspend related trials on May 6, 2020.[59] This observational study, cited over 1,000 times by mid-2020, faced scrutiny for unadjusted confounders like disease severity and comorbidities, with subsequent analyses questioning its causal inferences due to reliance on electronic health records without randomization.[60] Proponents of HCQ argued the preprint's rapid uptake exemplified premature dismissal of a potentially repurposable drug, while opponents viewed it as evidence against off-label promotion amid evolving evidence; the debate persisted despite peer-reviewed confirmations of null effects in randomized trials.[61]Ivermectin preprints on medRxiv similarly sparked interpretive conflicts. Multiple early submissions, including systematic reviews aggregating small trials, suggested prophylactic or therapeutic benefits against COVID-19, fueling advocacy for widespread use despite methodological flaws like high risk of bias and inconsistent dosing.[62] A 2021 medRxiv meta-analysis identified bias as a primary driver of positive ivermectin results across trials, with low-quality studies overrepresented in favorable citations; later large randomized trials, such as one posted on medRxiv in June 2022 (n=1,432), found no meaningful symptom reduction or hospitalization prevention at standard doses.[63][64] These discrepancies led to polarized interpretations, with some jurisdictions authorizing ivermectin based on preprint aggregates while regulatory bodies like the FDA cautioned against extralabel use, citing insufficient causal evidence from robust designs.[65] Such cases illustrate how preprint citations, absent rigorous scrutiny, can amplify selective interpretations aligned with preconceived views rather than empirical consensus.