Fact-checked by Grok 2 weeks ago

Digital curation

Digital curation is the active management and preservation of digital data over its entire lifecycle, encompassing activities from planning its creation and initial selection to ongoing maintenance, appraisal, and dissemination to ensure long-term accessibility, authenticity, and usability for current and future needs. This process addresses the challenges posed by rapidly evolving technologies, formats, and storage media that threaten the obsolescence or loss of digital assets, applying principles of sustainability, interoperability through common standards, and protection against degradation. In essence, it transforms static digital information into dynamic, reusable resources that support research, cultural heritage, and decision-making across sectors. The concept of digital curation emerged in the early 2000s, primarily from the scientific and e-science communities grappling with the explosion of and the need for its reliable long-term . A pivotal development was the establishment of the Digital Curation Centre (DCC) in the in 2002, funded by the Joint Information Systems Committee () to promote best practices in and preservation. This was followed by the launch of the peer-reviewed of Digital Curation in 2006, which solidified digital curation as a distinct and professional discipline focused on holistic lifecycle approaches rather than mere technical archiving. Early influences included international standards like the Open Archival Information System (OAIS) , adapted to emphasize proactive curation over reactive preservation. Central to digital curation is the DCC's Curation Lifecycle Model, a graphical framework that outlines sequential and iterative actions for effective , divided into key stages such as ingest (acquisition and validation), preservation planning, and access/dissemination. This model supports roles across functional units, including appraisal for selection, creation for discoverability, migration to prevent format obsolescence, and to maintain . It underscores curation's iterative nature, where activities like re-appraisal and transformation occur cyclically to adapt to changing user needs and technological landscapes. Digital curation's importance spans , cultural institutions, , and , where it enables for , ensures with legal and ethical standards, and safeguards investments in amid growing volumes of materials. For instance, in environments, it facilitates and by enhancing from creation onward, while in sectors, it preserves irreplaceable records like photographs, manuscripts, and for . Challenges include resource constraints and skill gaps, but ongoing advancements in tools and policies continue to evolve the field toward more robust, automated solutions.

Overview and History

Definition and Scope

Digital curation is defined as the active management and preservation of over its entire lifecycle to ensure long-term , , and value. This process encompasses maintaining and enhancing digital materials, mitigating risks such as technological obsolescence, and facilitating their reuse for research and other purposes. Unlike simple , digital curation requires ongoing human decision-making and intervention to adapt to evolving technological and contextual needs. Key objectives of digital curation include selection and appraisal to determine which materials warrant preservation, ingestion into secure systems, preservation planning to address potential threats, to ensure integrity, and to enable and reuse. These activities are guided by models such as the DCC Curation Lifecycle Model, which outlines sequential and cyclical steps from creation through disposal. The scope of digital curation extends to both born-digital content—originally created in digital form, such as emails, , and files—and digitized materials converted from analog sources, like scanned documents or photographs. It applies across diverse sectors, including libraries, archives, and museums for institutional collections; research environments for scholarly data; and even personal collections for individual digital memories. The term digital curation first appeared in amid growing initiatives in digital libraries and preservation efforts to address the challenges of managing rapidly proliferating digital information. This development reflected broader shifts toward proactive strategies for handling digital assets beyond mere archival storage.

Etymology and Evolution

The term "curation" originates from the Latin verb , meaning "to take care of," a historically applied to the and care of physical collections in museums and archives. In the digital realm, this notion adapted during the late to address the challenges of managing ephemeral electronic data, spurred by the rapid proliferation of after the Internet's mainstream adoption in the . The specific phrase "digital curation" first appeared in 2001 at a titled "Digital Curation: Digital Archives, Libraries and e-Science," organized by the UK's Digital Preservation Coalition, marking the formal recognition of the discipline as an extension of traditional curation to born-digital materials. Key milestones shaped the field's emergence, beginning with the 1994 Digital Libraries Initiative, a joint program by the U.S. (NSF), (DARPA), and (NASA), which allocated $24 million to six projects exploring digital collection building, metadata standards, and user access—laying groundwork for curation practices. In 2004, the (DCC) was founded in the UK through funding from the (JISC) and the e-Science Core Programme, aiming to foster research and tools for long-term digital stewardship across disciplines. The DCC further solidified the terminology in 2005 by defining digital curation as "maintaining and adding value to a trusted body of digital information for current and future use; specifically, the active management and appraisal of data over its entire ," emphasizing proactive intervention beyond mere storage. Digital curation evolved from foundational principles in and , which traditionally focused on physical records, but adapted to the volatile, volume-intensive nature of digital assets post-Internet, incorporating lifecycle approaches to ensure usability and integrity. Early adopters included the , which launched its National Digital Information Infrastructure and Preservation Program in 2000 to curate web archives and multimedia, and JISC, which sponsored pilot projects in the early 2000s to integrate curation into UK research workflows. By the 2010s, the discipline shifted emphasis from static preservation to enabling active reuse and , driven by mandates for in scientific funding and the rise of initiatives that prioritized discoverability and . The field continues to evolve, as evidenced by the International Digital Curation Conference reaching its 19th edition in 2025.

Core Principles

Fundamental Principles

Digital curation is grounded in a set of fundamental principles that ensure the long-term integrity, usability, and value of digital assets throughout their lifecycle. These principles guide the active management of data from creation to reuse, emphasizing technical and operational strategies to mitigate risks associated with technological change and data degradation. The principle of provenance requires maintaining comprehensive metadata that documents the origin, custody, and any transformations applied to digital objects. This includes recording the sources of data creation, subsequent handling by custodians, and all preservation actions performed, such as migrations or format conversions. In the OAIS reference model (as updated in the 2024 version, ISO 14721:2024), provenance information forms a core component of Preservation Description Information (PDI), enabling users to verify the historical context and trustworthiness of archived materials. By systematically tracking these details, curators can reconstruct the chain of custody, which is essential for scholarly validation and legal accountability in research data management. Authenticity in digital curation focuses on preserving the and reliability of digital objects to demonstrate that they remain true to their original form and intent. This is achieved through mechanisms such as checksums for detecting alterations, digital signatures for verifying creator identity, and trails that log all access and modification events. The PREMIS supports this by defining elements for fixity information, which confirms that content has not been corrupted or intentionally changed without authorization. Similarly, the OAIS model incorporates fixity as part of PDI to ensure that the semantic meaning and factual accuracy of assets are maintained over time. These tools collectively build user confidence in the unaltered state of preserved data, particularly in fields like scientific research where depends on verifiable sources. The principle of addresses the need for strategies that guarantee the long-term viability of objects despite evolving , software, and formats. Curators employ techniques like , to new formats, and redundant storage to counteract and . In frameworks, persistence is realized through ongoing monitoring and intervention to reproduce objects accurately in future environments, as outlined in foundational theories that quantify communication across technological shifts. This principle underscores the shift from mere storage to , ensuring that assets remain interpretable and functional for designated communities indefinitely. Accessibility balances the preservation of digital objects with their discoverability and usability for intended users, requiring metadata schemas that facilitate search, retrieval, and interpretation. This involves embedding descriptive, technical, and structural to support user interfaces and tools, while adhering to standards that prevent lock-in. The OAIS model (as updated in the 2024 version, ISO 14721:2024) defines access rights within PDI to manage permissions and enable controlled , ensuring equitable without compromising . In practice, this promotes the of systems where users can interact with preserved in contextually appropriate ways, such as through web-based viewers or . Interoperability serves as a guiding by mandating adherence to open standards and protocols that allow seamless exchange and integration of digital objects across diverse systems. The OAIS (as updated in the 2024 version, ISO 14721:2024) exemplifies this through its functional entities for ingestion, storage, and dissemination, which are designed to be implementable in varied archival environments using non-proprietary formats. By prioritizing standards like those in the PREMIS dictionary, curators ensure that and content can migrate between repositories without loss of functionality, fostering in global data ecosystems. The sustainability principle emphasizes cost-effective planning to support ongoing curation activities, including resource allocation for storage, staffing, and technological updates. The Digital Curation Sustainability Model (DCSM) provides a for assessing and balancing economic, organizational, and factors to maintain viable preservation operations over decades. This involves strategic decisions on scalable and models to avoid silos, ensuring that curation efforts remain feasible amid growing data volumes. These principles collectively inform data lifecycle management by providing a foundation for proactive . Digital curation is guided by ethical imperatives that prioritize the protection of rights, ensuring creators retain control over their digital works while allowing for preservation and access under defined limits. These rights extend to digital artifacts, where curators must obtain permissions or rely on exceptions to avoid infringement, balancing innovation with fair compensation for original contributions. Ethical concerns also encompass , particularly in curating or sensitive materials, where digitization risks perpetuating colonial narratives or violating community protocols without consultation. For instance, tools like Mukurtu software enable Indigenous-led curation to safeguard sacred knowledge from unauthorized dissemination. in access further underscores these ethics, demanding that digital collections mitigate barriers for underrepresented groups through inclusive and open platforms, preventing the reinforcement of historical exclusions. Privacy considerations form a of ethical digital curation, especially when handling embedded in archives or datasets. Under the European Union's (GDPR) of 2018, curators must secure explicit consent for processing identifiable information and implement safeguards like anonymization to prevent re-identification risks. Similarly, the (CCPA) of 2018 grants residents rights to access, delete, and opt out of data sales, compelling institutions to audit collections for compliance and restrict access to sensitive elements. These regulations highlight the need for curators to embed privacy-by-design principles, ensuring long-term stewardship does not compromise individual rights. Legal foundations underpin these practices through frameworks like copyright laws, which govern reproduction and distribution in digital environments. In the United States, the of 1998 provides safe harbors for curators acting as intermediaries while permitting for non-commercial preservation, such as copying for archival stability without market harm. evaluations consider factors like purpose, nature, amount, and effect on the original work's value, enabling limited access for research or education. Internationally, the of 1996 establishes protections for digital works, requiring adequate safeguards against circumvention while promoting exceptions for libraries and archives. Data protection acts, including GDPR and CCPA, complement these by enforcing accountability in handling personal information across borders. Open access ethics in digital curation emphasize promoting the FAIR principles—Findable, Accessible, Interoperable, and —to enhance without compromising . These principles advocate for persistent identifiers, rich , standardized formats, and clear licensing to facilitate ethical , as articulated in the 2016 foundational guidelines. By aligning curation with , institutions foster and , though requires vigilance against restrictions. Curators face dilemmas in deaccessioning or selective curation, where decisions to remove or prioritize materials risk introducing or eroding collection . Ethical deaccessioning demands transparent criteria to avoid favoritism toward dominant narratives, ensuring removals serve public trust rather than resource constraints. Selective practices must counteract inherent biases in source materials, such as underrepresentation of marginalized voices, through diverse appraisal methods. These challenges relate briefly to broader principles of , requiring curators to verify amid ethical trade-offs.

Key Activities and Processes

Data Lifecycle Management

Data lifecycle management in curation encompasses the systematic processes to ensure digital assets remain accessible, authentic, and usable over time, from initial through long-term preservation. This involves sequential stages that address the evolving needs of data throughout its existence, guided by established models to mitigate risks of loss or degradation. The pre-curation stage focuses on appraisal and selection, where curators evaluate digital materials based on their informational , uniqueness, and to organizational goals to determine suitability for long-term retention. This appraisal considers factors such as legal requirements, potential, and resource implications to prioritize assets for . Following selection, the stage involves transferring approved data into a curation system, including the assignment of descriptive, technical, and administrative to facilitate , , and future use. Metadata standards like or PREMIS are commonly applied during this process to embed essential context about the data's origin, structure, and . Repositories such as and support these workflows by providing open-source platforms for automated ingestion, validation, and initial storage, ensuring compliance with archival standards. Active preservation occurs post-ingestion, emphasizing ongoing to counteract threats to , such as format obsolescence, through actions like format migration and . This stage ensures remains interpretable by converting files to sustainable formats while preserving their intellectual and fixity. and represent the final active phase, where curated is made available to users via interfaces that support search, retrieval, and reuse, often with access controls to protect sensitive . This includes generating dissemination information packages tailored to user needs, balancing with . Throughout the lifecycle, monitoring for degradation is essential, involving regular integrity checks—such as checksum verification—and risk assessments to detect , failures, or environmental threats. These proactive measures, often automated within systems, help maintain and inform preservation actions. Disposition concludes the lifecycle by applying criteria for retention or deletion, weighing the data's enduring value against storage costs, legal obligations, and institutional policies. Assets deemed no longer relevant may be securely deleted, while others are retained indefinitely. Digital curation lifecycle management integrates closely with the Open Archival Information System (OAIS) , an ISO standard that outlines functional entities including submission (for ingest), archival storage (for preservation), and (for monitoring and disposition). This alignment ensures robust, interoperable workflows across institutions.

Curation Methodologies

Digital curation methodologies provide structured frameworks and standards to guide the systematic management, preservation, and of digital assets throughout their lifecycle. These approaches emphasize reliability, , and adaptability, drawing from established models to address the complexities of long-term digital . Key methodologies include lifecycle models, audit criteria, metadata standards, cost estimation tools, and iterative processes adapted from other disciplines. The Curation Lifecycle Model, developed in 2008 by the Digital Curation Centre, offers a cyclical for planning and implementing curation activities. This model organizes curation into sequential yet iterative stages, encompassing actions such as ingest, appraisal, representation (including creation and description), preservation planning, preservation actions (like and validation), , and use, , and . Its cyclical nature ensures ongoing management, allowing organizations to identify risks, assign roles, and optimize workflows for data creators, curators, and users, thereby supporting holistic research data management infrastructure. The model is adaptable to various granularities and has been widely used for training and resource allocation in efforts. Complementing lifecycle approaches, the Trustworthy Repositories Audit & Certification (TRAC) framework, published in 2007 by the Center for Research Libraries (CRL) in collaboration with the National Archives and Records Administration (NARA), establishes criteria for evaluating the reliability of digital repositories. TRAC focuses on three core areas: organizational infrastructure (governance, financial sustainability, and policies), digital object management (ingest, metadata handling, and preservation strategies), and infrastructure and security risk management (technical systems, disaster recovery, and security measures). It provides a checklist of 84 criteria, including qualitative and quantitative metrics, to assess a repository's capacity for long-term preservation, enabling audits that build trust among stakeholders such as funders and users. This methodology has influenced subsequent standards like ISO 16363 and is applied in repository self-assessments and third-party certifications. PREMIS (Preservation Metadata: Implementation Strategies), maintained by the since 2008 following its initial development by and RLG in 2005, serves as a standard for capturing and recording essential to preservation actions in digital curation. It defines a core set of implementable elements across four entities—intellectual entities, objects, agents, and —covering descriptive, technical, , and to document actions like format migrations, fixity checks, and access restrictions. The PREMIS Data Dictionary provides detailed semantics and implementation guidelines, supported by an for interoperability, ensuring that preservation decisions and processes are traceable and verifiable over time. Widely adopted in repository systems such as and , PREMIS facilitates automated preservation workflows and compliance with audit requirements like . Cost estimation models, such as (TCO), are critical for planning and budgeting digital curation efforts, accounting for the full spectrum of expenses beyond initial acquisition. In digital curation contexts, TCO encompasses costs for data capture, (hardware and media), maintenance (software updates and staff), and access/dissemination, often calculated over multi-year periods to reflect lifecycle realities. For instance, the Digital Curation Centre's analyses highlight that staff costs typically dominate (around 66% in early examples), while costs have declined dramatically due to technological advances, enabling projections like £1,000 per terabyte per year for sustainable operations. These models support decision-making on , shared , and , helping institutions mitigate financial risks and ensure long-term viability. Agile curation methods, adapted from principles, promote iterative and flexible approaches to digital curation tasks, emphasizing and responsiveness to evolving needs. Drawing from the Agile Manifesto, these methods prioritize individuals and interactions, usable over exhaustive documentation, user , and to change, applied through sprints for tasks like enhancement or preservation planning. Initial agile curation principles, proposed in community discussions around 2017, include maximizing impact via accelerated access, fostering automated interactions through well-documented services, ensuring close creator-curator , and pursuing incremental improvements with metrics focused on usage and . This enhances efficiency in dynamic environments, such as research repositories, by enabling self-organizing teams to deliver sustainable enhancements without rigid upfront planning. Digital Preservation refers to the series of managed activities necessary to ensure the long-term and of materials for as long as needed, regardless of changes in or environments. This process focuses primarily on technical strategies such as format migration, , and integrity checks to prevent and , distinguishing it from digital curation's broader emphasis on active management, selection, and value addition throughout the data lifecycle. Key activities include creating preservation and establishing policies to maintain authenticity and reliability over time. Digital Stewardship encompasses the comprehensive oversight of digital resources, integrating preservation, curation, and to ensure their and across their entire lifecycle. It involves not only technical maintenance but also policy development, funding allocation, and among stakeholders to address organizational and societal needs for digital assets. Unlike narrower preservation efforts, stewardship adopts a holistic approach that includes ethical considerations and long-term to support ongoing use and reuse. Data Archiving involves the long-term storage and preservation of in a to ensure its retention and future , often including activities such as , format migration, and management to combat technological . This process emphasizes secure retention and basic retrieval mechanisms, but typically focuses on preservation rather than the dynamic appraisal, selection, or contextual enrichment emphasized in digital curation. While it requires proactive interventions to maintain integrity, it lacks the full lifecycle management and value-adding aspects of curation. Digital Asset Management (DAM) is the systematic process of ingesting, storing, organizing, retrieving, and distributing digital files—particularly media such as images, videos, and documents—within a centralized system to support business operations and content reuse. DAM systems typically incorporate tagging, , and tools tailored for commercial environments, enabling efficient asset sharing across teams while maintaining and rights management. In contrast to curation's research-oriented focus, DAM prioritizes practical, enterprise-level handling of assets for immediate productivity and purposes. Born-Digital Materials are digital objects created natively in electronic form without an original analog counterpart, such as emails, web pages, or generated directly by software. These materials often require curation strategies that address volatile formats, embedded dependencies on proprietary systems, and rapid obsolescence risks inherent to their digital-native origins. Digitized Materials, conversely, are analog items— like photographs or manuscripts—converted into digital formats through scanning or other reformatting processes, resulting in that aim to replicate physical characteristics. Curation of digitized content involves ensuring to the source while managing conversion artifacts and hybrid workflows, differing from born-digital needs by emphasizing migration from physical to digital states.

Distinctions from Adjacent Fields

Digital curation distinguishes itself from traditional archiving primarily through its proactive, ongoing management of digital materials throughout their lifecycle, in contrast to the more static preservation of physical records in conventional archival practices. Traditional archiving typically involves the appraisal, arrangement, and description of inert paper-based or analog documents once they are transferred to an archival repository, with limited intervention after initial processing. In digital curation, however, curators actively intervene across the entire records continuum—from creation and selection to long-term maintenance—to address the dynamic nature of digital formats, which can degrade, become obsolete, or require format migration to ensure accessibility. This approach is necessitated by the inherent instability of digital objects, unlike the relative durability of static physical records. Compared to digital libraries, digital curation prioritizes the long-term integrity and sustainability of digital assets over immediate user access and retrieval functionalities. Digital libraries focus on organizing and providing current discoverability through search interfaces, standards, and user-oriented platforms to support present-day and needs. In contrast, curation extends beyond access to encompass strategies for mitigating technological risks, such as software obsolescence and , ensuring that materials remain authentic and usable for , potentially spanning decades or centuries. This preservation-centric orientation in curation often involves technical interventions like integrity checks and , which are secondary in digital library operations. Digital curation also diverges from by emphasizing the cultural, historical, and research value of digital materials rather than and operational efficiency. deals with the creation, control, and of records during their active use phase, guided by retention schedules and legal requirements to meet organizational or governmental mandates. Curation, however, appraises and sustains materials for their enduring scholarly significance, incorporating appraisal decisions that extend beyond compliance to foster and in broader contexts. For instance, while might dispose of documents post-retention, curation seeks to add value through enhancement and contextual for perpetual access. In relation to data science, digital curation underscores the long-term stewardship and integrity of datasets, whereas centers on analytical processing and insight generation from data. Data science practices often involve cleaning, transforming, and mining data for immediate applications like modeling or , with a focus on short-term utility and algorithmic outputs. Digital curation, by comparison, maintains data authenticity, , and quality over extended periods to support future reuse, including archiving and measures that prevent loss of during analysis-heavy workflows. This distinction highlights curation's role in bridging data creation to archival sustainability, rather than prioritizing interpretive or . Digital curation exhibits interdisciplinary overlaps with , drawing on shared principles of management, , and , yet it maintains a unique emphasis on the of digital resources amid evolving technological landscapes. contributes frameworks for data and user-centered systems, which curation adapts to ensure long-term viability through practices like persistent identifiers and format normalization. However, curation's distinct focus on environmental and institutional —addressing challenges such as for perpetual access—sets it apart, integrating archival and computational expertise to preserve in digital forms.

Major Challenges

Data Growth and Volume

The exponential growth of digital data presents one of the most pressing challenges in digital curation, as the sheer volume overwhelms traditional management practices and infrastructure. According to recent forecasts as of May , the global datasphere—the total amount of data created, captured, or replicated—reached 149 zettabytes in 2024 and is projected to expand to 181 zettabytes by the end of , driven primarily by increases in video content, devices, and cloud-based services. This surge implies escalating demands, with curators needing to allocate resources for petabyte-scale repositories that require not only physical but also robust capacity and computational power to handle and . For instance, institutions like research libraries and must plan for data that doubles roughly every two years, straining budgets and operational workflows. A key issue arising from this data proliferation is selection overload, where curators face difficulties in appraising and prioritizing vast sets for long-term preservation amid limited resources. In environments like archives, the volume, variety, and velocity of content—such as billions of posts, images, and videos generated daily—complicate traditional appraisal methods that rely on assessing evidential or informational value. datasets often lack clear boundaries, unlike physical , making it challenging to determine what merits curation without risking the loss of potentially significant material or incurring unnecessary storage costs. This overload exacerbates appraisal difficulties, as curators must navigate ephemeral, that may hold cultural or historical importance but is buried in terabytes of redundant or low-value . The impacts of such growth are profound, including rising operational costs and an urgent need for automated triage tools to filter and prioritize content efficiently. As data volumes increase exponentially, storage expenses—despite declining unit costs—escalate due to the need for redundant, secure systems to mitigate risks like hardware failure, with one analysis noting that storage can represent the largest risk factor in repository operations even as prices drop. This has led to calls for automated selection techniques to manage , as manual processes become untenable for datasets exceeding petabytes. A prominent example is the curation of web archives, where the , holding over 99 petabytes of content as of October 2025 with total exceeding 200 petabytes, grapples with scaling issues such as slow retrieval times and resource bottlenecks from high traffic and continuous crawling of the expanding . These challenges highlight how unchecked data growth can hinder accessibility and preservation efforts without strategic interventions, though efforts may help in coordinating multi-institutional responses.

Technological Obsolescence

Technological obsolescence poses a significant threat to digital curation by rendering stored inaccessible due to the rapid evolution of , software, and file formats, potentially leading to permanent loss without proactive intervention. In , this risk arises as technologies that were once standard become unsupported, making it impossible to read or interpret data without or strategies. For instance, the lifecycle of often spans decades, during which shifts in technology can outpace preservation efforts, emphasizing the need for ongoing and to ensure long-term accessibility. Format evolution exemplifies this challenge, as storage media and file structures have transitioned from physical formats like floppy disks to modern cloud-based systems, necessitating repeated migrations to maintain access. Early floppy disks, such as 8-inch, 5.25-inch, and 3.5-inch variants, became obsolete within years of their introduction due to discontinuation by manufacturers, forcing curators to transfer data to newer media to avoid degradation and incompatibility. Similarly, analog video formats like tapes, popular in the late , are now at risk of as playback equipment becomes scarce and tapes degrade through magnetic decay, prompting migrations to formats such as MP4 to preserve content integrity. These shifts highlight how format changes can compound risks, particularly when combined with increasing data volumes that amplify the scale of required interventions. Software dependencies further complicate curation, as proprietary applications essential for viewing or interacting with digital objects may cease support, leading to widespread inaccessibility. A prominent example is , whose end-of-life on December 31, 2020, affected numerous archived web-based media, including interactive animations and videos, as browsers and operating systems discontinued compatibility, requiring curators to convert Flash content (SWF files) to open standards like or . The Archives, for instance, faced challenges in preserving Flash-dependent websites and games, underscoring the vulnerability of software-reliant artifacts to vendor-driven obsolescence. Hardware risks manifest in the need for specialized equipment to access legacy systems, often addressed through to simulate obsolete environments on contemporary hardware. Early personal computers, such as those from the like the IBM PC, relied on specific processors and peripherals that are no longer manufactured, making direct access to their data or software impractical without emulators that replicate the original system's behavior. strategies, as outlined by the on Library and Information Resources, enable curators to run vintage applications on modern machines by mimicking hardware dependencies, thereby preserving the authentic without physical restoration of rare components. To mitigate these risks, tools like format registries provide essential identification and profiling capabilities for proactive planning. The PRONOM registry, maintained by The National Archives of the United Kingdom, catalogs over 1,300 file formats with details on their technical specifications, risks of , and migration pathways, allowing curators to assess and prioritize actions for at-risk holdings. By integrating PRONOM data into workflows, institutions can automate format identification and monitor emerging threats, facilitating timely interventions to sustain access. A notable case illustrating these issues is NASA's curation of Apollo mission data from the 1970s, where original tapes stored on 1-inch magnetic reels faced severe decay and obsolescence due to deteriorating media and incompatible playback hardware. The tapes, intended as backups for mission recordings, were largely erased and reused in the early 1980s amid tape shortages, while surviving reels suffered from binder —a chemical degradation process—rendering much of the raw lunar surface data irrecoverable without extensive restoration efforts. This incident, detailed in NASA's official report, demonstrates the perils of unaddressed technological obsolescence in high-stakes scientific archives, prompting ongoing initiatives to safeguard remaining mission artifacts.

Resource and Labor Constraints

Digital curation efforts are frequently hampered by resource and labor constraints, as the human-intensive nature of tasks such as creation and often exceeds initial projections. Labor represents the largest component of overall curation costs, far surpassing expenditures on , software, or . For instance, staff costs in data centers typically account for 69% to 82% of total budgets, highlighting the hidden efforts involved in maintaining over time. These activities, including manual generation and validation, demand substantial time and expertise, with still limited in addressing the full scope of needs. Skill gaps further exacerbate these challenges, requiring interdisciplinary expertise that combines , archival practices, and domain-specific knowledge. Digital curation professionals must navigate technical skills like systems alongside curatorial judgment in areas such as scientific or cultural domains, yet the field suffers from a of trained individuals. According to the U.S. , employment of librarians is projected to grow 2% from to 2034, slower than the average for all occupations, while archivists, curators, and workers are projected to grow 6%, faster than average; however, inadequate training programs and non-standardized job titles hinder workforce development, leading to reliance on generalists ill-equipped for complex curation demands. The field continues to face challenges in attracting library and information science graduates to digital curation roles, underscoring the need for targeted to bridge these gaps. Funding structures pose additional barriers, with short-term grants ill-suited to the perpetual requirements of long-term preservation. Many projects, including EU initiatives like , operate on biennial tenders from EU funds and national ministries, providing methodological support for but no direct financial aid for ongoing or . This mismatch—where funding cycles last 3-5 years while curation demands indefinite —results in under-resourced operations, as only 21% of researchers receive financial support for archiving despite widespread obligations. 's model, while promoting through partnerships and cost reductions in enhancement, leaves cultural institutions bearing the brunt of high expenses, particularly for smaller entities. In understaffed institutions, the reliance on manual processes amplifies risks of strain and high turnover, as labor shortages compound the demands of increasing volumes. Economic models, such as lifecycle costing frameworks developed by the LIFE project, offer tools to predict these expenses by modeling costs across ingestion, storage, access, and preservation phases over 5-10 year horizons. These frameworks enable institutions to forecast total costs, emphasizing the need for proactive budgeting to mitigate perpetual shortfalls.

Standardization and Interoperability

Digital curation faces significant challenges due to the absence of universally adopted global standards, which leads to inconsistencies in how digital objects are described and managed across institutions. Variations in metadata schemas, such as () and Metadata Object Description Schema (MODS), exemplify this issue; provides a simple set of 15 basic elements suitable for broad resource discovery, while MODS offers a more structured and richer framework with 20 elements and subelements that inherit semantics for detailed bibliographic control, making direct difficult without mapping or transformation. These differences arise because lacks substructure and precise instructions for elements like "" or "," resulting in inconsistent local implementations, whereas MODS addresses complex digital objects but requires greater expertise, hindering seamless adoption in diverse curation environments. Interoperability barriers further complicate digital curation, as siloed systems—often using incompatible formats, protocols, or —prevent effective data sharing and reuse across institutional and national boundaries. In , these silos manifest in fragmented workflows where repository software like or fails to integrate smoothly with institutional systems or distributed services, leading to incomplete preservation chains and reliance on manual interventions. Such barriers are exacerbated by heterogeneous data structures, including unstructured formats like spreadsheets, which demand advanced semantic mapping to enable cross-system access. The fluid nature of digital information across temporal and disciplinary lines amplifies these issues, as evolving hardware and software dependencies create ongoing risks that vary by context. Coordination challenges among libraries, governments, and private sectors add to the complexity, with fragmented digital strategies resulting in disjointed efforts and misalignments. For instance, libraries often develop isolated digital curation services due to varying institutional priorities, while initiatives may overlook private-sector innovations, leading to duplicated investments without shared protocols. Human-centered factors, such as resistance to data sharing stemming from concerns over control or insufficient training, compound these coordination hurdles, necessitating negotiated agreements across distributed networks. Progress toward standardization has been advanced through key initiatives like the ISO 14721 Open Archival Information System (OAIS) reference model, which provides a for digital archiving that standardizes terminology, processes, and data models to facilitate comparison and integration among systems. Similarly, the International Internet Preservation Consortium (IIPC) promotes in web archiving by coordinating global member institutions—spanning over 35 countries—to develop shared tools, training, and collaborative projects that ensure consistent capture and access of born-digital web content. These interoperability issues contribute to substantial risks, including data silos that foster duplicated curation efforts and the potential loss of collective knowledge through inaccessible or redundant archives. Siloed environments lead to inefficient , as organizations replicate similar preservation activities without leveraging shared repositories, ultimately fragmenting the global digital heritage.

Digitization of Analog Content

Digitization of analog content involves converting physical materials, such as books, manuscripts, photographs, audio tapes, and three-dimensional artifacts, into digital formats to enable long-term curation and access. This process is essential for preserving deteriorating analog items but presents significant technical hurdles, particularly in achieving high-resolution captures without causing physical damage. For instance, scanning rare books and manuscripts requires specialized equipment like overhead or planetary scanners to capture images at resolutions of 400-600 pixels per inch (ppi) or higher, ensuring fine details are preserved while minimizing handling that could exacerbate fragility. Fragile items, such as brittle paper or tightly bound volumes, demand non-destructive techniques, including the use of cradles or v-support systems to avoid stress on bindings and spines during imaging. Conservators often assess materials beforehand to identify risks, such as restricted openings in early printed books that could lead to tearing if forced. Quality control remains a core challenge, encompassing issues like color fidelity and the accuracy of (OCR) for textual content. Maintaining accurate color reproduction requires calibrated scanners with consistent lighting and color targets to match the original hues, as variations in spectral response can distort tonal ranges in digitized images. For textual materials, OCR performs reliably on printed documents but struggles with handwritten ones, where error rates can exceed 10-15% due to variations in script, degradation, and paper texture, necessitating manual verification or advanced tools to achieve usable accuracy levels above 85%. These quality concerns are amplified in mass digitization efforts, where inconsistencies can compromise the integrity of the digital surrogate for curation purposes. Scaling to large collections introduces legal and logistical obstacles, exemplified by the project launched in 2004, which aimed to scan millions of volumes but encountered substantial hurdles from authors and publishers. The initiative faced lawsuits alleging infringement through unauthorized scanning of in-copyright works, leading to a decade-long legal battle resolved in 2015 when a U.S. court ruled the project constituted for search and snippet views, though full access remained restricted. Beyond legal issues, the sheer volume demands efficient workflows, yet clearances slow progress and limit public availability. Additionally, the process is labor-intensive and costly for complex formats; digitizing audio tapes involves cleaning, winding, and real-time playback to capture analog signals, often taking hours per reel due to degradation risks like . Similarly, objects require multiple scans from various angles using structured light or laser technologies, with costs ranging from hundreds to thousands of dollars per item depending on complexity, plus extensive post-processing to align and refine models. To address these challenges, established best practices emphasize structured workflows, as outlined in the International Federation of Library Associations and Institutions (IFLA) Guidelines for Projects. These recommend phased planning, including selection criteria, standards, and protocols to ensure reproducibility and minimal intervention. Such guidelines promote the use of open standards for file formats like for images and for audio, facilitating while prioritizing preservation over access in initial captures. Post-digitization, the resulting files must be monitored for format obsolescence to sustain curatorial value.

Material and Format-Specific Issues

Digital curation encounters distinct challenges when addressing the preservation of diverse content types, where the inherent properties of materials and formats demand tailored strategies, handling protocols, and technological interventions to mitigate loss of and over time. For instance, artifacts often embed proprietary encodings or interactive elements that complicate long-term viability, requiring curators to balance fidelity to original intent with adaptive preservation techniques. These issues are amplified in emerging formats, where rapid innovation outpaces standardization efforts. In curating and digitized texts, a primary concern involves developing robust schemas to capture textual , such as editorial revisions or manuscript differences, ensuring scholarly access to historical evolutions without conflating versions. Handling redactions poses additional hurdles, as curators must systematically identify and mask sensitive information in digital surrogates while preserving the document's structural integrity and contextual annotations for future analysis. standards like those from the Getty emphasize descriptive and administrative elements to index such , facilitating discoverability amid evolving textual scholarship. For materials, preserving 3D models of artifacts derived from techniques introduces risks of data obsolescence, as for rendering point clouds and meshes becomes unsupported, potentially rendering models inaccessible without or . Curators must address format-specific , such as loss of texture fidelity in high-resolution scans, by adopting open standards like to enhance across platforms. These challenges underscore the need for proactive validation workflows to verify model accuracy against physical artifacts over decades-long preservation horizons. Curating , such as performances and oral traditions, relies heavily on video and audio recordings to capture ephemeral expressions, yet preserving the surrounding context—like performer intent, cultural rituals, or linguistic nuances—remains elusive without enriched that documents performative variables. Institutions employ approaches, integrating transcripts and ethnographic notes with files, to mitigate the flattening of dynamic elements inherent in live traditions. guidelines highlight the importance of community involvement in metadata creation to ensure recordings reflect authentic intangible values rather than isolated artifacts. Emerging formats exacerbate curation difficulties, particularly for VR/AR content, where immersive environments depend on device-specific runtimes that risk rapid obsolescence, necessitating strategies like scene graph exports to sustain spatial interactions. Social media ephemera, such as transient posts or stories, challenges preservation due to platform dependencies and privacy restrictions, often requiring API-based harvesting before content vanishes, though legal barriers limit comprehensive capture. AI-generated media introduces authenticity issues, as algorithmic outputs lack clear provenance, demanding metadata for tracing generative models and input parameters to verify integrity against future deepfake proliferation. Specific examples illustrate these tensions: preserving interactive web art requires emulating environments to maintain user-driven behaviors, as original scripts may fail on modern , with curators employing tools like Webrecorder to archive dynamic states. Similarly, blockchain-based NFTs face curation obstacles in linking on-chain metadata to off-chain media files, where server downtime or can sever artwork from ownership proofs, prompting hybrid strategies that embed assets directly on distributed ledgers. These cases highlight the imperative for format-agnostic preservation frameworks to accommodate evolving expressions.

Strategies and Approaches

Technological Solutions

Technological solutions in digital curation encompass a range of software tools, platforms, and systems designed to mitigate challenges such as technological and data volume by enabling long-term preservation, automated processing, and secure storage of digital objects. These solutions often align with established standards like the Open Archival Information System (OAIS) model, facilitating the ingest, management, and access of digital content while ensuring its authenticity and usability over time. Key approaches include and for rendering obsolete formats, AI-driven automation for enhancement, robust storage infrastructures, monitoring systems for compliance, and innovative tracking mechanisms like . Emulation recreates the original hardware and software environment to run legacy digital objects, avoiding the need to alter the content itself, while converts files to contemporary formats to maintain accessibility. , an open-source , supports by simulating historical operating systems and hardware, allowing curators to execute and view preserved software artifacts without proprietary dependencies. For instance, has been integrated into preservation workflows to emulate environments for early and documents, demonstrating its reliability in maintaining functional integrity. Complementing , JHOVE (JSTOR/Harvard Object Validation Environment) serves as a Java-based tool for identifying, validating, and characterizing file formats during processes, ensuring that converted objects conform to expected standards before archival storage. These tools collectively address by providing verifiable pathways to sustain access, with JHOVE's extensibility allowing integration into broader curation pipelines. Automated workflows leverage and to streamline generation, reducing manual labor and enhancing discoverability in large-scale digital collections. In image curation, models apply techniques to automatically tag visual content with descriptive keywords, such as objects, scenes, or emotions, thereby enriching schemas like without exhaustive human annotation. Scholarly analyses highlight how these AI methods improve curation efficiency in cultural repositories, where automated tagging achieves up to 90% accuracy in controlled datasets, though human oversight remains essential for contextual nuances. Such workflows integrate with existing systems to process heterogeneous data types, supporting scalable curation by generating records and semantic links that facilitate long-term retrieval. Storage solutions prioritize durability, scalability, and cost-effectiveness for archival needs, with cloud-based options like Glacier offering tiered retrieval classes optimized for infrequently accessed data. S3 Glacier Deep Archive, for example, provides 99.999999999% (11 nines) durability and retrieval times of within 12 hours for standard or up to 48 hours for bulk, making it suitable for preserving petabyte-scale cultural and scientific datasets. Storage costs $0.00099 per GB-month, with retrieval costs as low as $0.0025 per GB for bulk operations (as of November 2025). Distributed systems like the (IPFS) enable decentralized, content-addressed storage, where files are pinned across peer networks to prevent single points of failure and ensure persistent availability for preservation purposes. IPFS has been adopted in archival projects to maintain verifiable, tamper-resistant copies of digital heritage materials, complementing traditional storage by distributing redundancy globally. Monitoring tools such as Archivematica automate OAIS-compliant processing from ingest to dissemination, incorporating validation, normalization, and packaging into microservices-based workflows. Developed as an open-source system, Archivematica generates preservation (e.g., PREMIS events) and supports format migration, ensuring with standards like ISO 14721 while tracking the lifecycle of digital packages. Institutions use it to monitor integrity through checksum verification and automated reporting, with its modular design allowing customization for specific curation needs. Recent advances in technology have introduced pilots for tracking, creating immutable ledgers to record the custody and modifications of digital objects post-2020. For instance, projects integrating with IPFS have demonstrated enhanced traceability in library archives, where hashed records on distributed ledgers verify and without centralized vulnerabilities. A 2023 IEEE study on long-term preservation services piloted for digital signatures, achieving robust audit trails for archival files by combining it with decentralized storage, thus bolstering trust in curated collections. These initiatives, often tested in contexts, underscore 's potential to automate while addressing concerns through hybrid models.

Institutional and Collaborative Strategies

Institutional and collaborative strategies in digital curation emphasize organizational frameworks, partnerships, and capacity-building to ensure long-term sustainability of digital assets. These approaches address resource constraints by fostering coordinated efforts among institutions, governments, and stakeholders to develop policies, share responsibilities, and build expertise. Policy development forms a cornerstone of these strategies, with national initiatives providing guidelines for systematic curation practices. In the UK, the Joint Information Systems Committee (JISC) funded studies in 2008 to outline models for institutional digital preservation policies, aiming to integrate curation into broader research data management frameworks. These efforts, including the Digital Preservation Policies Study, recommended structured approaches to governance, risk assessment, and compliance to support ongoing digital asset viability across higher education institutions. Collaborative consortia exemplify shared infrastructure to distribute curation burdens and enhance scalability. The Data Conservancy, initiated by in partnership with entities like the National Snow and Ice Data Center, offers a discipline-agnostic platform for data collection, preservation, and access, utilizing open-source tools like Fedora Commons to enable interoperable services across communities. Similarly, , a of over 60 libraries, maintains a shared preservation with more than 19 million digitized volumes (as of 2025), allowing members to contribute content without individual hosting costs while ensuring redundant storage and access for print-disabled users. These models promote collective development of standards and workflows, reducing and costs through pooled resources. Training programs build essential skills for effective curation, often delivered through specialized modules and workshops. The provides targeted training on research data management principles, including data management plans via tools like DMPonline, preservation strategies, and principles, offered in formats such as one- to two-day workshops for librarians, researchers, and IT professionals. These sessions, conducted globally since the DCC's inception, emphasize practical application without formal certifications but equip participants to implement institutional policies. Funding models sustain these initiatives through diversified sources that align with long-term goals. Endowments offer stable support, as seen in philanthropic commitments from foundations like the Bill & Melinda Gates Foundation, which fund preservation aligned with societal benefits. Public-private partnerships facilitate innovation, such as industry collaborations providing cloud credits for , exemplified by the NIH's BD2K program that supported reuse platforms before transitioning to member-based models like , where national institutes contribute to maintenance for free global access. These hybrid approaches mitigate financial risks by blending government grants with private investments. Audit frameworks enable institutional to verify trustworthiness and compliance. The Trustworthy Repositories Audit & Certification () criteria, developed in 2007 by the Center for Research Libraries and the , provide a of over 100 metrics across organizational , object , and risk mitigation, allowing repositories to evaluate , preservation , and technical safeguards internally. This self-audit tool, aligned with the Open Archival Information System (OAIS) reference model, helps institutions identify gaps and prepare for external certifications like ISO 16363, promoting transparency and accountability in curation practices.

Emerging Innovations

Artificial intelligence and machine learning are transforming digital curation by enabling predictive analytics for risk assessment and automated appraisal processes. Predictive models analyze historical data on file degradation, format obsolescence, and storage failures to forecast potential threats to digital assets, allowing curators to prioritize interventions proactively. For instance, AI-driven tools can evaluate the long-term viability of datasets by simulating environmental and technological risks, reducing the likelihood of data loss in large-scale repositories. Automated appraisal leverages machine learning algorithms to classify and select content for preservation, such as the Automated Video Appraisal (AVA) system, which generates preliminary decisions on video recordings that archivists can refine, streamlining workflows in resource-constrained environments. These innovations enhance efficiency while maintaining curatorial oversight, particularly for born-digital materials where manual review is impractical. Blockchain and distributed ledger technologies provide immutable audit trails that bolster the authenticity and provenance of curated digital objects. By recording and access events on a decentralized ledger, ensures tamper-proof documentation of an asset's lifecycle, from creation to reuse, countering risks of alteration or falsification in distributed archives. In settings, these systems verify the of digital records through cryptographic hashing, enabling verifiable chains of custody without relying on central authorities. This approach is particularly valuable for preserving , where authenticity underpins scholarly trust and legal validity. Integrations with open science frameworks, especially the FAIR (Findable, Accessible, Interoperable, Reusable) data principles introduced in 2016, have reshaped curation practices by embedding reusability into ecosystems. Post-2016 implementations emphasize machine-actionable and persistent identifiers to facilitate data discovery and integration across platforms, aligning curation with collaborative research needs. Collaborative networks like the Data Curation Network apply FAIR guidelines to standardize processing, ensuring datasets from diverse sources remain interoperable for long-term analysis. This shift supports by promoting ethical sharing while addressing challenges like metadata inconsistency in heterogeneous collections. Sustainability innovations in digital curation focus on to minimize the of storage and processing. Practices such as optimizing infrastructures for and employing low-carbon data centers reduce the environmental impact of long-term preservation, where petabyte-scale archives contribute significantly to global emissions. Scalable, storage models allow curators to deactivate idle resources, cutting energy use by up to 30% in some deployments without compromising accessibility. These strategies integrate with broader green IT principles, prioritizing renewable energy sources and hardware longevity to align curation with climate goals. Post-2020 trends highlight curation challenges and solutions for pandemic-related data and emerging content. During the crisis, digital curators facilitated open repositories like the COVID-19 Open Research Dataset (CORD-19), applying principles to aggregate and preserve heterogeneous health data for rapid scientific reuse while mitigating risks. This effort underscored the need for agile curation in crisis contexts, with practices evolving to include real-time metadata standards for global collaboration. For content, preservation involves capturing immersive assets, such as 3D cultural reconstructions, using for authenticity and VR-compatible formats for replayability. Platforms like metaverse heritage simulations enable the curation of intangible cultural elements, ensuring their endurance beyond platform lifecycles. These developments address the ephemerality of virtual environments, fostering sustainable access to interactive digital narratives.

Applications Across Domains

Cultural Heritage Preservation

Digital curation plays a pivotal role in preserving cultural heritage by ensuring the long-term accessibility and integrity of both tangible artifacts and intangible expressions, mitigating risks from physical decay, geopolitical conflicts, and cultural erasure. Strategies for tangible artifacts often involve advanced digitization techniques such as 3D scanning and virtual reconstructions, which create high-fidelity digital surrogates that can be stored, analyzed, and shared without compromising the originals. For instance, the British Museum employs 3D scanning to digitize its collections, enabling virtual exhibitions and scholarly access to items like ancient sculptures and manuscripts, thereby extending their lifespan beyond physical constraints. These methods not only safeguard against material degradation but also facilitate immersive experiences through technologies like augmented reality. Intangible cultural elements, such as oral histories, traditional performances, and rituals, are preserved through archiving that captures audio, video, and interactive formats. Digital curation here emphasizes standards and contextual documentation to maintain authenticity and cultural significance, allowing communities to revisit and transmit knowledge across generations. Projects like the Recordings archive oral traditions from groups using preservation, ensuring these ephemeral elements endure despite the absence of physical forms. This approach addresses the of living , integrating community involvement to avoid . A prominent is UNESCO's , launched in 1992, which focuses on and nominating documentary heritage of global importance to prevent loss and promote universal access. The program has supported the of diverse collections, including ancient manuscripts from the and African oral archives, resulting in 570 registered items as of 2025. Challenges in this domain include ethical considerations around , where digital surrogates raise questions of ownership and cultural ; for example, debates over virtual access to repatriated artifacts underscore the need for collaborative agreements between institutions and source communities to respect . Outcomes of these curation efforts have significantly enhanced global access to cultural heritage, democratizing knowledge that was once limited to physical sites. The Europeana portal, aggregating digitized content from European institutions, provides open access to over 58 million items as of 2025, including artworks, books, and audiovisual materials, fostering education and cross-cultural dialogue while adhering to preservation standards. By addressing repatriation ethics through transparent digital policies, such initiatives not only preserve heritage but also promote equity in its representation.

Scientific and Research Data Management

Digital curation plays a critical role in scientific and research data management by ensuring the long-term accessibility, integrity, and usability of data to support reproducibility, which is essential for validating scientific claims and advancing knowledge. Reproducibility requires curating digital artifacts such as raw data, processed datasets, code, and documentation to allow independent verification of results, addressing challenges like data loss or obsolescence over time. This process involves active management throughout the data lifecycle, including appraisal, metadata creation, and preservation strategies that maintain the context and provenance of research outputs. Metadata standards are fundamental to this curation effort, enabling discoverability and interoperability of datasets. For instance, the DataCite Metadata Schema provides a core set of properties, such as identifiers, titles, creators, and resource types, that facilitate accurate citation and persistent linking of datasets to publications. This schema supports minting for datasets, ensuring they can be reliably referenced and retrieved, which is vital for reproducible research across disciplines. In , the CERN Data Preservation in High Energy Physics (DPHEP) collaboration exemplifies these practices by archiving vast volumes of experimental data from accelerators like the , totaling over one exabyte (more than 1,000 petabytes) as of 2025, to enable future analyses and reinterpretations. Similarly, the (NIH) has implemented data sharing policies since the 2010s, including the 2014 Genomic Data Sharing Policy and the 2023 Data Management and Sharing Policy, which mandate plans for making scientific data findable, accessible, and reusable to foster collaboration and verification in biomedical research. Big data in fields like and modeling presents unique curation challenges due to , velocity, and , requiring scalable strategies to handle petabyte-scale datasets while preserving fidelity. In , curating sequences from next-generation sequencing involves managing heterogeneous formats, ensuring against errors, and addressing storage demands that can exceed exabytes, all while complying with regulations. For models, challenges include integrating observational with simulations, correcting biases in historical records, and maintaining for evolving model outputs to support accurate projections. Tools like Jupyter notebooks enhance curation by serving as artifacts that bundle , , visualizations, and narrative explanations into interactive documents, promoting and ease of . These notebooks allow researchers to document workflows reproducibly, with cells that can be executed in sequence to regenerate results, though curation must address dependencies on evolving software environments to prevent . The impacts of such curation extend to enabling meta-analyses and advancing , where shared, well-curated data amplify research efficiency and innovation. Repositories like facilitate this by providing a platform for depositing datasets linked to publications, supporting secondary analyses that pool evidence across studies to draw robust conclusions, as seen in ecological and biomedical meta-analyses that leverage for greater statistical power.

References

  1. [1]
    What is Digital Curation? | DCC
    Apr 2, 2008 · "What is Digital Curation?". DCC Briefing Papers: Introduction to Curation. Edinburgh: Digital Curation Centre. Handle: 1842/3362. Available ...Missing: definition | Show results with:definition
  2. [2]
    Digital Curation | Smithsonian Institution Archives
    Across the cultural heritage sector, digital curation encompasses the selection, acquisition, preservation, maintenance, and delivery of digital data.
  3. [3]
    Digital Preservation, Digital Curation, Digital Stewardship: What's in ...
    Aug 23, 2011 · “Curation” takes a “whole life” approach to digital materials to address the selection, maintenance, collection and archiving of digital assets ...Missing: scholarly | Show results with:scholarly
  4. [4]
    The Current State of Digital Curation - NCBI
    Corporations utilize digital information to devise business strategies, inform decision making, manage inventories and production, and extend marketing. In the ...
  5. [5]
    Curation Lifecycle Model | DCC
    ### Summary of Digital Curation Lifecycle Model
  6. [6]
    What is digital curation? | DCC
    Digital curation is the management and preservation of digital data/information over the long-term. Digital curation involves maintaining, preserving and ...
  7. [7]
    [PDF] Digital Curation - The National Academies Press
    WHAT IS DIGITAL CURATION? Digital curation is the active management and enhancement of digital information assets for current and future use. In some ways ...<|control11|><|separator|>
  8. [8]
    New Imperatives in Digital Curation and Its Workforce - NCBI - NIH
    Jun 19, 2025 · Libraries, archives, and museums are transitioning from physical to digital collections and from manual to automated processes for collections ...
  9. [9]
    Personal Digital Archiving | Digital Preservation - Library of Congress
    Personal digital archiving involves preserving digital memories, including photos, audio, video, email, records, and websites. Digital materials can be more ...
  10. [10]
    Digital curation - ResearchGate
    Aug 9, 2025 · The term "digital curation" can be traced back to the early 1990s and has caught the interest of the research community (Tibbo, 2012, p.187).
  11. [11]
    [PDF] Digital Curation: A Life-cycle Approach to Managing and Preserving ...
    ... and future use: in other words, it is the active management and appraisal of digital information over its entire life cycle. Digital Curation Centre, UKOLN.
  12. [12]
    Everything's curated now - Language Log
    Mar 6, 2020 · The word “curate” comes from the Latin “curatus,” the past participle of “curare,” which means “to take care of.”
  13. [13]
    View of Digital Curation for Science, Digital Libraries, and Individuals
    The term “digital curation” was first used at the "Digital Curation: digital archives, libraries and e-science seminar" sponsored by the Digital Preservation ...
  14. [14]
    (PDF) Digital Curation for Science, Digital Libraries, and Individuals
    The term "digital curation" was first used in a seminar on digital libraries, archives and science in 2001. Participants from a variety of professions gathered ...
  15. [15]
    [PDF] The 1990s: The Formative Years of Digital Libraries
    During the past thirty years digital libraries have gone from a curiosity to mainstream. The 1990s were a particularly formative decade.
  16. [16]
    Digital Libraries Initiative (DLI) Projects 1994‐1999 - Fox
    Jan 31, 2005 · DLI Funding​​ $24 million was awarded in 1994 by NSF, DARPA and NASA, split evenly among six "DLI‐1 teams." Three were in California: two went to ...Missing: history | Show results with:history
  17. [17]
    (PDF) The Digital Curation Centre: a vision for digital curation
    Oct 14, 2025 · We describe the aims and aspirations for the Digital Curation Centre (DCC), the UK response to the realisation that digital information is both essential and ...
  18. [18]
    Digital Curation - an overview | ScienceDirect Topics
    The Digital Curation Centre (DCC), a UK-based consortium that was launched in 2004, has contributed to promote the concept (Higgins, 2011). Digital curation, as ...
  19. [19]
    Digital Preservation Pioneers
    The Library's Digital Preservation Program has only existed since 2000. The relatively new field of digital-information management is thus reliant on ...
  20. [20]
    [PDF] UNDERSTANDING PREMIS - The Library of Congress
    Jul 1, 2013 · The PREMIS Data Dictionary defines preservation metadata as "the information a repository uses to support the digital preservation process." ...
  21. [21]
    PREMIS: Preservation Metadata Maintenance Activity (Library of Congress)
    ### Summary of PREMIS Support for Provenance, Authenticity, and Persistence in Digital Curation
  22. [22]
    [PDF] PREMIS Final Report - OCLC
    PREMIS defines “preservation metadata” as the information a repository uses to support the digital preservation process. Specifically, the group looked at ...
  23. [23]
    [PDF] Towards a Theory of Digital Preservation - iRODS
    Examples include the name spaces that are used by the preservation environment to track provenance (descriptive metadata), integrity (rules, preservation ...
  24. [24]
    Overview of Technological Approaches to Digital Preservation and ...
    So digital preservation is not a simple process of preserving physical objects but one of preserving the ability to reproduce the objects. The process of ...Missing: curation | Show results with:curation
  25. [25]
    The Digital Curation Sustainability Model - 4C Project
    The purpose of the Digital Curation Sustainability Model (DCSM) is to highlight the key concepts, relationships and decision points for planning how to sustain ...Missing: principle | Show results with:principle
  26. [26]
    Copyright - WIPO
    Copyright is a legal term describing creator's rights over literary and artistic works, including books, music, films, and computer programs.Intellectual Property and Music · Film · How to Make a Living from Music
  27. [27]
    [PDF] Digital Heritage and the Ethics of Sharing Indigenous Knowledge ...
    Indigenous cultural production in archiving and curation provides a counterbalance to these workflows and gives us different models for sharing, arranging, and ...
  28. [28]
    Designed with Care: Mukurtu Provides Ethical Tools for Archiving ...
    Dec 10, 2024 · Mukurtu is designed to support an ethically based approach to preservation with the specific needs of Indigenous and Native collections in mind.
  29. [29]
    [PDF] Exploring ethical considerations for providing access to digital ...
    Sep 27, 2021 · The recognition of ICIP is a key practice to support ethical access, and cultural safety. Indigenous Cultural and Intellectual Property refers ...Missing: curation | Show results with:curation
  30. [30]
    Data Protection | DCC - Digital Curation Centre
    Jul 30, 2006 · Edinburgh: Digital Curation Centre. Handle: 1842/3359. Available ... Considerations. Anonymised data doesn't come under the DPA ...
  31. [31]
    [PDF] The Digital Millennium Copyright Act of 1998
    The DMCA, signed in 1998, implements WIPO treaties, addresses online copyright issues, and has five titles, including one on online liability.
  32. [32]
    U.S. Copyright Office Fair Use Index
    The Fair Use Index is a searchable database of court opinions tracking judicial decisions to help understand fair use, with summaries of facts and court ...Missing: curation | Show results with:curation
  33. [33]
    The FAIR Guiding Principles for scientific data management ... - Nature
    Mar 15, 2016 · This article describes four foundational principles—Findability, Accessibility, Interoperability, and Reusability—that serve to guide data ...
  34. [34]
    FAIR Principles: Interpretations and Implementation Considerations
    Jan 1, 2020 · The FAIR principles are intended as a guide to enable digital resources to become more Findable, Accessible, Interoperable and Reusable for ...3.1 Principle F · 3.2 Principle A · 3.3 Principle I
  35. [35]
    Deaccesioning in Museums - Elizabeth Varner, 2013 - Sage Journals
    Jun 1, 2013 · One of the paradoxes of deaccessioning is the need to dispose of tangible cultural heritage without violating the donors' or public's trust.Missing: selective | Show results with:selective
  36. [36]
    Deaccesioning in Museums: Evaluating Legal, Ethical and Practical ...
    Feb 19, 2020 · This article highlights the legal, ethical and practical considerations of deaccessioning including a list of relevant regulations and best practices.Missing: curation | Show results with:curation
  37. [37]
    Data Is Never Raw: Ethics and biases in Digital Cultural Heritage ...
    Jun 20, 2023 · I will focus on the area of pre-existing bias in data found in cultural heritage institutions, in particular in museums.Missing: deaccessioning | Show results with:deaccessioning<|control11|><|separator|>
  38. [38]
    [PDF] Ethical Issues In Digitization Of Cultural Heritage - EliScholar
    Abundant case studies and research on particular ethical issues prove that digitization has had a significant impact on managing online engagement with heritage.
  39. [39]
    (PDF) The DCC curation lifecycle model - ResearchGate
    Jun 19, 2025 · The DCC Curation Lifecycle Model provides a graphical high-level overview of the ; stages required for successful curation and preservation of ...
  40. [40]
    Preservation issues - Digital Preservation Handbook
    The application of data integrity techniques and the maintenance of audit trails can provide confidence that a digital object has remained unchanged (except ...Missing: degradation | Show results with:degradation
  41. [41]
    The DCC Curation Lifecycle Model | International Journal of Digital ...
    Dec 2, 2008 · The DCC will use the model: as a training tool for data creators, data curators and data users; to organise and plan their resources; and to ...Missing: principles | Show results with:principles
  42. [42]
  43. [43]
    Audit and certification - Digital Preservation Handbook
    CRL, 2007. Trustworthy Repositories Audit & Certification: Criteria and Checklist. Available: http://www.crl.edu/sites/default/files/d6/attachments/pages ...
  44. [44]
    None
    ### Summary of Total Cost of Ownership (TCO) in Digital Curation from DCC Experience
  45. [45]
    [PDF] Community Engagement for Developing the Principles and ...
    Below are some initial agile data curation values and principles that the authors have developed as a point of departure for a community discussion. Mapping of ...
  46. [46]
    [PDF] DCC | Digital Curation Reference Manual
    Apr 15, 2011 · A standard definition of digital preservation is: 'The series of managed activities necessary to ensure continued access to digital materials ...
  47. [47]
    [PDF] DC 101 Preservation Planning - Digital Curation Centre
    Digital preservation refers to activities aimed at ensuring that we can access data (in the form of digital objects and databases) in the future – for longer ...
  48. [48]
    [PDF] Digital Preservation: Continued access to authentic digital assets
    It defines digital preservation and its relationship with digital curation, illustrates some of the advantages in preserving resources created or stored ...Missing: definition | Show results with:definition
  49. [49]
    Digital Stewardship: The one with all the definitions
    Apr 2, 2014 · Digital stewardship encompasses all activities related to the care and management of digital objects over time.
  50. [50]
    Preserve, Curate, or Steward? Changing Definitions in Digital ...
    May 20, 2019 · Digital stewardship marries preservation and curation, unifying the lifecycle approach of curation along with research in digital libraries and ...
  51. [51]
    What is digital archiving? - Archaeology Data Service
    Digital archiving seeks to preserve the information regardless of the media on which that information is stored. Computer disks and other magnetic and optical ...
  52. [52]
    Data Curation and Architecture | Roper Center for Public Opinion ...
    “Data archiving” describes a broad range of processes undertaken to ensure that data can be used and understood by researchers now and into the future. Data ...Missing: definition | Show results with:definition
  53. [53]
    Semantics: Digital Preservation vs. Digital Curation - Google Groups
    Digital curation is larger than digital preservation; it includes preservation but also creation or selection, appraisal, ongoing maintenance and enhancement.Missing: passive | Show results with:passive
  54. [54]
    What Is Digital Asset Management? - IBM
    Digital asset management (DAM) is a process for storing, organizing, managing, retrieving and distributing digital files.What is digital asset... · How DAM software works
  55. [55]
    Digital asset management (DAM) systems: Basics and benefits.
    Oct 17, 2025 · Digital asset management (DAM) is the process of organizing, storing, and distributing digital files using specialized software.
  56. [56]
    What is digital asset management?
    Digital asset management (DAM) is a software solution that's used to efficiently and securely store, organize, manage, and share digital files and assets.
  57. [57]
    Three Types of Digital Material: Digitized, Born-digital, and Reborn ...
    Digitized material is the output of digitizing non-digital, or analog, material. In other words, the digitized material was not originally digital.
  58. [58]
    All Digital Objects are Born Digital Objects | The Signal
    May 15, 2012 · A digitized object exists to record and present characteristics of some physical object. In contrast, born digital objects began their existence as digital.
  59. [59]
    Born-digital records and metadata - The National Archives
    Born-digital records are records that have been natively created in digital format (rather than digitised from paper records). Examples of born-digital ...Missing: digitized | Show results with:digitized
  60. [60]
    Creating digital materials - Digital Preservation Handbook
    A good portion of what is now being digitized began life as born digital content. It was converted into an analogue format such as print on paper before the ...
  61. [61]
    Digital Curation/Digital Archiving: A View from the National Archives ...
    Nov 12, 2008 · This paper considers similarities and differences among the concepts of digital curation, digital archives, and digital libraries.
  62. [62]
    Digital Collections and Digital Preservation: One and the Same, Right?
    Sep 25, 2023 · In digital collections (or digital libraries), “access” focuses only on the ability to make digital objects accessible to the current community ...
  63. [63]
    | The Signal
    No readable text found in the HTML.<|separator|>
  64. [64]
    Clearing up the confusion between Document Management…
    Sep 16, 2022 · Conversely, a digital preservation system maintains records after they are no longer actively used but instead, need to be retained long-term ...Missing: curation | Show results with:curation
  65. [65]
    [PDF] Intersection of Digital library and Data Science - CEUR-WS
    Feb 23, 2023 · In Data science, data curation may indicate the process of research data management and extraction of important information from scientific ...Missing: comparison | Show results with:comparison
  66. [66]
    Data Curation - an overview | ScienceDirect Topics
    Digital curation is an interdisciplinary field, embracing archival, information, library, and computer science . Digital/data curation process and issues ...
  67. [67]
    [PDF] The Digitization of the World from Edge to Core - Seagate Technology
    IDC predicts that the. Global Datasphere will grow from 33. Zettabytes (ZB) in 2018 to 175 ZB by 2025. To keep up with the storage demands stemming from all ...
  68. [68]
    [PDF] Digital Research Data Curation: Overview of Issues, Current ...
    The problem encompasses storage and network capacity, integration with cyberinfrastructure development, best practices related to data archival and metadata, ...
  69. [69]
    Web-archiving and social media: an exploratory analysis
    Jun 22, 2021 · Archiving and mastering the volume, variety and velocity of data on social media platforms demands high-quality metadata to, among others, allow ...
  70. [70]
    Data Curation Implications of Qualitative Data Reuse and Big Social ...
    Nov 10, 2021 · This paper explores six key challenges related to data use and reuse for qualitative data and big social research and discusses their implications for data ...
  71. [71]
    [PDF] The Significance of Storage in the “Cost of Risk” of Digital Preservation
    In estimating Total Cost of Ownership (TCO), the complexity of the models just mentioned is devoted to digital library processes, not storage devices (or.
  72. [72]
    How does Archive.org work in terms of storage? Inter... - Hacker News
    Sep 28, 2018 · The Archive currently has about 46 Petabytes of content ("bytes archived"), and over 120 PB of raw disk capacity.Missing: issues volume
  73. [73]
    Data Overload – AHA - American Historical Association
    May 7, 2019 · The project's goal is to lower barriers to working with large-scale web archives by creating accessible tools and a web-based interface with which to use them.
  74. [74]
    [PDF] e-Science Curation Report - Digital Preservation
    This task force should work closely with and inform the work of the new UK Digital Curation Centre. A3. The mismatch of short-term funding against the long-term ...
  75. [75]
    Current and Future Demand for a Digital Curation Workforce - NCBI
    Apr 22, 2015 · Because of the volume of metadata needed, the costs of its manual creation, and both the feasibility and appeal of automated metadata ...
  76. [76]
    Financial Sustainability of Digitizing Cultural Heritage - MDPI
    The aim of our research was to investigate the Europeana project to understand how it eases the financial costs of digitization for cultural organizations.
  77. [77]
    [PDF] MODS: The Metadata Object Description Schema
    Her article, while giving an overview of MODS, also details the reasons why the Dublin Core is inadequate for describing complex digital objects, at least from ...Missing: curation variations challenges
  78. [78]
    Using Metadata Standards | DCC - Digital Curation Centre
    Metadata standards formalize element structure, ensuring consistent metadata for long-term discovery, use, and integrity of digital resources.2. Use Of Metadata Standards · Implementation... · Creating Profiles
  79. [79]
    The Effectiveness and Durability of Digital Preservation ... - Ithaka S+R
    Jul 19, 2022 · The main goal of the project is to examine and assess how digital preservation and curation systems are developed, deployed, and sustained.Missing: history | Show results with:history
  80. [80]
    Interoperability | DCC - Digital Curation Centre
    Feb 4, 2009 · Data interoperability is only one aspect of the overall interoperability problem. · Design issues need to address both human-centred aspects ( ...Missing: challenges | Show results with:challenges
  81. [81]
    Chapter 3 Current and Future Demand for a Digital Curation Workforce
    Private-sector sources can supplement incomplete or inadequate government data. One very useful source of information is data on job openings. The data used ...
  82. [82]
    Open archival information system (OAIS) - ISO 14721:2012
    ISO 14721:2012 defines the reference model for an open archival information system (OAIS). An OAIS is an archive, consisting of an organization, which may be ...Missing: interoperability | Show results with:interoperability
  83. [83]
    INTERNATIONAL INTERNET PRESERVATION CONSORTIUM - IIPC
    IIPC has members from over 35 countries, including libraries and archives, who join working groups to advance web archiving.About iipc · Playback · IIPC members · JOIN THE IIPC
  84. [84]
    [PDF] Guidelines for Planning the Digitization of Rare Book and ... - IFLA
    These guidelines are intended for anyone who is involved in planning a digitization project for rare and unique materials including library leaders who are ...
  85. [85]
    Preservation Guidelines for Digitizing Library Materials - Collections ...
    Scanning equipment must be properly sized to accommodate collection materials without causing damage. Oversized materials, including books that have foldouts, ...<|control11|><|separator|>
  86. [86]
    [PDF] Technical Guidelines for Digitizing Archival Materials for Electronic ...
    These guidelines define approaches for creating digital surrogates for access, covering image capture, metadata, file formats, and quality control.
  87. [87]
    Handwriting Recognition Benchmark: LLMs vs OCRs
    Oct 14, 2025 · Today, OCR technology provides higher than 99% accuracy with typed characters in high-quality images. However, the diversity in human ...Methodology · What are the challenges of... · Preparation of converting...
  88. [88]
    Fair Use Week 2023: Looking Back at Google Books Eight Years Later
    Feb 24, 2023 · Eight years after it was decided, the legacy of Google Books endures in policy debates and copyright lawsuits that capture the public's attention.Missing: hurdles | Show results with:hurdles
  89. [89]
    [PDF] Capturing Analog Sound for Digital Preservation:
    Mar 10, 2006 · Analog discs and tapes continue to require attention and pose particu- lar challenges. For historical audio recordings to be accessible to ...
  90. [90]
    [PDF] 3d scanning and replication for museum and cultural heritage ...
    The cost and complexity of 3D imaging technologies have made. 3D scanning impractical for many heritage institu- tions in the past, but this is changing, as an ...
  91. [91]
    [PDF] Guidelines for Digitization Projects For Collections and Holdings in ...
    Mar 1, 2002 · These Guidelines have been produced by a working group representing IFLA and the ICA that was commissioned by UNESCO to establish guidelines ...
  92. [92]
    [PDF] Appraisal and Selection for Digital Curation
    The quality of metadata and documentation affects the values of resources, and the costs and feasibility of preservation. Thus many appraisal policies include ...Missing: challenges variants
  93. [93]
    Center for Digital Antiquity - Cloned
    Nov 27, 2015 · Curators also reviewed the texts and illustrations to identify confidential information that might need to be redacted or designated as “ ...Missing: variants | Show results with:variants
  94. [94]
    Introduction to Metadata: Setting the Stage - Getty Museum
    Metadata provides a means of indexing, accessing, preserving, and discovering digital resources. The volume of digital information available over electronic ...Missing: curation variations
  95. [95]
  96. [96]
    [PDF] Challenges and Directions in 3D and VR Data Curation: Findings ...
    Abstract. This study identifies challenges and promising directions in the curation of 3D data. 3D visualization shows great promise for a range of ...
  97. [97]
    Research Challenges for Digital Archives of 3D Cultural Heritage ...
    The increasing creation of 3D cultural heritage models has resulted in a need for the establishment of centralized digital archives.Missing: obsolescence | Show results with:obsolescence
  98. [98]
  99. [99]
    [PDF] the role of contemporary archives in safeguarding and preserving ...
    Jun 26, 2014 · The dissertation offers new ways of thinking about the points where archives and intangible cultural heritage intersect; as such, the project ...
  100. [100]
    Digital heritage in digital museums
    All tangible heritage embodies intangible components such as spiritual values, symbols, meanings, knowledge, or the know-how of craftsmanship and construction.<|separator|>
  101. [101]
    Surmounting the Barriers to Social Media Archiving - DigiPres.org
    Sep 19, 2024 · This "Birds of a Feather" session aims to bring together digital archivists, researchers, and allied professionals to discuss these barriers and explore ...
  102. [102]
    When AI Blurs Reality: The Rise of Hyperreal Digital Culture
    Aug 28, 2025 · From Bigfoot vlogs to algorithmically created personas, hyperrealistic AI content is redefining the boundaries of digital creators.
  103. [103]
    [PDF] Preserving Interactive Multimedia Art
    The challenge of preserving born-digital multimedia art, which is inherently interactive, virtual, and temporary, has been an actively discussed topic in the.Missing: curation | Show results with:curation
  104. [104]
    [PDF] Through the Looking Glass: Non-Fungible Tokens & Libraries
    Aug 11, 2021 · Non-Fungible Tokens (NFTs) are a rapidly emerging technology that sit at the intersection of cryptocurrency, digital ownership, and preservation ...Missing: curation | Show results with:curation
  105. [105]
    Teleconferences - UMaine Digital Curation
    00:00 Potential and challenges for blockchain as a preservation tool. What this panel is not about. Assumptions (right and wrong) about NFTs. 03:08 The three ...
  106. [106]
    Digital Preservation Policies Study Part 1 - Emerald Publishing
    Jun 12, 2009 · This report into digital preservation policies in third level universities and colleges in the UK was commissioned and funded by the JISC (Joint ...<|separator|>
  107. [107]
    [PDF] JISC Programme Synthesis Study: Supporting Digital Preservation
    The toolkit discusses some of the main issues to be considered in development of a digital asset management strategy and poses questions for consideration based.
  108. [108]
    Infrastructure and Organizational Services for Research Data Curation
    The Data Conservancy community is driven by a common theme: the need for institutional solutions to digital research data collection, curation and preservation ...
  109. [109]
    Services & Programs - HathiTrust Digital Library
    Through our shared infrastructure, your library can contribute digitized content and avoid the costs of hosting your own preservation environment. There is ...Missing: curation | Show results with:curation
  110. [110]
    Training | DCC - Digital Curation Centre
    We provide dedicated training through workshops. We have conducted trainings in locations across the world as well as in the UK.
  111. [111]
    History of the DCC - Digital Curation Centre
    The DCC was launched in March 2004 to solve digital curation challenges, initially funded by Jisc, and became increasingly global. It is no longer supported by ...
  112. [112]
    Financing Models for Sustainable Data Reuse Infrastructure
    We explore financing models that support data resources and associated services in a long-term data infrastructure. Data infrastructure components often ...
  113. [113]
    application of artificial intelligence in digital preservation: emerging ...
    Jul 4, 2025 · This paper explores how AI can help preserve digital preservation through automated metadata creation, risk management analysis, intelligent file format ...
  114. [114]
    AI for Digital Preservation – A DPC Webinar
    May 4, 2023 · AVA generates preliminary appraisal decisions which are then accepted or rejected by a (human) archivist, and the video recordings are ...
  115. [115]
    Archives and AI: An Overview of Current Debates and Future ...
    Looking at the use of AI as part of archival processes, we observe a rich and growing activity focused on automating them. AI is experimented with to automate, ...
  116. [116]
    [PDF] Blockchain Technology for Digital Preservation in Libraries
    Blockchain has the potential to transform digital preservation in libraries by ensuring the authenticity, integrity, and provenance of digital records. However ...Missing: curation | Show results with:curation
  117. [117]
    Underscoring archival authenticity with blockchain technology
    Jun 26, 2019 · This data is permanently preserved through peer-to-peer distribution and consensus checking without the need for a trusted third party, thereby ...
  118. [118]
    Collaborative Curation for FAIR data - University Digital Conservancy
    Through the Data Curation Network (DCN), members enable findable, accessible, interoperable, and reusable (FAIR) data through a shared curation model, ...
  119. [119]
    Towards Environmentally Sustainable Long-term Digital Preservation
    Oct 31, 2022 · Moving toward environmentally sustainable digital preservation requires critically examining the motivations and assumptions that shape current practice.Missing: principle | Show results with:principle
  120. [120]
    Environmental Impact and Digital Preservation
    We must work together as a community to evolve digital preservation good practice to minimise the environmental impact of our actions.
  121. [121]
    The role of digital curation in COVID‐19 Open Science
    Mar 14, 2025 · In this paper, we examine the role digital curation practices and practitioners played in facilitating open science (OS) initiatives amid the COVID‐19 pandemic.
  122. [122]
    Digitalizing cultural heritage through metaverse applications - Nature
    Aug 13, 2024 · The current study aims to explore the potential of metaverse applications in digitalizing cultural heritage, encompassing virtual reconstructions, educational ...Missing: curation | Show results with:curation
  123. [123]
    A Metaverse Platform for Preserving and Promoting Intangible ...
    Apr 18, 2024 · This article proposes a metaverse application, inspired by the events of the Italian Resistance, promoting interactions between multiple users in an immersive ...
  124. [124]
    [PDF] 10 Things for Curating Reproducible and FAIR Research
    The object of curation in the context of reproducibility is the scientific claim: We are curating the digital artifacts that underlie the claim. Curation for ...
  125. [125]
    [PDF] Computational Reproducibility: A Practical Framework for Data ...
    This article proposes an approach of incremental curation according to abilities of the curator, arguing that partial reproducibility is better than nothing at ...<|separator|>
  126. [126]
    DataCite Metadata Schema
    The DataCite Metadata Schema is a list of core metadata properties chosen for an accurate and consistent identification of a resource for citation and ...Schema 4.6 · Release History · Contribute
  127. [127]
    [PDF] DataCite Metadata Kernel
    Primarily these are: • recommending a standard citation format for datasets, based on a small number of properties required for identifier registration;. • ...
  128. [128]
    Welcome | Data Preservation in High Energy Physics - CERN
    DPHEP is "Data Preservation in HEP" and is an International Collaboration of Institutes, Experiments, Funding agencies and other interested parties.
  129. [129]
    Data preservation | CERN
    CERN has created large volumes of data of many different types. This involves not only scientific data – about 420 petabytes (420 million gigabytes) of data.
  130. [130]
    Data Management and Sharing Policy - NIH Grants & Funding
    Aug 5, 2025 · This page provides information about NIH's scientific data management and sharing policies and repositories, previously available on the NIH ...Data Sharing Approaches · Data · Writing a Data Management · Budgeting for DataMissing: 2010 | Show results with:2010
  131. [131]
    Big data challenges in genome informatics - PMC - NIH
    In this article, we have discussed several big data challenges in genome informatics. Especially, we envision that those challenges will become intense in the ...
  132. [132]
    [PDF] Data Curation in Climate and Weather
    Erroneous data can be detected and fixed, new sources of observed data are discovered and prepared, observing system biases are systematically corrected, and ...
  133. [133]
    Open Data Challenges in Climate Science
    Dec 16, 2020 · The purpose of this paper is to explore challenges in open climate data experienced by data scientists at the Centre for Environmental Data ...
  134. [134]
    Computational reproducibility of Jupyter notebooks from biomedical ...
    Jupyter notebooks facilitate the bundling of executable code with its documentation and output in one interactive environment, and they represent a popular ...
  135. [135]
    [PDF] Jupyter Notebooks: A Primer for Data Curators
    Feb 1, 2019 · Introduction. Jupyter Notebooks are composite digital objects used to develop, share, view, and execute interspersed, interlinked, and.<|separator|>
  136. [136]
    The impact of open data: Six ways data availability benefits research ...
    2025年7月24日 · How does open data benefit research, researchers, and stakeholders? Here we explore just six of the many benefits of open data.
  137. [137]
    Nine quick tips for open meta-analyses - PMC - PubMed Central
    2024年7月25日 · Here, we outline 9 quick tips for open meta-analyses, aimed at guiding researchers to maximize the reach and utility of their findings.