Fact-checked by Grok 2 weeks ago

Crystallographic Information File

The Crystallographic Information is a standard text-based designed for the archiving, exchange, and publication of and related structural science data, developed and promoted by the International Union of Crystallography (IUCr). It employs a self-defining, extensible structure based on the (Scientific and Technical Data) file format, allowing for the representation of atomic coordinates, parameters, information, and experimental details in a machine-readable and human-interpretable manner. CIF has become the de facto standard in crystallography, required for submissions to major journals and databases such as the Cambridge Structural Database and the Inorganic Crystal Structure Database. The format originated in the early as a response to the need for a flexible, error-resistant alternative to fixed-format punch cards and early digital exchanges in . The initial version, CIF 1.0, was specified in 1991 by Hall, Allen, and Brown, with amendments formalized by the IUCr's for the Maintenance of the CIF Standard (COMCIFS) in 1997. This version uses a restricted syntax of the format, featuring data blocks, loops for tabular data, and dictionaries that define standardized data names and relationships to ensure interoperability across software tools. CIF's extensibility is achieved through public dictionaries maintained by COMCIFS, which categorize data items into core, macromolecular, and domain-specific sets, enabling adaptation for fields like and . In 2016, CIF 2.0 was introduced as an enhanced alternative to the original syntax, incorporating support for international characters, complex data types such as matrices and arrays, and more flexible string delimiters like triple quotes for multiline text. Approved by COMCIFS in 2014 and detailed in the specification by Bernstein et al., this version maintains with CIF 1 while addressing limitations in handling diverse global datasets and advanced computational needs. Both versions are actively supported, with CIF files identified by headers like #\#CIF_1.1 or #\#CIF_2.0, and software parsers such as those from the IUCr's CIFtbx library facilitating validation and conversion. Today, CIF underpins global data sharing in , ensuring and integration with modern databases and applications in .

History and Development

Origins and Initial Publication

In the late 1980s, crystallographic data exchange faced significant challenges due to the prevalence of program-specific and proprietary formats, such as those used by , which were constrained by fixed-field lengths (often limited to 80 characters per line) and Fortran-era conventions like punch-card compatibility. These limitations hindered interoperability between refinement programs, databases like the and , and journal submission systems, making it difficult to share complete structural data efficiently and standardize validation processes. To address these issues, the International Union of Crystallography (IUCr) established the Working Party on Crystallographic Information in 1987, chaired by E. N. Maslen, with sponsorship from the IUCr Commission on Crystallographic Data. The working party aimed to develop a universal standard for archiving and exchanging crystallographic information, emphasizing flexibility for evolving computational needs in small-molecule and materials crystallography. The resulting Crystallographic Information File (CIF) was designed as a self-defining, extensible format based on the (Self-defining Text Archiving and Retrieval) syntax, which uses key-value pairs to allow human-readable, machine-parsable files without rigid structures. Key contributors Sydney R. Hall, Frank H. Allen, and I. David Brown led the effort, focusing on core data items for crystal structures, experimental details, and refinement results. CIF's initial specification and the first core dictionary—containing definitions for essential items like parameters, atomic coordinates, and thermal displacement parameters—were published in 1991.

Evolution of Versions

The format, initially published in 1991, has evolved through targeted revisions to address advancements in crystallographic handling and software . Version 1.1 was approved by the Committee on the Maintenance of the CIF Standard (COMCIFS) on 10 December 2002, with the specification posted on the International Union of Crystallography (IUCr) website in February 2003. This update enhanced extensibility by permitting syntactically valid extensions beyond the data items defined in public dictionaries, allowing the format to adapt to specialized applications without breaking core compatibility. It also refined syntax rules, including improved line-folding mechanisms and conformance requirements, to facilitate better error detection and handling in software, reducing common implementation issues like malformed blocks. These changes were elaborated in the seminal review by Brown and McMahon (2002), which described CIF's tag-value structure and its role as a extensible for crystallographic . CIF 2.0 represented a major syntactic overhaul, formally approved by COMCIFS in August 2014 and detailed in a specification. Among its key innovations was full support for encoding, enabling the inclusion of characters for international symbols and non-Latin text in data descriptions. Enhanced looping constructs, such as triple-quoted delimiters ('''), allowed for more robust representation of multi-line strings and complex nested data, while new data types for lists (in square brackets) and tables (in braces) improved the management of large datasets, including matrices from computational simulations. These features expanded CIF's capacity for handling diverse, voluminous structural data in contemporary workflows. Parallel to syntax evolution, the core CIF dictionary has undergone regular updates to accommodate emerging experimental techniques, ensuring comprehensive coverage of crystallographic parameters. Notable expansions include the 1996 addition of categories for area detectors (e.g., _diffrn_detector_area_type) and reflection shells (e.g., reflns_shell), integrating insights from macromolecular experiments. In 2003, reciprocal cell parameters (e.g., _cell_reciprocal_volume) and crystal recrystallization methods were incorporated to support advanced refinement protocols. A 2010 update introduced _diffrn_radiation_wavelength_determination for precise and specifications, reflecting the integration of high-intensity data. These version iterations have been driven by the imperatives of archiving, where reliable, future-proof formats are essential for preserving vast repositories of structural against technological obsolescence. The IUCr's of perpetual support for CIF 1.0 in major databases—such as the Worldwide (wwPDB), Structural Database (), and Open Database ()—ensures seamless access to historical records while newer versions handle expanded modern requirements. This has solidified CIF's status as a for long-term stewardship in .

Key Contributors and Milestones

Sydney R. Hall, a crystallographer from the , served as the lead developer of the Crystallographic Information File (CIF), authoring the foundational specification published in 1991 that established its self-defining text archive structure based on the format. Hall's contributions extended to editing Acta Crystallographica Section C and chairing the IUCr's efforts in standardizing crystallographic data exchange, earning him the 2014 CODATA Prize for outstanding achievement in scientific data management and preservation. Frank H. Allen, as Executive Director of the Cambridge Crystallographic Data Centre (CCDC) from 2002 to 2008, played a pivotal role in integrating into small-molecule databases; under his leadership and earlier involvement, the Cambridge Structural Database (CSD) began accepting CIF deposits in 1994, enabling electronic data submission and building an archive that contained 335,000 entries as of January 2005. Ian D. Brown, an IUCr representative and co-author of the original specification, contributed to its early definition and syntax rules, emphasizing extensible data naming for crystallographic applications. Key milestones include the formation of an IUCr working group in 1990 to develop and manage dictionaries, which evolved into the Committee for the Maintenance of the Standard (COMCIFS) established in 1993 to oversee software development and . The CSD's adoption of in 1994 marked a major step in practical implementation for small-molecule structures. In the 1990s, the (PDB) integrated the macromolecular extension mmCIF, developed by an IUCr committee to address limitations in the legacy PDB format and support complex biomolecular data. A significant advancement occurred in 2019 when the Worldwide mandated mmCIF as the default deposition format for crystallographic structures, effective July 1, to enhance data richness and .

Format and Syntax

Core Structure and Syntax Rules

The is based on the Self-defining Text Archive and Retrieval () syntax, which organizes information into self-describing blocks containing key-value pairs and tabular structures for extensible exchange. This foundation allows CIF to represent complex crystallographic in a human-readable, machine-parsable format, with files composed of ASCII text that can be edited using standard tools. At its core, CIF structure revolves around data blocks, which encapsulate related information and begin with a data_ header followed by an optional , such as data_example_block. Each block contains items expressed as key-value pairs, where keys are tags starting with an (e.g., _tag_name), followed by whitespace and the corresponding value. For repeating , loops provide a tabular format initiated by the loop_ keyword, followed by a list of item tags and then rows of values aligned in columns, separated by whitespace; this enables efficient representation of arrays without explicit indexing. Save frames, optional nested structures starting and ending with save_ keywords, allow grouping of related items within blocks, though nesting is not permitted. Syntax rules emphasize flexibility while enforcing parsability. Whitespace—spaces, tabs, or end-of-line characters—serves solely to separate tokens and is otherwise insignificant, except in multi-line text fields. Values can be unquoted (simple strings without special characters), single-quoted ('value'), or double-quoted ("value"), with quoted forms limited to single lines and not permitting embedded quotes of the same type; multi-line text uses semicolon delimiters, beginning with a semicolon on a new line after the tag and ending similarly, preserving internal whitespace. Special characters like hash (#) initiate comments to end-of-line, while underscores, dollars ($), and brackets are reserved and cannot start unquoted values to avoid syntax conflicts. CIF 1.1 restricts files to the ASCII character set (positions 9, 10, 13, and 32–126), with a maximum line length of 2048 characters excluding end-of-line markers. In contrast, CIF 2.0 extends support to encoding, allowing non-ASCII characters (Unicode code points U+007F and above) while requiring identifiable encodings like a byte-order mark for , and normalizes newlines to U+000A. applies variably: data names, block codes, and reserved keywords (e.g., data_, loop_) are case-insensitive, but data values are case-sensitive to preserve exact content. These rules, while permissive, can lead to errors if reserved characters appear in unquoted fields or if line lengths exceed limits, necessitating careful validation against dictionaries that define permissible item names. For instance, a basic key-value pair might appear as:
_tag_name  value
While a loop structure could be:
loop_
_tag1
_tag2
value1a  value2a
value1b  value2b
Such constructs ensure CIF's robustness for archival purposes.

Data Items and Dictionaries

The Crystallographic Information File (CIF) structures crystallographic data using named data items, which are unique, hierarchical identifiers representing specific attributes of a or experiment. These names typically begin with an and follow a dotted notation to indicate categories and subcomponents, such as _atom_site_fract_x for the fractional x-coordinate of an atom's position within the . Data items are grouped into categories that function like relational tables, enabling the storage of related values in ; for instance, the atom_site category includes items like _atom_site_fract_x, _atom_site_fract_y, and _atom_site_fract_z to describe atomic coordinates collectively. This organization ensures and facilitates mapping to databases, with each item having defined attributes such as (e.g., numerical or character), units, and permissible value ranges. CIF dictionaries provide the semantic foundation by defining these data items, their types, categories, and interrelationships in a machine-readable format. The core CIF dictionary, maintained by the International Union of Crystallography (IUCr) Committee for the Maintenance of the CIF Standard (COMCIFS), covers essential terms for small-molecule crystallography and powder diffraction, specifying constraints like enumeration lists or parent-child links between items (e.g., linking atomic sites to occupancy factors). For macromolecules, the mmCIF dictionary—formally mmcif_pdbx.dic and curated by the Worldwide Protein Data Bank (wwPDB)—extends the core with specialized items for biological structures, such as those describing polymer sequences or non-crystallographic symmetry. Dictionaries themselves are written as STAR files, using data names like _definition.text to describe item meanings and _category.key to establish relational keys. To support modular , CIF uses save frames, which delineate subsections within a file for logically grouping related information, such as experimental conditions or multiple related structures, without altering the overall file syntax. This allows complex files to contain encapsulated blocks, improving organization for archival and exchange purposes. CIF's extensibility for user-defined items is achieved through the (DDL), which governs dictionary construction and allows customization while maintaining compatibility. DDL method 1 (DDL1), established in , provided the initial framework for core CIF dictionaries, focusing on basic attribute definitions. DDL method 2 (DDL2), introduced in 1998, enhanced this with stricter rules for expressing data relationships and was adopted for mmCIF extensions. DDL method 3 (DDLm), developed from 2012 onward, harmonizes DDL1 and DDL2, incorporating support for executable validation methods in the dREL language and enabling private dictionaries with registered prefixes for domain-specific additions. These methods ensure that new items can be defined without conflicting with standard vocabularies, promoting . Dictionary evolution aligns with CIF versions; for example, CIF 1.1 introduced new data items and categories for , including profile fitting parameters and instrumental details, to accommodate broader experimental techniques. Such updates are managed by COMCIFS to reflect advances in while preserving .

File Organization and Examples

The (CIF) follows a structured layout to ensure and . Files typically begin with a header specifying the version, such as #\#CIF_2.0, which indicates compliance with the updated syntax supporting and advanced types. This is followed by one or more data blocks, each initiated by a data_ keyword appended with a unique, case-insensitive identifier (up to 75 characters), such as data_example_crystal. Data blocks encapsulate related information through data items—pairs of underscores-prefixed tags (e.g., _cell_length_a) and their values—or structures for tabular . Save frames, delimited by save_ keywords, organize nested or referential , though they are primarily used in dictionary files rather than standard structural CIFs. Whitespace separates elements, with lines limited to 2048 characters, and comments prefixed by # for annotations. Metadata is handled via audit items in the data block, capturing and details. For instance, _citation_journal_abbrev records the abbreviated journal name (e.g., "Acta Cryst. C"), while related tags like _citation_year and _citation_id provide comprehensive referencing. These items ensure traceability and are often placed in an initial data_global block for overarching file information. Values may be unquoted for simple numerics, single- or double-quoted for strings with special characters, or semicolon-delimited for multi-line text fields. Best practices recommend using the .cif file extension for uncompressed files to facilitate direct parsing by software. For transmission or storage of large CIFs, especially those including structure factors, compression via (yielding .cif.gz) is advised, though submissions to journals may require uncompressed formats. Data block codes should be concise (≤32 characters) and meaningful, avoiding tabs in favor of spaces for consistent formatting. A basic example of a small-molecule CIF snippet illustrates this organization, showing the header, a data block with unit cell parameters, space group, and a partial atom sites loop (full coordinates omitted for brevity):

## CIF_2.0

CIF 2.0 is an alternative syntax to the original CIF format (versions 1.0 and 1.1), approved by the IUCr's COMCIFS in August 2014 and formally specified in 2016.[](https://www.iucr.org/resources/cif/cif2)[](https://journals.iucr.org/j/issues/2016/01/00/aj5269/) It supplements CIF 1.x by addressing limitations in character sets, data types, and string handling, while maintaining compatibility through shared dictionaries and data item definitions. CIF 2.0 files are identified by the header `##CIF_2.0` and must use [UTF-8](/page/UTF-8) encoding.[](https://journals.iucr.org/j/issues/2016/01/00/aj5269/)

Key enhancements include full [Unicode](/page/Unicode) support for international characters, introduction of complex data types such as lists (e.g., `[1 0 0]`) and tables (e.g., `{"symm": "P 4n 2 3 -1n"}`), and more flexible multiline string delimiters using triple quotes (`'''` or `"""`) in addition to semicolons.[](https://journals.iucr.org/j/issues/2016/01/00/aj5269/) These features enable better representation of diverse global datasets and advanced structures like matrices and arrays, simplifying parsing by disallowing embedded delimiters in quoted strings. The core file organization—data blocks, save frames, loops for tabular data, and key-value pairs—remains similar to [CIF](/page/Cif) 1.x, ensuring interoperability with existing software via converters.[](https://www.iucr.org/resources/cif/cif2)

A simple example of a CIF 2.0 file demonstrating basic structure and a list [data type](/page/Data_type):
##CIF_2.0 data_example _cell_length_a 10.3 _symmetry_space_group_name_H-M "P 4n 2 3 -1n" _cell_vectors [10.3 0.0 0.0] [0.0 10.3 0.0] [0.0 0.0 10.3]

Common pitfalls in CIF 2.0 include failing to include the version header, which may cause parsers to default to CIF 1.x rules; misuse of new [data types](/page/Data_type) without proper [dictionary](/page/Dictionary) support; or incorrect line folding in triple-quoted strings, leading to validation errors. These can be avoided by using tools like the IUCr's CIFtbx library for creation and validation against the core CIF [dictionary](/page/Dictionary).[](https://journals.iucr.org/j/issues/2016/01/00/aj5269/)

## Variants

### Core CIF for Small Molecules

The core Crystallographic Information File (CIF) dictionary is specifically designed for the archiving and exchange of data from single-crystal studies of small-molecule and inorganic crystals, encompassing essential structural parameters without the complexity required for biomolecular assemblies.[](https://www.iucr.org/resources/cif/dictionaries/cif_core) Its scope includes definitions for unit cell dimensions, such as `_cell_length_a`, `_cell_length_b`, `_cell_length_c`, `_cell_angle_alpha`, `_cell_angle_beta`, and `_cell_angle_gamma`, which describe the fundamental geometry of the crystal lattice. Symmetry information is captured through items like `_symmetry_space_group_name_H-M` for the Hermann-Mauguin notation of the space group and `_symmetry_equiv_pos_as_xyz` for equivalent positions, enabling the reconstruction of the asymmetric unit into the full unit cell. Atomic positions are detailed via `_atom_site_label`, `_atom_site_type_symbol`, `_atom_site_fract_x`, `_atom_site_fract_y`, and `_atom_site_fract_z`, often accompanied by `_atom_site_occupancy` to account for partial site occupations in disordered structures. Displacement parameters, critical for modeling thermal motion and disorder, are represented by isotropic values like `_atom_site_B_iso_or_equiv` or anisotropic tensors such as `_atom_site_aniso_U_11` through `_atom_site_aniso_U_33`. Refinement details, including statistical outcomes, are provided through items like `_reflns_number_observed` for the count of measured reflections, `_refine_ls_R_factor_R` for the conventional R-factor, and `_refine_ls_goodness_of_fit` for the overall fit quality.[](https://www.iucr.org/resources/cif/dictionaries/cif_core)[](https://www.iucr.org/cif/cifdic_html/1/cif_core.dic/index.html)

In practice, core CIF files organize these data into data blocks and loops, facilitating the storage of multiple related datasets, such as experimental conditions alongside structural results, in a single extensible file. This [format](/page/The_Format) supports the integration of [metadata](/page/Metadata) like [chemical composition](/page/Chemical_composition) via `_chemical_formula_analytical` and publication details, ensuring comprehensive [documentation](/page/Documentation) of the refinement process. For instance, a typical small-molecule [CIF](/page/Cif) might loop over atomic sites to list coordinates and occupancies, directly linking to [symmetry](/page/Symmetry) operations for validation. The dictionary's machine-readable nature allows automated parsing and error checking, promoting [data integrity](/page/Data_integrity) during submission and archival.[](https://www.iucr.org/resources/cif/dictionaries/cif_core)

Core CIF has become integral to major databases for small-molecule structures, including the [Cambridge Structural Database (CSD)](/page/CSD), where it serves as the standard format for depositing and retrieving over 1.36 million (as of January 2025) [organic](/page/Organic) and metal-[organic](/page/Organic) crystal structures, enabling advanced querying and analysis.[](https://www.ccdc.cam.ac.uk/media/CSD-Entries-Summary-Statistics-2025.pdf)[](https://www.ccdc.cam.ac.uk/community/access-deposit-structures/deposit-a-structure/guide-to-cifs/) Similarly, the Crystallography Open Database (COD) maintains an open-access repository of over 529,000 (as of November 2025) inorganic, [organic](/page/Organic), and mineral structures in core CIF format, with automated processing to extract and index data items for global searchability.[](https://www.crystallography.net/)[](https://www.crystallography.net/) These databases leverage the format's relational structure to connect atomic coordinates with derived properties, supporting research in [materials science](/page/Materials_science) and [chemistry](/page/2H).

Compared to legacy formats like the fixed-column punch-card style or early binary files, core CIF offers superior machine [readability](/page/Readability) through its self-describing, tag-value [syntax](/page/Hungarian_noun_phrase), which eliminates [parsing](/page/Parsing) ambiguities and supports relational linkages between [data](/page/Data) tables.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC4894604/) Its extensibility allows seamless incorporation of new [data](/page/Data) items via dictionary updates without invalidating existing files, making it adaptable to evolving crystallographic techniques while preserving [backward compatibility](/page/Backward_compatibility). This has streamlined [data](/page/Data) exchange, reducing transcription errors that plagued older methods. Historically, the International Union of Crystallography (IUCr) mandated CIF submission for its journals starting in 1992, accelerating its adoption as the [de facto standard](/page/De_facto_standard) for small-molecule publications and fostering widespread [software development](/page/Software_development).[](https://pmc.ncbi.nlm.nih.gov/articles/PMC4894604/)

### mmCIF for Macromolecules

The macromolecular Crystallographic Information File (mmCIF), also known as PDBx/mmCIF, extends the core Crystallographic Information File (CIF) standard to accommodate the complex hierarchical data inherent in protein, nucleic acid, and other biomolecular structures. Developed as a successor to the legacy Protein Data Bank (PDB) format, mmCIF addresses limitations in the fixed-record PDB format by offering a self-documenting, extensible syntax suitable for large-scale structural archiving. The initial mmCIF dictionary (version 1.0), containing over 1,700 data definitions, was released in 1997 by the IUCr's COMCIFS working group.[](https://www.sciencedirect.com/science/article/pii/S0076687997770320)

Key extensions in the mmCIF dictionary focus on polymeric entities and their connectivity. The _entity_poly_seq category defines the monomer sequence for each [polymer](/page/Polymer) entity, enabling representation of sequence heterogeneity such as [mutations](/page/The_Mutations) or post-translational modifications across multiple chains.[](https://mmcif.wwpdb.org/dictionaries/mmcif_pdbx_v50.dic/Categories/entity_poly_seq.html) The _pdbx_poly_seq_scheme category provides a residue-level mapping of the observed structure to the entity sequence, supporting alignments, numbering schemes, and handling of gaps or insertions observed in [electron density](/page/Electron_density). Complementing these, the _struct_conn category records inter- and intra-molecular connections, including covalent bonds, disulfide bridges, hydrogen bonds, and metal coordination, with explicit details on bond types and distances.[](https://mmcif.wwpdb.org/dictionaries/mmcif_pdbx_v50.dic/Categories/struct_conn.html)

In 2019, the Worldwide Protein Data Bank (wwPDB) consortium mandated that all new crystallographic depositions use the PDBx/mmCIF format exclusively, effective July 1, phasing out the legacy PDB format to enhance [data validation](/page/Data_validation), [interoperability](/page/Interoperability), and long-term preservation.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC6465986/) This transition underscores mmCIF's role in standardizing macromolecular data exchange. mmCIF's architecture is particularly adept at managing voluminous biomolecular datasets, supporting multiple conformational models (e.g., NMR ensembles via the _pdbx_nmr_ensemble category), non-polymeric ligands through _entity (type: non-polymer) and _chem_comp descriptors for [chemical composition](/page/Chemical_composition), and integration with [electron density](/page/Electron_density) maps via _reflns and _refine categories for [structure factor](/page/Structure_factor) data.[](https://mmcif.wwpdb.org/docs/user-guide/guide.html)[](https://www.rcsb.org/docs/general-help/electron-density-maps-and-coefficient-files)

### Specialized Extensions

The Crystallographic Information File (CIF) framework supports specialized extensions tailored to niche crystallographic subfields, enabling precise documentation of experimental data beyond standard small-molecule and macromolecular applications. These extensions maintain compatibility with the core CIF syntax while introducing domain-specific data items to address unique challenges, such as handling [powder](/page/Powder) patterns, modulated structures, [electron](/page/Electron) microscopy datasets, [image](/page/Image) arrays, and interdisciplinary integrations. Developed under the oversight of the International Union of Crystallography (IUCr), these variants promote data archiving and exchange in specialized research environments.[](https://www.iucr.org/resources/cif/dictionaries)

The [powder diffraction](/page/Powder_diffraction) CIF (pdCIF) dictionary supplements the core CIF to accommodate the requirements of powder diffraction experiments, including documentation of raw data, processing steps, and refinement results from techniques like Rietveld analysis. It supports instruments such as conventional [X-ray](/page/X-ray), [synchrotron](/page/Synchrotron), and [neutron](/page/Neutron) sources, facilitating the exchange of multi-phase datasets and [provenance](/page/Provenance) [information](/page/Information). A key data item, `_pd_proc_ls_profile`, describes parameters for peak profile fitting in least-squares refinement, allowing representation of instrumental resolution functions and peak shape models essential for quantitative phase analysis.[](https://www.iucr.org/resources/cif/dictionaries/cif_pd)

msCIF (modulated structures CIF dictionary) extends the core dictionary to describe incommensurately modulated and composite crystal structures, aligning with guidelines from the IUCr Commission on Aperiodic Crystals. It enables the modeling of atomic displacements, occupancies, and other parameters that vary periodically but incommensurately with the basic lattice, common in materials exhibiting quasi-periodic order. Central to this extension is the `_atom_site_modulation` category, which records modulation functions for atom sites, including Fourier coefficients for displacive and occupational variations, supporting the reconstruction of superspace models. Examples include one-dimensional modulations in compounds like K₂SeO₄ and misfit layers in (LaS)₁.₁₄NbS₂.[](https://www.iucr.org/resources/cif/dictionaries/cif_ms)

Recent updates to CIF dictionaries post-2020 have incorporated extensions for electron crystallography, particularly to handle two-dimensional ([2D](/page/2D)) and three-dimensional ([3D](/page/3D)) electron microscopy (EM) data from techniques like serial electron diffraction and MicroED. These additions, integrated into the core and image-related dictionaries, support metadata for low-dose imaging, beam-induced motion correction, and tilt series [reconstruction](/page/Reconstruction), addressing the growing use of cryo-EM in structural determination of beam-sensitive materials. The IUCr's initiation of a dedicated electron crystallography section in IUCrJ in 2021 underscores the framework's adaptation to this field, with data items for specifying EM-specific parameters like acceleration voltage and detector geometry.

The image CIF (imgCIF) dictionary provides a [mechanism](/page/Mechanism) for archiving raw [diffraction](/page/Diffraction) images and associated metadata within the CIF ecosystem, using binary encoding compatible with the Crystallographic [Binary File](/page/Binary_file) (CBF) format. It organizes data from area detectors into array structures, supporting one-, two-, and three-dimensional datasets from [X-ray](/page/X-ray), [neutron](/page/Neutron), or [electron](/page/Electron) sources. The `_img_data` item specifically encodes [pixel](/page/Pixel) intensities for raw [diffraction](/page/Diffraction) images, enabling the storage of unprocessed frames with details on [exposure](/page/Exposure), goniometry, and [calibration](/page/Calibration), which is vital for reprocessing and validation in high-throughput experiments.[](https://www.iucr.org/cif/cifdic_html/2/cif_img.dic/index.html)

CIF integrates with the [NeXus](/page/NEXUS) format—a hierarchical standard for [neutron](/page/Neutron), [X-ray](/page/X-ray), and [muon](/page/Muon) [scattering](/page/Scattering) data based on HDF5—to enhance [interoperability](/page/Interoperability) in [neutron](/page/Neutron) [scattering](/page/Scattering) applications. This linkage allows [CIF](/page/Cif) metadata, such as structural models, to be embedded within NeXus files for comprehensive [beamline](/page/Beamline) data management, including event streams and [instrument](/page/Instrument) geometries. Efforts by the IUCr and NeXus International Advisory Committee have mapped [CIF](/page/Cif) dictionaries to NeXus classes, facilitating hybrid workflows where [CIF](/page/Cif) handles crystallographic specifics and NeXus manages large-scale raw data volumes.[](https://www.nexusformat.org/pdfs/CIFNexus.pdf)

## Applications and Usage

### In Small-Molecule Crystallography

In small-molecule crystallography, the Crystallographic Information File ([CIF](/page/Cif)) is generated as part of the standard refinement [workflow](/page/Workflow) following structure solution from [X-ray](/page/X-ray) [diffraction](/page/Diffraction) data. Refinement programs such as SHELXL process the intensity data (typically in .hkl format) through least-squares optimization to derive atomic coordinates, thermal parameters, and other structural details, after which commands like LIST or ACTA produce a complete CIF output containing the refined model, experimental conditions, and [metadata](/page/Metadata) in the core CIF syntax.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC4294323/)[](https://www.psi.ch/sites/default/files/import/lns-diffraction/LinuxEN/shelx.pdf) This automated export ensures the file captures essential data items defined in the core CIF dictionary, facilitating seamless transition from computation to documentation.[](https://publish.uwo.ca/~chemxray/Links/shelxl_cards_quick_reference_manual.pdf)

CIF serves a critical archival role by enabling the deposition of small-molecule structures into centralized databases such as the [Cambridge Structural Database (CSD)](/page/CSD), Inorganic Crystal Structure Database (ICSD), and [Crystallography Open Database (COD)](/page/Cod), where it acts as the primary format for submission. Researchers submit CIF files—often alongside supporting files like .fcf or .hkl—through web-based portals provided by these repositories, ensuring structures are curated, validated, and made publicly accessible upon publication.[](https://www.ccdc.cam.ac.uk/community/access-deposit-structures/deposit-a-structure/guide-to-cifs/)[](https://support.ccdc.cam.ac.uk/support/solutions/articles/103000306356-what-are-the-criteria-for-deposition-to-the-ccdc-)[](http://icsd-depot.products.fiz-karlsruhe.de/en/icsddepotdepositstructures/deposit-structures) Major crystallographic journals, including those from the International Union of Crystallography (IUCr), mandate CIF deposition as a condition for manuscript acceptance, promoting data integrity and reuse across the community.[](https://www.ccdc.cam.ac.uk/community/access-deposit-structures/deposit-a-structure/cif-deposition-guidelines/)[](https://datascience.codata.org/articles/635/files/submission/proof/635-1-3210-1-10-20170807.pdf)

The adoption of CIF in this field yields significant benefits, particularly through automated validation and advanced [data mining](/page/Data_mining) capabilities. Tools like checkCIF, developed by the IUCr, parse the [CIF](/page/Cif) to assess [internal consistency](/page/Internal_consistency), completeness, and adherence to chemical reasonableness, flagging issues such as anomalous [bond](/page/Bond) lengths or missing [symmetry](/page/Symmetry) details via leveled alerts that [guide](/page/Guide) corrections before deposition.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC6944088/)[](https://journals.iucr.org/paper?yr5065) Furthermore, the standardized format supports [data mining](/page/Data_mining) across repositories; for instance, queries on geometric parameters or intermolecular interactions in the [CSD](/page/CSD) enable statistical analyses that inform molecular design and predict properties without re-refining raw data.[](https://www.iucr.org/resources/data/meeting-reports/asca2018)[](http://www.platonsoft.nl/platon/pl000601.html)

A notable application of CIF arises in pharmaceutical polymorph screening, where mined structures from databases like the [CSD](/page/CSD) accelerate the identification and characterization of crystal forms critical for drug stability and [bioavailability](/page/Bioavailability). In a [case study](/page/Case_study) on the [antibiotic](/page/Antibiotic) [ciprofloxacin](/page/Ciprofloxacin), researchers utilized CSD-deposited CIF data to benchmark computational predictions against experimentally determined polymorphs, revealing previously undetected forms under high-pressure conditions and informing scalable synthesis routes to avoid patent disputes over solid-state variants.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC4525153/)[](https://www.nature.com/articles/s41467-025-57479-1) This integration of archival CIFs with predictive modeling exemplifies how the format underpins efficient screening workflows in [drug development](/page/Drug_development).[](https://www.researchgate.net/publication/315971550_CRYSTAL_STRUCTURE_PREDICTION_IN_THE_CONTEXT_OF_PHARMACEUTICAL_POLYMORPH_SCREENING_AND_PUTATIVE_POLYMORPHS_OF_CIPROFLOXACIN)

As of November 2025, the CSD holds over 1.4 million small-molecule structures, with 1,407,820 total curated entries, underscoring the file's dominance in capturing and disseminating crystallographic knowledge.[](https://www.ccdc.cam.ac.uk/solutions/about-the-csd/csd-statistics/)

### In Macromolecular and Structural Biology

In macromolecular and [structural biology](/page/Structural_biology), the macromolecular Crystallographic Information File (mmCIF) serves as the primary data standard for depositing and archiving atomic coordinates of proteins, nucleic acids, and complexes determined via [X-ray crystallography](/page/X-ray_crystallography) and cryo-electron microscopy (cryo-EM) to the [Protein Data Bank](/page/Protein_Data_Bank) (PDB). Announced in February 2019 and mandatory from July 1, 2019, the Worldwide Protein Data Bank (wwPDB) requires the submission of mmCIF files for all crystallographic structures, enabling standardized coordinate deposition, validation against experimental data, and integration of metadata such as diffraction intensities or [electron density](/page/Electron_density) maps.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC6465986/) For cryo-EM, mmCIF supports the deposition of both atomic models and associated volume maps, facilitating the validation of structures against raw micrographs and half-maps to ensure consistency with observed densities.[](https://www.rcsb.org/docs/general-help/structures-without-legacy-pdb-format-files)

Within the PDB, mmCIF enables detailed [annotations](/page/annotation) that capture the biological context and archival status of structures, including the data item `_pdbx_database_status.status_code`, which specifies release conditions such as "REL" for publicly released entries or "HPUB" for those held until publication.[](https://mmcif.wwpdb.org/dictionaries/mmcif_pdbx_v40.dic/Items/_pdbx_database_status.status_code.html) This [annotation](/page/Annotation), required in all PDB entries, supports automated processing and ensures compliance with wwPDB policies for data dissemination.[](https://cdn.rcsb.org/wwpdb/docs/documentation/annotation/wwPDB-B-2024Jan-V4.4.pdf) mmCIF's extensible [dictionary](/page/Dictionary) also accommodates entity descriptions, polymer sequences, and functional classifications, allowing researchers to link structures to genomic data and biochemical assays.

The adoption of mmCIF has significantly impacted research by integrating experimental structures with computational predictions, as seen in tools like [AlphaFold](/page/AlphaFold), which outputs predicted protein structures directly in mmCIF format, including per-residue confidence scores (pLDDT) encoded in B-factor fields for seamless deposition or comparison with PDB entries.[](https://alphafold.ebi.ac.uk/faq) This compatibility has accelerated workflows in [structural biology](/page/Structural_biology), enabling hybrid modeling where AI predictions refine cryo-EM or [X-ray](/page/X-ray) data.[](https://academic.oup.com/nar/article/52/D1/D368/7337620) However, challenges persist in representing conformational and compositional heterogeneity inherent to cryo-EM datasets, where current mmCIF relies on limited mechanisms like alternative location labels (`altloc`) and B-factors, which conflate dynamic states (e.g., loop flexibility) with variable occupancy (e.g., [ligand](/page/Ligand) binding).[](https://pmc.ncbi.nlm.nih.gov/articles/PMC11220883/) Proposed enhancements include new hierarchical data categories to separately encode conformational ensembles (e.g., layered states for side chains) and compositional variants (e.g., bound/unbound ligands), improving interpretability for ensemble-based function predictions.[](https://doi.org/10.1107/S2052252524005098)

As of 2025, mmCIF is the mandatory submission and primary archival format for new entries, with ongoing transition to exclusive use alongside extended 12-character PDB IDs anticipated after 2028, underpinning an archive of nearly 250,000 structures.[](https://www.rcsb.org/stats/growth/growth-released-structures)[](https://www.rcsb.org/news/630fee4cebdf34532a949c34)[](https://www.sdsc.edu/news/2025/PR20251104-PDB.html) This shift ensures long-term interoperability while addressing the scale of macromolecular data in [structural biology](/page/Structural_biology).

### Broader Scientific and Archival Uses

Beyond traditional crystallographic applications, the [Crystallographic Information File (CIF)](/page/Cif) format facilitates interdisciplinary integration in [materials science](/page/Materials_science), particularly through its compatibility with [density functional theory](/page/Density_functional_theory) (DFT) simulations for [crystal structure](/page/Crystal_structure) modeling. In DFT workflows, CIF files serve as input for generating atomic positions and lattice parameters required by software packages such as [VASP](/page/VASP) and [Quantum ESPRESSO](/page/Quantum_ESPRESSO). For instance, tools like CIF2Cell convert CIF data into POSCAR format for VASP, enabling efficient setup of [periodic boundary conditions](/page/Periodic_boundary_conditions) and electronic structure calculations for crystalline materials. Similarly, CIF2Cell supports Quantum ESPRESSO input generation, allowing researchers to model properties like band gaps and elastic moduli directly from experimental [crystal](/page/Crystal) data. This [interoperability](/page/Interoperability) streamlines the transition from experimental structures to computational predictions, enhancing accuracy in materials design for applications such as semiconductors and catalysts.

CIF's role in open-access archiving has democratized access to crystallographic data, exemplified by the Crystallography Open Database ([COD](/page/Cod)), which distributes over 529,000 (specifically 529,712) small-molecule crystal structures in [CIF](/page/Cif) format as of November 2025. The [COD](/page/Cod) provides free, public-domain [CIF](/page/Cif) files for download, enabling global researchers to retrieve and reuse structures without restrictions, thereby accelerating discoveries in chemistry and [materials science](/page/Materials_science). This open distribution model, licensed under CC0, has fostered collaborative validation and extension of datasets, with structures sourced from [literature](/page/Literature) and direct submissions.[](https://www.crystallography.net/)

For enhanced interoperability, CIF data can be converted to JSON or XML formats, supporting integration with web [APIs](/page/Apis) and modern data pipelines. The CIF-JSON [schema](/page/Schema), developed by COMCIFS, maps CIF's self-defining structure to JSON objects, preserving data blocks, items, and tables while using standard JSON types like arrays and [null](/page/Null) values for [missing data](/page/Missing_data). This conversion facilitates seamless data exchange in web-based applications, such as querying [crystal](/page/Crystal) databases via RESTful [APIs](/page/Apis). In [machine learning](/page/Machine_learning) contexts, CIF-derived features are leveraged for predicting [crystal](/page/Crystal) properties, with frameworks like [Crystal](/page/Crystal) Twins processing CIF files into graph representations for [self-supervised learning](/page/Self-supervised_learning) on large datasets, achieving improved accuracy in forecasting formation energies and band gaps.

Educational applications of CIF extend its utility to [pedagogy](/page/Pedagogy), particularly in teaching [crystal](/page/Crystal) [symmetry](/page/Symmetry) and structural concepts through accessible parsers and visualization tools. In classroom settings, CIF files from databases like the Cambridge Structural Database are parsed to generate 3D-printable models or [virtual reality](/page/Virtual_reality) visualizations, allowing students to explore [symmetry](/page/Symmetry) elements, bond angles, and [chirality](/page/Chirality) interactively. For example, software such as Mercury converts CIF to STL files for [3D printing](/page/3D_printing), while Nanome enables VR immersion, with studies showing 75% of students reporting deeper understanding of coordination [chemistry](/page/2H). These methods make abstract [symmetry](/page/Symmetry) operations tangible, supporting curricula in [chemistry](/page/2H) and [materials science](/page/Materials_science) without requiring advanced lab equipment.

As a global standard, CIF has been widely adopted in neutron and synchrotron radiation facilities for data management and archiving. Facilities like the National Synchrotron Light Source II (NSLS-II) and [Diamond Light Source](/page/Diamond_Light_Source) integrate CIF with formats such as [NeXus](/page/NEXUS) and HDF5 to handle high-volume diffraction data, using imgCIF for [metadata](/page/Metadata) and [compression](/page/Compression) to manage terabytes of output per experiment. This adoption ensures standardized exchange between instruments, automated processing pipelines, and international databases, supporting [beamline](/page/Beamline) operations at data rates exceeding 1 GB/second.

## Tools and Implementation

### Software for Reading and Writing

Several software tools and libraries facilitate the reading and writing of [Crystallographic Information Files](/page/Crystallographic_Information_File) ([CIFs](/page/Cif)), enabling crystallographers to generate, parse, and manipulate these files during structure determination and data exchange. These tools range from standalone programs for structure refinement and export to programmatic libraries for integration into larger workflows, supporting both core [CIF](/page/Cif) and its variants like mmCIF.[](https://www.iucr.org/resources/cif/software)

Olex2 is a comprehensive crystallography platform that supports writing CIF files as part of its structure refinement and reporting pipeline. It allows users to save refined models directly in CIF format, incorporating crystallographic parameters, atomic coordinates, and reflection data for archival and publication purposes. This export functionality is integrated with Olex2's task system, where users can finalize structures by generating standard CIFs compliant with IUCr guidelines.[](https://www.olexsys.org/olex2/docs/reference/commands/file/)[](https://www.olexsys.org/olex2/docs/tasks/finalising-a-structure/)

PLATON serves as a multipurpose crystallographic tool that enables the export of refined structures to [CIF](/page/Cif) format, particularly after geometry analysis and data reduction steps. It processes input from various sources, such as SHELX files, and outputs [CIFs](/page/Cif) with detailed [molecular geometry](/page/Molecular_geometry), including derived parameters and validation checks, making it suitable for small-molecule [crystallography](/page/Crystallography) workflows.[](https://www.platonsoft.nl/platon/)[](https://www.iucr.org/resources/cif/software)

For programmatic reading and writing, [PyCIFRW](/page/Cif) is a [Python](/page/Python) library designed specifically for handling [CIF](/page/Cif) files, providing methods to parse dictionary-based data, validate syntax, and generate output files. Developed by the Australian National University, it supports both reading of CIF strings into Python objects and writing structured data back to CIF, with conformance to IUCr standards for core and mmCIF variants.[](https://github.com/jamesrhester/pycifrw)[](https://mmcif.wwpdb.org/docs/software-resources.html)

CIFLIB offers a C-language application programming interface ([API](/page/API)) for reading and writing macromolecular [CIF](/page/Cif) data, originally developed by the [Nucleic Acid](/page/Nucleic_acid) Database Project. It provides high-level functions to process CIF dictionaries, extract entity relationships, and check [data integrity](/page/Data_integrity), allowing developers to build custom applications that [interface](/page/Interface) with CIF files without manual [parsing](/page/Parsing).[](https://www.iucr.org/resources/cif/software/ciflib)

In Java environments, CIFTools (formerly associated with [Java](/page/Java)CIF implementations) is a library for reading, writing, and manipulating both text-based CIF and BinaryCIF formats. Maintained by the RCSB PDB, it enables efficient [parsing](/page/Parsing) of mmCIF files for [structural biology](/page/Structural_biology) applications, including conversion between formats and data extraction for database integration.[](https://github.com/rcsb/ciftools-java)

The Computational Crystallography Toolbox (cctbx), an open-source suite for crystallographic computations, includes the iotbx.cif module for robust CIF input/output operations. This module reads CIF files to construct [crystal structures](/page/Crystal_structure), handles [symmetry](/page/Symmetry) and [reflection data](/page/Reflection), and supports writing modified structures back to CIF, making it integral for automated refinement pipelines in both small-molecule and macromolecular contexts.[](https://cctbx.github.io/iotbx/iotbx.cif.html)

Commercially, [Mercury](/page/Mercury) from the [Cambridge Crystallographic Data Centre](/page/Cambridge) ([CCDC](/page/CCDC)) provides capabilities for reading and writing CIF files within its [crystal structure visualization](/page/Visualization) and [analysis environment](/page/Analysis). It imports CIFs to [display](/page/Display) [atomic models](/page/Atomic) and exports edited or analyzed structures in CIF format, supporting interoperability with the [Cambridge Structural Database](/page/Cambridge).[](https://www.ccdc.cam.ac.uk/solutions/software/mercury/)[](https://www.ccdc.cam.ac.uk/solutions/software/free-mercury/)

For API integrations, the LAMMPS [molecular dynamics](/page/Molecular_dynamics) simulation package supports [CIF](/page/Cif) through preprocessing tools like CIF2Cell, which reads [CIF](/page/Cif) files to generate input geometries and [lattice](/page/Lattice) parameters for simulations of crystalline materials. This allows seamless incorporation of experimental [CIF](/page/Cif) data into computational workflows without direct in-core parsing.[](https://www.lammps.org/prepost.html)[](https://github.com/andeplane/cif2cell-lammps)

### Validation and Editing Tools

The primary tool for validating Crystallographic Information Files ([CIFs](/page/Cif)) is checkCIF, an online service provided by the International Union of [Crystallography](/page/Crystallography) (IUCr). It performs comprehensive checks on core [CIF](/page/Cif) and macromolecular [CIF](/page/Cif) (mm[CIF](/page/Cif)) files, verifying syntax compliance, dictionary adherence, [cell](/page/Cell) parameters, space-group [symmetry](/page/Symmetry), anisotropic [displacement](/page/Displacement) parameters, and publication-related items such as structure factors.[](https://checkcif.iucr.org/)[](https://journals.iucr.org/services/cif/checkcif.html) Alerts generated by checkCIF are categorized by severity (A for critical, B for significant, G for general), flagging issues like missing mandatory data items, [symmetry](/page/Symmetry) inconsistencies, or unusual geometrical features.[](https://journals.iucr.org/e/issues/2020/01/00/su5533/)

Common error types detected include dictionary mismatches, where data names or structures deviate from the official DDL1 or DDLm [dictionaries](/page/dictionary), and numerical inconsistencies such as improbable bond lengths or [angles](/page/angles) that fall outside expected ranges based on statistical norms.[](https://journals.iucr.org/services/cif/checking/autolist.html) For batch validation, users can employ the standalone [PLATON](/page/Platon) software, which powers much of checkCIF's functionality and allows local processing of multiple [CIF](/page/cif) files to identify similar issues without online submission.

Editing tools facilitate manual corrections while preserving CIF integrity. enCIFer, developed by the Cambridge Crystallographic Data Centre (CCDC), is a free graphical application for viewing, editing, and validating single- or multi-block [CIFs](/page/Cif), with features to add or modify data safely and visualize molecular structures to aid tweaks.[](https://www.ccdc.cam.ac.uk/solutions/software/encifer/) For command-line editing, CIFEDIT from the IUCr's ciftools package enables viewing and modification of CIF contents, particularly useful for multi-block files.[](https://www.iucr.org/resources/cif/software) Avogadro, an open-source molecular editor, supports importing [CIF](/page/Cif) files for graphical adjustments to [atomic](/page/Atomic) coordinates or [unit](/page/Unit) cells, followed by export back to CIF format, making it suitable for structural refinements in small-molecule [crystallography](/page/Crystallography).[](https://avogadro.cc/)[](https://avogadro.cc/docs/building-materials/building-a-crystal-slab/)

In 2025, the [CIF](/page/Cif) extension for [Visual Studio Code](/page/Visual_Studio_Code) was released, offering crystallographers an advanced text editor for CIF and dictionary files. Developed by researchers at the [University of Jyväskylä](/page/University_of_Jyväskylä), it provides [syntax highlighting](/page/Syntax_highlighting), auto-completion based on IUCr dictionaries, real-time error checking, and hover information for data names, available as open-source on [GitHub](/page/GitHub).[](https://journals.iucr.org/j/issues/2025/04/00/gj5319/index.html)

Recent enhancements to validation tools include improved support for CIF 2.0, with scripts and libraries like those in the IUCr's CIFtbx updated as of 2024 to handle DDLm-based dictionaries and extended syntax for better [interoperability](/page/Interoperability).[](https://www.iucr.org/resources/cif/software)[](https://www.iucr.org/resources/cif/documentation)

### Visualization and Analysis Software

[Jmol](/page/Jmol) is an open-source Java-based viewer that enables interactive [3D rendering](/page/3D_rendering) of [crystal](/page/Crystal) structures directly from [CIF](/page/Cif) files, supporting both CIF 1.1 and 2.0 formats for small-molecule and macromolecular data.[](https://jmol.sourceforge.net/)[](https://chemapps.stolaf.edu/jmol/docs/) [VESTA](/page/Vesta), a cross-platform 3D visualization program, similarly imports [CIF](/page/Cif) files to display structural models, volumetric data like [electron](/page/Electron) densities, and [crystal](/page/Crystal) morphologies, facilitating detailed inspection of atomic arrangements and bonding.[](https://jp-minerals.org/vesta/en/)[](https://www.researchgate.net/publication/239252583_VESTA_A_Three-Dimensional_Visualization_System_for_Electronic_and_Structural_Analysis)

For thermal displacement analysis, ORTEP-3 generates high-quality illustrations of crystal structures, including thermal ellipsoids derived from atomic displacement parameters in [CIF](/page/Cif) files, with support for input formats like SHELX-derived [CIF](/page/Cif).[](http://www.cristal.org/DU-SDPD/nexus/farrugia/software/ortep3/index.htm) The [Bilbao](/page/Bilbao) Crystallographic Server provides web-based [symmetry](/page/Symmetry) tools that process [CIF](/page/Cif) inputs to analyze [space](/page/Space) groups, subgroups, distortion modes, and magnetic structures, outputting results in [CIF](/page/Cif)-compatible formats for further refinement.[](https://www.cryst.ehu.es/)

In macromolecular contexts, the Mol* Viewer serves as a modern web-based toolkit for 3D visualization and analysis of mmCIF files from the [Protein Data Bank](/page/Protein_Data_Bank), allowing interactive exploration of large biomolecular structures, density maps, and annotations without local installation.[](https://molstar.org/viewer/)[](https://pmc.ncbi.nlm.nih.gov/articles/PMC8262734/)

Advanced applications include the Materials Project database, which utilizes CIF inputs for [density functional theory](/page/Density_functional_theory) calculations of material properties such as band gaps, elastic constants, and formation energies, enabling predictive analysis of novel compounds.[](https://docs.materialsproject.org/methodology/materials-methodology/calculation-details) For scripted integration, the Atomic Simulation Environment (ASE) Python library reads and writes CIF files via its io module, supporting atomistic simulations, structure manipulation, and interfacing with computational engines for property evaluation.

## Standards and Future Directions

### Governance by IUCr

The International Union of Crystallography (IUCr) oversees the development and maintenance of the [Crystallographic Information File](/page/Cif) (CIF) standard through its Committee for the Maintenance of the CIF Standard (COMCIFS).[](https://www.iucr.org/resources/cif) Established to ensure the integrity and evolution of the CIF framework, COMCIFS operates under the auspices of the IUCr's Commission on Crystallographic Data and Commission on Journals.[](https://www.iucr.org/resources/cif) This committee plays a central role in governing CIF by reviewing and approving updates to its core components, thereby safeguarding its reliability as a data exchange format in [crystallography](/page/Crystallography).[](https://journals.iucr.org/a/issues/2024/02/00/es5053/)

COMCIFS's primary responsibilities include the maintenance of CIF dictionaries, which define the data names, relationships, and validation rules essential to the standard.[](https://www.iucr.org/resources/cif/documentation) The committee examines proposed extensions to CIF, such as new data items or dictionary revisions, and approves versions only after rigorous evaluation to maintain compatibility and scientific accuracy.[](https://www.iucr.org/__data/iucr/lists/comcifs-l/msg00285.html) For instance, suggestions for new data names are submitted via public repositories like [GitHub](/page/GitHub), allowing community input while ensuring oversight by COMCIFS.[](https://www.iucr.org/resources/cif) This process supports the ongoing refinement of CIF without disrupting existing implementations.[](https://github.com/COMCIFS)

IUCr policies emphasize open access to CIF dictionaries and resources, promoting widespread adoption and collaboration in the crystallographic community.[](https://www.iucr.org/resources/cif/comcifs/policy) Since the 1990s, IUCr journals, including *Acta Crystallographica*, have required CIF submission for structural reports, establishing it as a mandatory format for publication and archival purposes.[](https://www.iucr.org/resources/cif) These policies protect the CIF standard by encouraging conformance checks and prohibiting proprietary alterations that could fragment its use.[](https://www.iucr.org/resources/cif/comcifs/policy)

In terms of collaboration, COMCIFS works with organizations like the [Protein Data Bank](/page/Protein_Data_Bank) (PDB) and the Cambridge Crystallographic Data Centre (CCDC) to harmonize CIF variants across domains, facilitating seamless data exchange for small-molecule and macromolecular structures. This includes joint efforts on deposition protocols and validation tools to ensure [interoperability](/page/Interoperability).[](https://journals.iucr.org/paper?S0108767305094626)

The IUCr provides centralized resources for CIF governance through its [website](/page/Website) (www.iucr.org/resources/cif), serving as the primary hub for dictionaries, documentation, and software tools.[](https://www.iucr.org/resources/cif) Historically, the IUCr has sponsored working parties on CIF since 1987, leading to its formal adoption in 1990 and continued evolution as a foundational standard in [crystallography](/page/Crystallography).[](https://www.iucr.org/resources/cif)

### Adoption and Interoperability

The [Crystallographic Information File](/page/Crystallographic_Information_File) (CIF) has achieved widespread adoption as the de facto standard for data exchange and archiving in [crystallography](/page/Crystallography), with essentially every major journal requiring its use for structure depositions.[](https://www.sciencedirect.com/science/article/abs/pii/S0039602810000725) This format underpins key public databases, including the [Cambridge Structural Database](/page/CSD) (CSD), which stores over 1.36 million small-molecule structures in CIF format as of January 2025, and the [Protein Data Bank](/page/Approved_backlinks) (PDB), which utilizes the macromolecular extension mmCIF for more than 230,000 entries of protein and [nucleic acid](/page/Nucleic_acid) structures.[](https://www.ccdc.cam.ac.uk/media/CSD-Entries-Summary-Statistics-2025.pdf)[](https://pubs.aip.org/aca/sdy/article/12/2/021101/3344540/A-new-chapter-for-RCSB-Protein-Data-Bank-Molecule) Additionally, the Crystallography Open Database (COD) hosts approximately 530,000 open-access entries in CIF format, contributing to a collective repository of millions of CIF files available globally for research and validation purposes.[](https://www.crystallography.net/)

CIF's interoperability with other data formats enhances its utility across scientific workflows, enabling seamless integration with diverse tools and databases. Converters such as cif2pdb facilitate transformation of mmCIF files into the legacy PDB format for compatibility with visualization software like PyMOL or VMD.[](https://www.iucr.org/resources/cif/software/cif2pdb) Similarly, specifications like [CIF-JSON](/page/CIF-JSON) support conversion to [JSON](/page/JSON) for web-based applications and machine-readable processing, while general-purpose tools like Open Babel allow export to Chemical Markup Language (CML) for broader chemical informatics exchanges.[](https://github.com/COMCIFS/CIF_JSON)[](https://www.cheminfo.org/Chemistry/Cheminformatics/FormatConverter/index.html)

Despite its dominance, challenges persist in full [interoperability](/page/Interoperability) due to [legacy](/page/Legacy) software primarily supporting [CIF](/page/Cif) version 1.1, which limits access to advanced features introduced in CIF 2.0, such as improved [dictionary](/page/Dictionary) handling via DDLm.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC8056762/) This version disparity can complicate validation and processing in older crystallographic programs, though modern libraries like those in the Crystallographic [Information](/page/Information) [Framework](/page/Framework) address both versions to bridge the gap.[](https://www.iucr.org/resources/cif/cif2)

The International Union of Crystallography (IUCr) fosters adoption through community efforts, including workshops at its triennial congresses, online tutorials, and educational pamphlets that guide users on [CIF](/page/Cif) best practices and validation using tools like checkCIF.[](https://www.iucr2017.org/program/onsite-events/iucr-workshops/)[](https://www.iucr.org/education/resources) These initiatives, often in collaboration with database curators, ensure ongoing training and standardization enforcement.[](https://www.iucr.org/publications/teaching-pamphlets)

### Ongoing Developments and Challenges

Recent developments in the [Crystallographic Information File](/page/Crystallographic_Information_File) (CIF) framework have focused on expanding dictionaries to accommodate emerging techniques such as serial [crystallography](/page/Crystallography) and AI-driven structure prediction. In 2023, the IUCr published a standard descriptor dictionary specifically for fixed-target serial [crystallography](/page/Crystallography), enabling [beamline](/page/Beamline) software to handle data collection parameters and [metadata](/page/Metadata) for microcrystal experiments at synchrotrons and [X-ray](/page/X-ray) free-electron lasers. This enhancement supports the integration of time-resolved and room-temperature structural [data](/page/Data), which are increasingly vital for dynamic studies in [structural biology](/page/Structural_biology).[](https://journals.iucr.org/d/issues/2023/08/00/gm5097/) Similarly, the ModelCIF extension to the PDBx/mmCIF dictionary, introduced in 2023 and updated in 2025, provides a standardized representation for computed structure models derived from AI methods like [AlphaFold](/page/AlphaFold), including [metadata](/page/Metadata) on prediction confidence scores and ensemble variations to facilitate machine-readable archiving and validation.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC10293049/)[](https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5688643)

Despite these advances, CIF faces ongoing challenges, particularly in scalability for handling [big data](/page/Big_data) from techniques like [cryo-electron microscopy](/page/Microscopy) (cryo-EM), where mmCIF is commonly used. The sheer volume of raw and processed data in cryo-EM workflows often exceeds traditional file-based storage limits, leading to [interoperability](/page/Interoperability) issues between [legacy](/page/Legacy) formats and modern databases, as well as difficulties in efficient querying and reuse of large-scale structural datasets.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC10492738/) [Backward compatibility](/page/Backward_compatibility) with CIF 1.x remains a hurdle, as the CIF 2.0 syntax introduces features like [Unicode](/page/Unicode) support and complex data structures that are not fully parsable by older software, necessitating dual-format support or migration tools to avoid data silos in [legacy](/page/Legacy) systems.[](https://pmc.ncbi.nlm.nih.gov/articles/PMC4762566/)

Future directions emphasize enhancing CIF's alignment with [FAIR data](/page/FAIR_data) principles to improve [findability](/page/Findability), [accessibility](/page/Accessibility), [interoperability](/page/Interoperability), and reusability in crystallographic research. The format's dictionary-driven structure inherently supports machine-actionable [metadata](/page/Metadata), such as persistent identifiers in repositories like the [Cambridge](/page/Cambridge) Structural Database, promoting FAIR compliance for deposited structures.[](https://knowledgebase.nfdi4chem.de/knowledge_base/docs/fair/) Efforts are underway to bolster web integration through initiatives like the COMCIFS-maintained CIF_JSON schema, which maps CIF data to [JSON](/page/JSON) for easier API-based access and integration with web services, addressing the need for [dynamic data exchange](/page/Dynamic_Data_Exchange) in collaborative platforms.[](https://github.com/COMCIFS/CIF_JSON) The community drives these evolutions via COMCIFS, with calls for contributions encouraged through annual dictionary writing workshops—such as the 2023 event focused on creating new definitions and extending existing ones—and open discussion lists for developers.[](https://www.iucr.org/__data/assets/pdf_file/0015/157011/handout.pdf)[](https://www.iucr.org/resources/lists/cif-developers)

References

  1. [1]
    (IUCr) CIF 1.1 syntax specification
    The Crystallographic Information File (CIF) is an archiving and information interchange standard for crystallographic and related structural science data.
  2. [2]
    CIF (Crystallographic Information File): A Standard for ...
    The Crystallographic Information File (CIF) uses the self-defining STAR file structure. This requires the creation of a dictionary of data names and definitions ...
  3. [3]
    A Guide to CIF for Authors - International Union of Crystallography
    The Crystallographic Information File (CIF) is the standard for crystallographic data exchange prescribed by the International. Union of Crystallography. It ...
  4. [4]
    (IUCr) Crystallographic Information Framework
    CIF was developed by the IUCr Working Party on Crystallographic Information in an effort sponsored by the IUCr Commission on Crystallographic Data and the IUCr ...Software for CIF and STAR · CIF dictionaries · Cif2 · Comcifs
  5. [5]
    (IUCr) CIF version 2
    CIF2 is a new, alternative file format for use with the Crystallographic Information Framework, supplementing the original CIF format ('CIF1') developed in the ...
  6. [6]
    (IUCr) 30 years of CIF
    Oct 25, 2021 · Syd, Frank Allen of the CCDC and David Brown together developed CIF as a standard format in 1991, an outcome of the Working Party on ...
  7. [7]
    The Implementation and Evolution of STAR/CIF Ontologies
    Jan 12, 2016 · Timeline for CIF implementation by IUCr journals. 1987, IUCr Working Party on Crystallographic Information (WPCI). 1988, WPCI recommends STAR ...
  8. [8]
    (IUCr) CIF dictionaries
    CIF dictionaries provide a formal taxonomy of crystallographic terms and ideas. Dictionary entries are constructed in a structured machine-readable manner.
  9. [9]
    (IUCr) CIF dictionary cif_core.dic revision history
    ### Timeline of Major Updates to CIF Core Dictionary (Post-1991)
  10. [10]
    The crystallographic information file (CIF) - IUCr Journals
    Jun 28, 1991 · The specification of a new standard Crystallographic. Information File (CIF) is described. Its development is based on the Self-Defining Text ...
  11. [11]
    (IUCr) CODATA Prize 2014 awarded to Sydney Hall
    Nov 10, 2014 · The CODATA Prize is a major biennial award which acknowledges outstanding achievement in the world of scientific and technical data. The work of ...
  12. [12]
    [PDF] Frank Allen IUCr_aug08 - ePrints Soton
    Aug 23, 2008 · allen@ccdc.cam.ac.uk ... 2000. Data encoded from journal 'supplementary deposition documents' ! ▫ 1994 onwards. Electronic data deposition via CIF.Missing: adoption | Show results with:adoption
  13. [13]
    Petition for Open Data in Crystallography
    May 19, 2005 · (www.ccdc.cam.ac.uk). * The CCDC CIF archive currently contains more than 170,000 CIFs dating back to 1994, relating to organic and metal- ...Missing: adoption | Show results with:adoption
  14. [14]
    Committee for the Maintenance of the CIF Standard (COMCIFS)
    COMCIFS was established at the 16th IUCr Congress in Beijing in 1993. It supervises the orderly development of software systems to use CIF.Missing: formation | Show results with:formation
  15. [15]
    mmCIF Early History
    CIF was developed by the IUCr Working Party on Crystallographic Information in an effort sponsored by the IUCr Commission on Crystallographic Data and the IUCr ...Missing: 1987 | Show results with:1987
  16. [16]
    Protein Data Bank (PDB): The Single Global Macromolecular ...
    In the 1990s, the International Union of Crystallography (IUCr) charged a committee with creating a more informative and extensible data model for the PDB ...
  17. [17]
    Announcing mandatory submission of PDBx/mmCIF format files for ...
    This letter announces that PDBx/mmCIF format files will become mandatory for crystallographic depositions to the Protein Data Bank (PDB).
  18. [18]
    (IUCr) File syntax
    1. This document describes the full syntax of the Crystallographic Information File (CIF). Definition of terms. 2. The following terms are used in the ...Definition of terms · General features · Character set · Implementation restrictionsMissing: core | Show results with:core
  19. [19]
    CIF - Changes to the specification, 29 September 2010 DRAFT
    Aug 10, 2011 · This document specifies changes to the syntax and binary form of CIF. We refer to the current syntax specification of CIF as CIF1, ...
  20. [20]
    The CIF dictionaries: how they work
    Aug 31, 2019 · Data names that can be tabulated together belong to the same category. A category name is like a name for a table (“loop” in CIF-speak) loop_.
  21. [21]
    (IUCr) Dictionary definition languages
    The CIF data dictionaries are machine-readable compilations of tags identifying different data items and the attributes of the associated data.Missing: methods | Show results with:methods
  22. [22]
    (IUCr) Core CIF dictionary
    The rules given below apply within each moiety but different requirements apply to the way that moieties are connected (see _chemical_formula_moiety). (1) Only ...
  23. [23]
    (IUCr) Core CIF dictionary
    The core dictionary is a set of data names designed to cover the requirements of archiving and exchanging raw and processed data and derived structural results.Missing: rules | Show results with:rules
  24. [24]
    (IUCr) Core dictionary (coreCIF) definitions
    An essential guide and reference to CIF for programmers, data managers handling crystal-structure information and practising crystallographers. IUCrData.Missing: molecules | Show results with:molecules
  25. [25]
    Short Guide to CIFs - CCDC
    A CIF or Crystallographic Information File is the standard format for storing crystallographic structural data. CIF information has a specific structure or ...
  26. [26]
    Crystallography Open Database
    Open-access collection of crystal structures of organic, inorganic, metal-organic compounds and minerals, excluding biopolymers.Browse the COD · What's New? · How to query the COD database · Search
  27. [27]
    [30] Macromolecular crystallographic information file - ScienceDirect
    mmCIF is an extension of the Crystallographic Information File (CIF) for describing macromolecular structures, developed by the International Union of ...
  28. [28]
    Data Category entity_poly_seq - PDBx/mmCIF
    Data items in the ENTITY_POLY_SEQ category specify the sequence of monomers in a polymer. Allowance is made for the possibility of microheterogeneity in a ...
  29. [29]
    Data Category struct_conn - PDBx/mmCIF
    Data items in the STRUCT_CONN category record details about the connections between portions of the structure. These can be hydrogen bonds, salt bridges, ...
  30. [30]
    MMCIF USER GUIDE
    Jun 7, 2024 · Each polymer entity is expected to have a corresponding record in the _entity_poly, _entity_poly_seq, and _pdbx_poly_seq_scheme categories.
  31. [31]
    Electron Density Maps and Coefficient Files - RCSB PDB
    Electron density maps show how well a model fits data, created from coordinate and structure factor files. Common types are 2Fo-Fc and Fo-Fc maps. Coefficient ...
  32. [32]
    (IUCr) Powder CIF dictionary
    The powder CIF dictionary (pdCIF) is a supplement to the Core dictionary addressing the needs of powder diffractionists.
  33. [33]
    (IUCr) Modulated structures CIF dictionary
    The modulated structures CIF dictionary (msCIF) is a supplement to the Core dictionary designed to permit the description of incommensurately modulated crystal ...
  34. [34]
    (IUCr) Image dictionary (imgCIF) definitions
    An essential guide and reference to CIF for programmers, data managers handling crystal-structure information and practising crystallographers. IUCrData.
  35. [35]
    [PDF] NeXus: A DataFormat for x-ray, n- and muon Scattering
    • NeXus used NXDL, CIF uses many versions of DDL. • Herbert Bernstein has demonstrated that it is possible to map. CIF into NeXus. There are issues but no ...
  36. [36]
    Crystal structure refinement with SHELXL - PMC - NIH
    For small-molecule refinements with SHELXL, the input .hkl file should contain the unmerged data. This enables the program to produce a more complete output CIF ...
  37. [37]
    [PDF] The SHELX-97 Manual
    For the deposition of reflection data, the CIF format .fcf file written by SHELXL may be used directly. 9.3 Creation of map (and pdb) files for various graphics ...<|control11|><|separator|>
  38. [38]
    [PDF] SHELXL-97 Quick Reference Manual
    A 'Crystallographic Information File' file name.cif is created in self-defining STAR format. This. ASCII file is suitable for data archiving, ...
  39. [39]
    CCDC Deposition Criteria for CSD and ICSD Inclusion - CCDC Home
    May 22, 2025 · Files should be in CIF, FCF or HKL format and may be included in a ZIP file · At least one CIF file must be included in the submission · For more ...
  40. [40]
    Deposit Structures | ICSD DEPOT - FIZ Karlsruhe
    Each submission must include at least one CIF file; Additional files may be in HKL, RES, FCF, Word format and may be included in a ZIP file; Each submission ...
  41. [41]
    CIF Deposition Guidelines - CCDC
    This page is designed to help you deposit your data, check your CIFs and provide you with more information as you go through our deposition process.
  42. [42]
    [PDF] Crystallography and Databases | Data Science Journal
    Aug 7, 2017 · Structures are typically submitted to the CSD by researchers in CIF format via deposition services provided by the CCDC. These deposition ...
  43. [43]
    checkCIF validation ALERTS: what they mean and how to respond
    The introduction of the CIF standard also opened the way for the automated checking of the archived data for their internal consistency and completeness, which ...
  44. [44]
    (IUCr) Validation of the Crystallography Open Database using the ...
    As a result, formal validation using CIF should be viewed as an essential step of crystallographic data quality assurance.
  45. [45]
    MS15: Database developments, validation and data mining
    Dec 4, 2018 · Its Crystallographic Information Framework (CIF) provides precise data definitions and led to automated criteria (checkCIF) for testing the ...
  46. [46]
    PLATON/VALIDATION
    Structural data provided in CIF-format may be validated in terms of consistency, missing relevant data, proper procedure and overlooked problems. In ...<|separator|>
  47. [47]
    Combined crystal structure prediction and high-pressure ... - NIH
    Jul 22, 2015 · This study illustrates how CSP and high-pressure crystallization can provide elements for rational pharmaceutical polymorph screening and in ...
  48. [48]
    A robust crystal structure prediction method to support small ... - Nature
    Mar 5, 2025 · Several studies have demonstrated the ability of computational methods to accurately predict the crystal structures of small molecules, ...
  49. [49]
    crystal structure prediction in the context of pharmaceutical ...
    Aug 6, 2025 · Molecular simulation is increasingly used by medicinal chemists in the process and product development. Reliable computational predictions ...
  50. [50]
    [PDF] Cambridge Structural Database 1 January 2025 CSD Entries - CCDC
    Jan 1, 2025 · Total No. of structures. 1,362,809. 100.0. No. of different compounds. 1,217,887. -. No. of literature sources. 2,557. -. Organic structures.
  51. [51]
    wwPDB Deposition
    All four-character PDB IDs will be exhausted by 2028. Entries with 12-character PDB IDs (e.g., pdb_00002hyv) will only be available in PDBx/mmCIF format. More ...
  52. [52]
    Structures Without Legacy PDB Format Files - RCSB PDB
    Jul 22, 2025 · Users must transition to PDBx/mmCIF before 2028, when 4-character IDs are expected to be fully assigned.
  53. [53]
    Data Item _pdbx_database_status.status_code - PDBx/mmCIF
    PDBx/mmCIF Data Dictionary Data Item _pdbx_database_status ... Required in PDB entries: yes; Used in current PDB entries: Yes, in about 100.0 ...Missing: annotations | Show results with:annotations
  54. [54]
    [PDF] wwPDB Policies and Processing Procedures Document - RCSB PDB
    wwPDB processes PDB data using mmCIF format, covering entry title, macromolecules, polymer sequences, and more, following the PDB Exchange Dictionary.
  55. [55]
    FAQs - AlphaFold Protein Structure Database
    How can I download a structure prediction? Coordinate files can be downloaded from the menu in the top right of the structure page in mmCIF or PDB format.
  56. [56]
    providing structure coverage for over 214 million protein sequences
    Nov 2, 2023 · The data generation process for the AlphaFold DB is carried out by Google DeepMind, with all predictions stored in PDB, mmCIF and binaryCIF ...
  57. [57]
    Comprehensive encoding of conformational and compositional ...
    The proposed mmCIF enhancements aim to capture complex conformational and compositional heterogeneity of biomolecules, which current models do not robustly ...<|control11|><|separator|>
  58. [58]
  59. [59]
    PDB Statistics: Overall Growth of Released Structures Per Year
    Year, Total Number of Entries Available, Number of Structures Released Annually. 2025, 245,074, 15,412. 2024, 229,662, 15,471. 2023, 214,191, 14,500.
  60. [60]
    Future Planning: PDB entries with extended CCD or PDB IDs will be ...
    Extended PDB IDs and corresponding PDB DOIs have been included in the PDBx/mmCIF formatted atomic coordinate files for all new and re-released entries since ...
  61. [61]
    Software for CIF and STAR - International Union of Crystallography
    The software listed on this page includes copies of, or links to, many of the programs and libraries available for use with CIF and STAR Files.Missing: enhancements extensibility
  62. [62]
    file | OlexSys
    Saves current model to a file. By default an ins file is saved and loaded. It is also possible to save the current file in cif,ins,p4p,mol,mol2,pdb,xyz formats.
  63. [63]
    Finalising a Structure | OlexSys
    Direct access to the IUCr CheckCif is available through Olex2 under Work|Report|CheckCif-Report with the option to include the .fcf (Send FCF) in the submission ...Before Writing a Final CIF · Editing Report / CIF parameters · Validating the CIF
  64. [64]
    THE PLATON HOMEPAGE
    PLATON is designed as a Multipurpose Crystallographic Tool program. It includes multiple tools to be used as part of a single crystal structure determination.Missing: export | Show results with:export
  65. [65]
    jamesrhester/pycifrw - GitHub
    PyCIFRW provides support for reading and writing CIF (Crystallographic Information Format) files using Python. It was developed at the Australian National ...Introduction · Conformance · ExampleMissing: Olex2 JavaCIF cctbx Mercury CCDC LAMMPS<|control11|><|separator|>
  66. [66]
    PDBx/mmCIF Software Resources - wwPDB
    PyCifRW Python CIF read and write methods (Updated 2025); PDBeCIF PDBe Python package to work with mmCIF formatted files including tutorial and examples.Missing: Olex2 JavaCIF cctbx Mercury CCDC LAMMPS
  67. [67]
    (IUCr) CIFLIB
    CIFLIB is a class library that was developed by the Nucleic Acid Database Project (NDB) to provide an application program interface (API) to macromolecular CIF ...
  68. [68]
    rcsb/ciftools-java: A Java library for handling text and BinaryCIF files.
    CIFTools implements reading and writing of CIF files (specification) as well as their efficiently encoded counterpart, called BinaryCIF.Missing: JavaCIF | Show results with:JavaCIF
  69. [69]
    iotbx.cif — CCTBX Developer documentation
    The iotbx.cif.reader class reads the CIF string and builds a Python representation of the CIF. We then call the build_crystal_structures() method which ...
  70. [70]
    Mercury - CCDC
    See chemistry in 3D and generate high-definition, customized images, animations, and 3D models of molecules using the crystal structure visualization and ...
  71. [71]
    Free Crystal Structure Visualization Software - CCDC
    What file formats does Mercury support? Mercury works with most molecular structure and crystal structure files including .mol, .mol2, .cif, .pdb and many more.
  72. [72]
    Pre/Post Processing Tools for use with LAMMPS
    CIF2Cell is a tool to generate the geometrical setup for various electronic structure codes from a CIF (Crystallographic Information Framework) file. The ...
  73. [73]
    andeplane/cif2cell-lammps - GitHub
    Dec 27, 2020 · A tool to generate the geometrical setup for various electronic structure codes from a CIF (Crystallographic Information Framework) file.
  74. [74]
    CheckCIF
    checkCIF reports on the consistency and integrity of crystal structure determinations reported in CIF format. Please upload your CIF using the form below.
  75. [75]
    checkCIF/PLATON (full publication check) - IUCr Journals
    checkCIF/PLATON checks CIF syntax, cell and geometry, space-group symmetry, anisotropic displacement parameters, publication items, and structure factors.
  76. [76]
    checkCIF validation ALERTS: what they mean and how to respond
    Jan 1, 2020 · The IUCr CIF/FCF validation facility is available through https://checkcif.iucr.org along with related tools. A Microsoft Windows executable for ...
  77. [77]
    checkCIF data validation tests - IUCr Journals
    PURPOSE: To check that the total sum formulae derived from the _chemical_formula_sum _chemical_formula_moiety, _atom_site_ and _atom_type_ data all match.<|control11|><|separator|>
  78. [78]
    EnCIFer - CCDC
    EnCIFer enables users to validate CIFs and ensure their files are format-compliant for deposition with journals and databases or for storage in laboratory ...
  79. [79]
    Avogadro - Free cross-platform molecular editor - Avogadro
    Avogadro is a free, cross-platform molecule editor and visualizer for Windows, Linux, and Mac OS X, designed for computational chemistry and related areas.Manual · Avogadro Discussion · Avogadro 1.95 Released · Teach
  80. [80]
    Building a Crystal Surface (Slab) - Avogadro
    May 23, 2022 · Import a crystal structure, then use Crystallography > Build > Slab, specify Miller plane indices and dimensions, and click 'Build' to generate ...
  81. [81]
    (IUCr) Documentation
    Specification of the Crystallographic Information File format, version 2.0. J. Appl. Cryst. 49, 277-284; details J. C. Bollinger (2016). A portable general ...
  82. [82]
    Jmol: an open-source Java viewer for chemical structures in 3D
    Jmol is a free, open source viewer of molecular structures useful for students, educators and researchers in chemistry, biochemistry and other fields.Download Jmol · Jmol Files · Jmol download · Jmol Documentation
  83. [83]
    Jmol/JSmol Interactive Script Documentation
    CIF, International Union of Crystallography Crystallographic Information File, versions CIF 1.1 and CIF 2.0, including Macromolecular Crystallographic ...
  84. [84]
    VESTA - JP-Minerals
    VESTA is a 3D visualization program for structural models, volumetric data such as electron/nuclear densities, and crystal morphologies.Download · Crystallographic Software · Documentation · Features
  85. [85]
    VESTA: A Three-Dimensional Visualization System for Electronic ...
    Aug 7, 2025 · A cross-platform program, VESTA, has been developed to visualize both structural and volumetric data in multiple windows with tabs.
  86. [86]
    ORTEP-3 for Windows
    Currently supported formats are SHELX, GX, CIF, SPF ... Several style-templates are supplied, but the graphical representations of thermal ellipsoids ...
  87. [87]
  88. [88]
    Mol* Viewer
    Download structure, source, PDB, PDB Id(s), options, apply, download density, download file, open files, load trajectory, download, load genome 3D (G3D), ...
  89. [89]
    Mol* Viewer: modern web app for 3D visualization and analysis of ...
    The web-native Mol* Viewer enables 3D visualization and streaming of macromolecular coordinate and experimental data, together with capabilities for displaying ...
  90. [90]
    Calculation Details - Materials Project Documentation
    Aug 18, 2025 · We use DFT as implemented in the Vienna Ab Initio Simulation Package (VASP) software [1] to evaluate the total energy of compounds.
  91. [91]
    Report of the Executive Committee for 2022 - IUCr Journals
    ... Committee for the Maintenance of the CIF Standard (COMCIFS), which maintains the integrity of the Crystallographic Information Framework (CIF) on behalf of the ...Missing: responsibilities | Show results with:responsibilities
  92. [92]
    Re: Proposed COMCIFS terms of reference
    Oct 3, 2005 · 1. To establish policies defining the use of the Crystallographic Information Framework (CIF). · 2. To receive and examine all proposed ...
  93. [93]
    IUCr Committee for the Maintenance of the CIF standard - GitHub
    IUCr Committee for the Maintenance of the CIF standard has 25 repositories available. Follow their code on GitHub.Missing: pdCIF _pd_proc_ls_profile
  94. [94]
    (IUCr) Statement of policy
    Sep 30, 2000 · The syntax of a CIF is a subset of the more general STAR File[2] format. The CIF and STAR File approaches are used increasingly in the ...Missing: rules | Show results with:rules
  95. [95]
    CIF operations and applications at the CCDC - IUCr Journals
    For such reports, the. IUCr is collaborating with the Protein Data Bank (PDB) to facilitate the deposition and publication procedures by extracting as much.
  96. [96]
    A standard format for reporting atomic positions in measured or ...
    Jun 15, 2010 · Essentially every journal requires deposition of data using the “Crystallographic Information Format” or CIF, established by the International ...
  97. [97]
    A new chapter for RCSB Protein Data Bank Molecule of the Month in ...
    Apr 18, 2025 · It currently provides access to >230 000 structures and supports a broad community of users numbering in the many millions, including the ...
  98. [98]
    (IUCr) cif2pdb
    cif2pdb (Bernstein & Bernstein, 1996) is a program for converting an mmCIF data set to a partial pseudo-PDB entry with HEADER, TITLE, COMPND, SOURCE, KEYWRD, ...
  99. [99]
    COMCIFS/CIF_JSON: A JSON schema for transferring CIF information.
    This specification contains some features to allow straightforward conversion to JSON from files in CIF syntax. However, round-tripping through CIF-JSON ...
  100. [100]
    OPENBABEL - Chemical file format converter - Cheminfo.org
    Use open babel to convert most of the chemical formats. How to proceed ? Enter an input value, for example a SMILES like "CCCC"; Select ...Missing: crystallography | Show results with:crystallography<|control11|><|separator|>
  101. [101]
    Validation of the Crystallography Open Database using the ...
    Feb 14, 2021 · In the field of crystallography, however, the Crystallographic Information Framework (CIF) (Hall & McMahon, 2006 ▸; Brown & McMahon, 2002 ▸) is ...
  102. [102]
    Workshops - IUCR - 2017
    This one-day, free tutorial workshop will provide an overview of the physics and chemistry of X-ray Absorption Spectroscopy with a particular emphasis on ...
  103. [103]
    (IUCr) External educational resources
    A web-portal to high quality open access crystallography resources (access to large collections of CIF, services of the Bilbao crystallographic server, ...
  104. [104]
    (IUCr) Teaching pamphlets
    Provide tutorials introducing different aspects of symmetry, diffraction techniques, structural science and crystallography.Missing: workshops | Show results with:workshops
  105. [105]
    ModelCIF: An extension of PDBx/mmCIF data representation ... - NIH
    The first approach is template-based structure prediction (also known as homology modeling or comparative modeling), in which the structure of an unknown ...
  106. [106]
  107. [107]
    Big data in contemporary electron microscopy: challenges and ...
    This includes the possible incompatibility between data formats or format versions and the lack of interoperability of data formats and software applications, ...
  108. [108]
    COD::CIF::Parser: an error-correcting CIF parser for the Perl language
    It turns out, however, that the CIF 2.0 extensions are backwards incompatible with the CIF 1. x series of formats, and therefore a second grammar and a second ...
  109. [109]
    FAIR Data Principles - NFDI4Chem Knowledge Base
    In chemistry, the deposition of crystallographic data in a standardized file format (CIF) into a repository such as the Cambridge Structural Database (CSD) ...
  110. [110]
    COMCIFS Dictionary Writing Workshop
    Aug 21, 2023 · Topics covered will include an introduction to CIF dictionaries, writing individual definitions for existing dictionaries, writing new ...
  111. [111]
    (IUCr) CIF Developers Discussion List
    cif-developers is an open discussion list aimed at providing a focus for developers of software using the Crystallographic Information File (CIF) exchange ...Missing: serial | Show results with:serial