Fact-checked by Grok 2 weeks ago

Skeletal formula

A skeletal formula, also known as a line-angle formula or bond-line structure, is a simplified graphical representation of the molecular structure of compounds, particularly those based on carbon chains or rings, in which carbon atoms are implied at the vertices and endpoints of lines, atoms bonded to carbon are omitted, and single bonds between carbons are depicted as straight lines. This notation assumes that each carbon atom forms exactly four bonds, with any unspecified valences filled by hydrogen atoms, making it an efficient way to illustrate the carbon skeleton without explicitly showing every atom in simple hydrocarbons like alkanes. In more complex molecules, heteroatoms such as oxygen, , or are explicitly labeled at the appropriate positions, while multiple bonds are represented by parallel double or triple lines, and functional groups are drawn according to standard conventions. Hydrogen atoms attached to heteroatoms, such as in hydroxyl or amino groups, are typically shown explicitly to clarify their presence. Skeletal formulas are widely adopted in organic chemistry textbooks, research papers, and educational materials because they balance brevity and clarity, allowing chemists to quickly visualize molecular connectivity, (via wedges or dashes for 3D arrangements), and reaction sites without the clutter of full structures. Originating as a in the late alongside the growth of structural , this representation has become standardized for depicting everything from simple alkanes like (a zigzag line of three implied carbons) to intricate biomolecules and polymers.

Introduction

Definition and Purpose

A skeletal formula, also known as a line-angle or bond-line formula, is a type of used in to represent the connectivity of atoms in a . In this notation, covalent bonds are depicted as straight lines, with carbon atoms implied at the endpoints and intersections of these lines, while hydrogen atoms are omitted except when necessary to satisfy requirements or when attached to heteroatoms. This shorthand emphasizes the carbon skeleton and functional groups without explicitly labeling every carbon or hydrogen, allowing for a streamlined visualization of molecular architecture. The primary purpose of skeletal formulas is to facilitate the rapid sketching and communication of complex molecules, particularly in , and publication contexts where full or condensed structures would be cumbersome. By focusing on bond arrangements and key substituents, this representation highlights molecular topology, positions, and stereochemical features when indicated, enabling chemists to prioritize reactivity patterns over atomic minutiae. Skeletal formulas are ubiquitous in textbooks and due to their efficiency in conveying structural information without clutter. Advantages of skeletal formulas include significant time savings in drawing and interpreting structures, as well as enhanced clarity for visualizing the overall framework of large biomolecules like steroids or polymers. For instance, (C₃H₈) is represented simply as a zigzag line with two segments, implying three carbon atoms connected by single bonds and eight hydrogens filling the valences. However, limitations exist: this notation is primarily suited for carbon-based compounds and may not adequately convey precise without additional conventions like wedges or dashes; it can also be ambiguous for novices unfamiliar with the implied atoms, potentially requiring supplementary details for full comprehension./01%3A_Bonding_in_organic_compounds/1.04%3A_Representing_organic_compounds)

Historical Development

The skeletal formula, a representation of molecular structures in , emerged during the mid-19th century amid the foundational developments in structural theory. August Kekulé's proposal of carbon's tetravalency and chain-forming ability in 1858 laid the groundwork for diagrammatic notations, while his 1865 depiction of as a cyclic structure employed lines to indicate bonds between carbon atoms, marking an early shift toward simplified visual representations./01%3A_Introduction_to_Organic_Chemistry/1.02%3A_A_Bit_of_History) Independently, Scott Couper introduced dashed lines to symbolize carbon-carbon bonds in organic molecules that same year, emphasizing connectivity over explicit atomic positions. A pivotal advancement came from Alexander Crum Brown, who in 1864–1867 refined these ideas by using circles for atoms connected by solid lines, effectively creating the precursor to modern line-angle or skeletal formulas that omit explicit carbon labels at vertices./01%3A_Introduction_to_Organic_Chemistry/1.02%3A_A_Bit_of_History) This diagrammatic approach, influenced by Crum Brown's mathematical background, allowed chemists to convey complex structures more intuitively than verbose textual descriptions or fully expanded formulas, facilitating the rapid growth of and theory in the late . By the early , these minimalist conventions had evolved further, with lines representing bonds and implied carbons at intersections becoming standard in chemical literature, as seen in the works of on sugar projections in 1891. Throughout the mid-20th century, skeletal formulas gained widespread adoption in textbooks and research publications, prized for their efficiency in depicting polycyclic and functionalized molecules without sacrificing essential structural information./01%3A_Introduction_to_Organic_Chemistry/1.02%3A_A_Bit_of_History) This period coincided with the expansion of , where simplified diagrams supported analyses of reaction mechanisms and . Formal standardization arrived with the IUPAC Recommendations on Graphical Representation Standards for Diagrams in 2008, which codified conventions for line-bond depictions to ensure consistency across printed and digital media. In the late 20th and early 21st centuries, the rise of introduced alternatives like the Simplified Molecular Input Line Entry System (SMILES), developed by David Weininger in 1988 as a linear string notation for machine-readable structures. Despite such innovations enabling algorithmic processing and database storage, skeletal formulas persisted as the predominant visual tool due to their intuitive alignment with human spatial reasoning, further enhanced by digital rendering software in the that automated their generation from models.

Core Conventions

Carbon Skeleton and Basic Structure

The skeletal formula represents the of organic molecules by depicting carbon-carbon bonds as straight lines, with carbon atoms implied at the endpoints, intersections, and vertices of these lines, assuming bonds unless otherwise indicated. This simplifies the visualization of molecular frameworks by omitting explicit carbon symbols and focusing on , where each line segment corresponds to a between adjacent carbons. Linear carbon chains are typically drawn using a pattern to approximate the three-dimensional tetrahedral around each carbon , with near 109.5 degrees. This arrangement aids in conveying the spatial of the while maintaining clarity in two dimensions; for instance, a straight chain of n carbons requires n-1 line segments. Branching is shown by lines diverging from points along the main chain, indicating substituent carbon at those junctions. Cyclic structures are depicted as closed polygons, where the number of sides corresponds to the and carbons are implied at each . For example, a five-sided represents , with five implied carbons connected by single bonds. Implicit atoms are assumed to fill the remaining valences of each carbon to achieve tetravalency. A representative example is n-hexane (C₆H₁₄), portrayed as a chain of five line segments, implying six carbon atoms at the ends and four intermediate vertices.

Terminology

The , also referred to as the bond-line formula or line-angle formula, represents the carbon framework of an using lines to depict bonds between atoms, with carbon atoms implied at the endpoints and intersections of these lines. The term "" specifically denotes this underlying carbon framework, emphasizing the chain of carbon-carbon bonds that forms the molecular backbone, while omitting explicit notation for most hydrogen atoms. In skeletal formulas, valence rules dictate that each implicit carbon atom possesses exactly four bonds, with any unspecified valences filled by atoms to satisfy this tetravalency. For instance, a carbon atom at the end of a line is understood to have three implicit s, while an intersection point with three lines implies one . Certain conventions extend skeletal formulas to indicate stereochemistry, such as the wedge symbol, which represents a bond projecting out of the plane toward the viewer, and the dashed line, which denotes a bond receding behind the plane. Common abbreviations simplify complex substituents in skeletal formulas; for example, "Ph" denotes the (C₆H₅-), a ring attached via one carbon, and "tBu" represents the tert-butyl group ((CH₃)₃C-), a branched alkyl moiety. A frequent misconception is interpreting skeletal formulas as three-dimensional models rather than two-dimensional projections that abstractly convey connectivity and basic spatial relationships, potentially leading to errors in visualizing .

Graphical Standards

Skeletal formulas adhere to established graphical conventions that prioritize clarity, consistency, and ease of interpretation in depicting the carbon of molecules. These standards emphasize the use of straight lines to represent single bonds between carbon atoms, with intersections and endpoints implying carbon atoms unless otherwise specified. Bond angles in representations approximate a zig-zag pattern, typically at about 120° for adjacent single bonds to simulate tetrahedral while maintaining readability, though actual tetrahedral angles of 109.5° are idealized for purposes. Uniform bond lengths are maintained throughout a structure to ensure visual consistency, with no explicit depiction of atom sizes or varying scales for different elements in the skeleton. Bonds should be drawn long enough to be clearly visible, generally exceeding twice the height of atom labels, and resized proportionally when the entire . orientation is preferred for linear chains to facilitate left-to-right reading, while vertical arrangements may be used for emphasis in branched or cyclic structures; substituents on double bonds are conventionally placed upwards. The International Union of Pure and Applied Chemistry (IUPAC) provides comprehensive guidelines in its recommendations on graphical representation, which build on the 1993 and were incorporated into the 2013 edition, stressing clarity over rigid adherence to exact geometries. These rules advocate avoiding overlapping lines, atoms, or labels, recommending adjustments such as bond lengthening or slight angle modifications to resolve ambiguities. Multiple bonds are represented by adjacent to s, with separation less than one-third of the single bond length. In digital rendering, software like , introduced in the 1980s, standardizes these conventions by automating zig-zag chain drawing, uniform , and overlap prevention, enabling precise printed outputs that align with IUPAC guidelines. Hand-drawn skeletal formulas, while more flexible, often exhibit greater variation in line thickness and angles but still follow core principles to minimize overlaps and ensure interpretability, particularly in educational or preliminary sketching contexts.

Implicit and Explicit Atoms

Implicit Carbon and Hydrogen Atoms

In skeletal formulas, carbon atoms are implied at every , , and change of the lines representing carbon-carbon bonds, simplifying the depiction of molecules while assuming standard connectivity. This convention positions carbons at vertices or line termini without labels, where terminal positions represent methyl groups (\ce{CH3-}), linear chain segments indicate methylene groups (\ce{-CH2-}), vertices with three connections denote methine groups (\ce{>CH-}), and vertices with four connections represent quaternary carbons (\ce{>C<}). Each line segment thus signifies a between two carbons, streamlining the representation of the molecular skeleton. Hydrogen atoms bonded to carbon are entirely implicit and omitted from the drawing to emphasize the carbon framework. Their presence and count are inferred to fulfill carbon's tetravalency, ensuring each carbon achieves four bonds through connections to other carbons and sufficient s. For unbranched alkanes, this implicit addition yields the general \ce{C_nH_{2n+2}}, as the linear chain structure dictates the hydrogen distribution without explicit notation. Standard rules prohibit showing s on carbon unless attached to heteroatoms or needed for stereochemical clarity; the total hydrogen count derives exclusively from satisfaction. A straightforward example is (\ce{C2H6}), depicted as a single horizontal line connecting two implied carbons, each with three implicit hydrogens to complete their bonds, equivalent to \ce{H3C-CH3}. For branched hydrocarbons like (\ce{C4H10}), the skeletal formula shows a central implied carbon at an intersection with three short lines extending to terminal carbons, representing a methine carbon bonded to three methyl groups, or \ce{(CH3)3CH}, where all ten hydrogens are inferred from tetravalency. Despite their efficiency, skeletal formulas for purely all-carbon structures carry limitations, as the line-based alone may not distinguish between constitutional isomers without supplementary details like molecular or context. This ambiguity arises because multiple arrangements of lines can imply the same bond count but different topologies, necessitating careful in . In molecules with heteroatoms, adjustments apply, but carbon atoms and their attendant hydrogens remain implicit.

Explicit Heteroatoms

In skeletal formulas, heteroatoms such as oxygen (O), nitrogen (N), sulfur (S), and are represented explicitly using their standard atomic symbols, positioned at the intersections or ends of lines that denote bonds within the carbon skeleton. This explicit depiction distinguishes them from the implied carbon atoms, ensuring clarity in the molecular structure./02%3A_Representing_organic_structures/2.01%3A_Drawing_organic_structures) Bonds to these heteroatoms are illustrated as straight lines connecting to the carbon framework, following the same conventions as carbon-carbon bonds, with , , or bonds indicated by one, two, or three parallel lines, respectively. Valence adjustments for heteroatoms align with their standard bonding capacities in compounds; oxygen typically forms two bonds, three, and two or more depending on the context, while form one bond. These representations maintain the overall tetravalency of carbon while accommodating the heteroatom's electron requirements. Explicit heteroatoms commonly appear in functional groups, such as the oxygen in alcohols or the in , where the symbol is placed directly at the site of attachment to the carbon chain. For instance, is depicted in skeletal form as a two-line chain representing the , with an O symbol at the end connected by a , corresponding to the condensed formula CH₃CH₂OH. Similarly, a simple like features an N symbol bonded to a single carbon line. Halogens are handled as explicit heteroatoms in special cases, shown with their symbols (e.g., for , for ) attached via a single line to the appropriate carbon position, as in (CH₃), where the skeletal representation is a single carbon implied at the end of the line with attached. This convention ensures that substituents like are immediately identifiable without implying additional carbon atoms.

Explicit Hydrogen Atoms

In skeletal formulas, hydrogen atoms are explicitly depicted when attached to heteroatoms such as oxygen, , or to clearly indicate their presence and satisfy the requirements of these atoms, which can vary and lead to ambiguity if omitted. This convention ensures that functional groups like alcohols (-), amines (-NH₂), and thiols (-SH) are unambiguously represented, as the number of s on heteroatoms directly influences chemical reactivity and properties. For instance, in the skeletal formula of , the structure is drawn as a single line terminating in -, where the hydrogen is explicitly shown attached to the oxygen atom. Hydrogen atoms in acidic positions, such as the hydroxyl group in carboxylic acids (-COOH), are also shown explicitly to highlight their in proton donation and distinguish them from esters or other derivatives. In these cases, the is placed adjacent to the oxygen symbol, often without a drawn bond line for brevity, but the attachment is implied by proximity. For , a simple molecule, the skeletal formula is fully explicit as H-O-H, with bonds indicated by lines to emphasize the . Explicit hydrogens on carbon atoms are generally omitted in skeletal formulas, as their positions are inferred from the carbon's and bonding pattern, but they may be included rarely for emphasis, such as in stereochemical representations where a hydrogen must be distinguished at a chiral using or dashed lines. In such depictions, the symbol is placed near the carbon with a directional to indicate . Exceptions to the default omission occur in organometallic compounds or scenarios where is unclear, such as certain coordination complexes, where explicit hydrogens on carbon (e.g., in metal hydrides) are shown to avoid misinterpretation of the . The symbol is typically written as "H" and positioned adjacent to the attached atom, with an optional short line representing the for clarity in complex diagrams.

Symbolic Representations

Pseudoelement Symbols Overview

Pseudoelement symbols serve as non-standard notations in skeletal formulas, representing complex substituents that would otherwise require detailed structural drawings to convey the same information. These symbols facilitate the depiction of intricate molecules by substituting elaborate group representations with compact labels, enhancing the readability of chemical diagrams without sacrificing accuracy. In practice, they are integrated into the carbon by connecting via a line, often distinguished through typographic conventions such as boldface or enclosure in parentheses to avoid confusion with standard element symbols. The primary purpose of pseudoelement symbols is to minimize visual clutter in skeletal formulas, particularly when dealing with repetitive or bulky groups such as alkyl chains or aromatic systems that appear frequently in structures. By condensing these elements into simple symbols, chemists can focus on the core molecular framework and key functional relationships, which is especially valuable in illustrating reaction mechanisms, synthetic pathways, and molecular interactions. This approach promotes efficiency in both hand-drawn sketches and digital renderings, allowing for quicker communication of structural details in research and education. Pseudoelement symbols are broadly categorized into alkyl, aromatic, and functional types, reflecting their roles in denoting hydrocarbon chains, ring systems, and reactive moieties, respectively; these categories enable systematic application in diverse molecular contexts, with further details provided in dedicated sections.

Alkyl and Substituent Groups

In skeletal formulas, alkyl and substituent groups derived from s are often represented using standardized abbreviations or simplified line drawings to denote branches attached to the main carbon . These conventions streamline the depiction of complex molecules by avoiding the need to draw every atom explicitly, focusing instead on and key features. Common alkyl groups, which are saturated hydrocarbon chains, are abbreviated with short symbols placed at the attachment point, such as "Me" for the (-CH₃), "Et" for the (-CH₂CH₃), "iPr" for the isopropyl group (-CH(CH₃)₂), and "tBu" for the tert-butyl group (-C(CH₃)₃). These symbols are positioned adjacent to the bond line representing the connection to the core structure, ensuring clarity in diagrams. Aromatic substituents, such as the (-C₆H₅), are typically abbreviated as "Ph" and visualized as a simple (often with an inscribed to indicate ) attached via a single line to the main chain. Similarly, the naphthyl group (-C₁₀H₇), derived from , is abbreviated as "Nap" and represented by its fused-ring skeletal outline connected at the appropriate position (e.g., 1- or 2-naphthyl). For unsaturated substituents, the (-CH=CH₂) is shown as a short line ending in a stub (CH₂=CH-), while the (-CH₂CH=CH₂) extends this with an additional methylene unit, depicted as -CH₂-CH=CH₂ in line notation. These groups are attached directly to the primary carbon skeleton via bond lines, illustrating their role as pendant chains or rings. For instance, is represented as a ring () connected to a single carbon (implicit -CH₃), simplifying the full structure C₆H₅CH₃. Generic conventions use lowercase letters like "r" or uppercase "R" to denote unspecified alkyl substituents, allowing for broad applicability in schematic representations without specifying the exact chain length or branching.
Group TypeExampleAbbreviationSkeletal Representation
AlkylMethylMe(single line or label)
AlkylEthylEt-- (two-line chain or label)
AlkylIsopropyliPr/-- (branched line or label)
Alkyltert-ButyltBu*-- (tertiary branch or label)
AlkylBenzylBnPh-CH₂- (label or line to hexagon)
AlkylCyclohexylCyHexagon (aliphatic) attached by line
AromaticPhenylPhHexagon attached by line
AromaticNaphthylNapFused rings attached by line
UnsaturatedVinyl(none standard)=-- (double bond stub)
UnsaturatedAllyl(none standard)-=- (methylene with double bond)

Functional and Special Groups

In skeletal formulas, functional groups containing heteroatoms or carbonyl moieties are often represented using pseudosymbols to simplify depiction while maintaining clarity in diagrams. The , abbreviated as Ac (CH₃CO–), is commonly used to denote acetate esters or amides. Similarly, the , denoted as Bz (C₆H₅CO–), serves as a shorthand for benzoyl esters or protecting moieties in complex molecules. These symbols are integrated into line-angle representations by attaching them directly to the relevant bond, emphasizing the functional group's role over exhaustive atomic detail. Sulfonyl and sulfonate groups, essential for activation and protection in , employ pseudosymbols such as Ts for p-toluenesulfonyl (CH₃C₆H₄SO₂–) and Ms for mesyl (CH₃SO₂–). These are frequently shown in skeletal diagrams as Ts– or Ms– attached to oxygen or . Protecting groups for temporary blockage of reactive sites are denoted by symbols like Boc for tert-butoxycarbonyl ((CH₃)₃COCO–) and Fmoc for 9-fluorenylmethyloxycarbonyl (9-fluorenyl-CH₂OCO–). While these pseudosymbols are prevalent in scientific literature for efficiency, IUPAC guidelines advise their use only with explicit definitions to ensure unambiguous interpretation, discouraging overuse in formal structural representations to prioritize complete atomic depiction.

Advanced Representations

Multiple Bonds

In skeletal formulas, double bonds are represented by two parallel lines or the "=" symbol connecting the relevant atoms, a convention that distinguishes them from single bonds depicted by a single line. This representation is standard in to indicate the presence of pi bonds alongside bonds, commonly appearing in structures such as alkenes and carbonyl compounds. Triple bonds are similarly denoted by three parallel lines or the "≡" symbol, signifying one and two pi bonds, and are typically found in functional groups. The parallel alignment of these lines ensures clarity, with the segments drawn in close proximity—separated by less than 33% of the —to prevent misinterpretation as separate bonds. Placement of multiple bonds follows specific conventions to minimize ambiguity in the two-dimensional . For double bonds, the lines may be offset based on atomic : toward the more substituted side in asymmetric cases, centered in symmetrical ones like carbonyls (C=O), or adjusted for trans configurations by offsetting one segment. Triple bonds adhere to similar parallelism but are less frequently offset due to their linear . In chains, bonds are oriented horizontally or at standard angles to maintain readability. Representative examples illustrate these depictions. Ethene (C₂H₄) is shown as two carbon atoms connected by a double bond, with two implicit hydrogens on each carbon: H₂C=CH₂. Acetylene (C₂H₂), an alkyne, features a triple bond between two carbons, each bearing one implicit hydrogen: HC≡CH. Multiple bonds affect the calculation of implicit hydrogens on carbon atoms, which are assumed to satisfy tetravalency (four bonds total). A double bond counts as two connections for each attached carbon, and a triple as three, thereby reducing the number of implicit hydrogens compared to saturated structures. For instance, simple alkenes follow the general formula CₙH₂ₙ, reflecting two fewer hydrogens per double bond than the alkane formula CₙH₂ₙ₊₂, while alkynes are CₙH₂ₙ₋₂ due to the triple bond's higher bond order.

Benzene and Aromatic Rings

In skeletal formulas, is conventionally represented as a regular with a circle inscribed within it, symbolizing the delocalized π-electrons across the ring rather than localized double bonds. This depiction emphasizes the aromatic nature of the molecule, where each vertex implies a carbon atom bonded to one hydrogen, with no explicit hydrogens shown. An alternative, older representation uses alternating single and double bonds around the (the Kekulé structure), but this is less favored in modern skeletal notation as it may imply fixed bond orders inconsistent with delocalization. For fused aromatic systems like , skeletal formulas depict the rings as abutting regular hexagons sharing a common side (two bonds), with circles inscribed in each ring to indicate . Standard orientations position the fusion horizontally for readability, minimizing overlap and aligning with conventional depictions in chemical literature. In larger polycyclic aromatic hydrocarbons, such as , additional hexagons are fused linearly or angularly, maintaining the shared-side convention without explicit carbon labels at fusion points. Substituted aromatic compounds attach functional groups directly to the vertices of the ring in skeletal formulas, with positions numbered starting from at the carbon bearing the principal , proceeding or counterclockwise to give the lowest numbers to other substituents. For example, phenol is shown as a with an inscribed and an -OH group attached to one , implying the five remaining carbons each bear a . Numbering ensures substituents are placed to avoid bond overlap, often with the ring oriented such that the principal group is at the bottom or left. Variations in aromatic ring depiction include a thick or bold circle to further highlight delocalization, particularly in hand-drawn skeletal formulas, though thin solid circles are standard in printed diagrams. Contemporary usage increasingly avoids explicit double bonds in aromatic systems to prevent misinterpretation of bond localization, favoring the circle or sometimes a dashed circle for clarity in complex structures.

Stereochemistry

Skeletal formulas incorporate stereochemical information to depict the three-dimensional arrangement of atoms, addressing the inherent limitations of two-dimensional representations. The primary method for indicating tetrahedral at chiral centers is the wedge-and-dash notation, where a solid wedge represents a projecting forward (toward the viewer) out of the plane of the paper, and a dashed line indicates a projecting backward (away from the viewer). Bonds lying in the plane are shown with ordinary lines. This convention, standardized by IUPAC, allows chemists to specify the configuration at sp³-hybridized carbon atoms without altering the simplified skeletal structure. In cyclic compounds, cis/trans isomerism is conveyed through relative positioning or by using up/down wedges and dashes to show substituents on the same or opposite sides of the ring plane. For example, in cyclohexane derivatives, a cis-1,2-disubstituted ring might depict both substituents with solid wedges (both forward) or one wedge and one dash (one forward, one back), while trans would use one of each in opposing directions. This notation relies on the ring being drawn in a standard chair or flat projection, with the plane defined implicitly by the skeletal lines. For alkenes, double bond geometry is represented using Z/E descriptors based on Cahn-Ingold-Prelog priority rules, often visualized in skeletal formulas with zigzag chains where the higher-priority groups are shown on the same (Z) or opposite (E) sides of the . The itself is indicated by two parallel lines, with substituents positioned to reflect their spatial relationships without additional wedges. This approach ensures clarity in linear representations while adhering to IUPAC guidelines for stereodescriptors. A representative example is (R)-2-butanol, depicted in skeletal form as a zigzag chain with the chiral carbon at the second position: the (CH₃) as a line, the hydroxyl (OH) as a solid wedge forward, the ethyl (CH₂CH₃) as a dashed line back, and the implicit in the plane. This explicitly shows the tetrahedral arrangement, contrasting with achiral representations that omit such indicators; skeletal formulas do not imply without these additions. The two-dimensional nature of skeletal formulas limits their ability to fully capture complex , such as in large biomolecules or multiple chiral centers, often requiring supplementary explicit atoms or 3D software for precision. Despite these constraints, wedge-dash augmented skeletal formulas are widely used in pharmaceutical chemistry to illustrate and drug-receptor interactions, where stereoisomers can exhibit vastly different biological activities.

Specialized Features

Hydrogen Bonds

In skeletal formulas, hydrogen bonds are represented using dotted or dashed lines to distinguish them from covalent bonds, which are depicted with solid lines. These non-covalent interactions occur between a attached to an electronegative donor atom (typically oxygen in O-H or in N-H groups) and an electronegative acceptor atom (such as oxygen or ) possessing a of electrons. This notation allows for the visualization of intermolecular or intramolecular associations without altering the core covalent framework of the skeletal structure. The convention emphasizes that hydrogen bonds are not integral to the covalent skeleton and are primarily employed in diagrams illustrating intermolecular interactions, such as in molecular assemblies or biochemical complexes. Unlike the solid lines representing single, double, or triple covalent bonds in skeletal formulas, the dotted lines for hydrogen bonds indicate weaker, directional attractions, often spanning distances of 2.5–3.5 between the donor and acceptor atom. If ambiguity arises, labels such as "H-bond" may be added adjacent to the dotted line for clarity. In cases where the on the donor is critical, it is explicitly shown in the skeletal formula, departing from the usual omission of hydrogens on carbon. A classic example is the water dimer, where skeletal formulas of two H₂O molecules are connected by dotted lines between the O-H donor of one and the oxygen acceptor of the other, highlighting the tetrahedral arrangement of hydrogen bonds in liquid water. In biochemical contexts, this notation is essential for depicting DNA base pairing, such as the two or three dotted lines between adenine and thymine (involving N-H donors and O/N acceptors) or guanine and cytosine, which stabilize the double helix structure. These representations find applications in illustrating mechanisms, where dotted lines show s between substrate functional groups and residues, facilitating through stabilization of transition states. In , skeletal formulas with dotted lines depict host-guest complexes, such as those in molecular recognition events driven by multiple O-H···O or N-H···N interactions.

Protecting and Sulfonyl Groups

In skeletal formulas, sulfonyl groups are depicted as -SO_2-R attachments to heteroatoms, where the sulfur is bonded to two oxygens (one with a ) and an R , serving dual roles in for reactions and protection of nucleophilic sites like alcohols and amines. The tosyl (Ts) group, with R as 4-methylphenyl, is commonly shown with the SO_2 linked to a simplified ring bearing a para-methyl branch, while the mesyl (Ms) group features R as methyl and is represented as -SO_2CH_3. These sulfonates (-OSO_2R) and sulfonamides (-NSO_2R) enhance ability in substitution reactions and stabilize adjacent functional groups during . Protecting groups act as reversible masks for reactive functional groups, such as hydroxyl (-OH) and amino (-NH_2), allowing selective manipulation of other sites in multifunctional molecules; in skeletal notation, they are integrated directly onto the carbon framework via abbreviated labels on the protected atom. Silyl ethers, a prevalent class for protection, employ notations like tert-butyldimethylsilyl (TBDMS or ), rendered as -OTBDMS to imply the full (CH_3)_2C(CH_3)_3 structure without explicit drawing, offering stability under basic conditions and facile removal with . This convention originated in seminal work on for complex molecule assembly. In multi-step synthetic schemes, protecting and sulfonyl groups are routinely abbreviated in skeletal formulas to prioritize the core transformations, with full structures referenced only when ambiguity arises, ensuring diagrams remain uncluttered for tracing reactivity. For instance, in , Ts or Ms groups on are shown as -NTs or -NMs along the chain, facilitating orthogonal deprotection during coupling sequences. A modern example is the 2-(trimethylsilyl)ethoxymethyl (SEM) group, abbreviated as -SEM and attached as -OSEM or -NSEM, which has been employed post-2000 in total syntheses of alkaloids like asperazine for mild, selective protection of indoles and alcohols, removable under acidic conditions without affecting sensitive motifs. These notations underscore the essential role of protecting and sulfonyl groups in , enabling precise control over and in constructing intricate natural products and pharmaceuticals.

References

  1. [1]
  2. [2]
    Optional Supplementary Chapter: Organic Chemistry – Astrobiology
    Here are simple rules for how to interpret a skeletal formula: Each vertex in a skeletal formula is a carbon atom. Double bonds and triple bonds are shown ...
  3. [3]
    Skeletal Diagrams in Organic Chemistry - HSCprep
    Mar 4, 2025 · Heteroatoms (non-carbon/hydrogen atoms) are shown explicitly; Multiple bonds appear as parallel lines. Reading Skeletal Diagrams. To interpret ...
  4. [4]
    A Brief Guide to Types of Organic Chemistry Formulae
    Apr 11, 2014 · ... organic molecule. The skeletal formula is the simplest way of representing organic molecules, and as such is commonly seen in both textbooks ...<|control11|><|separator|>
  5. [5]
    CH103 - Chapter 5: Covalent Bonds and Introduction to Organic ...
    Line or Skeletal Formulae. In a line or skeletal formula, all the hydrogen atoms are not shown and all the carbons are not labeled but rather are indicated at ...
  6. [6]
  7. [7]
  8. [8]
    A History of Molecular Representation Part One: 1800 to the 1960s
    This is the first in a series of articles dealing with the historical development of molecular representations.
  9. [9]
    [PDF] Graphical Representation Standards for Chemical Structure Diagrams
    These depiction styles are generally applicable across many classes of chemical structures. © 2008 IUPAC, Pure and Applied Chemistry 80, 277–410. Graphical ...
  10. [10]
    SMILES, a chemical language and information system. 1 ...
    SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules.
  11. [11]
    [PDF] Chapter 23: Organic Chemistry
    Page 14. • Skeletal Structures: Consist of straight lines that represent carbon-carbon bonds. -- Heteroatoms (atoms other than C or H) shown explicitly.Missing: skeleton | Show results with:skeleton
  12. [12]
    Structure and Bonding – Organic Chemistry: Fundamental Principles ...
    Organic chemistry is the study of the synthesis, structure, reactivity and properties of a large group of chemical compounds primarily consisting of carbon.
  13. [13]
    [PDF] 11.2 Alkanes - De Anza College
    • as a skeletal formula. Chemistry: An Introduction to General, Organic, and Biological Chemistry, Twelfth Edition. © 2015 Pearson Education, Inc. Structural ...
  14. [14]
    Representing Organic Compounds - Student Academic Success
    IUPAC naming is a system of naming organic compounds. that identifies the number of carbons, presence of functional groups, and the homologous series. the ...
  15. [15]
  16. [16]
    [PDF] Skeletal Structure Organic Chemistry
    Instead of drawing out every atom and bond explicitly, skeletal formulas emphasize the carbon backbone and the arrangement of other atoms attached to it. In ...
  17. [17]
    skeletal formula (08208) - IUPAC Gold Book
    skeletal formula. synonym: bond-line formula. Copy. https://doi.org ... Citation: 'skeletal formula' in IUPAC Compendium of Chemical Terminology, 5th ed.
  18. [18]
    Illustrated Glossary of Organic Chemistry - Skeletal formula (skeletal ...
    Skeletal formula (skeletal structure; line-angle formula): A representation of molecular structure in which covalent bonds are shown as lines.
  19. [19]
    11.3: Condensed Structural and Line-Angle Formulas
    Aug 8, 2019 · Line-angle formulas imply a carbon atom at the corners and ends of lines. Each carbon atom is understood to be attached to enough hydrogen atoms ...
  20. [20]
    Hidden Hydrogens, Hidden Lone Pairs, Hidden Counterions
    Jun 27, 2011 · The key thing in recognizing hidden hydrogens is that wherever you see less than 4 bonds to carbon, that carbon is going to have hydrogens ...Missing: formula | Show results with:formula
  21. [21]
    Wedge and Dash Representation - Chemistry Steps
    Wedge and dash notation shows atoms in front of the plane with a wedge line, and atoms behind with a dashed line, but it's not absolute.
  22. [22]
    1.8: Structural Formulas - Lewis, Kekule, Bond-line, Condensed, and ...
    May 30, 2020 · Dashed lines show atoms and bonds that go into the page, behind the plane, away from you. In the above example, the OH group is going into the ...
  23. [23]
    Abbreviations and acronyms of chemical compounds
    Standard abbreviations and acronyms of chemical compounds used by the ... Ph, phenyl, C6H5, 77.104, radical derived from benzene. 70, piv, pivaloyl, (CH3)3CCO ...<|separator|>
  24. [24]
    Common alkyl substituents - University of Calgary
    the prefixes sec- or s- and tert- or t- are commonly used to indicate specific types of branched substituents. The less common prefix n- (or neo-) is ...Missing: skeletal Ph
  25. [25]
    [PDF] Examining the Relationship between 2D Diagrammatic Conventions ...
    Nov 7, 2014 · Within the context of the research presented here, we focus on three 2D diagrams that novices are asked to use routinely throughout the course ...
  26. [26]
    Graphical Representation Standards for Chemical Structure Diagrams
    May 2, 2025 · Recommendations are provided for the display of two-dimensional chemical structure diagrams in ways that avoid ambiguity and are likely to be understood ...
  27. [27]
    [PDF] Brief Guide to the Nomenclature of Organic Chemistry - IUPAC
    The structural formulae of organic-chemical compounds are usually drawn in accordance with the zig-zag convention as used widely above.12 In this convention ...
  28. [28]
    Blue Book | International Union of Pure and Applied Chemistry
    The rules given in the Nomenclature of Organic Chemistry, commonly known as the “Blue Book”, emphasize the generation of unambiguous names.Missing: formula conventions
  29. [29]
    4.3: Condensed Formula and Skeletal Structures
    May 12, 2024 · Skeletal formulas imply a carbon atom at the corners and ends of lines. Each carbon atom is understood to be attached to enough hydrogen atoms ...
  30. [30]
    Getting to grips with skeletal formula in A Level Chemistry
    Feb 20, 2025 · This article breaks down the rules and provides step-by-step examples to help you master skeletal formulae in A level Chemistry.
  31. [31]
    Skeletal Structure: Videos & Practice Problems - Pearson
    Heteroatoms are non-carbon atoms in an organic molecule, such as nitrogen, oxygen, and fluorine. In bond line structures, lone pairs on heteroatoms are implied ...Missing: multiple | Show results with:multiple
  32. [32]
    Drawing Bond-Line (Skeletal) Structures - Organic Chemistry Tutor
    Drawing Bond-Line structures efficiently requires adhering to certain guidelines. These include: Zig-zag lines: Always draw structures in neat zig-zags, as ...
  33. [33]
    [PDF] Skeletal Structure Organic Chemistry
    This article explores the intricacies of skeletal structures, their advantages, limitations, and their broader impact on organic chemistry comprehension.
  34. [34]
    2.1 Drawing and Interpreting Organic Formulas
    Hydrogens bonded to nitrogen, oxygen, sulfur, or anything other than carbon are shown, but are usually drawn without showing the bond. The following examples ...
  35. [35]
    6.2: Representing Organic Molecules - Chemistry LibreTexts
    Oct 25, 2025 · Structural formulas can uniquely identify an organic molecule because they show all the atoms and how they are bonded to one another. They are ...Missing: heteroatoms | Show results with:heteroatoms
  36. [36]
    10.2: Organic Structural Representations: Molecular Formulas
    May 28, 2024 · ... heteroatoms are written in the final position, or positions, in a molecular formula. If multiple heteroatoms are present in a single organic ...Missing: skeletal | Show results with:skeletal
  37. [37]
    Skeletal Formula: Videos & Practice Problems - Pearson
    Skeletal formulas, also known as bond line or line formulas, are streamlined representations of complex organic structures.
  38. [38]
    4.8: Line Drawings - Chemistry LibreTexts
    Oct 4, 2022 · In line structures, we drop the label "C" for the carbon atoms. Anytime there is a joint between two bonds (drawn as a vertex in a zig-zag line) ...
  39. [39]
    Abbreviations - The Journal of Organic Chemistry
    Standard abbreviations include: observed optical rotation [ ], specific rotation [ ], Å angstrom(s), Ac acetyl, and FD field desorption.<|control11|><|separator|>
  40. [40]
    Organic Nomenclature - MSU chemistry
    In general, for a hydrocarbon composed of n carbon atoms associated with m rings the formula is: CnH(2n + 2 - 2m). The structural relationship of rings in a ...Alkanes · Cycloalkanes · Alkenes And Alkynes
  41. [41]
    A historical study of structures for communication of organic ...
    Oct 11, 2004 · This paper will explore the origins of structure diagrams, their representation in the chemistry literature and the resulting effect on chemical information ...Missing: skeletal formula
  42. [42]
    Functional Groups In Organic Chemistry
    Oct 6, 2010 · A functional group is what we call specific groupings of certain atoms within molecules that have their own characteristic properties.<|control11|><|separator|>
  43. [43]
    3.1. Nomenclature of alkanes & related structures - Lumen Learning
    In general formulas and structures, alkyl and aryl groups are often abbreviated as R. Naming complex alkanes. Playing in picture-in-picture. (opens a new ...
  44. [44]
    [PDF] Common Abbreviations in Organic Chemistry
    Page 1. COMMON ABREVIATIONS IN ORGANIC CHEMISTRY. Ac. = Acetyl. AcO. = Acetate. Ac2O. = Acetic anhydride. AIBN. = a,a'-Azoisobyronitrile. All. = Allyl. Alloc.Missing: skeletal | Show results with:skeletal
  45. [45]
    Peptide Synthesis- Protein Facility at Iowa State University
    Typical labile protecting groups include t-Boc (tert-butyloxycarbonyl) and FMOC (9-flourenylmethloxycarbonyl). t-Boc is a very satisfactory labile group ...
  46. [46]
    [PDF] use of abbreviations in the chemical literature | iupac
    IUPAC recommends few, well-established abbreviations, defining them in a paper, and not using them in titles/abstracts. Avoid abbreviating physical quantities.
  47. [47]
    1.1: Line Angle Notation as the Language of Chemical Structure
    ### Summary of Multiple Bonds in Line-Angle (Skeletal) Formulas
  48. [48]
    How to draw organic molecules - Chemguide
    To overcome this possible confusion, the convention is that you always look for the longest possible chain of carbon atoms, and then draw it horizontally.Missing: wedge dashed
  49. [49]
    [PDF] STEREOCHEMISTRY - iupac
    Hydrogen or other atoms or groups attached at steri- cally designated positions should never be omitted. In chemical formulae, rings are usually drawn with ...
  50. [50]
    Stereochemistry in Drug Action - PMC - NIH
    Stereochemistry in drug action refers to how chiral drugs, with two forms (enantiomers), can behave differently in biological systems, impacting drug action.
  51. [51]
  52. [52]
    [PDF] Chapter 2: Structure and Bonding II - Organic Chemistry
    Chapter 2 covers sketching atomic orbitals, forming sigma/pi bonds, identifying hybridization, resonance structures, non-covalent interactions, and hydrogen ...Missing: skeletal | Show results with:skeletal
  53. [53]
  54. [54]
    [PDF] 1 Protection Reactions - Wiley-VCH
    The 4-toluenesulfonyl (tosyl or Ts) group 25 is the first example of this type, which was described by Emil Fischer. However, its application to peptide ...
  55. [55]
    [PDF] Protective Groups - Harvard
    Protective Groups In Organic Synthesis, 3rd ed. John Wiley & Sons: New York, 1991. Important Silyl Ether Protective Groups: Trimethylsilyl (TMS).
  56. [56]
    Total Synthesis of Asperazine | Journal of the American Chemical ...
    Removal of the SEM-protecting groups 17 from 12 gave 13. At this point, 1H NOE analysis revealed the E stereochemistry of the exocyclic double bond ...Missing: post- | Show results with:post-
  57. [57]
    Protection (and Deprotection) of Functional Groups in Organic ...
    In this review we describe the application of heterogeneous catalysis in protecting group chemistry, focusing mainly on that developed during the past decade.2.2. Silyl Ethers 2.2. 1... · 2.3. 3. Deprotection · 10. ReferencesMissing: post- | Show results with:post-<|control11|><|separator|>