Fact-checked by Grok 2 weeks ago

Signal peptide

A signal peptide is a short , typically 15–30 residues in length, situated at the of precursor proteins targeted for or membrane integration across cellular . It directs these proteins to the secretory pathway by binding the (SRP), which pauses and facilitates co-translational translocation through protein-conducting channels, such as the Sec61 translocon in eukaryotes or SecYEG in prokaryotes. Following successful targeting, the signal peptide is generally cleaved by a signal peptidase at a specific site, yielding the mature functional protein. The canonical structure of signal peptides consists of three domains: an N-region (1–5 residues, positively charged with basic amino acids like or ), a central H-region (7–15 hydrophobic residues forming an α-helix), and a C-region (3–7 polar or neutral residues including the cleavage motif). The N-region ensures correct topological by its positive charge, which repels the negatively charged plasma membrane and promotes SRP . The H-region drives membrane insertion via hydrophobic interactions, while the C-region adheres to the "-3, -1 rule," favoring small uncharged residues (e.g., , ) at positions -3 and -1 relative to the scissile bond for peptidase recognition. Variations exist, such as longer peptides in certain viral proteins or twin-arginine signal peptides for folded protein export in . Signal peptides are conserved and ubiquitous in all kingdoms of life, accounting for the targeting of 10–30% of eukaryotic and prokaryotic proteomes to secretory or membrane destinations. Beyond initial targeting, they can retain intracellular roles post-cleavage, including modulation of , membrane anchoring, or even nuclear localization in some cases. In , signal peptides are engineered for enhanced recombinant protein secretion in hosts like Escherichia coli or Pichia pastoris, improving yields for biopharmaceuticals, enzymes, and vaccines. Their prediction and design rely on computational tools analyzing hydrophobicity and cleavage motifs, underscoring their foundational role in cellular protein trafficking.

Fundamentals

Definition and Role

A signal peptide is a short N-terminal sequence, typically 16-30 residues in length, that directs nascent proteins to the secretory pathway for translocation across cellular membranes or insertion into them. These peptides are essential components of precursor proteins destined for secretion or membrane localization in both eukaryotic and prokaryotic cells. In eukaryotic cells, signal peptides are recognized by the (SRP), which binds to the nascent polypeptide emerging from the and targets it to the (ER) membrane, where translocation occurs through the Sec61 translocon. In prokaryotes, analogous mechanisms involve SRP-mediated targeting to the SecYEG translocon for co-translational translocation or chaperone-assisted post-translational pathways, enabling proteins to cross the cytoplasmic membrane into the or outer membrane. Signal peptides often feature a hydrophobic core region that facilitates their interaction with these targeting and translocation machineries. Following successful translocation, the signal peptide is cleaved by signal peptidase, releasing the mature protein for its final destination within the secretory pathway or on the surface. Representative examples include the signal peptide of preproinsulin in eukaryotic , which guides the precursor to the for processing into insulin, and those of bacterial periplasmic proteins, such as enzymes involved in nutrient uptake, which direct translocation across the inner membrane.

Historical Discovery

The concept of signal peptides emerged from efforts to understand how proteins are targeted to specific cellular locations, particularly for secretion. In 1971, Günter Blobel and David Sabatini proposed the initial version of the signal hypothesis, suggesting that secretory proteins synthesized on membrane-bound ribosomes contain an N-terminal "signal" sequence that directs their insertion into the endoplasmic reticulum (ER) membrane. This idea was based on observations of rough microsomes and the asymmetric distribution of nascent secretory proteins. Experimental validation came in 1975 through studies by Blobel and Bernhard Dobberstein, who used in vitro translation systems with canine pancreatic microsomes and mRNA from immunoglobulin light chains. They demonstrated that preproteins are translocated across the membrane in a vectorial manner, with the N-terminal signal sequence emerging first from the ribosome and facilitating co-translational transfer, while subsequent proteolytic processing removes the signal. Further advancements in the late 1970s and 1980s solidified the role of signal peptides in eukaryotic systems. Blobel and colleagues identified the (SRP), a ribonucleoprotein complex that binds the signal sequence as it emerges from the , halting until docking at the membrane via the SRP receptor; this was purified and characterized between 1979 and 1982. Confirmation of signal peptide cleavage came in 1980, when Robert C. Jackson and Blobel isolated and characterized a signal peptidase from canine pancreatic rough microsomes, showing its ability to process full-length presecretory proteins post-translationally . The involvement of a protein-conducting translocon channel in the membrane, initially proposed in the 1975 hypothesis, was further evidenced in the 1980s through fractionation studies revealing complexes essential for translocation. Parallel discoveries in prokaryotes during the established evolutionary conservation of signal peptide mechanisms. In , Masayori Inouye and colleagues sequenced the prolipoprotein precursor of the major outer membrane lipoprotein in , identifying a 20-amino-acid N-terminal extension as the first prokaryotic signal sequence, which is cleaved during export to the outer membrane. This was followed by the detection of prokaryotic signal peptidase activity in E. coli membrane fractions in 1978, confirming endoproteolytic cleavage of nascent pre-coat proteins from bacteriophage f1. Blobel's contributions culminated in the 1999 Nobel Prize in Physiology or Medicine, awarded solely to him "for the discovery that 'proteins have intrinsic signals that govern their transport and localization in the cell'," recognizing the signal hypothesis and its experimental foundations.

Structural Characteristics

Amino Acid Composition

Signal peptides typically exhibit a tripartite structure consisting of an N-region, a central H-region, and a C-region. The N-region, spanning 1 to 5 residues at the amino terminus, is characteristically positively charged due to the presence of basic amino acids such as lysine (Lys) and arginine (Arg), which contribute to the overall polarity and orientation during translocation. The H-region, comprising 7 to 15 residues, forms a hydrophobic core dominated by non-polar amino acids including leucine (Leu), alanine (Ala), valine (Val), and isoleucine (Ile), enabling interaction with the lipid bilayer of the membrane. The C-region, 3 to 7 residues long, is polar and terminates in a cleavage site motif where small, uncharged amino acids—often alanine—at positions -1 and -3 relative to the cleavage point follow von Heijne's rule, such as the Ala-X-Ala sequence, facilitating precise enzymatic removal by signal peptidase. Variations in signal peptide composition occur across organisms, influencing length and hydrophobicity while preserving functional motifs. In , signal peptides tend to be shorter, averaging 15 to 20 residues, compared to 20 to 30 residues in and eukaryotes, with reduced hydrophobicity in the H-region to adapt to their environments. Eukaryotic signal peptides, particularly those targeting the , often display greater polarity in the H-region with occasional charged residues, contrasting with the stricter hydrophobic profiles in prokaryotes, though subtle differences exist between Gram-positive and . These adaptations reflect evolutionary pressures on translocation efficiency without compromising the core hydrophobic character essential for insertion. A representative example is the signal peptide of the bacterial outer OmpA from , which spans 21 residues: MKKTAIAIAVALAGFATVAQA. This sequence illustrates the tripartite , with the N-region (MKKT) featuring positive charges, the H-region (AIAIAVALAGFATVA) rich in hydrophobic leucines and alanines for spanning, and the C-region (QA) adhering to the cleavage motif at the -1 and -3 positions. The hydrophobic core of the H-region remains highly conserved across signal peptides, serving as the primary determinant for initial interaction and translocation initiation, with disruptions in hydrophobicity severely impairing function.

Secondary and Tertiary Features

The secondary and tertiary features of signal peptides are primarily centered on the H-region's conformational preferences. The H-region, typically comprising 7-15 hydrophobic residues, predominantly adopts an α-helical conformation that enables its insertion into the during protein translocation. This helical structure positions hydrophobic side chains to interact favorably with the interior, stabilizing the peptide's orientation parallel to the bilayer plane. Flexibility at the junctions between the N-, H-, and C-regions allows the signal peptide to adapt to binding partners and environments, while the C-region often features β-turns or loops near the site to facilitate access by signal peptidase. These structural elements provide the necessary mobility for the peptide to transition from a soluble state to membrane-embedded configurations without rigid constraints. Recent biophysical studies using cryo-electron microscopy (cryo-EM) and (NMR) have provided detailed insights into these conformations in complex with targeting factors. For instance, cryo-EM structures from 2022 of ribosome-nascent chain complexes reveal how the signal peptide's α-helix interacts with the (SRP), undergoing localized deformations to fit into the SRP's hydrophobic groove for specific recognition and handover during co-translational targeting. These findings highlight dynamic adjustments in helical integrity upon binding, enhancing targeting efficiency. The amphipathic nature of the H-region α-helix further supports interactions with , where one face rich in hydrophobic residues partitions into the bilayer core while polar elements on the opposite face interact with headgroups or aqueous interfaces. This property promotes stable bilayer insertion and orientation, as demonstrated in model studies.

Functional Mechanisms

Protein Translocation Process

The protein translocation process initiated by the begins with its recognition during . In eukaryotes, as the signal peptide emerges from the , it is bound by the (SRP), a ribonucleoprotein complex consisting of SRP RNA and six proteins, which pauses to prevent misfolding. This interaction occurs with high affinity due to the hydrophobic core of the signal peptide fitting into the SRP54 subunit's binding groove. In prokaryotes, the analogous involves the Ffh protein (the bacterial SRP54 homolog) and Ffs RNA binding the signal peptide on the nascent chain, similarly arresting elongation. The targeting phase follows, where the SRP-ribosome-nascent chain complex docks to the SRP receptor on the (ER) membrane in eukaryotes or the plasma membrane in prokaryotes. In eukaryotes, the SRP receptor (SR), composed of SRα and SRβ subunits, recognizes the SRP via GTP-dependent interactions, facilitating delivery of the complex to the Sec61 translocon. Prokaryotic targeting employs the FtsY receptor, which interacts with the Ffh-SRP complex through similar GTP-binding mechanisms, directing it to the SecYEG translocon embedded in the membrane. Translocation proper occurs upon handover of the nascent chain to the protein-conducting . In eukaryotes, the ribosome associates directly with the Sec61 complex, a heterotrimeric (Sec61α, β, γ), allowing the signal peptide to insert into the lateral gate formed by transmembrane helices, opening an aqueous pore through which the polypeptide threads into the lumen. The prokaryotic SecYEG , evolutionarily conserved with Sec61, functions analogously, with the signal peptide binding to SecY's transmembrane domains to initiate channel gating and vectorial translocation across the inner . Cleavage of the signal peptide happens co- or post-translationally once a sufficient length of the mature protein has entered the or . This is mediated by signal peptidase I (SPase I), an membrane-embedded complex in eukaryotes (comprising subunits SPC12, SPC18, SPC21, SPC22/23, and SPC25) or a similar in prokaryotes (LepB), which recognizes the cleavage site at the of the signal peptide and excises it, releasing the mature protein. The process is energetically driven by GTP in both domains of . In eukaryotes and prokaryotes, GTP binding and subsequent by SRP and its receptor (SRP54/SRα in eukaryotes; Ffh/FtsY in prokaryotes) power targeting and complex disassembly, releasing the signal peptide for translocon engagement. In , translocation through SecYEG additionally requires the proton motive force across the inner to drive further chain movement, particularly for post-translational aspects, while eukaryotes rely on ribosome-driven pushing and BiP-mediated pulling in the ER .

Co-Translational vs Post-Translational Pathways

Signal peptides mediate protein translocation across cellular membranes through two primary timing-based pathways: co-translational and post-translational. In the co-translational pathway, predominant in eukaryotes, translocation occurs concurrently with protein synthesis on cytosolic ribosomes. The (SRP) binds to the emerging hydrophobic signal peptide on the nascent polypeptide chain, pausing and directing the ribosome-nascent chain complex to the (ER) membrane via the SRP receptor. This pathway is particularly suited for larger secretory or membrane proteins, preventing premature folding in the that could hinder translocation. Energy for targeting and insertion is provided by GTP through SRP and its receptor components. In contrast, the post-translational pathway involves complete synthesis of the protein in the before translocation begins, making it the dominant mechanism in prokaryotes. In bacteria such as , the chaperone SecB binds to the fully translated preprotein, maintaining it in an unfolded, translocation-competent state, and delivers it to the SecA at the SecYEG translocon in the plasma membrane. SecA uses cycles of to drive the preprotein through the translocon, with additional energy from the proton motive force across the membrane facilitating later stages. This pathway is common for smaller proteins that can remain unfolded without aggregating. In eukaryotes, post-translational translocation of signal peptide-bearing proteins occurs via the Sec62/Sec63 complex in the membrane, typically for short secretory proteins, and is ATP-dependent involving chaperones like and BiP. Examples include the precursor prepro-α-factor, which translocates across ER membranes after synthesis completion in an ATP-dependent manner. In prokaryotes, (PhoA) serves as a classic post-translational substrate in E. coli, dependent on SecB and SecA for export to the . Key differences between the pathways include their temporal coupling to , chaperone systems, and energetic requirements. Co-translational translocation relies on SRP for direct targeting and GTP as the primary energy source, ensuring efficiency for complex proteins, whereas post-translational translocation uses dedicated chaperones like SecB or /BiP and ATP (plus proton motive force in prokaryotes) to handle fully synthesized chains, better accommodating smaller or prokaryotic proteins. Protein size influences pathway selection: larger proteins favor co-translational to avoid cytosolic folding issues, while smaller ones use post-translational routes. These pathways ensure targeted protein localization while adapting to organism-specific needs.

Variations in Secretion

Efficiency Determinants

The efficiency of signal peptide-directed protein is profoundly influenced by the physicochemical properties of the signal peptide sequence, particularly the hydrophobicity and length of its central hydrophobic (H) region. An optimal average hydrophobicity, quantified by a grand average of hydropathy () index of approximately 1.0, promotes efficient insertion and translocation, whereas values below 0 indicate poor performance and severely limit yields. Conversely, excessively high hydrophobicity in the H-region can promote protein misfolding or aggregation during translocation, thereby compromising overall . The length of the H-region typically ranges from 7 to 15 , with deviations leading to suboptimal specificity for the secretion machinery; for instance, shorter or longer H-regions reduce translocation rates in bacterial systems by altering the of the signal peptide- . Cleavage site accessibility represents a critical bottleneck in , governed by the signal peptidase recognition at positions -3 and -1 relative to the point, which favors small, uncharged residues such as (A-X-A ). Mutations at these positions, including substitutions with larger or charged , disrupt peptidase binding and can significantly reduce , resulting in accumulation of unprocessed precursors and reduced mature protein yields. For example, altering the -1 in certain signal peptides has been shown to drop from nearly 90% to around 30%. The basic composition around this site, including polar residues in the C-region, further modulates accessibility by influencing the conformational flexibility required for peptidase approach. Contextual factors, including interactions between the signal peptide and the mature , significantly modulate secretion success, as incompatible pairings can induce premature folding that sterically hinders translocation. Species-specific adaptations also play a role; for instance, bacterial signal peptides often exhibit lower efficiency than native signals when expressed in eukaryotic hosts, due to differences in secretion pathway components. In industrial biotechnology, environmental variables such as and further impact efficiency, with deviations from neutral (around 7) or physiological temperatures (30-37°C) impairing translocation machinery activity and reducing yields in recombinant systems. Secretion yield assays across diverse signal peptides reveal striking quantitative variations, with differences spanning 10- to over 100-fold for the same target protein, underscoring the need for tailored selection to optimize production.

Alternate and Nonclassical Pathways

Unconventional protein secretion (UPS) refers to mechanisms that export proteins without relying on the classical endoplasmic reticulum (ER)-Golgi pathway, often involving leaderless proteins that lack N-terminal signal peptides. These pathways enable the release of cytokines, growth factors, and other bioactive molecules in response to cellular stress or specific physiological needs, bypassing the need for ER targeting. In contrast to classical co- or post-translational translocation via the Sec61 translocon, UPS routes typically avoid signal peptide recognition and cleavage. A prominent example of UPS involves leaderless proteins such as and interleukin-1β (IL-1β), which are exported directly across the . FGF2 occurs through a Type I mechanism, where at the inner leaflet induces a transient pore for direct translocation, facilitated by the protein's interaction with the . IL-1β, a pro-inflammatory , follows a Type II pathway mediated by ABC transporters, such as ABCC1 or ANO6-dependent mechanisms, allowing its release without or ER involvement. Another route includes blebbing, where cytoplasmic proteins are packaged into blebs that pinch off to form microvesicles, as observed in some stress-induced secretions. Nonclassical targeting signals direct proteins to organelles via pathways distinct from ER signal peptides, often featuring internal or C-terminal motifs. Mitochondrial presequences, typically 20-80 amino acids long and forming amphipathic α-helices with positively charged and hydrophobic faces, guide matrix proteins across the outer and inner membranes via the and TIM complexes; while most are N-terminal, some proteins employ internal signals for intermembrane space localization. Peroxisomal targeting utilizes C-terminal signals like the SKL tripeptide (PTS1 ), recognized by the PEX5 receptor for import without membrane translocation or cleavage, or N-terminal PTS2 motifs for non-cleavable targeting. In bacteria, Type II, III, and IV secretion systems facilitate signal-less protein export using specialized apparatuses. The Type II system employs a pseudopilus driven by ATPases to push folded proteins across the outer membrane, independent of Sec-dependent signals, as seen in cholera toxin secretion by Vibrio cholerae. Type III systems form needle-like injectisomes powered by ATP hydrolysis, translocating effectors directly into host cells without classical signals, relying on chaperone-mediated recruitment, as in Salmonella's SPI-1 apparatus. Type IV systems, including conjugation machines and type IV pili, use ATPase-driven pilus assembly for protein or DNA transfer, exporting substrates like relaxases without N-terminal signals. Autophagy-mediated secretion in eukaryotes represents a vesicular UPS route (Type III), where leaderless proteins are sequestered into autophagosomes that fuse with the plasma membrane, releasing cargo extracellularly; this pathway is crucial for exporting binding protein (AcbA) in and has been implicated in mammalian IL-1β release under inflammatory conditions. Recent studies highlight viral exploitation of , such as SARS-CoV-2's ORF8 protein, which undergoes unglycosylated unconventional to modulate host immune responses and enhance pathogenicity. These alternate pathways often lack signal peptide cleavage and do not require canonical translocons like Sec61 or SecYEG, instead relying on direct membrane interactions, transporters, or vesicular intermediates for specificity and efficiency.

Prediction and Analysis

Computational Methods

Computational methods for predicting signal peptides primarily rely on bioinformatics tools that analyze sequences to identify characteristic motifs, such as the positively charged n-region, hydrophobic h-region, and polar c-region, often using algorithms. One of the seminal and widely adopted tools is SignalP, developed by the von Heijne laboratory, which has evolved from rule-based hidden Markov models (HMMs) and neural networks (NNs) in early versions to architectures in recent iterations. SignalP 6.0, released in 2021, employs a protein (ProtTrans) with a for to identify all five known types of signal peptides across , , and eukaryotes, achieving high accuracy on benchmark datasets. This tool outputs probability scores for signal peptide presence, type classification, and cleavage site position, enabling high-throughput annotation in pipelines. Recent advancements also incorporate structural predictions from tools like AlphaFold2 to validate signal peptide motifs by assessing 3D conformation compatibility with translocation. Advancements in have further enhanced prediction accuracy, particularly through transformer-based models that capture long-range dependencies in protein sequences. For instance, TSignal (2023) utilizes BERT-like protein models with dot-product to outperform traditional methods on eukaryotic and prokaryotic datasets, demonstrating improved for rare signal peptide variants. In , parameter-efficient (PEFT) frameworks applied to large protein models, such as PEFT-SP, have boosted signal peptide and cleavage site by leveraging low-rank adaptations on pretrained embeddings, achieving up to 5% higher specificity than SignalP 6.0 on diverse eukaryotic sequences without requiring full model retraining. These approaches prioritize conceptual features like hydrophobicity patterns over exhaustive benchmarks, focusing on transferable representations from vast protein corpora. Recent models have begun integrating nucleotide-level features to refine predictions, especially for eukaryotes, where signal sequence coding regions (SSCRs) exhibit low adenine (A) content and codon biases that facilitate mRNA nuclear export and co-translational targeting. By encoding DNA sequences alongside , these tools account for synonymous codon usage that influences efficiency and secretion pathway selection, leading to better performance in predicting context-dependent cleavage in mammalian systems. Training datasets for such models are drawn from curated resources like , which annotates over 12 million signal peptide entries across species (as of October 2025), and specialized databases such as SignalPDB, containing experimentally verified sequences from prokaryotes and eukaryotes. Performance is typically evaluated using (true positive rate) and specificity (true negative rate), with modern tools like SignalP 6.0 reporting balanced metrics above 0.95 on classical signal peptides in independent test sets. Despite these advances, computational methods exhibit limitations in handling nonclassical secretion signals, which lack canonical n-h-c motifs and rely on unconventional pathways; accuracy for these drops below 70% in standard predictors like SignalP, necessitating specialized tools such as for leaderless proteins. This shortfall underscores the need for hybrid approaches combining sequence motifs with structural predictions to improve overall reliability.

Experimental Approaches

Experimental approaches to identify and characterize signal peptides primarily involve and assays that monitor protein translocation, cleavage, and structural properties. These methods allow researchers to validate the presence and functionality of signal peptides in nascent proteins, often using model systems to dissect the mechanisms of . Key techniques include cell-free systems, reporter assays, proteomic analyses, studies, and biophysical . In vitro translation systems, such as wheat germ extracts or rabbit reticulocyte lysates supplemented with canine pancreatic microsomes, enable the study of co-translational translocation and . These cell-free setups incorporate radiolabeled amino acids, like [³⁵S]-methionine, to track the and of preproteins. Upon addition of microsomal membranes, translocation-competent nascent chains are sequestered into the , protected from digestion, and undergo signal peptidase-mediated , resulting in a size shift observable by autoradiography. This approach, pioneered in seminal reconstitution experiments, demonstrated that signal sequences direct ribosomal attachment to membranes and facilitate vectorial discharge of polypeptides. Fusion assays in vivo utilize reporter proteins, such as (GFP), fused to candidate signal peptides and expressed in eukaryotic cells like HEK293 or . Secretion efficiency is assessed by measuring extracellular GFP fluorescence via microscopy or, more quantitatively, by to detect surface-displayed or secreted fusions. For instance, N-terminal signal peptide-GFP constructs translocated to the lead to glycosylated, cleaved products that can be secreted, with quantifying the proportion of fluorescent cells or supernatant intensity. These assays confirm targeting specificity and cleavage in living systems, often revealing variations in translocation rates among different signal sequences. Proteomic methods, particularly liquid chromatography-tandem mass spectrometry (LC-MS/MS), facilitate large-scale identification of signal peptides through N-terminome analysis. Enrichment strategies, such as combined fractional diagonal chromatography (COFRADIC) or amine-reactive tags, isolate N-terminal peptides from complex proteomes, allowing detection of both cleaved mature proteins and uncleaved signal peptides retained in non-secreted forms. In lines, this approach has mapped thousands of N-termini, distinguishing signal peptide processing events and identifying atypical cleavages that affect . Such analyses provide for signal peptide diversity across proteomes. Mutagenesis studies employ to alter specific residues in signal peptides, testing their impact on translocation and cleavage motifs. or substitution of hydrophobic core residues, followed by translation or reporter assays, validates the necessity of amphipathicity and the -3, -1 cleavage rule. For example, mutations disrupting the alpha-helical propensity abolish ER targeting, as measured by reduced protease protection. Recent CRISPR-based functional genomic screens, including those perturbing signal peptidase components or accessory factors, have systematically identified genetic modifiers of secretion efficiency in mammalian cells since 2021. These high-throughput approaches complement targeted mutagenesis by revealing pathway-wide regulators. Biophysical techniques, including (CD) and , probe the structural features of signal peptides in membrane-mimetic environments. CD spectra in trifluoroethanol or lipid vesicles reveal alpha-helical content, with characteristic minima at 208 and 222 nm indicating amphipathic helix formation essential for SRP recognition. assays, using tryptophan-labeled peptides and membrane quenchers like , assess insertion depth and helix-membrane interactions by monitoring emission changes upon binding. These methods quantify secondary structure propensity, showing that efficient signal peptides adopt helical conformations in non-polar solvents.

Nomenclature and Applications

Terminology and Distinctions

The term signal peptide (often abbreviated as SP), interchangeably referred to as signal sequence or leader peptide, denotes the short N-terminal amino acid sequence that directs nascent proteins to the secretory or export pathway across cellular membranes in both prokaryotes and eukaryotes. This nomenclature emphasizes its role as a targeting motif, distinct from the polynucleotide leader sequences found in prokaryotic operons, such as the attenuator region of the trp operon, where short translated peptides regulate transcription termination rather than protein localization. To avoid confusion with these regulatory contexts, "signal peptide" is the preferred standard term for export-related sequences. According to IUPAC recommendations, a signal peptide is defined as any sequence of residues that, when linked to a newly synthesized protein, identifies it for transport mechanisms guiding it to a specific location among the organelles of a eukaryotic or from the to the periplasmic space of prokaryotic cells. These are generally cleaved presequences in the classical secretory pathway, removed by signal peptidase upon translocation. In contrast, uncleaved or differently processed transit peptides direct proteins to organelles like chloroplasts or mitochondria, serving as presequences that may remain partially attached or be cleaved by distinct proteases. Organism-specific variations in nomenclature reflect structural and processing differences. In , the full precursor is commonly called a preprotein or prepeptide, with the signal peptide cleaved to yield the mature form during export via Sec or Tat pathways. In eukaryotes, particularly for secreted hormones and neuropeptides, the precursor is termed preproprotein, where the "pre" segment is the cleavable signal peptide and the "pro" denotes an intervening prodomain processed later in the secretory pathway, as seen in preproparathyroid hormone or prepro-α-factor. Signal peptides must be distinguished from mitochondrial targeting signals (MTS), which are N-terminal presequences with overlapping amphipathic α-helical potential but distinct physicochemical properties: signal peptides exhibit a highly hydrophobic H-region optimized for membrane insertion in the or plasma membrane, whereas MTS are enriched in positively charged residues forming amphipathic helices for import into the . Signal peptides trace an ancient evolutionary origin, conserved across all domains of life—, , and Eukarya—as essential zip codes for protein sorting, underscoring their fundamental role predating eukaryotic endosymbiosis.

Therapeutic Targeting

In cancer therapy, inhibitors of protein translocation across the secretory pathway, such as CAM741, can disrupt of pro-angiogenic factors like (VEGF), potentially impeding tumor . These agents block signal peptide insertion into the Sec61 translocon, leading to cytosolic accumulation of VEGF precursors and downregulation of its expression in preclinical models. Biotechnological applications leverage engineered signal peptides to enhance recombinant protein production, particularly in bacterial systems like . Optimized signal peptides, such as variants of the PelB or OmpA sequences, have been designed through and screening libraries to improve translocation efficiency, resulting in up to 5-fold increases in periplasmic yields of therapeutic proteins like antibodies and enzymes. This approach minimizes intracellular aggregation and simplifies downstream purification. A key challenge in therapeutic targeting of signal peptides lies in achieving specificity to prevent off-target disruption of essential host pathways, which could lead to ER , immune activation, or in non-diseased tissues. Strategies like tumor-selective activation and structure-based design of inhibitors aim to mitigate these effects, though clinical translation requires further optimization to balance efficacy with safety profiles.

References

  1. [1]
    Post-Targeting Functions of Signal Peptides - NCBI - NIH
    They are usually 16 to 30 amino acid residues in length and comprised of a hydrophilic, usually positively charged N-terminal region, a central hydrophobic ...Introduction · Signal Peptides That Have a... · Signal Peptides Released...
  2. [2]
    A comprehensive review of signal peptides: Structure, roles, and ...
    Signal peptides (SP) are short peptides located in the N-terminal of proteins, carrying information for protein secretion. They are ubiquitous to all ...
  3. [3]
    Signal Peptides: From Molecular Mechanisms to Applications in ...
    Jun 18, 2025 · Typically, SPs are 15–30 amino acids in length and are recognized and cleaved by type I signal peptidase (SPI).2. Sp Structure · 4. Sp Applications · 4.3. Sps In Vaccine...
  4. [4]
    SignalP 6.0 predicts all five types of signal peptides using protein ...
    Jan 3, 2022 · Signal peptides (SPs) are short amino acid sequences that control protein secretion and translocation in all living organisms.
  5. [5]
    SPSED: A Signal Peptide Secretion Efficiency Database - PMC - NIH
    Dec 20, 2021 · Signal peptides (SPs) are short amino acid sequences that direct the linked proteins into the secretory pathway. SPs are found in the N-terminus ...Missing: definition | Show results with:definition
  6. [6]
    Bacterial Signal Peptides- Navigating the Journey of Proteins - PMC
    This review highlights the function of signal peptides in Gram-negative bacteria in protein sorting and targeting to the inner membrane, and translocation.
  7. [7]
    Mechanism of Protein Translocation by the Sec61 Translocon ...
    The signal sequence is cleaved by the signal peptidase (not shown) at some point during the translocation. The endoplasmic reticulum (ER)-resident Hsp70 BiP ...
  8. [8]
    Sec-Secretion and Sortase-Mediated Anchoring of Proteins in Gram ...
    Signal peptide-driven secretion of precursor proteins directs polypeptides across the plasma membrane of bacteria. Two pathways, Sec- and SRP-dependent, ...
  9. [9]
    Biosynthesis, structure, and folding of the insulin precursor protein
    Preproinsulin is comprised of the signal peptide (light blue), insulin-B chain (red), C-peptide (white), and insulin-A chain (green). Signal peptide mutations ...
  10. [10]
    [PDF] blobel-lecture.pdf - Nobel Prize
    Signal hypothesis 1975. Proposal of a protein-conducting channel (PCC). The PCC was envisaged to be assembled from integral membrane proteins. The signal ...
  11. [11]
    [PDF] transfer of proteins across membranes - UC Berkeley MCB
    Aug 23, 2007 · GONTER BLOBEL and BERNHARD DOBBERSTEIN. From The Rockefeller University, New York 10021. ABSTRACT. Fractionation of MOPC 41 DL-I tumors ...
  12. [12]
    Post-translational processing of full-length presecretory proteins with ...
    Post-translational processing of full-length presecretory proteins with canine pancreatic signal peptidase. ... Authors. R C Jackson, G Blobel. PMID: 6994559; DOI ...
  13. [13]
    Amino acid sequence for the peptide extension on the prolipoprotein ...
    The significance of the amino acid sequence is discussed in terms of the mechanism of biosynthesis and assembly of the lipoprotein in the E. coli outer membrane ...
  14. [14]
    Signal sequences. The limits of variation - PubMed
    The results show subtle differences between eukaryotic and prokaryotic sequences, but the general impression of signal sequences as being highly variable is ...
  15. [15]
    Patterns of amino acids near signal-sequence cleavage sites
    In this paper, some such patterns, based on a sample of 78 eukaryotic signal sequences, are presented and discussed, and a first attempt at formulating rules ...
  16. [16]
    Archaeal signal peptides—A comparative survey at the genome level
    In all cases, the shortest average signal peptides were predicted by the eukaryotic training system and the longest by the Gram-positive bacterial trained data ...Missing: variations | Show results with:variations
  17. [17]
    Amino acid sequence of the signal peptide of ompA protein, a major ...
    The structural gene for ompA protein, a major outer membrane protein of Escherichia coli, was cloned using the plasmid cloning vehicle, pMF21.
  18. [18]
    Signal Peptide - an overview | ScienceDirect Topics
    Signal peptides (SP) are short peptides located in the N-terminal of proteins, carrying information for protein secretion. They are ubiquitous to all ...
  19. [19]
    Extensive sequence turnover of the signal peptides of members of ...
    Jul 16, 2009 · A typical signal peptide (SP) involves a hydrophobic alpha-helical region which is called the h-region. This hydrophobic segment is ...
  20. [20]
    Functional limits of conformation, hydrophobicity, and steric ...
    Although previous studies based on Chou-Fasman analysis have suggested that the cleavage region forms a beta-turn which is required for transport, our results ...
  21. [21]
    Secondary Structure as a Conformational Determinant in Processing ...
    Turns or loops, often associated to exposed regions in proteins [11,. 12], possess a greater degree of flexibility and local mobility, compared to more rigid ...
  22. [22]
    Signal Peptide-rheostat Dynamics Delay Secretory Preprotein Folding
    All are composed of 3 parts: N-terminal (N-), hydrophobic helical core (H-) and flexible C-terminal (C-) regions (Figure 1(A);.20 N-regions are ∼ 5 residues ...
  23. [23]
    Structure analysis of the membrane-bound PhoD signal peptide of ...
    Our present results suggest that instead an amphiphilic α-helix may get embedded in the membrane as the first step of protein translocation. Such an amphiphilic ...
  24. [24]
    Comparison of In Silico Signal Sequence-Phospholipid Results with ...
    Mar 25, 2024 · The study presented here indicates not only that there is a direct signal sequence–phospholipid interaction but also that the corresponding signal peptides can ...
  25. [25]
    Clearing tail-anchored proteins from mitochondria - PNAS
    May 22, 2014 · The transmembrane domains of mitochondrial tail-anchored proteins are shorter than their ER counterparts and are often flanked by positively ...Missing: peptides non-
  26. [26]
    tail-anchored proteins and their membrane insertion pathways
    Tail-anchored (TA) proteins have a TMD near their C-terminal end, and their C-terminal stretch is no longer than 30 amino acids. They are released when their ...Missing: non- | Show results with:non-
  27. [27]
    Review Co-translational targeting and translocation of proteins to ...
    ▻ SRP receptor transfers the ribosome nascent chain complex from SRP to the translocon. ▻ The Sec61 channel conveys the nascent chain from the ribosome directly ...
  28. [28]
    Article SecA Interacts with Ribosomes in Order to Facilitate ...
    Feb 4, 2011 · The posttranslational translocation machinery consists minimally of two components: the SecYEG translocon and the ATPase SecA. SecYEG forms an ...
  29. [29]
    kinetic and thermodynamic role of ATP and the protonmotive force
    Translocation requires ATP for initiation and SecA protein. Most further steps are driven by the protonmotive force (Δp).
  30. [30]
  31. [31]
    [PDF] In Vitro Protein Translocation across the Yeast Endoplasmic Reticulum
    The in vitro synthesized precursor of the a-factor pheromone, prepro-a-factor, of Saccharomyces cere- visiae was translocated across yeast microsomal.
  32. [32]
    Cotranslational folding of alkaline phosphatase in the periplasm of ...
    Aug 13, 2020 · Post-translational export commences when SecA recognizes a secretory protein via its signal sequence, and targets it to the inner membrane.
  33. [33]
    Signal Peptide Efficiency: From High-Throughput Data to Prediction ...
    Jan 17, 2023 · The Escherichia coli genome encodes at least 29 putative signal peptides contg. a twin arginine motif characteristic of proteins exported via ...
  34. [34]
    Signal peptides for recombinant protein secretion in bacterial ...
    Mar 29, 2018 · Signal peptides are short amino-terminal parts of exported precursor proteins that direct the respective proteins to the protein export systems ...Missing: seminal | Show results with:seminal
  35. [35]
    Signal peptide cleavage regions: limits & implications
    Analysis of these mutants revealed that cleavage regions ranging from 3 to 9 residues are completely and efficiently processed. The extent of processing drops ...
  36. [36]
    Characterization of missense mutations in the signal peptide and ...
    Aug 7, 2020 · Mutations in the signal peptidase recognition site interfere with protein secretion, expression, and γ-carboxylation efficiency by preventing ...Abstract · Materials and methods · Results · Discussion
  37. [37]
    A New Type of Signal Peptidase Cleavage Site Identified in an RNA ...
    Insertion of alanine at position 191 of the Erns sequence was shown before to reduce the efficiency of membrane binding from about 87% (wild type) to ∼30% (32).
  38. [38]
    Novel signal peptides improve the secretion of recombinant ...
    May 12, 2017 · The data showed that some of the modified signal peptides improved the secretion efficiency of AT H35L compared with E. coli signal peptides from dsbA, pelB ...<|control11|><|separator|>
  39. [39]
    Effects of Temperature and pH on Recombinant Thaumatin II ... - MDPI
    The environmental conditions selected for fermentation can have a direct effect on cell growth, secretion yield of the recombinant protein and product stability ...
  40. [40]
    Comparison of secretory signal peptides for heterologous protein ...
    Feb 6, 2018 · ... secretion from C. reinhardtii is deeply influenced by the SP used; our results showed that secretion varied 10-fold. Therefore, the ...
  41. [41]
    Unconventional protein secretion: an evolving mechanism
    May 10, 2013 · The process by which proteins are secreted without entering the classical endoplasmic reticulum (ER)–Golgi complex pathway, in eukaryotic ...
  42. [42]
    Diversity in unconventional protein secretion | Journal of Cell Science
    Nov 15, 2012 · Eukaryotic cells use the endoplasmic reticulum (ER)-to-Golgi membrane pathway for the secretion of the vast majority of secretory proteins.
  43. [43]
    Unconventional Pathways of Protein Secretion: Mammals vs. Plants
    Apr 27, 2022 · In this review we describe the mammals and plants UPS pathways identified so far highlighting commonalities and differences.
  44. [44]
    The unconventional secretory machinery of fibroblast growth factor 2
    FGF2 represents an example of a non-vesicular mechanism of unconventional secretion. Here, the author discusses the current knowledge about the molecular ...
  45. [45]
    [PDF] Unconventional protein secretion: From basic mechanisms to ... - HAL
    Jun 9, 2023 · In types I and II UPS, cargo proteins are directly translocated across the PM by pore formation or. ABC transporters, respectively. In type III ...<|control11|><|separator|>
  46. [46]
    [PDF] Pathways of Unconventional Protein Secretion - Hubrecht Institute
    Leaderless proteins are secreted along three pathways mediated by distinct mechanisms: Type I, or pore-mediated transloca- tion across the plasma membrane; Type ...Missing: blebbing | Show results with:blebbing
  47. [47]
    An internal targeting signal directing proteins into the mitochondrial ...
    The ability of most presequence peptides to form amphipathic α-helices is thought to be important for their recognition by the translocation machineries in the ...
  48. [48]
    Import mechanism of peroxisomal proteins with an N-terminal signal ...
    May 9, 2025 · Most peroxisomal matrix proteins contain a type 1 peroxisomal targeting signal (PTS1) that is located at their carboxy (C) terminus and ...Missing: internal presequences
  49. [49]
    The structure and mechanism of the bacterial type II secretion system
    Dec 7, 2020 · The type II secretion system (T2SS) is a multi-protein complex used by many bacteria to move substrates across their cell membrane.
  50. [50]
    Primary architecture and energy requirements of Type III ... - Frontiers
    Many pathogens use Type III and Type IV protein secretion systems to secrete virulence factors from the bacterial cytosol into host cells.
  51. [51]
    Type IV secretion systems: from structures to mechanisms
    Oct 9, 2025 · Type IV secretion systems (T4SS) are complex multiprotein machineries found in many bacteria, with the unique ability to translocate both DNA ...
  52. [52]
    Facets of Autophagy Based Unconventional Protein Secretion–The ...
    Dec 8, 2020 · Autophagy-based unconventional protein secretion (UCPS) is a form of autophagy where cargo is secreted out instead of being degraded, unlike ...Abstract · Introduction · Autophagy Dependent... · Secretory Versus Degradative...
  53. [53]
    Unconventional secretion of unglycosylated ORF8 is critical for the ...
    We show that open reading frame 8 (ORF8) acts as messenger for inter-cellular communication between alveolar epithelial cells and macrophages during SARS-CoV-2 ...
  54. [54]
    Unconventional protein secretion - new insights into the ...
    Aug 6, 2025 · Four UPS secretion mechanisms have been proposed; direct secretion through a pore at the plasma membrane, secretion through an ABC transporter, ...
  55. [55]
    TSignal: a transformer model for signal peptide prediction
    Jun 30, 2023 · We introduce TSignal, a deep transformer-based neural network architecture that utilizes BERT language models and dot-product attention techniques.Abstract · Introduction · Materials and methods · Results
  56. [56]
    Parameter-efficient fine-tuning on large protein language models ...
    We present a parameter-efficient fine-tuning (PEFT) framework for SP prediction, PEFT-SP, to effectively utilize pretrained PLMs.
  57. [57]
    The Signal Sequence Coding Region Promotes Nuclear Export of ...
    Dec 4, 2007 · The SSCR export signal appears to be characterized in vertebrates by a low content of adenines, as demonstrated by genome-wide sequence ...
  58. [58]
    Signal peptide | UniProt help
    May 18, 2025 · Signal peptides are found in proteins that are targeted to the endoplasmic reticulum and eventually destined to be either secreted/extracellular/periplasmic/ ...
  59. [59]
    Signal Peptide Database
    Signal sequences have a tripartite structure, consisting of a hydrophobic core region (h-region) flanked by an n- and c-region. The latter contains the ...Database search · Search my Protein Search the... · UniProt KnowledgebaseMissing: NHC | Show results with:NHC
  60. [60]
    Transfer of proteins across membranes. II. Reconstitution of ...
    Dec 1, 1975 · Taken together, these results establish unequivocally that the information for segregation of a translation product is encoded in the mRNA ...
  61. [61]
    Signal Sequence Is Sufficient for Green Fluorescent Protein to Be ...
    The studies of routing using GFP fused to secreted proteins and peptides have yielded mixed results. ... Flow Cytometric Analysis and Sorting of Live Male Rat ...
  62. [62]
    Comprehensive analysis of human protein N-termini enables ...
    Jul 26, 2017 · Cleavage of the signal peptide generates a new N-terminus. The isoforms arising from alternative splicing, promoter and translation initiation ...
  63. [63]
    Defining a Conformational Consensus Motif in Cotransin-Sensitive ...
    Defining a Conformational Consensus Motif in Cotransin-Sensitive Signal Sequences: A Proteomic and Site-Directed Mutagenesis Study. Wolfgang Klein, Carolin ...
  64. [64]
    A genome-wide CRISPR screen identifies host factors that regulate ...
    Feb 11, 2021 · A CRISPR screen defines a signal peptide processing pathway required by flaviviruses. Nature 535, 164–168 (2016). Article ADS CAS PubMed ...Missing: secretion | Show results with:secretion
  65. [65]
    Helix formation and stability in a signal sequence - PubMed - NIH
    ... circular dichroism spectra to mimic the conformational distribution of this peptide in membranelike environments has provided a description of specific residue
  66. [66]
    signal peptide (13677) - IUPAC Gold Book
    Any sequence of amino acid residues that, when linked to a newly synthesized protein, identifies it to transport mechanisms that guide the protein to a ...
  67. [67]
    Leader sequences are not signal peptides | Nature Biotechnology
    Leader sequences are polynucleotide regions located between the promoter and the coding region and are involved in the regulation of gene expression.
  68. [68]
    Signal Peptide - an overview | ScienceDirect Topics
    Signal peptides are short peptides located at the N-terminal of proteins that carry information for protein secretion. They are found in both prokaryotes ...
  69. [69]
    Characterization of signal and transit peptides based on motif ...
    The most common targeting peptides are signal peptides (SPs) and transit peptides (TPs), hereafter also collectively referred to as presequences.
  70. [70]
    Bacterial Signal Peptides- Navigating the Journey of Proteins
    This review highlights the function of signal peptides in Gram-negative bacteria in protein sorting and targeting to the inner membrane, and translocation.Missing: definition | Show results with:definition
  71. [71]
    Synthetic Substrate for Eukaryotic Signal Peptidase. Cleavage of a ...
    Sep 25, 1989 · A synthetic peptide analog of the precursor region of preproparathyroid hormone has been shown to be a specific substrate for hen oviduct ...
  72. [72]
    The similarity between N-terminal targeting signals for protein import ...
    However, the targeting signals for mitochondria, chloroplasts, or the ER appear structurally similar, because they all involve an α-helical domain in proximity ...
  73. [73]
    Mitochondrial Targeting Signal - an overview | ScienceDirect Topics
    Mitochondrial targeting signals are rich in basic amino acids, which can form amphipathic α-helices. Majority of mitochondrial proteins have N-terminal ...
  74. [74]
    A comprehensive review of signal peptides: Structure, roles, and ...
    Signal peptides (SP) are short peptides located in the N-terminal of proteins, carrying information for protein secretion. They are ubiquitous to all ...
  75. [75]
    Protein lipidation in health and disease: molecular basis ... - Nature
    Mar 15, 2024 · For instance, the recently identified compound IMP-1088 acts as a novel NMT inhibitor. It effectively hinders the assembly and replication of ...
  76. [76]
  77. [77]
    Inhibitors of protein translocation across membranes of the secretory ...
    Jan 5, 2018 · The lumenal signal peptidase complex (SPC) cleaves signal peptides (orange box) from the mature protein, while uncleaved signal anchors ...<|control11|><|separator|>
  78. [78]
    Enhancing Recombinant Protein Yields in the E. coli Periplasm by ...
    We present a combined screen involving different signal peptides and varying production rates that enabled the identification of more optimal conditions for ...
  79. [79]
    CRISPR Advancements in Correcting Protein Misfolding
    By directly targeting specific genetic mutations that cause protein misfolding, CRISPR could help address the core causes of neurodegenerative diseases, leading ...
  80. [80]
    Evaluation of cationic peptide-based nanogels as delivery systems ...
    Oct 22, 2025 · This study explores the formulation, decoration strategies, and structural properties of NGs derived from mixed hydrogel (HG) matrices of Fmoc- ...