Spider silks: recombinant synthesis, assembly, spinning, and engineering of synthetic proteins
© Scheibel. 2004
Received: 11 October 2004
Accepted: 16 November 2004
Published: 16 November 2004
Skip to main content
© Scheibel. 2004
Received: 11 October 2004
Accepted: 16 November 2004
Published: 16 November 2004
Since thousands of years humans have utilized insect silks for their own benefit and comfort. The most famous example is the use of reeled silkworm silk from Bombyx mori to produce textiles. In contrast, despite the more promising properties of their silk, spiders have not been domesticated for large-scale or even industrial applications, since farming the spiders is not commercially viable due to their highly territorial and cannibalistic nature. Before spider silks can be copied or mimicked, not only the sequence of the underlying proteins but also their functions have to be resolved. Several attempts to recombinantly produce spider silks or spider silk mimics in various expression hosts have been reported previously. A new protein engineering approach, which combines synthetic repetitive silk sequences with authentic silk domains, reveals proteins that closely resemble silk proteins and that can be produced at high yields, which provides a basis for cost-efficient large scale production of spider silk-like proteins.
Among the various spider silks the major ampullate (MA) silk, which forms the primary dragline, is extremely tough. MA silk reveals a tensile strength that is comparable to Kevlar (4 × 109 N/m2) coupled with a reasonable viscoelasticity (dragline 35 %, Kevlar 5 %). Spiders use dragline silk as a strong yet flexible structural element in the web, providing a framework to which other silks are attached, and as a life line when a spider is dropping off to escape an enemy. Minor ampullate (MI) silk, used for structural reinforcement in construction of the web, has a similar high tensile strength in comparison to major ampullate silk but has little elasticity [7, 8]. Due to the low elasticity of MI silk it is irreversibly deforming when stretched. An orb web's capture spiral, in part composed of viscid silk formed by the flagelliform gland, which is therefore named flagelliform silk, is stretchy and can triple in length before breaking, but provides only half the tensile strength of major ampullate silk . The combination of strength and stretchiness gives the capture spiral a toughness (energy to break) greater than elastin, tendon, silkworm silk, bone, synthetic rubber, Kevlar, and high-tensile steel.
Spider silks are protein polymers that display extraordinary physical properties [1–4, 8], but there is only limited information on the composition of the various silks produced by different spiders. Among the different types of spider silks, draglines from the golden orb weaver Nephila clavipes and the garden cross spider Araneus diadematus are most intensely studied. Dragline silks are generally composed of two major proteins [5, 10–13] and it remains unclear whether additional proteins play a significant role in silk assembly and the final silk structure. The two major protein components of draglines from Nephila clavipes are termed MaSp1 and MaSp2 (Major ampullate Spidroins) and from Araneus diadematus ADF-3 and ADF-4 (Araneus Diadematus Fibroin). The dragline silk proteins have apparent molecular masses between 180 kDa and 720 kDa depending on the conditions of analysis [14–16]. It is assumed that, based on amino acid composition, within the dragline fiber the molecular ratio between MaSp1 and MaSp2 and between ADF-4 and ADF-3 is approximately 3 to 2 [10, 11, 17].
So far the largest sequence information could be obtained for flagelliform silk from Nephila clavipes (Figure 2B). This flagelliform silk protein is translated from a ~15.5 kb mRNA transcript originating from a 30 kb Flag locus [9, 23]. The coding sequence is divided into 13 exons. The NR amino-terminal region is split between exons 1 and 2. All of the other exons are found to encode exactly one repeating unit, built from the described motifs (Figure 2B). The final exon 13 in addition includes the NR carboxyl-terminal region.
On the basis of several studies, the major categories of peptide motifs in spider silk proteins have been assigned structural roles [24–28]. The GPGXX motif has been suggested to be involved in a β-turn spiral, probably providing elasticity, based on structures of comparable proteins [29–32]. If elasticity is due to GPGXX β-spirals, then this motif should be found in the more elastic silks. Flagelliform silks, which show the highest elasticity with more than 200 %, consist of contiguous repeats of this motif for at least 43 times in each repeating unit (Figure 2B). The only non-flagelliform silk proteins with GPGXX motifs are MA proteins MaSp2, ADF-3, and ADF-4, which also display some viscoelasticity. In accordance to the lower elasticity of dragline silk in comparison to flagelliform silk the number of tandemly arrayed repeats depicts at most 9 concatenated GPGXX motifs before interruption by another motif [1, 21]. Alanine-rich motifs contain typically 6–9 alanine residues and have been found to form crystalline β-sheet stacks leading to tensile strength [6, 24, 25, 12]. The MA and MI silks are both very strong, and at least one protein in each silk (there are always pairs) contains the An or (GA)n motif. Interestingly, this motif is not found in flagelliform silks. A glycine-rich 31-helix is adopted by the GGX motif forming an amorphous matrix that connects crystalline regions and that provides elasticity [26, 33, 34]. The postulated GGX motif is widely distributed and this motif can be found in MA, MI and flagelliform silks (Figure 2A). Several groups have suggested that the motifs GPGXX and GGX might be involved in forming an amorphous matrix, which would provide the elasticity of the fiber. The spacers contain charged groups and separate the iterated peptide motifs into clusters. Non-repetitive termini are common to all sequenced MA, MI and flagelliform silks belonging to the Araneoidea family with highly conserved carboxyl-terminal sequences [19, 35, 36]. The structural impact of the spacer and terminal regions is so far undetermined . Recent findings on single NR-regions of ADF-3 and ADF-4 (without additional repeating units) revealed a secondary structure comprising α-helices as determined by Circular Dichroism and they seem to retain this structural feature in proteins that additionally contain repeating units . It can be speculated that the α-helical NR carboxyl-termini might play a crucial role during assembly of the silk fiber [19, 36, 38].
Silk assembly in vivo is a remarkable process. For instance, dragline silk proteins are stored at concentrations up to 50 % (w/v) in the respective glands . This highly concentrated protein solution forms the silk dope (spinning solution), which displays properties of a liquid crystal [40–42]. Therein, the polyalanine motifs are thought to adopt an α-helical conformation, while the glycine-rich motifs form either β-turns or random coil conformation [39, 43, 44].
Thread assembly is initiated during a passage of the silk dope through the spinning duct accompanied by extraction of water, sodium and chloride [45, 46]. Simultaneous secretion of potassium and hydrogen ions into the lumen of the duct lowering the pH from 6.9 to 6.3 is thought to initiate partly unfolding of the proteins by disrupting their water shell and altering coulombic forces [42, 45–48]. The silk proteins are thought to extend somewhat, align and get packed much closer in the extensional flow-field of the draw-down taper found in the distal part of the duct. As the hydrophobic polyalanine segments of the silk proteins align and are drawn closer together by extensional flow, they are exposed to an increasingly hydrophobic environment, which might trigger their conversion from an α-helical to a β-pleated structure resulting in the formation of numerous interchain hydrogen bonds. The latter would act as multifunctional crosslinks at nodes between the more mobile glycine-rich segments. Thus the assembly of the thread can be seen as a liquid-crystalline phase transition involving separation into polymer-rich and solvent-rich phases .
While some aspects of spider silk assembly have been unraveled, the contribution of the individual silk proteins to the assembly process still needs to be resolved in more detail. Comparative studies of the two major dragline silk proteins of Araneus diadematus, ADF-3 and ADF-4, revealed that, although their amino acid sequences are rather similar , they display remarkably different solubility and assembly characteristics: While ADF-3 is soluble even at high concentrations , ADF-4 is virtually insoluble and self-assembles into filamentous structures under specific conditions . At a closer look, the different solubilities of ADF-3 and ADF-4 could be explained by the hydrophobicities of the two proteins. The hydrophilic ADF-3 interacts favourably with the aqueous solvent and thus remains soluble under most conditions. In contrast, the more hydrophobic ADF-4 favours interactions with other protein molecules and thus tends to aggregate. Interestingly, all pairs of dragline silk proteins from different spider species display a common distinct distribution of hydrophobicity. In direct comparison, MaSp1 / ADF-4 proteins generally display relatively high hydrophobicity, while the corresponding MaSp2 / ADF-3 partner protein is more hydrophilic .
Laboratory-scale production of spider silk would initiate a new generation of ecological materials. Spider silk is for instance a promising tool with broad usability in medical devices. In the middle ages spider webs were used as wound dressing – some reports are even dated back to ancient Greek and Roman cultures. Silkworm silk does not cause allergic reactions and it is thought that spider silk behaves similarly . The unmatched toughness of spider silk would allow to improve several medical products such as wound closure systems, band-aids, and extremely thin sutures for neurosurgery. Additionally, spider silks can be further used in artificial ligaments and tendons for durable implants. High performance fibers built from spider silks can be employed in several technical and industrial applications. In addition to specialty ropes and fishing nets, spider silk can be utilized for parachutes, ballistic applications (body armor), sporting goods, textiles, and lightweight constructions for airplanes [52, 53]. Therefore, one day industrially produced spider silk could out-compete man-made fibers.
Recombinant production of spider silk proteins has been complicated by the highly repetitive nature of the underlying genes, by their high gc-content, by the length of the constructs, and by the specific codon usage of spiders. In first studies, in vitro translation of mRNA from excised major ampullate glands of Nephila clavipes was performed using tRNA from E. coli, but translation was discontinuous [14, 54]. In the era of recombinant proteins and genetic engineering one would envisage to easily produce spider silk proteins (mainly from draglines) in microbes or cell culture. Unfortunately, no dragline silk gene has been cloned in its entirety and only sequence data from the 3' end of partial cDNA clones of dragline genes from Nephila clavipes and Araneus diadematus and other spiders have been reported [10, 11, 20–22]. Therefore, all recent studies used partial cDNA constructs of dragline silk genes to produce recombinant silk proteins in E. coli , in MAC-T (bovine) and BHK (hamster) cells , or in insect cell lines from Spodoptera frugiperda using the baculovirus expression system . The most promising expression system seems to be the baculovirus system, since it was possible to efficiently produce dragline silk components at a high yield.
Cloning strategies for designing genes for bacterial, yeast or plant expression have been developed to produce recombinant silk-like proteins closely resembling natural dragline [29, 36, 56–62] or flagelliform silk proteins . Since gene manipulation and amplification of spider silks is difficult by PCR due to the repetitive nature of silk genes, cloning strategies involved engineering of synthetic DNA modules. These modules were optimized for the codon usage adapted by the corresponding expression host. The use of synthetic modules constructed from small size oligonucleotides repeats has allowed control over primary gene and protein sequence and final protein size. Tobacco, potatoes, the yeast Pichia pastoris and mainly E. coli have been utilized as expression hosts for synthetic genes yielding proteins with up to 150 kDa [29, 56–62]. Unfortunately, expression levels from the synthetic genes have been low and mostly the recombinant silk proteins represented only up to 5% of the total protein in the cell . Although once production levels of up to 1000 mg/l of cell culture have been reported , large losses in yield are encountered during purification due to precipitation and non-specific interactions. For the microbial expression systems, yields of purified proteins have been generally in the 10 to 40 mg/l range (>90% purity) [[55, 56, 59, 60], summarized in ].
To mimic the repetitive sequence of ADF-3, two modules have been designed. The sequence of one module, termed A, was derived from the poly-alanine containing consensus sequence of ADF-3 (Figure 3A). The sequence of a second module termed Q contained four repeats of the GPGQQ motif. In a first cloning step the spacer region of the cloning vector was replaced by one of the synthesized DNA modules. Subsequently two modules could be joined in a site-directed way. To study different length repeat units, one or two Q modules were combined with one A module to obtain (AQ) (Figure 3B) or (QAQ) (Figure 3C). The complementary 3'-single strand extensions gg (sense) and cc (antisense) were used for connecting two modules (Figure 3B). Thus the DNA sequence required to link two modules was confined to a glycine codon (ggn). Glycine is naturally abundant in spider silk proteins (~30%), therefore modules could be designed to match authentic amino acid sequences. Since the arrangement of the cloning cassette's elements remained unchanged upon cloning, repeat units could be multimerized to generate synthetic genes coding for the repetitive proteins (rep-proteins) (AQ)12 and (QAQ)8 (Figure 3C).
The repetitive part of ADF-4 is generally composed of a single conserved repeat unit displaying only slight variations. These variations were combined and one consensus module termed C has been designed (Figure 3A), which was multimerized to obtain the rep-protein C16 (Figure 3C).
ADF-3 and ADF-4 both display NR-regions at their carboxyl termini, comprising 124 and 109 amino acids respectively. Gene sequences coding for these regions were amplified by PCR, and codons problematic for bacterial expression were changed to more suitable codons by site directed mutagenesis. In the described system, all synthetic genes could be combined with the appropriate authentic NR-regions. Additionally NR3 and NR4 could be expressed individually. All constructs could be purified by a heat step followed by an ammonium sulfate precipitation , which has been employed in previous studied for purifying spider silk proteins [35, 62].
Based on this protein engineering approach, which combines synthetic repetitive sequences with authentic NR-regions, proteins closely resembling authentic silk proteins could be produced at high yields. Bacterial production in Erlenmeyer flasks yielded similar protein amounts for all constructs. Yields of individual preparations ranged from 10 to 30 mg of purified protein per liter of culture medium. Fermentation of cells increased the yield of purified protein to 140 and 360 mg/l (purity >95%). Therefore, the established bacterial expression system provides the basis for cost-efficient large scale production of spider silk-like proteins.
Phosphate, like other lyotropic ions, is known to increase the surface tension of water, promoting hydrophobic interactions . In the case of spider silk proteins it is likely that the addition of phosphate initiates interactions between the hydrophobic poly-alanine motifs, causing the aggregation of the proteins. Accordingly aggregation of polyalanine-rich proteins is pronounced in comparison to synthetic silks which contained one third less poly-alanine motifs . Strikingly, recombinant spider silk proteins are highly soluble in most aqueous solutions, but form nanometer-sized fibers upon addition of methanol, phosphate or other suitable ions (Figure 4A).
A remaining critical step concerning commercial production of silk fibers is the successful spinning of recombinant proteins into fibers resembling the natural silks in their microstructure and in their mechanical properties, which are outstanding by any measure. Besides the chemical parameters discussed above, several mechanical parameters play important roles in generating silk. To draw silk under natural spinning conditions, spiders attach their dragline to an object with glue from the piriform glands, before drawing the silk out by moving away or by descending and using their weight to draw the silk. It is common practice to take advantage of the drawing process by the forced silking of captive animals to collect silk for experiments. Analysis of the differences between naturally and forcibly spun dragline silk provided evidence for discrepancies in their material properties [44, 68, 69]. Forced spinning under spinning speeds ranging from 0.1 to 400 mm/s and temperatures ranging from 5 to 40°C revealed dramatic differences in strain at breaking, breaking energy, initial Young's modulus and point of yielding . Therefore, in case of spinning recombinant silk proteins in vitro several aspects have to be taken into account to gain materials with expected properties.
Several attempts are reported in the literature and even more have been performed to wet-spin recombinant spider silk proteins. In a first attempt, microfabricated spinnerets were constructed using silicon microfabrication methods [71–73]. These spinnerets allowed for the production of meters of silk fibers from solutions containing as little as 10 mg of protein. First the spinneret was validated and tested by producing fibers from dissolved silk from the silkworm Bombyx mori , before solubilized dragline silk from Nephila clavipes was wet-spun . The diameters and mechanical properties of the regenerated silkworm silks converged the native silk ones. However, the wet-spun spider silks exhibited diameters of about 40 μm compared to the natural fiber diameter of 2.5 to 4.0 μm with mechanical properties that did not match the natural ones . Other attempts of wet-spinning revealed fiber diameters of approximately 10 – 60 μm [49, 74]. These fibers were subjected to either single or double postspinning draw, first in 70 to 80 % methanol (single and double draw) and then in water (double draw only) to increase their mechanical properties. Fibers subjected to higher draw ratios displayed greater toughness, tenacity, and modulus values . However, even the best values obtained by such technique were in the range of the regenerated Nephila fibers , but lower than the reported values for natural dragline silks .
Since the physical and chemical properties of bio-polymers and their assembly processes depend on the amino acid composition of the underlying polypeptide, engineering "synthetic" proteins with specific structural features will create a new class of fibrous proteins. However, to design new biomaterials based on spider silk, all properties of the underlying proteins have to be analyzed and in the best case successfully mimicked . Therefore, the crucial design features of both the feedstock of the dope and the spinning process have to be closely adopted, which would allow for managing the commercial production of new materials.
The work is supported by the Deutsche Forschungsgemeinschaft and the Fonds der Chemischen Industrie. I would like to thank Bettina Richter for scanning electron microscopy and Daniel Huemmerich for atomic force microscopy. Further I acknowledge Daniel Hümmerich and Christian Ackerschott for critical comments on the manuscript and the members of the fiberlab for inspiring discussions.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.