Research | Open | Published:
Engineering an efficient secretion of leech carboxypeptidase inhibitor in Escherichia coli
Microbial Cell Factoriesvolume 8, Article number: 57 (2009)
Despite advances in expression technologies, the efficient production of heterologous secreted proteins in Escherichia coli remains a challenge. One frequent limitation relies on their inability to be exported to the E. coli periplasm. However, recent studies have suggested that translational kinetics and signal sequences act in concert to modulate the export process.
In order to produce leech carboxypeptidase inhibitor (LCI) in the bacterial periplasm, we compared expression of the natural and optimized gene sequences, and evaluated export efficiency of LCI fused to different signal sequences. The best combination of these factors acting on translation and export was obtained when the signal sequence of DsbA was fused to an E. coli codon-optimized mature LCI sequence. When tested in high cell density cultures, the protein was primarily found in the growth medium. Under these conditions, the engineered expression system yields over 470 mg.l-1 of purified active LCI.
These results support the hypothesis that heterologous secreted proteins require proper coupling between translation and translocation for optimal high-level production in E. coli.
Escherichia coli is by far the simplest, but one of the most widely used host cell for the production of recombinant proteins . Nevertheless, the efficient translocation across the inner membrane and proper periplasmic folding of eukaryotic proteins stabilized by multiple disulfide bonds remains challenging for this organism . Unfortunately, many proteins of which there is a great biotechnological or biomedical interest are secreted proteins containing essential disulfide bonds for their native structure. Either premature cytoplasmic protein folding or incorrect disulfide bond formation in the bacterial periplasm are two known limitations in the overproduction of secreted proteins . Recently, it has been reported that signal sequences promoting co-translational translocation improved the translocation of heterologous proteins . Therefore, targeting these recombinant precursors to the cotranslational signal recognition particle (SRP) dependent pathway conceivably could result in much higher levels of periplasmic proteins than directing them posttranslationally to the SecYEG translocase . Strategies to overcome folding problems due to disulfide bond formation have primarily focused on the co-production of protein disulfide isomerases . For example, the overproduction of DsbC, a periplasmic thiol isomerase, resulted in large amounts of native human tissue plasminogen activator .
In the present study, we have investigated the production of leech carboxypeptidase inhibitor (LCI) in the periplasm of E. coli. This protein is composed of 66 amino acid residues forming a globular domain with five-stranded β-sheet and a short α-helix that are stabilized by four disulfide bonds . Like other small disulfide-rich proteins, the active conformation of LCI is strictly dependent upon the correct formation of disulfide bonds . Found in the digestive track of leeches, LCI is a strong inhibitor of human pancreatic and plasma carboxypeptidases, and thus has considerable biomedical interest . Indeed, by targeting the thrombin-activatable fibrinolysis inhibitor (TAFI) involved in hemostasis, LCI could play an important role in thrombotic disorder therapy . The binding and inhibition activity of LCI is primarily exerted by its C-terminal extremity that interacts with the active site of metallo-carboxypeptidases. In order to overproduce LCI in the periplasm, an E. coli codon-optimized sequence and different signal sequences were evaluated using a tightly controlled expression vector, suitable for high cell density cultures.
Results and Discussion
Construction of LCI precursors
Native LCI has been previously produced in E. coli using the signal sequence of OmpA (OmpAss), but the low yield of secreted protein and plasmid instability precluded any development requiring large-scale production . We decided to investigate several expression parameters in order to improve the production of LCI in the periplasm of E. coli. The gene encoding the mature LCI protein (LCIN), as determined from the medical leech Hirudo medicinalis , contains 7 codons that are used at a frequency below 8 per 1000 in E. coli . We hypothesized that a biased codon usage might limit its expression level. Since other non-optimal codons for E. coli are also present in LCIN, a whole synthetic coding sequence with codon usage optimized for E. coli was designed. The resulting LCIO sequence contains 44 codon changes over the 66 codons (figure 1). Both LCIN and LCIO were subcloned under the control of the tightly regulated araB promoter from the pLCB vector encoding chloramphenicol resistance. This expression vector, derived from the pBAD33 plasmid , was constructed for achieving high cell density cultures. In a first attempt, we retained the original OmpAss to evaluate the effect of codon optimization. Next, we investigated whether the limiting expression levels were only governed by its mature sequence, and tested the production of LCIN and LCIO fused either to the signal sequence of DsbA (DsbAss) or MalE (MalEss), two well-studied periplasmic proteins of E. coli. We chose these signal sequences because there are known to direct co- and post-translational export of heterologous precursors through the SRP- and Sec-dependent pathways, respectively . Besides translocation modes, it was recently shown that a biased codon usage in signal sequences may also play a role in the coupling of translation to protein export, by slowing down the translation rate . Therefore, we decided to use the natural signal sequences of E. coli for targeting the various LCI precursors (preLCI) to the SecYEG translocase.
MalEss and DsbAss promote high-level of LCI in the periplasm
All expression vectors were transformed into the LMG194 strain in which arabinose is not catabolized because of a deletion encompassing most of the araBAD operon . The corresponding cells were cultured at 37°C in LB media supplemented with chloramphenicol. After 2 h of induction, LCI expression appeared to slow down cell growth progressively (figure 2). When growth stops, a higher density was achieved for cells expressing LCI fused to MalEss or DsbAss than for those expressing LCI fused to OmpAss, independent of codon optimization. This observation suggested that the physiological state of the corresponding cells could be affected. We took advantage of the carboxypeptidase inhibitory activity of LCI as a reliable indicator of active protein to detect its presence in both whole cell lysates and culture supernatants. Indeed, it was previously reported that periplasmic LCI could be released into the growth medium , as frequently observed for small exported proteins . Table 1 shows the distribution of active LCI between cells and culture supernatants determined from the six expression vectors. Cells expressing LCI fused either to DsbAss or MalEss displayed higher levels of activity than those expressing LCI with OmpAss (Table 1). If the LCI protein is unable to fold within the reductive environment of the cytoplasm , these data could reflect differences in the efficiency of export of the different LCI precursors (see below). However, active LCI was found in the culture supernatants when cells expressed the E. coli optimized gene, suggesting also a translational effect. To assess the steady-state production and cellular location of LCI, cells were fractionated from spheroplasts. Regardless of the signal sequence used, figure 3 shows the accumulation of large amounts of preLCI for all expression vectors. In addition, expression from LCIopt was markedly higher than from LCIN, consistent with increased translation rate. Nevertheless, for cells expressing LCI fused to OmpAss, little or no protein was detectable in the periplasmic fractions while preLCIs were correctly produced. This result indicated that LCI export did not occur to any significant extent when OmpAss was used, and also confirmed previous findings . It is noteworthy that growth of the corresponding cells was severely inhibited. The relative amounts of precursor (p) and mature (m) LCI were assessed, and export efficiency (m/p) evaluated for each expression vectors (Table 2). To ensure that all periplasmic contents were released during spheroplast preparation, we compared m/p ratios determined from whole cell extracts to those determined from subcellular fractions (Table 2). Because both values are similar, it can be concluded that: (i) a complete release of the periplasmic mature LCI had been attained, and (ii) all preLCIs remained in the soluble cytoplasmic fractions. Besides OmpAss, our results indicated that the two other preLCIs with optimized E. coli codons could give rise to a high level of periplasmic expression (figure 3C). The highest export efficiency was observed with LCIO fused to DsbAss. Interestingly, amounts of LCI in the culture supernatants (Table 1) correlated with their levels in the corresponding periplasmic fractions, suggesting that the presence of extracellular LCI resulted from a direct leakage of the outer membrane. Although the level of LCI when fused to MalEss was somewhat lower than when fused to DsbAss, the efficient export of preLCI may require a co-translocation mode. Since pDsbALCIo seemed to be a good expression vector, we checked its stability on selection pressure. After different cultivation times, bacteria were plated onto solid LB media with and without chloramphenicol in the presence of arabinose. After overnight growth, the number of colony forming units (CFU) determined from these plates indicated no plasmid loss (see below).
High level production of LCI in a fermentor
The batch production of LMG194 cells transformed with pDsbALCIo was performed at 37°C in the HDM medium, a balanced complex medium previously optimized for high cell density cultures in multiple microfermentors . Addition of arabinose was performed at an OD600 value of 16, equivalent to 5.5 g dry cell weight (DCW) per liter, and the culture continued to grow during the next 8-h period (figure 4). The level of periplasmic LCI increased up to 5 h after induction, then decreased progressively. In contrast, the level of extracellular LCI increased continuously until a cultivation time of 8 h, resulting in most of the overproduced protein being found in the culture supernatants. This result suggested that a high concentration of periplasmic LCI is required before being released into the growth medium, presumably because of an increased permeability of the outer membrane. Although that the mechanism of this non- or semi-specific  protein secretion is still unclear, it was proposed that the high level production of secreted protein could inhibit the synthesis of outer membrane proteins, and compromise the permeability barrier of outer membrane. Therefore, we checked the viability of cells during cultivation in the fermentor, and in parallel experiments we monitored CFU counts for plasmid loss. The data shown in figure 5 indicate that about 90% of cells were still alive and able to form colonies on selective plates. The accumulated LCI in the growth medium during high cell density culture did not result from cell lysis or death. Finally, after 8 h of induction, about 470 mg of active LCI could be purified from 1 liter of culture by a single step reverse-phase chromatography.
In this study, we found that E. coli codon optimization in the LCI gene when combined to the signal sequence of DsbA allowed the production/purification of 470 mg of active LCI per liter of culture. While codon usage may be an important criterion for translation rate [17, 18] and/or protein folding [18–20], our studies indicate that, besides the nature of signal sequence, it is also an important parameter to ensure an efficient export of heterelogous precursors. If the nature of signal sequence determines the targeting pathways , the correct combination of both parameters appears to be necessary for optimal coupling of translation to protein translocation in E. coli.
Bacterial strain and plasmids
The E. coli LMG194 strain [F- ΔlacX74 galE galK thi rpsL ΔphoA Δara714 leu::Tn10] carrying the araBAD deletion  was used as the expression host throughout the experiments. Recombinant DNA manipulations were performed as described in established protocols . Plasmid pLCB was constructed in two steps from the pBAD33 expression vector . First, the residual bla sequence was deleted by Bgl I-Tth 111I digestion and filling in with Klenow fragment. Second, a DNA fragment which contained the Shine-Dalgarno sequence comprising an ATG start codon within a Nde I site from the pIVEX2.3MCS vector  was amplified using 5'-AAGAGCTCGAATTCCATATGTATATCTCCTTGCTAGCCCAAAAAAACGGGTATGG-3' and 5'-GTAACAAAGCGGGACCAAAGCC-3' as primers, and pBAD33 as DNA template. The PCR product was digested with Mlu I and Sac I, and cloned into the same restriction sites of the previous pBAD33 derivative. The structure of the resulting plasmid was confirmed by sequencing and designated as pLCB. The mature LCI sequence was codon optimized for E. coli expression and chemically synthesized by Geneart (Regensburg, Germany). The substitution of malE or dsbA signal sequence was generated by overlap extension PCR as previously described .
For shake flask cultures, cells were grown in 100 ml of LB medium supplemented with chloramphenicol (30 μg.ml-1). Induction of the araB promoter was accomplished by addition of L-arabinose to a final concentration of 0.2%. After 6 h at 37°C, cells were harvested by centrifugation at 6,000 rpm for 15 min. For high cell density cultures, bacteria were grown in a Sartorius Biostat B® 2-L fermentor at 37°C. The aeration rate and stirrer speed were regulated to keep the dissolved oxygen concentration at 60% of its saturation value. Precultures (80 ml) were prepared in shake flasks at 37°C to mid-log phase, and then added into the fermentor containing 800 ml of the HDM medium  supplemented with chloramphenicol (30 μg.ml-1). Induction was accomplished by addition of L-arabinose (0.5%). Cell biomass was monitored by measuring both the optical density at 600 nm (OD600) and dry cell weight (DCW) as previously described . Cell viability was determined by using the LIVE/DEAD BacLight kit (Invitrogen) in combination with flow cytometry as described by the manufacturer . Plasmid stability was assessed by plating properly diluted amounts of culture samples on LB-agar plates containing 0.5% arabinose without antibiotic and with 30 μg.ml-1 chloramphenicol. After overnight growth at 37°C the numbers of colony forming unit (CFU) were determined.
Cell fractionation and protein assays
Cells carrying the pLCB derivatives, normalized to the same OD600, were fractionated by spheroplast preparation as previously described . To analyse secreted LCI in the culture media, culture supernatants were applied to SepPak Plus C18 cartridges (Waters) pre-equilibrated by 10% acetonitrile. Then, the columns were washed with 10% acetonitrile, and proteins were eluted by 30% isopropanol. Total protein content was determined by the Bradford assay using bovine serum albumin as a standard. Cellular fractions were separated on 10% Bis-Tris polyacrylamide NuPage gels (Invitrogen), and proteins were visualized by Coomassie blue staining. For quantitative analysis, gels were scanned with Gel Doc XR imaging system (Biorad).
After cultivation, cells were centrifuged as described above and supernatants were filtered through a 0.22 μm syringe filter (Millipore). LCI was purified by reverse phase chromatography using a Ultimate 300 HPLC system (Dionex) and a Vydak C4 column, with a linear gradient ranging from 20 to 80% acetonitrile at a flow rate of 1 ml.min-1 as previously described . To quantify the concentration of native LCI found in periplasmic and culture supernatants, a calibration curve was constructed by using purified active protein as a standard. The LCI activity was assayed using the Carboxypeptidase A assay kit (Sigma Aldrich) in 50 mM Tris-HCl buffer, pH 7.5; containing 100 mM NaCl.
Baneyx F, Mujacic M: Recombinant protein folding and misfolding in Escherichia coli. Nat Biotechnol. 2004, 22 (11): 1399-1408. 10.1038/nbt1029.
Georgiou G, Segatori L: Preparative expression of secreted proteins in bacteria: status report and future prospects. Curr Opin Biotechnol. 2005, 16 (5): 538-545. 10.1016/j.copbio.2005.07.008.
de Marco A: Strategies for successful recombinant expression of disulfide bond-dependent proteins in Escherichia coli. Microb Cell Fact. 2009, 8: 26- 10.1186/1475-2859-8-26.
Steiner D, Forrer P, Stumpp MT, Pluckthun A: Signal sequences directing cotranslational translocation expand the range of proteins amenable to phage display. Nat Biotechnol. 2006, 24 (7): 823-831. 10.1038/nbt1218.
Schierle CF, Berkmen M, Huber D, Kumamoto C, Boyd D, Beckwith J: The DsbA signal sequence directs efficient, cotranslational export of passenger proteins to the Escherichia coli periplasm via the signal recognition particle pathway. J Bacteriol. 2003, 185 (19): 5706-5713. 10.1128/JB.185.19.5706-5713.2003.
Qiu J, Swartz JR, Georgiou G: Expression of active human tissue-type plasminogen activator in Escherichia coli. Appl Environ Microbiol. 1998, 64 (12): 4891-4896.
Reverter D, Fernandez-Catalan C, Baumgartner R, Pfander R, Huber R, Bode W, Vendrell J, Holak TA, Aviles FX: Structure of a novel leech carboxypeptidase inhibitor determined free in solution and in complex with human carboxypeptidase A2. Nat Struct Biol. 2000, 7 (4): 322-328. 10.1038/74092.
Arolas JL, Castillo V, Bronsoms S, Aviles FX, Ventura S: Designing out disulfide bonds of leech carboxypeptidase inhibitor: implications for its folding, stability and function. J Mol Biol. 2009, 392 (2): 529-546. 10.1016/j.jmb.2009.06.049.
Reverter D, Vendrell J, Canals F, Horstmann J, Aviles FX, Fritz H, Sommerhoff CP: A carboxypeptidase inhibitor from the medical leech Hirudo medicinalis. Isolation, sequence analysis, cDNA cloning, recombinant expression, and characterization. J Biol Chem. 1998, 273 (49): 32927-32933. 10.1074/jbc.273.49.32927.
Sanglas L, Valnickova Z, Arolas JL, Pallares I, Guevara T, Sola M, Kristensen T, Enghild JJ, Aviles FX, Gomis-Ruth FX: Structure of activated thrombin-activatable fibrinolysis inhibitor, a molecular link between coagulation and fibrinolysis. Mol Cell. 2008, 31 (4): 598-606. 10.1016/j.molcel.2008.05.031.
Nakamura Y, Gojobori T, Ikemura T: Codon usage tabulated from international DNA sequence databases: status for the year 2000. Nucleic Acids Res. 2000, 28 (1): 292- 10.1093/nar/28.1.292.
Guzman LM, Belin D, Carson MJ, Beckwith J: Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter. J Bacteriol. 1995, 177 (14): 4121-4130.
Zalucki YM, Beacham IR, Jennings MP: Biased codon usage in signal peptides: a role in protein export. Trends Microbiol. 2009, 17 (4): 146-150. 10.1016/j.tim.2009.01.005.
Choi JH, Lee SY: Secretory and extracellular production of recombinant proteins using Escherichia coli. Appl Microbiol Biotechnol. 2004, 64 (5): 625-635. 10.1007/s00253-004-1559-9.
Arolas JL, Aviles FX, Chang JY, Ventura S: Folding of small disulfide-rich proteins: clarifying the puzzle. Trends Biochem Sci. 2006, 31 (5): 292-301. 10.1016/j.tibs.2006.03.005.
Frachon E, Bondet V, Munier-Lehmann H, Bellalou J: Multiple microfermentor battery: a versatile tool for use with automated parallel cultures of microorganisms producing recombinant proteins and for optimization of cultivation protocols. Appl Environ Microbiol. 2006, 72 (8): 5225-5231. 10.1128/AEM.00239-06.
Gustafsson C, Govindarajan S, Minshull J: Codon bias and heterologous protein expression. Trends Biotechnol. 2004, 22 (7): 346-353. 10.1016/j.tibtech.2004.04.006.
Komar AA: A pause for thought along the co-translational folding pathway. Trends Biochem Sci. 2009, 34 (1): 16-24. 10.1016/j.tibs.2008.10.002.
Rosano GL, Ceccarelli EA: Rare codon content affects the solubility of recombinant proteins in a codon bias-adjusted Escherichia coli strain. Microb Cell Fact. 2009, 8: 41- 10.1186/1475-2859-8-41.
Marin M: Folding at the rhythm of the rare codon beat. Biotechnol J. 2008, 3 (8): 1047-1057. 10.1002/biot.200800089.
Hegde RS, Bernstein HD: The surprising complexity of signal sequences. Trends Biochem Sci. 2006, 31 (10): 563-571. 10.1016/j.tibs.2006.08.004.
Sambrook J, Russell DW: Molecular cloning: a laboratory manual. 2001, Cold Spring Harbor: Cold Spring Harbor Laboratory Press, 3
Roge J, Betton J-M: Use of pIVEX plasmids for protein overproduction in Escherichia coli. Microb Cell Fact. 2005, 4: 18- 10.1186/1475-2859-4-18.
Miot M, Betton JM: Optimization of the inefficient translation initiation region of the cpxP gene from Escherichia coli. Protein Sci. 2007, 16 (11): 2445-2453. 10.1110/ps.073047807.
Vidal L, Pinsach J, Striedner G, Caminal G, Ferrer P: Development of an antibiotic-free plasmid selection system based on glycine auxotrophy for recombinant protein overproduction in Escherichia coli. J Biotechnol. 2008, 134 (1-2): 127-136. 10.1016/j.jbiotec.2008.01.011.
Berney M, Hammes F, Bosshard F, Weilenmann HU, Egli T: Assessment and interpretation of bacterial viability by using the LIVE/DEAD BacLight Kit in combination with flow cytometry. Appl Environ Microbiol. 2007, 73 (10): 3283-3290. 10.1128/AEM.02750-06.
Betton J-M, Boscus D, Missiakas D, Raina S, Hofnung M: Probing the structural role of an alpha beta loop of maltose-binding protein by mutagenesis: heat-shock induction by loop variants of the maltose-binding protein that form periplasmic inclusion bodies. J Mol Biol. 1996, 262 (2): 140-150. 10.1006/jmbi.1996.0504.
Sharp PM, Li WH: The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987, 15 (3): 1281-1295. 10.1093/nar/15.3.1281.
We thank FX Avilès and his collaborators for many helpful discussions, and E. Johnson for critical reading of the manuscript. JM Puertas is a recipient of the Spanish Ministry of Science and Innovation (MICINN). This work was supported in part by the Institut Pasteur and the Centre National de la Recherche Scientifique (CNRS), and by a grant from the Agence Nationale de la Recherche (O6-BLAN-023904).
The authors declare that they have no competing interests.
JMP and JMB designed and performed experiments, interpreted the data and wrote the manuscript.