Skip to main content

A genetic replacement system for selection-based engineering of essential proteins



Essential genes represent the core of biological functions required for viability. Molecular understanding of essentiality as well as design of synthetic cellular systems includes the engineering of essential proteins. An impediment to this effort is the lack of growth-based selection systems suitable for directed evolution approaches.


We established a simple strategy for genetic replacement of an essential gene by a (library of) variant(s) during a transformation.

The system was validated using three different essential genes and plasmid combinations and it reproducibly shows transformation efficiencies on the order of 107 transformants per microgram of DNA without any identifiable false positives. This allowed for reliable recovery of functional variants out of at least a 105-fold excess of non-functional variants. This outperformed selection in conventional bleach-out strains by at least two orders of magnitude, where recombination between functional and non-functional variants interfered with reliable recovery even in recA negative strains.


We propose that this selection system is extremely suitable for evaluating large libraries of engineered essential proteins resulting in the reliable isolation of functional variants in a clean strain background which can readily be used for in vivo applications as well as expression and purification for use in in vitro studies.


About eight percent of E. coli genes are essential for the cell [1]. Essential genes are of particular scientific interest as they encode proteins required for important biological functions, thereby building the minimal core of cellular viability which tends to be conserved across species. Knowledge about essential genes and their protein products is important for drug design [2, 3], biotechnological applications [4], minimal genome approaches [58] and, in general, crucial for understanding and engineering the basic cellular functions required for life [9]. While the construction of the Keio-collection, a collection of single gene knock-outs in E. coli[1], enormously facilitated the systematic investigation of the physiology of E. coli as well as protein and strain-engineering approaches, it is still restricted to non-essential genes and their protein products. Engineering approaches involving essential genes and proteins are complicated because knock-outs cause lethality. This means that phenotypes of engineered proteins cannot be easily evaluated in vivo as suitable clean strain backgrounds are not available.

In the last decade several approaches have been investigated to identify essential genes and to study their function in vivo by conditional elimination of the protein from the cell. This was achieved by triggering interference of the synthesis of the target protein on either the transcriptional or translational level [1015]. However, these “bleach-out” methods rely on conditional protein elimination rather than elimination of the target gene itself and thus retain a wild-type copy of the essential gene in the cell. This sets limitations for the utility of these systems as ready-to-use selection systems for directed evolution experiments since recombination of library members with the chromosomal wild-type gene or mutations in the system regulating the expression of the wild-type protein can lead to the selection of false positive variants. This is particularly true when using a library for which only a small fraction of variants is expected to be functional. In this case recombination events are preferentially selected over functional library members, which results in every selection effort turning into a laborious screen for bona fide functional library members.

Besides evaluation of large libraries, another desire during protein engineering of essential genes is to replace the wild-type gene by a single engineered or heterologous variant for in vivo functional studies or for the construction of specialized strains which can be used to purify the mutant protein free of wild-type protein. Phage P1-mediated transduction of a chromosomal knock-out into a strain expressing a variant of the essential gene of interest from a plasmid is the current method of choice to achieve genetic replacement of an essential target gene by a variant (e.g [16]). The knock-out was thereby created while complementing the chromosomal gene loss by a plasmid-encoded version of the essential gene.

Although P1-transduction is widely used, the protocol is time-consuming and restricted to a few variants at a time as efficiencies of successful transductions are low, often requiring empirical testing for the proper phage concentration followed by re-plating and PCR-screening for correct genotypes. Therefore, it was our aim to develop a general genetic set-up which turns working with essential genes and the engineering of their gene products into a straight-forward approach as facile as working with non-essential genes. Here, we present a simple transformation-based system. Establishment of the system begins with the chromosomal replacement of the essential gene of interest by a PCR-derived selection marker [17], in conjunction with a complementary vector-encoded version of the target. A central element of the method is that the complementation vector carries an I-Sce I nuclease recognition site and can thus be rapidly and conditionally eliminated in the presence of an I-Sce I nuclease-expressing helper plasmid. During elimination of the complementation vector, cells are made electrocompetent and transformed with a vector-encoded library or a specific variant of the essential gene. Thus, the actual step of gene exchange is reduced to a single transformation plus preparation of a suitable knock-out strain.


Overview of the replacement system

The selection system can be flexibly assembled in various user-defined ways, but will be discussed in its simplest version first. It relies on two vectors: the knock-out and complementation vector pKOCOMP and the helper plasmid pI-Sce I (Figure 1A and Table 1). Plasmid pKOCOMP is a derivative of pKD46 [17] and encodes the arabinose-inducible λ red recombination system (genes β γ and exo) as well as the essential gene of interest. It also contains an 18 bp I-Sce I cleavage site which allows for the conditional elimination of pKOCOMP in the presence of I-Sce I. I-Sce I is expressed from the helper plasmid pI-Sce I under the control of the rhamnose-inducible RhaSR/Prha BAD-system [18]. A variant of this helper plasmid, pP ara I-Sce I, carries an arabinose-inducible regulatory system AraC/Para BAD[19] instead of the rhamnose-based system. After testing various set-ups, these promoters were chosen as they exhibited high expression level in the presence of rhamnose or arabinose but could be efficiently switched off in the presence of glucose due to catabolite repression [19, 20]. It should be noted that the specific set-up of the complementation vector is flexible as long as it carries the I-Sce I site, which can easily be introduced by PCR, is compatible with pKD46, which is required for λ red-mediated knock-out of the target gene and can be cured by growth at 42°C. The system can then be completed by subsequent transformation with pI-Sce I or pP ara I-Sce I.

Figure 1

Overview on the transformation-based replacement of essential genes. A: Plasmids required for establishment of the selection system B: General procedure: The essential target gene adk is replaced by an antibiotic resistance cassette while pKOCOMP-adk complements for the chromosomal loss. Plasmid pKOCOMP can be conditionally eliminated by co-expression of I-Sce I nuclease from helper plasmid pI-Sce I. During elimination, cells are made electrocompetent and are transformed with a variant or library under investigation to evaluate functionality or to select for functional library members which can complement for the loss of pKOCOMP.

Table 1 Bacterial strains and plasmids used in this study

Validation and characterization of the system using the essential gene product adenylate kinase (Adk)

The first target gene chosen for validation and characterization of the system was adk, encoding E. coli’ s Adk, an essential gene product required for the biosynthesis of purine ribonucleotides and for the regulation of intracellular nucleotide availability [1, 21, 22]. For complementation, adk under control of P tac was inserted into pKOCOMP, giving rise to pKOCOMP-adk. Upon induction of pKOCOMP-adk with IPTG, the chromosomal copy of adk was replaced by a PCR-generated kanamycin resistance cassette [17]. The genotype of the resulting strain E. coli adk::kan [pKOCOMP-adk was confirmed by PCR using primers binding to chromosomal regions up- and downstream of the adk-locus ( Additional file 1: Table S1 and Additional file 2 Figure S1). To complete the selection system, strain adk::kan [pKOCOMP-adk was transformed with helper plasmid pI-Sce I. The resulting strain adk::kan [pKOCOMP-adk; pI-Sce I] was grown in the presence of glucose during maintenance to repress I-Sce I nuclease production. However, when cells were transferred to glucose-free LB medium, I-Sce I production was apparently efficiently induced by the addition of rhamnose, as no colony-forming units could be recovered after an induction-8period of 180 minutes (Figure 2A). Importantly, even without a copy of the adk gene, cells remained viable for about two generations, presumably until all mRNA and protein was depleted. This was expected as linear DNA is rapidly degraded by cellular exonucleases and has a half-life in the range of minutes [23] whereas most proteins are relatively stable with half-lives in the range of hours [24].

Figure 2

Characterization of the pKOCOMP- adk /pI- Sce I and the pCOMP- adk /pP ara I- Sce I systems. A: Elimination of pKOCOMP-adk from adk::kan due to rhamnose-induced expression of I-Sce I nuclease from helper plasmid pI-Sce I. Red arrow: Time point for harvest and competent cell preparation. The inlet gives a better resolution of the elimination dynamics. The chromosomal adk-replacement was established using pKOCOMP-adk. B: Elimination of pCOMP-adk from adk::kan due to arabinose-induced expression of I-Sce I nuclease from helper plasmid pP ara I-Sce I. The chromosomal adk-replacement was established using pKD46 and pCOMP-adk.

During this intermediate period cells were made electrocompetent by washing with H2O and glycerol and transformed with various test plasmids (Table 1) in order to determine transformation efficiencies and the frequency of false positive variants. Vector pSEVA132-adk encoding the wild-type adenylate kinase under control of its natural promoter was used for determination of transformation efficiencies. Transformation efficiencies of 107 μg-1 DNA were routinely achieved. The system was intensively characterized regarding the frequency of false positive variants which could arise due to recombination of the library plasmid with residual linearized pKOCOMP-adk or due to an uninduced subpopulation of cells that maintained pKOCOMP (Table 2). Recombination-based false positives were tested by transformation with pSEVA132-adkstop, harboring a stop codon in the adk gene that could be repaired by recombination allowing growth of the corresponding carrier cell. False positives due to incomplete induction were examined by transformation with the empty vector pSEVA132. In both cases no false positive colonies were detected when I-Sce I expression was induced and pKOCOMP was eliminated before competent cell preparation (Table 2), corresponding to a frequency of less than 2 x 10-4 false positives per transformed cell. However, there was a detectable frequency of recombination events when pKOCOMP was eliminated only after transformation with the test plasmids or when we tried to eliminate pKOCOMP solely by growth at the non-permissive temperature. In the former case, I-Sce I nuclease expression was induced only after transformation by plating on arabinose-supplemented agar plates, such that both plasmids would be simultaneously present in the cells for a short period of time. Here we found illegitimate events with a frequency of about 3 x 10-3 per transformed cell (Table 2). This demonstrates the importance of careful management of the plasmid elimination step.

Table 2 Transformation efficiencies and frequency of false positive variants for the Adk-specific selection systems

Flexibility of the replacement system

To verify that the selection system can be set up with alternative combinations, we constructed pCOMP-adk. This vector is based on the pBBR1 ori with an expected copy number of 10-20 per cell [25], carrying a P tac promoter-controlled adk gene, the lac-repressor LacI, and an I-Sce I cleavage site. While the previous complementation vector pKOCOMP-adk used a temperature-sensitive pSC101 ori with about 2-3 copies per cell when grown at 37°C [26], the increased copy number of pCOMP-adk allowed for examination of whether plasmid elimination was sufficiently efficient at higher intracellular plasmid concentration. We used pCOMP-adk in combination with pKD46 to replace adk by a kanamycin cassette. After curing cells of pKD46 by growth at 43°C, the resulting strain adk::kan [pCOMP-adk was transformed with pP ara I-Sce I and used for gene replacement as described before using arabinose to induce I-SceI production. Despite the higher copy number of pCOMP, the plasmid was again rapidly eliminated from the cells in the presence of arabinose (Figure 2B). Probably due to the faster on-set and the possibly higher expression levels of the arabinose responsive promoter Para BAD[19, 20], elimination-dynamics of the pCOMP/pP ara I-Sce I system were faster than those of the pKOCOMP/pI-Sce I system. Although the elimination data of the two systems are difficult to compare due to differences in copy number and promoters, these results indicate that the described approach can be set up in multiple ways, making it easy to adapt to plasmid strategies for specific purposes. Transformation efficiencies of the pCOMP/pP ara I-Sce I system (~107 colonies μg-1 DNA) were comparable to the pKOCOMP/pI-Sce I system (Table 1) and we could not identify false positive transformants.

Generality of the system

To confirm that the utility of the system was not limited to adk but could be easily extended to other essential genes, we constructed in vivo selection systems for other essential gene products: the chaperonin GroEL and its co-chaperonin GroES (encoded by the groE operon containing the genes groL and groS), and glycerol-3-phosphate dehydrogenase (GpsA encoded by gpsA). For establishment of the GroEL-specific system we introduced an I-Sce I cleavage site into the vector pSEVA431-groE by PCR giving rise to vector pCOMP-groE. Plasmid pSEVA431-groE encodes the groE-operon under control of the IPTG-inducible P tac promoter. It also harbors the lacI gene, a spectinomycin resistance cassette and replicates with a pBBR1 ori. Plasmid pCOMP-groE was used in combination with pKD46 to replace the chromosomal groE-operon by a kanamycin resistance cassette ( Additional file 2 Figure S1). The resulting strain groE::kan [pCOMP-groE] was cured from pKD46 at 43°C and then transformed with the helper plasmid pP ara I-Sce I.

The GpsA-specific system was constructed by cloning the natural gpsA transcriptional unit (consisting of genes secB and gpsA under control of their natural promoter) into vector pSEVA431 using primers encoding for an I-Sce I restriction site, resulting in vector pCOMP-secBgpsA. Chromosomally encoded secB and gpsA were then replaced by a kanamycin resistance cassette using pKD46 ( Additional file 2 Figure S1). To complete the set-up, strain secBgpsA::kan [pCOMP-secBgpsA] was cured of pKD46 and transformed with helper plasmid pP ara I-Sce I. Both systems were characterized regarding elimination dynamics of the complementing plasmids pCOMP-groE and pCOMP-secBgpsA after I-Sce I induction, as well as regarding transformation efficiencies of electrocompetent cells prepared during pCOMP-elimination and frequency of false positive variants. Both pCOMP-type plasmids were lost at a comparable rate to the pCOMP-plasmid carrying adk (Figure 3). After that, both systems routinely showed transformation efficiencies of 106-107 colonies μg-1 DNA when transformed with the positive control vectors pSEVA132-groE or pSEVA132-secBgpsA. No false positive variants could be detected after transformation with the test plasmids pSEVA132-groEstop and pSEVA132-secBgpsAstop, constructed in analogy to pSEVA132-adkstop before. Importantly, it seems to be a general feature that cells stay viable - as judged by the doubling time in comparison to a non-induced control culture that did not produce I-Sce I - for one or more generations, depending on the target gene, after loss of the complementing plasmid. This is an important characteristic of the system as cells can be made competent for transformation with a library or variant of an essential gene during a period where the complementing plasmid has already been lost and can no longer contribute to recombination.

Figure 3

Characterization of the pCOMP- groE /pP ara I- Sce I and the pCOMP-sec BgpsA /pP ara I- Sce I system. Elimination of (A) pCOMP-groE from groE::kan and (B) pCOMP-secBgpsA from secBgpsA::kan in the presence of arabinose and the helper plasmid pParaI-Sce I. Elimination is induced by arabinose-induced expression of I-Sce I nuclease from helper plasmid pParaI-Sce I. Red arrow: Time point for harvest and competent cell preparation.

Performance of the system during selection

To test the system regarding its performance in reliably identifying functional variants from a large library without contamination by false positive variants we challenged the GroEL- and the Adk-specific selection system with mock libraries with various excesses of non-functional variants. As there is a conventional bleach-out system available for GroEL (the E. coli MGM100 strain [10]), we also compared the performance of the replacement system developed here to the bleach-out system. In strain MGM100, the groE promoter has been replaced by the araBAD promoter and the regulatory gene araC. The strain can therefore be maintained in the presence of arabinose but GroEL production from the chromosomal locus can be fully repressed in the presence of glucose. This way, functional variants can be selected from a library in the presence of glucose. However, as a copy of the wild-type gene is still present during selection, recombination can lead to selection of false positive variants, which can dominate even in stringent selection conditions. To improve the strain regarding the expected recombination frequency we deleted recA leading to the strain SBΔrecA ( Additional file 2: Figure S1).

The created mock libraries contained decreasing ratios of functional to non-functional gene variants from 1:102 to 1:105 (functional:non-functional) mimicking libraries where a decreasing number of variants is expected to be functional within a large pool of non-functional variants, as it is typical e.g. for libraries created by error-prone PCR with different error rates. As non-functional variants we used pSEVA132-groEstop or pSEVA132-adkstop. As functional variants we used pSEVA132-groEwatermark or pSEVA132-adkwatermark. These plasmids encode for a GroEL or Adk- variant with an in-frame insertion of a short peptide at previously identified permissive sites. Both variants are fully functional and can be identified by PCR using watermark-specific primers. To compare the GroEL-specific selection system with the GroEL bleach-out system SBΔrecA, electrocompetent W3110 groE::kan [pCOMP-groE, pP ara I-Sce I] cells or electrocompetent SBΔrecA cells were transformed with the different mock libraries in separate experiments. As positive and negative controls, cells were transformed with only functional or only non-functional variants. After plating and incubation overnight on LB-agar plates supplemented with arabinose, the corresponding antibiotics, and, in the case of W3110 groE::kan [pCOMP-groE, pP ara I-Sce I], glucose, colonies were scored and a subset of the colonies was genotyped with watermark-specific primers (Table 3).

Table 3 Recovery of functional GroEL-variants and Adk-variants from mock libraries with increasing excess of non-functional variants using the established selection system in comparison with a conventional bleach-out system

In the GroEL-specific system, the numbers of colonies correlated well with the numbers expected from transformation efficiencies and the decreasing number of functional variants in each library. Even from the most stringent library (functional variants at a frequency of 10-5) only functional variants carrying the watermark were recovered after transformation of ~ 81,000 cells. No colony was detected after transformation of approximately the same number of cells with only non-functional variants. In contrast when using SBΔrecA for the same experiment, false positive variants were detected after transformation with the negative control (transformation with only non-functional variants) with a frequency of ~ 2 x 10-4. In addition, we identified false positive variants after genotyping a subset of the colonies which had been selected from the different mock libraries (Table 3). Selection stringency positively correlated with the false positive rate, thus requiring laborious orthogonal assays to differentiate true from false positives. In a directed evolution experiment this would result in the requirement for intensive post-screening of selected variants for true functional library members. Plasmids of three of the false positive variants, which had been isolated after transformation with only non-functional variants, were further analyzed by sequencing. For two of them the stop codon had been reverted to the wild-type codon, probably due to recombination with the chromosomal groL copy. The third analyzed variant still had the stop codon within groL indicating that the ability to grow must have arisen from a mutation in the araBAD promoter preventing full repression. This phenomenon had been described before for strain MGM100 [16].

The same experiment was performed with the Adk-specific selection system using mock libraries with decreasing ratios of functional Adk-variants to non-functional Adk-variants. Also for this set-up we did not identify any false positive variant and after transformation with the most stringent library we could reliably identify a functional variant containing the watermark after transformation of ~ 138,000 cells (Table 3).


In this study we present a facile and efficient set-up for a (within the tested boundaries) background-free selection system for functional, engineered, essential proteins. It is based on the conditional elimination of a complementary plasmid-based copy of an essential gene in a knock-out strain in order to replace the essential gene by genes from a library (for example).

We show that I-Sce I nuclease-based cleavage proves to be a suitable strategy for the fast and efficient elimination of a complementing plasmid from a knock-out strain which can occur while the strain is being made transformation-competent. Efficient plasmid elimination is an essential prerequisite for the high performance of the system during evaluation of large libraries under stringent conditions because elimination of the wild-type gene from the cells prior to introduction of variants prevents recombination-based background growth. The major shortcoming of conventionally used bleach-out systems is indeed due to recombination with a silenced wild-type gene during selection. Even in a recA strain, RecA-independent recombination between homologous regions occurs with frequencies between 10-3-10-4[27]. As soon as the number of expected functional variants within a library drops below 10-3 selection is primarily for recombination events instead of functional library members. This can clearly be seen in our selection experiment using the GroEL bleach-out strain SBΔrecA and mock libraries with decreasing ratio of functional variants. Even though our strain was deficient for recA, we frequently isolated false positive variants.

Besides being recombination-free, another advantage of the presented set-up over bleach-out systems is that selected variants are directly expressed in a clean strain-background circumventing laborious post-transformation work such as P1-transductions. This allows for subsequent purification of engineered proteins for in vitro characterization and user-defined in vivo applications without running the risk of wild-type contamination.

We also show that even after elimination of the complementing vector from the knock-out strain, cells remain viable for one or more generations - depending on the gene - and electrocompetent cells prepared after elimination yield up to 107 transformants μg-1 DNA.

As in our set-up the essential target genes are expressed from an inducible promoter during competent cell preparation, appropriate bleach-out times can be adjusted for individual gene products by tuning the inducer concentration.

The system was validated with three different essential E. coli proteins: Adk, glyceraldehyde-3-phosphate dehydrogenase and the chaperonin GroEL. Transformation efficiencies and the absence of detectable recombination events proved to be independent of the essential target gene.


In the current work we present a straightforward, transformation-based system which enables the genetic replacement of a wild-type essential gene of interest by a library or variant. It thereby directly results in the isolation of functional variants in a clean strain background with considerably reduced effort.

Furthermore, it substantially facilitates working with and engineering of essential genes and their protein products making it an experimentally easy, fast and scalable task.

Finally, it should be possible to adapt the here introduced replacement strategy to other hosts – like e.g. yeast, Bacillus subtilis or Clustridium spec. - for which homologous recombination-based knock-out strategies are available [2830].

Materials and methods

Chemical and enzymes

Restriction enzymes and ligase were obtained from New England Biolabs (Ipswich, MA, USA) and used according to manufacturers’ instructions. Chemicals were purchased in the highest purity available from Sigma-Aldrich, Fluka (Buchs, Switzerland) or Roth (Lauterbourg, France). Trypton and yeast extract were from BD Bioscience (Basel, Switzerland). Oligonucleotides and Sanger-sequencing service were purchased from Microsynth (Balgach, Switzerland).

Strains and plasmids

E. coli DH10B was used for general cloning procedures. E. coli W3110 was used as the chassis for all chromosomal deletions (see Table 1 for an overview on strains used in this study). SBΔrecA is a derivative of strain MGM100 [10]. It was constructed from BW25113 recA:: FRT [1] by P1-phage transduction using a lysate from MGM100 and selecting for KmR. The final clone was confirmed by PCR analysis of the groE and recA::FRT locus and by its inability to grow on glucose after GroEL-bleach-out. Plasmid pKOCOMP-adk is a derivative of pKD46 [17] and was constructed by first cloning the adk gene into the multiple cloning site of expression vector pACT3 [31] via restriction sites Bam HI and Hin dIII using primers pKOCOMP-adk-fw and pKOCOMP-adk-rv (see Table S1 for primer sequences). The resulting vector pACT-adk was used as template to amplify the Ptac promoter-controlled adk gene with primers pACT-forward and pACT-reverse, encoding for a I-Sce I recognition site, and cloned into pKD46 via its unique Nco I-site. For construction of pCOMP-adk the P tac promoter-controlled adk gene was amplified with primers pACT-SceI-Spe and pACT-Pac and the PCR product was cloned into the unique Spe I and Pac I sites of pSEVA432 (ori pBBR1, SpecR resistance). Plasmid pSEVA132-adk is a derivative of pSEVA132 (ori pBBR1, ApR) and encodes adk controlled by its natural promoter and fused to a C-terminal 6xHis-tag. It was amplified from genomic E. coli DNA using primers adk-forward and adk-reverse and cloned into pSEVA132 via restriction sites Xma I and Sac I. Plasmid pCOMP-gro E was constructed by amplification of the P tac -controlled groE-operon from pACT-ESL using primers pACT-SceI-Spe and pACT-Pac and cloned into the unique SpeI and PacI sites of pSEVA431. pACT-ESL is pACT3 derived and encodes the P tac controlled groE operon. groS was amplified from W3110 by PCR with primers groS-fw and groS-rv and groL was PCR-amplified with primers groL-fw and groL-rv and then sequentially cloned into the Kpn I and Hin dIII sites of pACT3. Plasmid pSEVA132-groE was constructed by amplification of the natural groE operon from E. coli genomic DNA using primers groE-forward and groE-reverse and cloning them into the unique sites Xma I and Xba I. Plasmid pCOMP-secBgpsA was constructed by cloning the natural secB-gpsA transcriptional unit, amplified with primers secBgpsA-forward and gpsA _I-Sce I-reverse, into pSEVA431 via restriction sites Xma I and Xba I. Plasmids pSEVA132-adkstop, pSEVA132-adkwatermark, pSEVA132-groEstop, pSEVA132-groEwatermark and pSEVA132-secBgps Astop were constructed by amplification and re-ligation of pSEVA132-adk, pSEVA132-groE or pSEVA132-secBgps A using primers adk-stop-fw/adk-stop-rv, adk-watermark-fw/adk-watermark-rv, groE-watermark-fw/groE-watermark-rv, groE-stop-fw/groE-stop-rv and secBgpsA-stop-fw/secBgpsA-stop-rv.

Helper plasmids pI-Sce I and pP ara I-Sce I are derivatives of pSEVA671 (ori p15A, GmR). The gene for I-SceI nuclease was amplified from plasmid pSTKST [32] using primers I-SceI-fw and I-SceI-rv and cloned into pSEVA671 via Pac I and Eco RI restriction sites. The RhaR-RhaS/Prha BAD regulatory system was amplified from E. coli genomic DNA using primers Rha-forward and Rha-reverse and cloned in front of I-SceI via restriction sites Nsi I and Spe I. The AraC/Para BAD regulatory system was amplified from E. coli genomic DNA using primers ParaBAD-fw and ParaBAD-rv and used to exchange RhaR-RhaS/P rha using sites Nsi I and Spe I.

Preparation of competent cells and transformation

E. coli adk::kan [pKOCOMP-adk, pI-Sce I] or [pCOMP-adk, pP ara I-Sce I] were grown overnight in LB liquid broth supplemented with 50 μg mL-1 kanamycin, 10 μg mL-1 gentamicin, 100 μM IPTG and 0.5% (wt/vol) glucose (to efficiently repress I-Sce I production) at 30°C or 37°C. Cells were pelleted, washed once with LB and diluted 1:100 in fresh LB broth, supplemented with the same antibiotics as before but without glucose. Cells were grown at 37°C. At an OD600 of 0.2, 10 mM rhamnose or 0.2% arabinose (wt/vol) was added to induce I-Sce I nuclease production. When reaching an OD600 of 0.4-0.5, cells were chilled on ice for 30 min, harvested and washed twice with chilled water and once with 10% glycerol as described [26]. For transformation with the test plasmids and the mock libraries, 50 μL cells (OD600 around 100) were mixed with 1.5 ng DNA, exposed to an electrical pulse of 1.3 kV and recovered in 1 mL LB broth supplemented with 10 mM rhamnose or 0.2% arabinose for 1 h at 37°C. Selection was done overnight at 37°C on LB agar plates containing 50 μg mL-1 kanamycin, 10 μg mL-1 gentamicin, 100 μg mL-1 ampicillin and 10 mM rhamnose or 0.2% arabinose. The GroEL-specific selection system based on W3110 groE::kan [pCOMP-groE, pP ara I-Sce I] and the GpsA-specific selection system based on W3110 secBgpsA::kan [pCOMP-secBgpsA, pP ara I-Sce I] were treated the same way. For preparation of competent SBΔrec A, cells were grown overnight in LB supplemented with 50 μg mL-1 kanamycin and 0.2% arabinose. Cells were washed twice with water and diluted 1:100 in fresh LB medium. When cells reached an OD600 of 0.1 0.5% glucose was added to repress chromosomal GroEL production. Cellular GroEL was bleached for two generations before cells were harvested for competent cell preparation (at OD600 of 0.4).

Determination of plasmid loss

Loss of pKOCOMP-adk or pCOMP-adk, pCOMP-groE and pCOMP-secBgpsA was determined as follows: The corresponding knock-out strains adk::kan, groE::kan and secBgpsA::kan containing helper plasmid pI-Sce I or pP ara I-Sce I were grown in LB liquid broth supplemented with 50 μg mL-1 kanamycin and 10 μg mL-1 gentamicin until exponential growth was reached. Next, I-Sce I nuclease production was induced with 10 mM rhamnose or 0.2% arabinose. After induction aliquots were taken after 0, 10, 20, 40, 60, 120 and 180 min and normalized to OD600. Serial dilutions were subsequently plated on LB agar supplemented with 0.5% glucose. The next day the number of colony forming units (CFU) was counted. The number of colonies resulting from aliquots which had been taken from a control culture grown in the absence of rhamnose or arabinose were set to 100% CFU.

Chromosomal knock-out of essential genes

Knock-outs were done by λ red-based recombination with a PCR-encoded kanamycin resistance cassette as described earlier [17]. The kanamycin-cassettes were generated with pKD13 as a template and primers adk- H1 and adk-H2, groE-H1 and groE-H2 or secBgpsA- H1 and secBgpsA-H2. Knock-outs were PCR-verified with primers P1-P6 as indicated in Additional file2 Figure S1 and Additional file 1 Table S1.


  1. 1.

    Baba T, Ara T, Hasegawa M, Takai Y, Okumura Y, Baba M, Datsenko KA, Tomita M, Wanner BL, Mori H: Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol. 2006, 2: 2006-0008.

    Article  Google Scholar 

  2. 2.

    Rosamond J, Allsop A: Harnessing the power of the genome in the search for new antibiotics. Science. 2000, 287: 1973-1976. 10.1126/science.287.5460.1973

    CAS  Article  Google Scholar 

  3. 3.

    Haselbeck R, Wall D, Jiang B, Ketela T, Zyskind J, Bussey H, Foulkes JG, Roemer T: Comprehensive essential gene identification as a platform for novel anti-infective drug discovery. Curr Pharm Des. 2002, 8: 1155-1172. 10.2174/1381612023394818

    CAS  Article  Google Scholar 

  4. 4.

    Johnson DBF, Xu JF, Shen ZX, Takimoto JK, Schultz MD, Schmitz RJ, Xiang Z, Ecker JR, Briggs SP, Wang L: RF1 knockout allows ribosomal incorporation of unnatural amino acids at multiple sites. Nat Chem Biol. 2011, 7: 779-786. 10.1038/nchembio.657

    CAS  Article  Google Scholar 

  5. 5.

    Gil R, Silva FJ, Pereto J, Moya A: Determination of the core of a minimal bacterial gene set. Microbiol Mol Biol Rev. 2004, 68: 518-537. 10.1128/MMBR.68.3.518-537.2004

    CAS  Article  Google Scholar 

  6. 6.

    Posfai G, Plunkett G, Feher T, Frisch D, Keil GM, Umenhoffer K, Kolisnychenko V, Stahl B, Sharma SS, de Arruda M, et al: Emergent properties of reduced-genome Escherichia coli. Science. 2006, 312. 10.4-1046.

    CAS  Article  Google Scholar 

  7. 7.

    Trinh CT, Unrean P, Srienc F: Minimal Escherichia coli cell for the most efficient production of ethanol from hexoses and pentoses. Appl Environ Microbiol. 2008, 74: 3634-3643. 10.1128/AEM.02708-07

    CAS  Article  Google Scholar 

  8. 8.

    Lee JH, Sung BH, Kim MS, Blattner FR, Yoon BH, Kim JH, Kim SC: Metabolic engineering of a reduced-genome strain of Escherichia coli for L-threonine production. Microb Cell Fact. 2009, 8: 2- 10.1186/1475-2859-8-2

    Article  Google Scholar 

  9. 9.

    Danchin A: Natural selection and immortality. Biogerontology. 2009. 10. 503-516. 10.1007/s10522-008-9171-5

    Article  Google Scholar 

  10. 10.

    McLennan N, Masters M: GroE is vital for cell-wall synthesis. Nature. 1998, 392: 139-139. 10.1038/32317

    CAS  Article  Google Scholar 

  11. 11.

    Herring CD: Introduction of conditional lethal amber mutations in Escherichia coli. Methods Mol Biol. 2008, 416: 323-334. 10.1007/978-1-59745-321-9_21

    CAS  Article  Google Scholar 

  12. 12.

    Jin Y, Watt RM, Danchin A, Huang JD: Use of a riboswitch-controlled conditional hypomorphic mutation to uncover a role for the essential csrA gene in bacterial autoaggregation. J Biol Chem. 2009, 284: 28738-28745. 10.1074/jbc.M109.028076

    CAS  Article  Google Scholar 

  13. 13.

    Ji Y, Zhang B, Van SF, Horn , Warren P, Woodnutt G, Burnham MK, Rosenberg M: Identification of critical staphylococcal genes using conditional phenotypes generated by antisense RNA. Science. 2001, 293: 2266-2269. 10.1126/science.1063566

    CAS  Article  Google Scholar 

  14. 14.

    Tang YC, Chang HC, Chakraborty K, Hartl FU, Hayer-Hartl M: Essential role of the chaperonin folding compartment in vivo. EMBO J. 2008, 27: 1458-1468.

    CAS  Google Scholar 

  15. 15.

    Herring CD, Blattner FR: Conditional lethal amber mutations in essential Escherichia coli genes. J Bacteriol. 2004, 186: 2673-2681. 10.1128/JB.186.9.2673-2681.2004

    CAS  Article  Google Scholar 

  16. 16.

    van der Vies SM, Lund PA: Determination of chaperonin activity in vivo. Methods Mol Biol. 2000, 140: 75-96.

    CAS  Google Scholar 

  17. 17.

    Datsenko KA, Wanner BL: One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci U S A. 2000, 97: 6640-6645. 10.1073/pnas.120163297

    CAS  Article  Google Scholar 

  18. 18.

    Tobin JF, Schleif RF: Transcription from the rha operon psr promoter. J Mol Biol. 1990, 211: 1-4. 10.1016/0022-2836(90)90003-5

    CAS  Article  Google Scholar 

  19. 19.

    Guzman LM, Belin D, Carson MJ, Beckwith J: Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter. J Bacteriol. 1995, 177: 4121-4130.

    CAS  Google Scholar 

  20. 20.

    Egan SM, Schleif RF: A regulatory cascade in the induction of rhaBAD. J Mol Biol. 1993, 234: 87-98. 10.1006/jmbi.1993.1565

    CAS  Article  Google Scholar 

  21. 21.

    Cousin D, Buttin G: Thermosensitive mutants of K12 Escherichia coli. 3. A lethal mutation of E. coli affecting the activity of the adenylate kinase. Ann Inst Pasteur (Paris). 1969, 117: 612-630.

    CAS  Google Scholar 

  22. 22.

    Glaser M, Nulty W, Vagelos PR: Role of adenylate kinase in the regulation of macromolecular biosynthesis in a putative mutant of Escherichia coli defective in membrane phospholipid biosynthesis. J Bacteriol. 1975, 123: 128-136.

    CAS  Google Scholar 

  23. 23.

    Kuzminov A, Schabtach E, Stahl FW: Chi sites in combination with RecA protein increase the survival of linear DNA in Escherichia coli by inactivating exoV activity of RecBCD nuclease. EMBO J. 1994, 13: 2764-2776.

    CAS  Google Scholar 

  24. 24.

    Tobias JW, Shrader TE, Rocap G, Varshavsky A: The N-end rule in bacteria. Science. 1991, 254: 1374-1377. 10.1126/science.1962196

    CAS  Article  Google Scholar 

  25. 25.

    Antoine R, Locht C: Isolation and molecular characterization of a novel broad-host-range plasmid from Bordetella bronchiseptica with sequence similarities to plasmids from gram-positive organisms. Mol Microbiol. 1992, 6: 1785-1799. 10.1111/j.1365-2958.1992.tb01351.x

    CAS  Article  Google Scholar 

  26. 26.

    Sambrook J, Russell DW: Molecular cloning : a laboratory manual. 2001, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 3

    Google Scholar 

  27. 27.

    Bi X, Liu LF: RecA-independent and RecA-dependent intramolecular plasmid recombination - Differential homology requirement and distance effect. J Mol Biol. 1994, 235: 414-423. 10.1006/jmbi.1994.1002

    CAS  Article  Google Scholar 

  28. 28.

    Wang Y, Weng J, Waseem R, Yin X, Zhang R, Shen Q: Bacillus subtilis genome editing using ssDNA with short homology regions. Nucleic Acids Res. 2012, 40: e91- 10.1093/nar/gks248

    CAS  Article  Google Scholar 

  29. 29.

    Kuehne SA, Minton NP: ClosTron-mediated engineering of Clostridium. Bioengineered. 2012, 3: 245-252.

    Article  Google Scholar 

  30. 30.

    Wach A, Brachat A, Pohlmann R, Philippsen P: New heterologous modules for classical or PCR-based gene disruptions in Saccharomyces cerevisiae. Yeast. 1994. 10. 1793-1808.

    CAS  Article  Google Scholar 

  31. 31.

    Dykxhoorn DM, St Pierre R, Linn T: A set of compatible tac promoter expression vectors. Gene. 1996, 177: 133-136. 10.1016/0378-1119(96)00289-2

    CAS  Article  Google Scholar 

  32. 32.

    Kolisnychenko V, Plunkett G, Herring CD, Feher T, Posfai J, Blattner FR, Posfai G: Engineering a reduced Escherichia coli genome. Genome Res. 2002, 12: 640-647. 10.1101/gr.217202

    CAS  Article  Google Scholar 

  33. 33.

    Hayashi K, Morooka N, Yamamoto Y, Fujita K, Isono K, Choi S, Ohtsubo E, Baba T, Wanner BL, Mori H, Horiuchi T: Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110. Mol Syst Biol. 2006, 2: 0007-

    Article  Google Scholar 

  34. 34.

    Durfee T, Nelson R, Baldwin S, Plunkett G, Burland V, Mau B, Petrosino JF, Qin X, Muzny DM, Ayele M, et al: The complete genome sequence of Escherichia coli DH10B: insights into the biology of a laboratory workhorse. J Bacteriol. 2008, 190: 2597-2606. 10.1128/JB.01695-07

    CAS  Article  Google Scholar 

Download references


This work was funded by the EU FP6 projects Eurobiosyn and NANOMOT and the ESF project Nanocell. The authors are indebted to Tania Roberts for proofreading of the manuscript.

Author information



Corresponding author

Correspondence to Sven Panke.

Additional information

Competing interests

The authors declare that there are no competing interests.

Authors’ contributions

SB performed the experiments, SB and SP designed the experimental approach and wrote the manuscript, SP supervised the research. All authors read and approved the final manuscript.

Electronic supplementary material

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Billerbeck, S., Panke, S. A genetic replacement system for selection-based engineering of essential proteins. Microb Cell Fact 11, 110 (2012).

Download citation


  • Essential Gene
  • Kanamycin Resistance Cassette
  • Chaperonin GroEL
  • Complementation Vector
  • araBAD Promoter