Skip to main content

Restriction site free cloning (RSFC) plasmid family for seamless, sequence independent cloning in Pichia pastoris



Tagging proteins is a standard method facilitating protein detection, purification or targeting. When tagging a certain protein of interest, it is challenging to predict which tag will give optimal results and will not interfere with protein folding, activity or yields. Ideally, multiple tags and positions are tested which however complicates molecular cloning and expression vector generation. In conventional cloning, tags are either added on PCR primers (requiring a distinct primer and PCR product per tag) or provided on the vector (typically leaving a restriction site scar).


Here we report a vector family of 40 plasmids allowing simple, seamless fusions of a single PCR product with various N- and C-terminal tags, signal sequences and promoters. The restriction site free cloning (RSFC) strategy presented in this paper relies on seamless cloning using type IIS restriction endonucleases. After cutting out a stuffer (placeholder) fragment from the vectors, a single PCR product can be directly inserted in frame into all 40 plasmids using blunt end or TA ligations, requiring only verification of the orientation. We have established a RSFC vector family for the commonly used protein expression host Pichia pastoris and demonstrated the system with the secretory expression of horseradish peroxidase (HRP). HRP fusions to four tags (Myc, FLAG, His, Strep) and two fusion proteins (GFP and MBP) showed a 31-fold difference in volumetric activities. C-terminal tagging caused in some cases almost a complete loss of function, whereas N-terminal tags showed moderate differences.


The RSFC vectors provide an unprecedented toolbox for expression optimization in P. pastoris. The results obtained with HRP underline the importance of comparing different tags to maximize activities of fusion proteins. In a similar fashion the RSFC strategy can be applied in other expression hosts to screen for optimal promoters, signal sequences or to facilitate the evaluation of (iso-) enzyme families.


Protein tags are commonly applied tools facilitating purification (affinity tags), enabling immuno-detection (epitope tags) or increasing solubility. Fusions to fluorescent proteins help elucidating the cellular localization and fusions to signal sequences provide specific intracellular targeting or secretion [1, 2]. However, as an extrinsic addition to a protein of interest (POI), such fusions may also show detrimental effects by affecting protein conformation, yields, activity or stability [1, 3]. The specific interactions of the POI with a certain tag are generally hard to foresee and may also depend on the position of the tag (N- or C-terminal). Unknown proteolytic processing or intracellular targeting of the POI may also influence the suitability of a specific fusion site. In addition, the same tagged protein may behave differently depending on the host system used (e.g. bacteria, yeast, higher eukaryotes) [4]. As there are large numbers of affinity, epitope tags and fusion proteins available it is challenging to predict the optimal choice for a certain POI. Therefore, commonly multiple tags are tested in N- or C- terminal positions and screened for optimal results [47].

However, preparing expression constructs containing multiple tags may require tedious cloning work. Tags are commonly provided on the plasmid adjacent to the multiple cloning site (MCS). This requires unique vectors for each tag and N-/C-terminal position. The gene of interest (GOI) needs to be cloned into the MCS via unique restriction endonuclease (RE) recognition sites. These restriction site scars remain in the protein coding sequence (CDS) and are later translated into additional amino acids, which may interfere with the POI’s properties. Also cloning strategies based on recombination such as Gateway (e.g. [8]) leave the recombination sequence as a scar in the CDS.

Ideally, tags should be fused seamlessly to the GOI i.e. without any restriction site scars or additional sequences from the MCS. Seamless cloning can be achieved by various strategies [9]. Frequently, tags are directly added by PCR as a 5′ overhang of a primer and thereby seamlessly attached to the CDS. This approach requires however a unique primer for each tag, N-/C-terminal position and each GOI.

We aimed to design a simple, seamless system to facilitate testing of multiple tags in N-/C-terminal position at minimal cost and effort (e.g. without the need to order numerous primers).

Several novel cloning methods are completely independent of REs and allow simple assembly of multiple fragments solely by short overlaps (around 25 bp) relying on in vitro ‘recombination’ (e.g. annealing of single stranded overhangs and enzymatic linkage). These methods include SLIC (sequence and ligation–independent cloning) [10], SLiCE (Seamless Ligation Cloning Extract) [11], Gibson assembly [12], CPEC (Circular Polymerase Extension Cloning) [13] and are concisely compared on the website of the Joint BioEnergy Institute (JBEI), Emeryville, CA, USA [14, 15]. All these methods may be used to seamlessly add a tag to a protein by adding the tag sequence to a PCR primer. However, there is an additional overhang required for in vitro recombination with the vector, requiring relatively long primers. Most inconveniently a new primer is needed for each tag, each position and each POI to be tested.

Therefore we have based our strategy on type IIS REs. In contrast to type II REs, which recognize and cut within a palindromic sequence, type IIS REs cut outside of a non-palindromic recognition sequence [16, 17]. Thereby RE site scars can be circumvented making type IIS REs prominent tools for seamless cloning [9]. There are various type IIS enzymes available that create different types of overhangs including up to 4 bp overhangs suitable for sticky end cloning (e.g. Eam1104I [18], BsaI [19, 20]), single base pair overhangs that can be applied for TA cloning (e.g. XcmI [21, 22], Eam1105I [23, 24], BciVI [25]) or blunt end cloning (MlyI/SchI [25, 26]), see Figure 1a.

Figure 1

Detailed outline of the restriction site free cloning (RSFC) strategy. a Recognition sites of various type IIS REs. The cleavage patterns are indicated as red lines. b Schematic workflow of restriction site free cloning. After removal of a stuffer fragment using type IIS REs, a single PCR product can be ligated into all vectors in a seamless, sequence independent fashion. The strategy is shown for four vectors but can be extended to as many as desired. c Design of the MlyI stuffer fragment for blunt end ligations. The MlyI recognition sequence is written in italics, the entire cleavage pattern is underlined. Variable bp are denoted as ‘N’. Upstream sequences may include promoters, N-terminal tags and signal sequences, downstream sequences may include C-terminal tags and stop codons. d Design of the BmrI stuffer fragment for TA cloning. Same explanation as (c), in addition the incorporation of the dA and dT residues for TA cloning via Start- and Stop/Tyr-codons are shown (red). By varying the last nucleotide ‘X’ of the Stop/Tyr codon, either translation can be terminated or a C-terminal tag linked in frame. A dA-tailed PCR product suitable for ligation is also shown.

In this study we have evaluated type IIS REs for blunt end and TA cloning and designed a restriction site free cloning (RSFC) strategy that enables simple, seamless cloning of a PCR product in frame with any desired upstream or downstream sequence in a vector. Based on this strategy, we have designed a RSFC vector family of 40 plasmids for the methylotrophic yeast Pichia pastoris, a commonly used protein production host for industrially relevant biocatalysts and biopharmaceuticals [2729]. The vectors feature different epitope and affinity tags (Myc, FLAG, His, Strep) and fusion proteins (eGFP and MBP) in N- and C-terminal position that are provided for intracellular and secretory expression.

Results and discussion

Restriction site free cloning (RSFC)

Blunt end vs. TA cloning concept

We aimed to design a vector system in which a single PCR product of a GOI can be directly fused, sequence independently to various N- or C-terminal tags provided on different plasmids. Thereby only two primers are required to test seamless fusions of multiple tags with the GOI. This design is achieved by inserting a stuffer (placeholder) fragment flanked by two type IIS RE sites in opposite orientations in all vectors (Figure 1b–d). The CDSs of different N- and/or C-terminal tags or fusion proteins are provided upstream/downstream of the stuffer fragment. By digestion using the respective type IIS RE, the stuffer fragment including the RE sites is cut out, resulting in RE site free vector backbones that can be directly ligated with the same PCR product (Figure 1b).

Commonly used type IIS RE based cloning strategies such as Golden Gate cloning [19, 20] cannot be used for this purpose as they rely on type IIS enzymes creating short overhangs such as Eam1104I or BsaI (Figure 1a). The use of these enzymes requires also RE digestion of the PCR product and the overhangs created on the vectors would differ between tags and impede seamless fusions.

Direct, sequence independent cloning of PCR products is in this context only possible by using TA cloning or blunt end ligations. These methods are in general not directional (with a few exceptions e.g. [25, 26]) and require verification of the orientation (e.g. by colony PCR, cPCR). TA cloning is based on the property of Taq-Polymerase to add a single deoxyadenine (dA) nucleotide at the 3′ ends of amplified DNA [21]. These PCR products can be directly cloned using a vector with a single 3′ deoxythymidine (dT) overhang. TA cloning works more efficiently than blunt end cloning [21], however the required dA nucleotide complicates seamless fusions to tags as it must be universally incorporated in the transitions between tag and vector. In this respect, blunt end ligations, that are completely sequence independent, are more favorable.

We designed test vectors based on type IIS REs for blunt end and TA cloning to compare their suitability. There is only one blunt end type IIS RE available that cuts outside of its recognition sequence (MlyI). MlyI has also been established for directional blunt end ligations of PCR products using a lacO, lacZ based blue-white screening in Escherichia coli [25, 26]. There are several type IIS REs available, that create a single base 3′ overhang (e.g. BmrI, BciVI, HphI, see Figure 1a). We tested commercially available preparations of these three enzymes all of which showed sufficient cleavage efficiencies (data not shown). HphI and BciVI have been previously used for TA cloning [21, 25], yet these restriction sites were present more frequently in the vector backbones we wanted to use. Therefore we used BmrI.

The basic sequence design of the transitions between the vector, the type IIS restriction sites and the stuffer fragment are shown in Figure 1c, d. For blunt end cloning using MlyI, the design is completely sequence independent (Figure 1c). For TA cloning, 3′ dT residues must be provided on the vector backbone and incorporated in the transition between vector and GOI. We solved this by using the dT nucleotide of the start codon (ATG) and the dA nucleotide of a partial stop codon (TAX), creating a 3′ dT base on the reverse strand (Figure 1d). Depending on the desired sequence context, ‘X’ may be provided on the vector side as A/G for a stop codon (translation termination) or T/C (coding for tyrosine, for linkage of C-terminal tags).

Cloning efficiencies

We compared the basic blunt end and TA cloning based system at first with expression vectors for Schizosaccharomyces pombe as these plasmids required fewer modifications in the vector backbones than the P. pastoris plasmids we intended to use. See Additional file 1: Figure S1 for plasmid maps and the “Materials and methods” section for details on the design. After cutting out the stuffer fragment using MlyI or BmrI, the vector backbones were dephosphorylated to counter act self-ligation. Primers for insert amplification were phosphorylated prior to ligation (see “Materials and methods” section for experimental details and a simple, cost effective protocol). Both cloning strategies resulted in similar transformation efficiencies (via electroporation), approximately 102–103 colony forming units (cfu)/µg DNA (in the ligation reaction) with self-made competent cells (competence with circular, supercoiled plasmids: 106–107 cfu/µg DNA) and in both cases all 10 out of 10 clones tested contained an insert. We verified the orientation by cPCR; as statistically expected approximately half the clones contained an insert in the correct orientation (blunt end/MlyI: 5 of 10, TA cloning/BmrI: 7 of 10). Additional file 2: Figure S2 outlines a simple cPCR strategy to test the correct orientation (using sequencing primers of the vector and the primers used for amplifying the insert). The vector/insert transitions were confirmed by sequencing and did not show any mutations. However, when cloning an insert into a larger set of vectors using blunt end ligations (see P. pastoris vectors below) we noticed occasionally single bp deletions of the insert adjacent to the vector transition (<5% of constructs). Sequencing of additional transformants resulted in all cases in correct sequences. Notably, the deletions were always in the 5′ ends of the insert and occurred more often after repeated freeze/thaw cycles of the PCR product. We therefore recommend aliquoting the PCR product and vector backbones and using them only once.

In general these RSFC ligations resulted in lower efficiencies (cfu/µg DNA) than comparable sticky end ligations, but still yielded sufficient numbers of transformants for our standard cloning applications. MlyI based blunt end ligations worked similarly efficient as BmrI based TA cloning. Previously, TA cloning has been reported to be more efficient than blunt end cloning [21], however the difference may arise from the different enzymes used for vector preparation in our study.

We mutated MlyI sites present in the vector backbones to enable the stuffer removal (see “Materials and methods” section for details). All mutations but one resulted in no differences in DNA yields compared to the parental plasmids. Mutating a MlyI site in the E. coli pUC origin of replication to a sequence previously reported [25, 26] decreased plasmid yields to approximately 30% of the unmutated parental plasmid (wildtype pUC: ~400 ng/µl, MlyI mutated pUC: ~120 ng/µl). The MlyI site appears also in other high copy number origins of replication (ori) and switching to a lower copy number ori would also result in lower plasmid yields. We intended to use the RSFC plasmids only for sub cloning and aimed to perform expression in P. pastoris. To this end mini prep yields (typically >5 µg) were sufficient. However if similar plasmids should be used for expression in E. coli, we would recommend to screen other possible mutations of the MlyI site using degenerate primers to restore wild type plasmid yields.

However, the blunt end/MlyI based strategy allowed completely seamless cloning whereas seamless TA cloning was hindered by the requirement for dT/dA bases in the insert/vector transition. This problem is similar to the use of typeIIS enzymes creating longer sticky end overhangs that need to be complementary between all plasmids of a vector family (for example in plasmids by BioGrammatics, Inc., Carlsbad, CA, USA and ‘Electra’ plasmids by DNA2.0, Inc., Menlo Park, CA, USA). As outlined in Figure 1d, the TA strategy can be still used for fusion of the same PCR product to different tags, however N-terminal tags are always linked via an ATG (coding for methionine/start), whereas C-terminal tags must be linked via tyrosine codons. Tyrosine is naturally a relatively rarely occurring and bulky amino acid, making it structurally rather unfavorable as a linker to an adjacent tag. In ‘Electra’ plasmids by DNA2.0 this issue is solved by adding an additional C-terminal ‘linker’ amino acid to all vectors, whereas in the RSFC strategy only vectors with C-terminal tags require a linker amino acid. Still we have solely focused on the blunt end/MlyI based strategy in the following plasmid design for P. pastoris. The blunt end/MlyI based ligations required no A-tailing step of PCR products but reached similar ligation efficiencies as TA cloning and allowed completely sequence independent fusions.

In summary, our cloning approach, relying on blunt end or TA ligations between a phosphorylated PCR product and a dephosphorylated vector backbone created by type IIS RE digestion, allowed seamless, sequence independent cloning at reasonable efficiencies. PCR products can be directly used for ligations and do not need RE digestion, therefore any insert sequence can be used (TA cloning with proof reading polymerases requires a separate dA-tailing step). There have previously been type IIS based cloning efforts using blunt end and TA ligations for the cloning of PCR fragments [2126]. However, these strategies did not allow seamless fusions and are in part with lacO, lacZ based blue white screening [25, 26], despite the convenience of directional cloning, even incompatible with seamless fusions. To distinguish our approach from these efforts and other type IIS based strategies such as Golden Gate cloning [19, 20], we have termed our approach restriction site free cloning (RSFC).

RSFC plasmids for P. pastoris as toolbox for optimizing protein production

Tags and fusion proteins

We applied the RSFC cloning strategy to design a plasmid family for P. pastoris allowing seamless fusions of a GOI with various tags and fusion proteins in N- and C-terminal position. There are different expression plasmids available for P. pastoris based on various cloning strategies such as Gateway [8], TA cloning [22, 25], sticky end type IIS ligations (plasmids by BioGrammatics, ‘Electra’ plasmids by DNA2.0) and ‘classical’ typeII RE/ligation based systems ([3032] and P. pastoris plasmids by Life Technologies, Carlsbad, CA, USA). The pCri vector family [32] is a multi-host platform, allowing to clone a single PCR product via restriction digestion and a MCS into different vectors. For P. pastoris only three pCri plasmids with a His tag are available. Therefore none of the vector systems currently available for P. pastoris offer different tags and only the BioGrammatics and Electra plasmids by DNA2.0 vectors allow seamless, yet sequence dependent cloning still requiring restriction digestion of the insert.

We designed a set of 40 RSFC plasmids for P. pastoris (termed pPpRSFC) offering different tags (Myc, FLAG, His, Strep) and fusion proteins (enhanced green fluorescent protein, eGFP and maltose binding protein, MBP) in N- and C-terminal position, see Table 1 for exact properties and Figure 2 for a schematic vector map. We have assigned numbers (#1 to #40) to the plasmids and are using them hereafter when referring to a specific construct.

Table 1 RSFC vector family designed for P. pastoris
Figure 2

Representative map of P. pastoris RSFC plasmids. The features of all RSFC plasmids designed are summarized in this schematic map. Different promoters, N/C-terminal tags, resistance markers are shown. pGAP plasmids do not contain the 3′ AOX1 homologous sequence for recombination (3′ AOX1 HS). HIS4 vectors contain in addition an ampicillin resistance cassette. The mating factor alpha signal sequence (MF alpha) is optional and only present in secretory plasmids. See Table 1 for the part combinations created in this study. Features are not drawn to scale. Exact plasmid maps are provided in the Additional file 3 in GenBank format.

After stuffer removal by MlyI digestion, a single PCR product can be cloned in a seamless and sequence independent fashion into all vectors, fused to tags and fusion proteins ranging from 18 to 1,101 bp in length. Epitope and affinity tags are included and constructs with affinity tags contain a TEV protease cleavage site to allow tag removal. The hexameric His tag is provided with and without TEV protease cleavage site. MBP is provided as a fusion protein with the potential to improve solubility and act as a purification tag, although in P. pastoris problems with proteolytic degradation have been reported [33]. eGFP is an enhanced version of GFP allowing simple fluorescence detection of tagged proteins.

When cloning a GOI into the pPpRSFC vectors, the forward primer must be designed starting at the DNA sequence coding for the 2nd amino acid of the POI (omitting the ATG start codon). The reverse primer must be designed starting (on the reverse strand) at the DNA sequence coding for the last amino acid/penultimate codon (omitting the stop codon). Especially a stop codon on the PCR product would interfere with tag fusions, therefore the start and stop codon are always provided on the vectors and must be omitted from PCR inserts.

P. pastoris is not only suitable for intracellular expression but can also produce secreted heterologous proteins at high titers while secreting only little endogenous protein [2729]. Therefore we designed all plasmids also for secretory expression using the S. cerevisiae mating factor alpha pre-pro signal sequence (MF alpha), the most commonly applied signal sequence in P. pastoris. The MF alpha sequence is processed by two proteases (Ste13 and Kex2) that cleave the amino acid sequence KREAEA at the end of MF alpha [34]. Kex2 cleaves efficiently after KR whereas the Ste13 cleavage after the EA repeat may be incomplete, depending on the following amino acids of the POI. In several cases removal of the EAEA repeats has led to a more homogenous product [35, 36]. Therefore we designed the basic MF alpha pPpRSFC plasmids (#2, 3, 5, 6, 36, 37, 39, 40) with and without the EAEA sequence. Plasmids bearing tags always contain the EAEA repeat (Table 1).

Promoters, integration events and resistance markers

The pPpRSFC plasmids are based on the pPpT4 vector family reported by Näätsaari et al. [30] and also used as a platform for the P. pastoris Electra vectors by DNA2.0. The majority of pPpRSFC plasmids contain the promoter of the alcohol oxidase 1 gene (pAOX1). This strong, tightly regulated methanol inducible promoter is most commonly used in P. pastoris [37]. We have also designed basic plasmids bearing the glyceraldehyde-3-phosphate dehydrogenase promoter (pGAP) to enable methanol free, constitutive expression (see Table 1).

In contrast to the yeast S. cerevisiae, where stable, autonomously replicating plasmids are available, circular plasmids bearing a yeast ARS (autonomously replicating sequence) are not stable in P. pastoris and genomic integration of plasmid cassettes is the method of choice for heterologous gene expression [27, 38]. Most commonly P. pastoris integration cassettes are created by linearizing plasmids or generation of linear cassettes by PCR [39, 40] and targeted to the AOX1 locus via homologous sequences. Depending on the linearization site in the plasmid, different homologous recombination events can be targeted [38]. The pPpRSFC plasmids allow linearization to target gene replacement at the AOX1 locus. Thereby the endogenous AOX1 gene is deleted and the minor AOX2 gene must take over the function of oxidizing methanol to formaldehyde. Due to the lower expression levels of AOX2, these aox1 knockout strains show a MutS (methanol utilization slow) phenotype, which may result in higher yields than a Mut+ phenotype [41, 42]. This can be achieved by linearization using BglII. If the BglII site is present in the insert, the vectors can still be linearized using the rare 8 bp SwaI sites as a failsafe backup. If a Mut+ phenotype is desired, the vectors can be linearized using unique REs cleaving in the 5′ or 3′ homologous sequence (e.g. PmeI, AseI or EcoNI, BsrBI). However, due to low homologous recombination frequencies in P. pastoris wildtype strains [30], even when targeting a gene replacement at the AOX1 locus (expected MutS phenotype), still the majority of transformants are Mut+. Therefore it is necessary to validate the Mut phenotype by growth on methanol plates.

The RSFC plasmids are based on a modular design, the promoter, N- or C-terminal tags, terminator, the resistance marker and the 3′ homologous sequence can be easily exchanged using unique restriction enzyme sites (Figure 2). Most plasmids are based on Zeocin selection, however basic expression plasmids (#35–40) were also constructed with a histidine marker to be used with auxotrophic strains. The tagged expression cassettes from the Zeocin plasmids can be easily shuffled to the histidine plasmid backbones using unique PciI and BamHI sites.

In the pPpRSFC plamids not only the transition between the insert and the vector is seamless, also the transition of the promoter to the start codon and the stop codon to the terminator are seamless. In standard RE based cloning, the MCS may interfere with translation initiation [43] and this appears relevant for the AOX1 promoter as extensions of the 5′ UTR (also caused by a MCS) were shown to negatively affect expression [44].

Applications of RSFC vectors to optimize HRP expression in P. pastoris

Effects of tags and fusions proteins in N- and C-terminal position

With the set of pPpRSFC plasmids available, we aimed to validate the system with a typical application. We tested expression of horseradish peroxidase (HRP), a commonly used reporter enzyme for signal amplification in diagnostic kits and immunohistochemistry. Secretory expression of HRP has been previously demonstrated in P. pastoris [42, 4547]. Cytoplasmic expression promised little chance of success as HRP is a secretory plant peroxidase that requires formation of disulfide bridges and is typically glycosylated in the secretory pathway [48, 49]. Still, we tested the basic pPpRSFC plasmid (#1, untagged, pAOX1) for cytoplasmic HRP expression. This construct showed neither activity in the supernatant (Figure 3) nor in the cytoplasm (data not shown). Therefore different tags were only evaluated for secretory expression. A single PCR fragment of HRP was cloned into the vectors as outlined above. All pAOX1 plasmids were linearized via BglII sites to target a gene replacement event at the AOX1 locus, and screened for a MutS phenotype, which has been reported to be more favorable for HRP expression than Mut+ [42].

Figure 3

Type of tag and position (N/C-terminal) strongly affect volumetric HRP activities. Volumetric HRP activities in the supernatant with ABTS as substrate after cultivation on methanol for 72 h are shown. The pPpRSFC plasmids used are indicated via the numbers given in Table 1, the use of a signal sequence (MFalpha), different promoter than pAOX1 and if applicable tag and position (N/C) are given. Mean values and standard deviations of biological triplicates are shown.

The different tags and positions had diverse effects on volumetric HRP activities (Figure 3) and led to valuable insights. For all tags, the N-terminal version was giving higher activities than the C-terminal version. For the larger fusion proteins (eGFP and MBP), C-terminal tagging even led to almost complete loss of activity (#10 and #30). Comparing the tagged construct with the highest activity (#21) with the construct of the lowest activity (#10) gives a 31 fold difference. N-terminal tagging with the relatively large eGFP (and MBP) did not strongly affect activity, whereas shorter tags (Myc, FLAG, His, Strep) showed varying effects. The N-terminally His tagged construct with TEV protease cleavage site (#25) showed the lowest activity of all N-terminally tagged proteins. However, the N-terminal His tagged construct without TEV protease cleavage site (#21) showed activity similar to other tags, hinting a negative effect of the TEV protease cleavage site in this context. Changes of the MFalpha sequence by removal of the EAEA sequence decreased activity 1.6 fold with the methanol inducible AOX1 promoter (#2 vs. #3). With the constitutive GAP promoter (#5 vs. #6), removal of the EAEA sequence even led to a 17 fold decrease in activity. A possible mechanistic explanation would be that the EAEA repeats improved secretion due to increased Kex2 cleavage efficiencies [50, 51]. pGAP driven HRP expression was therefore, depending on the presence of EAEA repeats, competitive to the methanol inducible pAOX1. Due to shorter process times (not requiring methanol induction) pGAP driven expression may even be more favorable for HRP production regarding space time yields and process setup.

The effects seen on volumetric activities by using different tags may be caused by various reasons. On the one hand the tags may have interfered to a different extent with protein folding or access of the substrate to the active site thereby negatively affecting activity. On the other hand they also may have affected the protein yields by altering the protein stability, interfering with the secretion process or even on the mRNA level with transcript stability. Also the tags or linker sequence may have targeted proteolytic degradation, as previously reported for MBP in P. pastoris [33]. However, as we aimed only to evaluate the suitability of the RSFC strategy for screening different tags, we did not further investigate the underlying causes. The pPpRSFC plasmid family proved to be a simple tool to optimize volumetric activities of tagged HRP, showing that especially the tag positions and presence of EAEA repeats are crucial factors.

Fluorescence microscopy of strains expressing eGFP tagged HRP

GFP has routinely been used in P. pastoris as an intracellular reporter for comparing promoter activities [5254] and to facilitate screening of protein production by testing fusions proteins [55], especially for membrane proteins [5658]. Concerning GFP fusions of secretory proteins, conflicting results were obtained. In some cases GFP was successfully used as secretion reporter and for protein fusions [5962]. In other cases problems with secretion (e.g. intracellular retention) were noticed [6366]. As we had also designed N- and C-terminal fusions with eGFP (including the MFalpha signal sequence for secretion, #9 and #10), we performed fluorescence microscopy to investigate possible cellular retention and bottlenecks in the HRP secretion process.

The N-terminal eGFP-HRP fusion exhibited largely unchanged HRP activity, whereas the C-terminally tagged version had almost completely lost activity (Figure 3). We also included controls of intracellular eGFP expression (#1) and secretory eGFP alone (without an HRP fusion, created by self-ligating #9). Fluorescence microscopy images of methanol grown cells are shown in (Figure 4). While cytoplasmic expression showed bright fluorescence of the whole cell (Figure 4d), all secretory constructs (Figure 4a–c) showed punctate structures. These structures appeared somewhat similar to ER or Golgi mistargeting observed previously when expressing a GFP tagged membrane protein (human µ-opioid receptor, a G-protein coupled receptor) [65]. Most notably also the control of eGFP alone (Figure 4c), without an HRP fusion showed this retention. eGFP was apparently poorly secreted in P. pastoris and effects evoked by HRP may be masked and outweighed by the poor eGFP secretion.

Figure 4

Fluorescence microscopy of fusions of HRP to GFP. Bright field images are shown on top, fluorescence images below. a HRP N-terminally tagged with eGFP (#9-MFalpha-N-eGFP-HRP), b HRP C-terminally tagged with eGFP (#10-MFalpha-C-eGFP-HRP), c control of eGFP with MFalpha (self-ligated #9), d control of cytoplasmic eGFP expression (#1-eGFP), e negative control of empty MutS strain. For the bright field image of panel (c) brightness was decreased −11%, contrast was increased +44% for better comparability with the other panels.

We also measured eGFP fluorescence in the supernatant and the cell fraction (Additional file 4: Figure S3). Fluorescence in the supernatant could be detected for secretory constructs (Additional file 4: Figure S3a–c), while the cytoplasmic eGFP expression control (Additional file 4: Figure S3d) showed only marginal fluorescence in the supernatant. However, also for the secretory constructs (Additional file 4: Figure S3a–c) intracellular fluorescence surpassed fluorescence in the supernatant approximately 5- to 12-fold. These results suggested together with the microscopy images, that large amounts of eGFP were withheld in the secretion process. In this respect, eGFP fusion proteins may be used with caution when investigating secretory processes in P. pastoris. However, these effects may also be protein dependent, as there were cases reported where GFP was successfully used to evaluate signal sequences [59, 60] and some GFP fusion proteins were sufficiently secreted [61, 62].

Conclusions and outlook

The RSFC cloning strategy outlined here and the pPpRSFC plasmid family are simple tools to optimize expression of tagged proteins with little cloning efforts. RSFC requires at first the design and assembly of the vector family to be used. However, subsequent screening is drastically facilitated as large amounts of vector backbones can be prepared at once by MlyI digestion. Subsequently, the backbones ready for cloning can be frozen as aliquots and thawed when needed.

There have been systems reported previously that allow testing of the expression of a POI in different expression hosts by using only two PCR products [67]. This approach is based on ligation-independent cloning (LIC) similar to [1013, 68]. While these methods allow highly efficient, seamless cloning, they rely on the annealing of single stranded overhangs, thereby requiring identical sequences between vector and insert. Therefore these methods are not suitable for seamless, sequence independent fusions possible with RSFC. However, as a downside of RSFC the blunt end ligations work less efficiently than annealing based in vitro recombination methods [1013] and confirmation of the correct orientation is required. Otherwise only about 50% of the transformants show the desired orientation which is a disadvantage for library approaches. Nevertheless, after stuffer removal, inserts can also be cloned directionally into RSFC plasmids by in vitro recombination methods (such as Gibson assembly [12]). However this task requires the design of a separate primer for every tag and position to be tested as the overhang required for annealing changes between the vectors. We recommend using in vitro recombination methods with pPpRSFC plasmids when testing only a few constructs. When testing a larger number of constructs, the increased costs for primers and materials associated with in vitro recombination methods outweigh the costs for cPCRs to test the orientation of blunt end ligations. When performing a single experiment and cloning a low number of GOIs with only one tag, it will be more effort to set up a RSFC vector than to order a few long primers. However, if routinely a large number of GOIs should be screened with a set of tags in different positions, RSFC vectors are a powerful strategy.

A limitation of the RSFC system reported here is the use of MlyI, the only type IIS enzyme performing a blunt end cleavage. The recognition sequence of MlyI is five bp long (Figure 1a), thereby posing a problem as it appears statistically once per 512 bp (45/2) [69]. This may require frequent removal of MlyI sites in the vector backbones to be used. MlyI sites in CDSs of tags, fusion proteins and resistance markers can be easily removed by mutating the MlyI sequence to synonymous codons. However, mutating sites present in promoters, terminators or origins of replication have to be validated for unchanged functionality (or must be exchanged for parts free of MlyI sites). These issues could be solved by using artificial type IIS REs with longer recognition sequences. The catalytic domain of the archetypical type IIS enzyme FokI has been fused to I-SceI, a homing endonuclease with an 18 bp recognition sequence. This chimeric meganuclease showed sufficient cleavage resulting in 4 bp ‘sticky’ overhangs that could be ligated at 90% fidelity [69]. Following this strategy, the catalytic domain of MlyI (which is similar to FokI [70]) could also be fused to I-SceI. Statistically an 18 bp recognition sequence would appear approximately once in 1011 (418) bp, however I-SceI recognizes also slightly degenerate sequences leading to an estimated appearance once in 108 bp [69, 71, 72], which would still surpass the specificity of wildtype MlyI by several orders of magnitude.

Most vectors for P. pastoris have been conceptualized solely as straightforward expression vectors ([8, 22, 25, 30, 31] and P. pastoris plasmids by Life Technologies, BioGrammatics and DNA2.0) and few plasmid families allow to fine-tune expression [30, 31]. The 40 plasmids reported here extend the scope of applications and facilitate characterization and optimization of the production of heterologous proteins in P. pastoris. The RSFC strategy outlined here is not limited to tags and fusions proteins, but could also be applied to compare different promoters or signal sequences in other expression systems. Similarly, isoenzymes or families of homologous enzymes can be fused to tags to screen for better expression, solubility or other properties to identify enzymes combining desired biological, chemical and technological features.

Materials and methods

Chemicals and media

Phusion DNA Polymerase, restriction endonucleases and other DNA modifying enzymes were acquired from Thermo Fisher Scientific (Waltham, MA, USA) or New England Biolabs (Ipswich, MA, USA). Miscellaneous chemicals were purchased from Becton, Dickinson and Company (Franklin Lakes, NJ, USA), Carl Roth (Karlsruhe, Germany) and Fresenius Kabi Austria (Graz, Austria).

Plasmids were isolated using a GeneJET Plasmid Miniprep Kit by Thermo Fisher Scientific. The standard protocol was optimized for MlyI based constructs to compensate the decreased plasmid yields. A single colony of a strain bearing the respective plasmid was streaked on an agar plate containing the respective antibiotic. After incubation overnight, a cell pellet (approximately 0.1 g wet cells) was scratched of the plate and used for the isolations (final elution volume: 100 µl of ddH2O).

Agarose embedded DNA, restriction digests and PCRs were purified using a Wizard SV Gel and PCR Clean-Up System by Promega.

P. pastoris strains were grown on full medium (yeast extract, peptone, 2% glucose, YPD), buffered minimal dextrose (BMD) and buffered minimal methanol medium with 0.5% methanol (BMM) as described by Weis et al. [16]. As only exception we used 2% glucose in the BMD medium and for HRP expression, media were supplemented with 1 mmol/l ferrous sulfate heptahydrate (FeSO4.7H2O). Escherichia coli strains were selected on LB-medium containing 25 μg/ml Zeocin™ (Life Technologies, Carlsbad, CA, USA). P. pastoris transformants were selected on YPD agar plates containing 100 μg/ml Zeocin. Primers were ordered from Integrated DNA Technologies (Leuven, Belgium), see Additional file 5: Table S1 for the sequences.

Plasmid construction

pombe RSFC test vectors pGAZ2-TA-BmrI-stuffer and pGAZ2-Blunt-MlyI-stuffer

For all cloning work an E. coli Top10 F’ strain was used. The vectors for initially comparing blunt end and TA cloning were based on a replicative S. pombe vector pGAZ2 (Additional file 1: Figure S1, unpublished results). For the TA-cloning vector ‘pGAZ2-TA-BmrI-stuffer‘, a stuffer fragment was amplified using primers TA_fwd_HindIII+BmrI+stuffer and TA_rev_BamHI+BmrI+stuffer (see Additional file 5: Table S1) and cloned into pGAZ2 via HindIII and BamHI sites. The stuffer fragment was selected as a sequence that has no significant homology to E. coli and S. pombe genomes and lacks MlyI, BmrI, HindIII and BamHI RE sites; we used a part of a P. pastoris alpha, alpha trehalase gene. The ‘pGAZ2-Blunt-MlyI-stuffer’ vector required mutating two MlyI sites in the vector backbone. This was done by PCR amplifying the vector using primers pUC_mut_MlyI_fwd + pUC_mut_MlyI_rev and ZeoCDS_mut_MlyI_fwd + ZeoCDS_mut_MlyI_rev using Pfu Ultra polymerase (Agilent Technologies, Santa Clara, CA) followed by DpnI digestion to remove template vector. The MlyI site in the pUC was mutated to the sequence reported by Rao et al. [25], the MlyI site in the zeocin resistance gene was mutated to a synonymous codon. After transformation, introduction of the correct mutations were confirmed by Sanger sequencing. Both plasmids do not provide seamless fusions, as the stuffer fragments were for convenience inserted via HindIII and BamHI sites leaving RE site scars. For test purposes the gene coding for Thermomyces lanuginosus endo-beta-1,4-D-xylanase was amplified using primers Xyla_fwd and Xyla_rev and cloned into the two vectors (detailed protocol see below).

P. pastoris pPpRSFC plasmid family

The expression plasmids used in this study are based on the pPpT4 P. pastoris/E. coli shuttle vector family (e.g. GenBank accession number JQ519690.1) reported by Näätsaari et al. [30]. Two MlyI sites in the backbone (pUC and zeocin resticane gene) were mutated in the same way as in the S. pombe plasmids of this study (Additional file 1: Figure S1; same primers as in Additional file 5: Table S1) and confirmed by sequencing. The AOX1 promoter, MlyI stuffer fragment and AOX1 terminator were amplified in separate PCR reactions using primers PAOX1_PciIF/OePAox1StufferR, OestufferF/OeStufferR and OeAox1TTstufferF/Aox1TT_BamHIR respectively. In the subsequent overlap extension PCR reactions the fragments were joined together using primer pair PAOX1_PciIF/AOX1TT_BamHIR followed by restriction with PciI/BamHI and were cloned in a vector backbone with mutated MlyI sites to create an intermediatory plasmid backbone termed ‘pPp’. The stuffer fragment sequence was selected from as a sequence that has no homology to E. coli and P. pastoris and lacks unique RE used in the pPpRSFC plasmid family. We selected a part of a gene involved in the S. cerevisiae biotin metabolism. An EcoRI site in the stuffer was mutated using primers pairs OeEcoRIF and OeEcoRIR. There appeared a few additional mutations in the stuffer that had no functional consequences and where therefore left unchanged (see plasmid sequences in Additional file 3).

For constitutive plasmids, the GAP promoter was amplified via primers GAP_PciIF/OeGapStuffR and was cloned into the pPp backbone using PciI/EcoRI to create #4 (pPpRSFC-pGAP). The 3′ AOX1 homologous sequence was amplified via primers 3′AOX1_PstIASCIF/3′AOX1_KpnISwaIR and was cloned into pPp using KpnI/PstI restriction sites to create #1 (pPpRSFC). For secretory expression plasmids, the MFAlpha sequence was amplified using primer pair AlphaFSSF/AlphaEcoRIR (or aEAEAEcoRIR for insertion of Glu-Ala repeats). The AOX1/GAP promoters were amplified via primers PAOX1_PciIF + OeAlphaPAox1R/GAP_PciIF+ OeGapAlphaR. The MFAlpha sequence was fused with pAOX1/pGAP using primers PAOX1_PciIF+ AlphaEcoRIR (expression cassette for #3) or PAOX1_PciIF+ aEAEAEcoRIR (expression cassette for #2),/GAP_PciIF+ AlphaEcoRIR (expression cassette for #6) or GAP_PciIF+ aEAEAEcoRIR (expression cassette for #5). The pAOX1-MFAlpha PCR products were cloned into pPpRFSC via PciI/EcoRI sites to create #3 and #2. The pGap-MFAlpha fusion construct was cloned into pPpRSFC-pGAP via PciI/EcoRI restriction site to construct pPpRSFC-#6 and #5.

pGAP plasmids do not contain the 3′ AOX1 sequence for homologous integration in the AOX1 locus. Plasmids #1 to #6 were made initially and completely sequenced. In the creation of the following plasmids, only newly inserted parts (and the RE sites used for cloning) were confirmed by sequencing. A full description of how the 28 plasmids (#7–#40) with the N- and C- terminal tags were created would be beyond the scope of this section and is provided in the Additional file 5: Table S1 (spreadsheets on plasmid construction). For further details consult the annotated plasmid sequences provided in Additional file 3.

The HRP gene (isoenzyme A2A [46, 47]) was amplified using primers HRP-A2-RSFC-fwd and HRP-A2-RSFC-rev and cloned in the respective vectors (detailed protocol see below).

RSFC cloning of inserts and colony PCRs

For blunt end cloning, the vector backbone was dephosphorylated using either Thermo Scientific shrimp alkaline phosphatase or FastAP according to the manufacturer’s recommendations. The backbone was gel purified and used for ligations with phosphorylated PCR products. Prior, PCR primers were phosphorylated using Thermo Scientific T4 Polynucleotide Kinase according to the manufacturer’s recommendations. Subsequently the reaction mixtures containing the phosphorylated primers were desalted on nitrocellulose filters (Merck Millipore, Darmstadt, Germany) and added to the PCR (Phusion polymerase). Ligations were performed using the blunt end protocol provided for Thermo Scientific T4 DNA Ligase.

For TA cloning, phosphorylated Phusion PCR products were purified (Promega Wizard SV Gel and PCR Clean-Up System) and dA-tailed using Taq-Polymerase (GoTaq Flexi, Promega [Fitchburg, WI, USA], standard buffer, 0.2 mmol/l dATP, 30 min incubation at 72°C) and directly used for ligation (blunt end protocol provided for Thermo Scientific T4 DNA Ligase).

To verify the correct orientation by colony PCR, primers were selected as outlined in Additional file 2: Figure S2. A tiny amount of an E. coli colony from a transformation of the respective ligation was added to a GoTaq Flexi reaction. The manufacturer’s protocol was followed except reducing the reaction volume to 20 µl and increasing the initial denaturing step to five min to break the cells.

P. pastoris transformations and screening

For testing the pPpRSFC plasmids, the P. pastoris CBS7435 wildtype strain was used. Plasmids bearing the AOX1 promoter were linearized with BglII, plasmids with pGAP were linearized with SwaI. All linearized plasmids were transformed into competent P. pastoris cells prepared by the condensed protocol reported by Lin-Cereghino et al. [73]. Only low amounts of DNA (0.5 µg) were used for the transformations to avoid multicopy integration. A landscape of 80 clones was screened and checked for the desired MutS phenotype on minimal methanol plates. Ten MutS clones were rescreened for uniform expression; a single representative clone was used for the subsequent characterizations. Screenings, rescreening and characterizations were performed in deep well plates as described previously [74]. BMD 2% was used instead of BMD 1% (giving higher yields, data not shown) and the methanol induction was performed in 12 h intervals for 72 h.

HRP activity assay, eGFP fluorescence microscopy and measurements

HRP activity assays with 2,2′-azino-bis(3-ethylbenzthiazoline-6-sulphonic acid) diammonium salt (ABTS) were performed as described previously [42]. For intracellular HRP activity measurements, cells were broken using Yeast Protein Extraction Reagent (Y-PER from Thermo Scientific).

The cell suspensions of eGFP expressing strains were centrifuged and washed in an equal amount of water before fluorescence microscopy (Leica Microsystems, Germany, DM LB2, DFC350FX) at 1,000-fold magnification, fluorescence images were taken using filter set ‘I3’ [excitation filter BP 450–490]. eGFP fluorescence (ex/em 488/507 nm) and OD600 were measured in a Synergy MX plate reader (Biotek, Winooski, VT, USA) using micro titer plates (Nunc MicroWell 96-Well Optical-Bottom Plates with Polymer Base, Black; Thermo Fisher Scientific). Cell suspensions were diluted to be within the linear range. The background measurements of diluted medium were subtracted. Subsequently the relative fluorescence units were normalized per OD600.


  1. 1.

    Malhotra A (2009) Tagging for protein expression. Methods Enzymol 463:239–258

    CAS  Article  Google Scholar 

  2. 2.

    Su WW (2005) Fluorescent proteins as tools to aid protein production. Microb Cell Fact 4:12

    Article  Google Scholar 

  3. 3.

    Arnau J, Lauritzen C, Petersen GE, Pedersen J (2006) Current strategies for the use of affinity tags and tag removal for the purification of recombinant proteins. Protein Expr Purif 48:1–13

    CAS  Article  Google Scholar 

  4. 4.

    Lichty JJ, Malecki JL, Agnew HD, Michelson-Horowitz DJ, Tan S (2005) Comparison of affinity tags for protein purification. Protein Expr Purif 41:98–105

    CAS  Article  Google Scholar 

  5. 5.

    Niiranen L, Espelid S, Karlsen CR, Mustonen M, Paulsen SM, Heikinheimo P et al (2007) Comparative expression study to increase the solubility of cold adapted Vibrio proteins in Escherichia coli. Protein Expr Purif 52:210–218

    CAS  Article  Google Scholar 

  6. 6.

    Gordon E, Horsefield R, Swarts HGP, de Pont JJHHM, Neutze R, Snijder A (2008) Effective high-throughput overproduction of membrane proteins in Escherichia coli. Protein Expr Purif 62:1–8

    CAS  Article  Google Scholar 

  7. 7.

    Mohanty AK, Wiener MC (2004) Membrane protein expression and production: effects of polyhistidine tag length and position. Protein Expr Purif 33:311–325

    CAS  Article  Google Scholar 

  8. 8.

    Sasagawa T, Matsui M, Kobayashi Y, Otagiri M, Moriya S, Sakamoto Y et al (2011) High-throughput recombinant gene expression systems in Pichia pastoris using newly developed plasmid vectors. Plasmid 65:65–69

    CAS  Article  Google Scholar 

  9. 9.

    Lu Q (2005) Seamless cloning and gene fusion. Trends Biotechnol 23:199–207

    CAS  Article  Google Scholar 

  10. 10.

    Li MZ, Elledge SJ (2007) Harnessing homologous recombination in vitro to generate recombinant DNA via SLIC. Nat Methods 4:251–256

    CAS  Article  Google Scholar 

  11. 11.

    Zhang Y, Werling U, Edelmann W (2012) SLiCE: a novel bacterial cell extract-based DNA cloning method. Nucleic Acids Res 40:e55

    CAS  Article  Google Scholar 

  12. 12.

    Gibson DG, Young L, Chuang R, Venter JC, Hutchison CA, Smith HO (2009) Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods 6:343–345

    CAS  Article  Google Scholar 

  13. 13.

    Quan J, Tian J (2009) Circular polymerase extension cloning of complex gene libraries and pathways. PLoS ONE 4:e6441

    Article  Google Scholar 

  14. 14.

    Online manual to ‘j5 automated DNA assembly’. Accessed 26 Mar 2015

  15. 15.

    Hillson NJ, Rosengarten RD, Keasling JD (2012) j5 DNA assembly design automation software. ACS Synth Biol 1:14–21

    CAS  Article  Google Scholar 

  16. 16.

    Bath AJ, Milsom SE, Gormley NA, Halford SE (2002) Many type IIs restriction endonucleases interact with two recognition sites before cleaving DNA. J Biol Chem 277:4024–4033

    CAS  Article  Google Scholar 

  17. 17.

    Roberts RJ, Vincze T, Posfai J, Macelis D (2010) REBASE–a database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res 38(Database issue):D234–D236

    CAS  Article  Google Scholar 

  18. 18.

    Padgett KA, Sorge JA (1996) Creating seamless junctions independent of restriction sites in PCR cloning. Gene 168:31–35

    CAS  Article  Google Scholar 

  19. 19.

    Engler C, Kandzia R, Marillonnet S (2008) A one pot, one step, precision cloning method with high throughput capability. PLoS One 3:e3647

    Article  Google Scholar 

  20. 20.

    Engler C, Gruetzner R, Kandzia R, Marillonnet S (2009) Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS One 4:e5553

    Article  Google Scholar 

  21. 21.

    Mead DA, Pey NK, Herrnstadt C, Marcil RA, Smith LM (1991) A universal method for the direct cloning of PCR amplified nucleic acid. Biotechnology (N Y) 9:657–663

    CAS  Article  Google Scholar 

  22. 22.

    Lan D, Wang W, Sun L, Wang Y, Yang B (2011) A novel vector for expression cloning of large numbers of polymerase chain reaction products in Pichia pastoris. Afr J Microbiol Res 5:211–216

    CAS  Google Scholar 

  23. 23.

    Ido E, Hayami M (1997) Construction of T-tailed vectors derived from a pUC plasmid: a rapid system for direct cloning of unmodified PCR products. Biosci Biotechnol Biochem 61:1766–1767

    CAS  Article  Google Scholar 

  24. 24.

    Reisinger C, Kern A, Fesko K, Schwab H (2007) An efficient plasmid vector for expression cloning of large numbers of PCR fragments in Escherichia coli. Appl Microbiol Biotechnol 77:241–244

    CAS  Article  Google Scholar 

  25. 25.

    Rao B, Zhong X, Wang Y, Wu Q, Jiang Z, Ma L (2010) Efficient vectors for expression cloning of large numbers of PCR fragments in P. pastoris. Yeast 27(5):285–292. doi:10.1002/yea.1753

    CAS  Article  Google Scholar 

  26. 26.

    Wu Q, Zhong X, Zhai C, Yang J, Chen X, Chen L et al (2010) A series of novel directional cloning and expression vectors for blunt-end ligation of PCR products. Biotechnol Lett 32:439–443

    CAS  Article  Google Scholar 

  27. 27.

    Cregg JM, Tolstorukov I, Kusari A, Sunga J, Madden K, Chappell T (2009) Expression in the yeast Pichia pastoris. Methods Enzymol 463:169–189

    CAS  Article  Google Scholar 

  28. 28.

    Gasser B, Prielhofer R, Marx H, Maurer M, Nocon J, Steiger M et al (2013) Pichia pastoris: protein production host and model organism for biomedical research. Futur Microbiol 8:191–208

    CAS  Article  Google Scholar 

  29. 29.

    Vogl T, Hartner FS, Glieder A (2013) New opportunities by synthetic biology for biopharmaceutical production in Pichia pastoris. Curr Opin Biotechnol 24:1094–1101

    CAS  Article  Google Scholar 

  30. 30.

    Näätsaari L, Mistlberger B, Ruth C, Hajek T, Hartner FS, Glieder A (2012) Deletion of the Pichia pastoris KU70 homologue facilitates platform strain generation for gene expression and synthetic biology. PLoS One 7:e39720

    Article  Google Scholar 

  31. 31.

    Sears IB, O’Connor J, Rossanese OW, Glick BS (1998) A versatile set of vectors for constitutive and regulated gene expression in Pichia pastoris. Yeast 14:783–790

    CAS  Article  Google Scholar 

  32. 32.

    Goulas T, Cuppari A, Garcia-Castellanos R, Snipas S, Glockshuber R, Arolas JL et al (2014) The pCri system: a vector collection for recombinant protein expression and purification. PLoS One 9:e112643

    Article  Google Scholar 

  33. 33.

    Li Z, Leung W, Yon A, Nguyen J, Perez VC, Vu J et al (2010) Secretion and proteolysis of heterologous proteins fused to the Escherichia coli maltose binding protein in Pichia pastoris. Protein Expr Purif 72:113–124

    CAS  Article  Google Scholar 

  34. 34.

    Julius D, Brake A, Blair L, Kunisawa R, Thorner J (1984) Isolation of the putative structural gene for the lysine-arginine-cleaving endopeptidase required for processing of yeast prepro-alpha-factor. Cell 37:1075–1089

    CAS  Article  Google Scholar 

  35. 35.

    Briand L, Perez V, Huet JC, Danty E, Masson C, Pernollet JC (1999) Optimization of the production of a honeybee odorant-binding protein by Pichia pastoris. Protein Expr Purif 15:362–369

    CAS  Article  Google Scholar 

  36. 36.

    Treerattrakool S, Eurwilaichitr L, Udomkit A, Panyim S (2002) Secretion of Pem-CMG, a peptide in the CHH/MIH/GIH family of Penaeus monodon, in Pichia pastoris is directed by secretion signal of the alpha-mating factor from Saccharomyces cerevisiae. J Biochem Mol Biol 35:476–481

    CAS  Article  Google Scholar 

  37. 37.

    Vogl T, Glieder A (2013) Regulation of Pichia pastoris promoters and its consequences for protein production. New Biotechnol 30:385–404

    CAS  Article  Google Scholar 

  38. 38.

    Daly R, Hearn MTW (2005) Expression of heterologous proteins in Pichia pastoris: a useful experimental tool in protein engineering and production. J Mol Recognit 18:119–138

    CAS  Article  Google Scholar 

  39. 39.

    Liu Z, Pscheidt B, Avi M, Gaisberger R, Hartner FS, Schuster C et al (2008) Laboratory evolved biocatalysts for stereoselective syntheses of substituted benzaldehyde cyanohydrins. ChemBioChem 9:58–61

    CAS  Article  Google Scholar 

  40. 40.

    Pscheidt B, Liu Z, Gaisberger R, Avi M, Skranc W, Gruber K et al (2008) Efficient Biocatalytic Synthesis of (R)-Pantolactone. Adv Synth Catal 350:1943–1948

    CAS  Article  Google Scholar 

  41. 41.

    Macauley-Patrick S, Fazenda ML, McNeil B, Harvey LM (2005) Heterologous protein production using the Pichia pastoris expression system. Yeast 22:249–270

    CAS  Article  Google Scholar 

  42. 42.

    Krainer FW, Dietzsch C, Hajek T, Herwig C, Spadiut O, Glieder A (2012) Recombinant protein expression in Pichia pastoris strains with an engineered methanol utilization pathway. Microb Cell Fact 11:22

    CAS  Article  Google Scholar 

  43. 43.

    Crook NC, Freeman ES, Alper HS (2011) Re-engineering multicloning sites for function and convenience. Nucleic Acids Res 39:e92

    CAS  Article  Google Scholar 

  44. 44.

    Staley CA, Huang A, Nattestad M, Oshiro KT, Ray LE, Mulye T et al (2012) Analysis of the 5′ untranslated region (5′UTR) of the alcohol oxidase 1 (AOX1) gene in recombinant protein expression in Pichia pastoris. Gene 496:118–127

    CAS  Article  Google Scholar 

  45. 45.

    Morawski B, Lin Z, Cirino P, Joo H, Bandara G, Arnold FH (2000) Functional expression of horseradish peroxidase in Saccharomyces cerevisiae and Pichia pastoris. Protein Eng 13:377–384

    CAS  Article  Google Scholar 

  46. 46.

    Krainer FW, Pletzenauer R, Rossetti L, Herwig C, Glieder A, Spadiut O (2014) Purification and basic biochemical characterization of 19 recombinant plant peroxidase isoenzymes produced in Pichia pastoris. Protein Expr Purif 95:104–112

    CAS  Article  Google Scholar 

  47. 47.

    Näätsaari L, Krainer FW, Schubert M, Glieder A, Thallinger GG (2014) Peroxidase gene discovery from the horseradish transcriptome. BMC Genom 15:227

    Article  Google Scholar 

  48. 48.

    Veitch NC (2004) Horseradish peroxidase : a modern view of a classic enzyme. Phytochemistry 65:249–259

    CAS  Article  Google Scholar 

  49. 49.

    Lin-Cereghino GP, Stark CM, Kim D, Chang JWJ, Shaheen N, Poerwanto H et al (2013) The effect of α-mating factor secretion signal mutations on recombinant protein expression in Pichia pastoris. Gene 519:311–317

    CAS  Article  Google Scholar 

  50. 50.

    Kjeldsen T (2000) Yeast secretory expression of insulin precursors. Appl Microbiol Biotechnol 54:277–286

    CAS  Article  Google Scholar 

  51. 51.

    Thim L, Hansen MT, Norris K, Hoegh I, Boel E, Forstrom J et al (1986) Secretion and processing of insulin precursors in yeast. Proc Natl Acad Sci USA 83:6766–6770

    CAS  Article  Google Scholar 

  52. 52.

    Hartner FS, Ruth C, Langenegger D, Johnson SN, Hyka P, Lin-Cereghino GP et al (2008) Promoter library designed for fine-tuned gene expression in Pichia pastoris. Nucleic Acids Res 36:e76

    Article  Google Scholar 

  53. 53.

    Xuan Y, Zhou X, Zhang W, Zhang X, Song Z, Zhang Y (2009) An upstream activation sequence controls the expression of AOX1 gene in Pichia pastoris. FEMS Yeast Res 9:1271–1282

    CAS  Article  Google Scholar 

  54. 54.

    Vogl T, Ruth C, Pitzer J, Kickenweiz T, Glieder A (2014) Synthetic core promoters for Pichia pastoris. ACS Synth Biol 3:188–191

    CAS  Article  Google Scholar 

  55. 55.

    Al-Samarrai TH, Kirk CA, Jones WT, Harvey D, Templeton MD (2010) A simple and rapid method for selecting high producers of recombinant proteins in individual clones of P. pastoris. Comb Chem High Throughput Screen 13:377–382

    CAS  Article  Google Scholar 

  56. 56.

    Brooks CL, Morrison M, Lemieux JM (2013) Rapid expression screening of eukaryotic membrane proteins in Pichia pastoris. Protein Sci 22:425–433

    CAS  Article  Google Scholar 

  57. 57.

    Mizutani K, Yoshioka S, Mizutani Y, Iwata S, Mikami B (2011) High-throughput construction of expression system using yeast Pichia pastoris, and its application to membrane proteins. Protein Expr Purif 77:1–8

    CAS  Article  Google Scholar 

  58. 58.

    Vogl T, Thallinger GG, Zellnig G, Drew D, Cregg JM, Glieder A et al (2014) Towards improved membrane protein production in Pichia pastoris: General and specific transcriptional response to membrane protein overexpression. N Biotechnol 31:538–552

    CAS  Article  Google Scholar 

  59. 59.

    Eiden-Plach A, Zagorc T, Heintel T, Carius Y, Breinig F, Schmitt MJ (2004) Viral preprotoxin signal sequence allows efficient secretion of green fluorescent protein by Candida glabrata, Pichia pastoris, Saccharomyces cerevisiae, and Schizosaccharomyces pombe. Appl Environ Microbiol 70:961–966

    CAS  Article  Google Scholar 

  60. 60.

    Kottmeier K, Ostermann K, Bley T, Rödel G (2011) Hydrophobin signal sequence mediates efficient secretion of recombinant proteins in Pichia pastoris. Appl Microbiol Biotechnol 91:133–141

    CAS  Article  Google Scholar 

  61. 61.

    Tang J-B, Zhu P, Yang H, Sun L-M, Song S, Ji A (2008) Expression and secretion of recombinant ZZ-EGFP fusion protein by the methylotrophic yeast Pichia pastoris. Biotechnol Lett 30:1409–1414

    CAS  Article  Google Scholar 

  62. 62.

    Panjideh H, Coelho V, Dernedde J, Fuchs H, Keilholz U, Thiel E et al (2008) Production of bifunctional single-chain antibody-based fusion proteins in Pichia pastoris supernatants. Bioprocess Biosyst Eng 31:559–568

    CAS  Article  Google Scholar 

  63. 63.

    Cha HJ, Dalal NN, Bentley WE (2005) Secretion of human interleukin-2 fused with green fluorescent protein in recombinant Pichia pastoris. Appl Biochem Biotechnol 126:1–11

    CAS  Article  Google Scholar 

  64. 64.

    Passolunghi S, Brocca S, Cannizzaro L, Porro D, Lotti M (2003) Monitoring the transport of recombinant Candida rugosa lipase by a green fluorescent protein-lipase fusion. Biotechnol Lett 25:1945–1948

    CAS  Article  Google Scholar 

  65. 65.

    Sarramegna V, Talmont F, Seree de Roch M, Milon A, Demange P (2002) Green fluorescent protein as a reporter of human mu-opioid receptor overexpression and localization in the methylotrophic yeast Pichia pastoris. J Biotechnol 99:23–39

    CAS  Article  Google Scholar 

  66. 66.

    Surribas A, Resina D, Ferrer P, Valero F (2007) Rivoflavin may interfere with on-line monitoring of secreted green fluorescence protein fusion proteins in Pichia pastoris. Microb Cell Fact 6:15

    Article  Google Scholar 

  67. 67.

    Dortay H, Akula UM, Westphal C, Sittig M, Mueller-Roeber B (2011) High-throughput protein expression using a combination of ligation-independent cloning (LIC) and infrared fluorescent protein (IFP) detection. PLoS One 6:e18900

    CAS  Article  Google Scholar 

  68. 68.

    De Jong RN, Daniëls MA, Kaptein R, Folkers GE (2006) Enzyme free cloning for high throughput gene cloning and expression. J Struct Funct Genomics 7:109–118

    CAS  Article  Google Scholar 

  69. 69.

    Lippow SM, Aha PM, Parker MH, Blake WJ, Baynes BM, Lipovsek D (2009) Creation of a type IIS restriction endonuclease with a long recognition sequence. Nucleic Acids Res 37:3061–3073

    CAS  Article  Google Scholar 

  70. 70.

    Besnier CE, Kong H (2001) Converting MlyI endonuclease into a nicking enzyme by changing its oligomerization state. EMBO Rep 2:782–786

    CAS  Article  Google Scholar 

  71. 71.

    Scalley-Kim M, McConnell-Smith A, Stoddard BL (2007) Coevolution of a homing endonuclease and its host target sequence. J Mol Biol 372:1305–1319

    CAS  Article  Google Scholar 

  72. 72.

    Doyon JB, Pattanayak V, Meyer CB, Liu DR (2006) Directed evolution and substrate specificity profile of homing endonuclease I-SceI. J Am Chem Soc 128:2477–2484

    CAS  Article  Google Scholar 

  73. 73.

    Lin-Cereghino J, Wong WW, Xiong S, Giang W, Luong LT, Vu J et al (2005) Condensed protocol for competent cell preparation and transformation of the methylotrophic yeast Pichia pastoris. Biotechniques 38:44, 46, 48

  74. 74.

    Weis R, Luiten R, Skranc W, Schwab H, Wubbolts M, Glieder A (2004) Reliable high-throughput screening with Pichia pastoris by limiting yeast cell death phenomena. FEMS Yeast Res 5:179–189

    CAS  Article  Google Scholar 

Download references

Authors’ contributions

TV, MA, FWK designed the experiments, analyzed and interpreted the data. MA, TV and FWK performed the experiments. TV and MA wrote the manuscript. TV, MA, FWK, HS and AG conceived of the study. AG and HS supervised the research. All authors read and approved the final manuscript.


We gratefully acknowledge the Austrian Science Fund (FWF) project number W901 (DK ‘Molecular Enzymology’ Graz), the Higher Education Commission of Pakistan and support from NAWI Graz. We would like to thank Harald Pichler and Christine M. Winkler for fruitful discussions regarding eGFP tagging experiments and Sarah Trunk for excellent technical assistance. We would also like to thank Rafael E. Luna for critical reading of the manuscript and helpful comments in frame of a scientific story telling workshop held by the DK ‘Molecular Enzymology’ Graz.

Compliance with ethical guidelines

Competing interests The authors declare that they have no competing interests.

Author information



Corresponding author

Correspondence to Thomas Vogl.

Additional information

Thomas Vogl and Mudassar Ahmad contributed equally

Additional files

Additional file 1:

Figure S1. Vector maps of the S. pombe vectors used in this study

Additional file 2:

Figure S2. Simple strategy for confirming the orientation of the insert. The forward or reverse primer used for amplifying the insert can be used together with the forward or reverse sequencing primer of the vector to confirm the correct orientation. Upon correct primer choice only the forward orientation gives a PCR fragment. The sequencing primers designed for Sanger sequencing allow sequencing of the insert from both sides. Depending on the vector, different primers should be used (e.g. when the MFalpha signal sequence or a fusion protein is present, see the primer list for all sequencing primers available).

Additional file 3:

Plasmid sequences of the constructs used in this study in Genbank format.

Additional file 4:

Figure S3. Fluorescence measurements of fusions of HRP to eGFP. Samples are labeled in the same way as in Figure 4. eGFP fluorescence of supernatants and cell pellets of methanol induced cells were normalized per cell density (OD600).

Additional file 5:

Table S1. Primers used in this study and detailed plasmid construction. Primers used for creating the S. pombe and P. pastoris plasmids are separated. Also primers for construction of the plasmids are separated from primers for sequencing and insertion of GOIs. In addition separate spreadsheets are providing information on the exact construction of the plasmids by listing the PCR products and restriction enzymes used for assembly.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Vogl, T., Ahmad, M., Krainer, F.W. et al. Restriction site free cloning (RSFC) plasmid family for seamless, sequence independent cloning in Pichia pastoris . Microb Cell Fact 14, 103 (2015).

Download citation


  • Protein tagging
  • Protein tags
  • Seamless cloning
  • Pichia pastoris
  • Expression optimization
  • Cloning strategy
  • Type IIS restriction endonucleases