An estimate is worth about a thousand experiments: using order-of-magnitude estimates to identify cellular engineering targets

Metcalf, Kevin James; Slininger Lee, Marilyn F.; Jakobson, Christopher Matthew; Tullman-Ercek, Danielle

doi:10.1186/s12934-018-0979-7

Review
Open access
Published: 30 August 2018

An estimate is worth about a thousand experiments: using order-of-magnitude estimates to identify cellular engineering targets

Kevin James Metcalf ORCID: orcid.org/0000-0002-2721-3378¹^nAff3,
Marilyn F. Slininger Lee¹^nAff4,
Christopher Matthew Jakobson¹^nAff5 &
…
Danielle Tullman-Ercek²

Microbial Cell Factories volume 17, Article number: 135 (2018) Cite this article

2250 Accesses
1 Citations
9 Altmetric
Metrics details

Abstract

Biotechnological processes use microbes to convert abundant molecules, such as glucose, into high-value products, such as pharmaceuticals, commodity and fine chemicals, and energy. However, from the outset of the development of a new bioprocess, it is difficult to determine the feasibility, expected yields, and targets for engineering. In this review, we describe a methodology that uses rough estimates to assess the feasibility of a process, approximate the expected product titer of a biological system, and identify variables to manipulate in order to achieve the desired performance. This methodology uses estimates from literature and biological intuition, and can be applied in the early stages of a project to help plan future engineering. We highlight recent literature examples, as well as two case studies from our own work, to demonstrate the use and power of rough estimates. Describing and predicting biological function using estimates guides the research and development phase of new bioprocesses and is a useful first step to understand and build a new microbial factory.

Background

Every microbial production process begins with an exciting but daunting set of problems—we want to know if the process is feasible, and how to increase production of a complex biological system or pathway when the best approach may be non-obvious. These problems are particularly difficult if the system or pathway in question has never been used in an engineering context before. In the following review, we outline the methodology our group developed to inform these decisions. We combine simple biological and biochemical observations [1] with intuitive estimates to identify the aspects of the biological system that will have the greatest impact on product yield to guide engineering efforts (Fig. 1). Many approximate values come from the BioNumbers database [2], and we cite the BioNumber identification number (BNID), where applicable. We also use significant figures in the estimates to signify the degree of precision in an estimate, following rules outlined in the text Cell Biology by the Numbers [3]. Simultaneously, we identify other aspects of a system that are unlikely to significantly impact our targets, in order to exclude these from our engineering efforts. This exercise is a useful addition to the planning stages of any biological engineering endeavor. Below, we discuss examples from the literature, and we use case studies from our work to outline the molecular-level estimates of achievable cellular behaviors. At the end, we describe a decision scheme for applying rough estimates to a new bioprocess. We encourage others in the field to include their rough estimates for process feasibility and engineering targets in their published work.

Estimates for production of artemisinin

Rough estimates help to characterize complex biological systems and improve product yield. In a landmark study, Keasling and coworkers used Saccharomyces cerevisiae to produce artemisinic acid, a precursor to the antimalarial drug artemisinin [4]. After an economic analysis of the process, they targeted a titer of 25 g/L in production, a feat they recently achieved [5]. We show a rough estimate to justify, in retrospect, this effort. The enzyme amorphadiene synthase (ADS) from Artemisia annua catalyzes the cyclization of farnesyl diphosphate to amorpha-4,11-diene a turnover number of 0.2 s⁻¹ [6], which is relatively slow compared to the other enzymes in the pathway [7] (Table 1).

Table 1 Turnover number (k_cat) for enzymes with published kinetic data used in the artemisinic acid pathway in S. cerevisiae [8]

Full size table

We therefore assume that this enzyme is substrate-saturated and catalyzes the rate-determining reaction step. Using BioNumbers estimates for S. cerevisiae, as well as experimental values from their recent publication [5], we estimate that this process can yield up to 16 g/L artemisinic acid (calculation 1).

Estimated maximum titer of artemisinic acid in S. cerevisiae (BNIDs 104150 and 100986)

$$\begin{aligned}\left( {1.3} \times 10^{6} \frac{\text{ ADS}}{\text{cell}}\right)\left(\frac{1 \text{ mole ADS}}{{6.02} \times 10^{23} \text{ molecules ADS}} \right)\\ & \times ({15} \text{ OD}) \left( \frac{3 \times 10^7 \text{ cells}}{\text{OD} \times \text{mL}} \right) \left( 0.2 \text{ s}^{-1} \right) \left( \frac{60^2 \text{ s}}{1 \text{ h}} \right)\\ & \times (100 \text{ h})\left( \frac{10^3 \text{ mL}}{1 \text{ L}} \right) \left( \frac{234 \text{ g}}{\text{mol}} \right) \approx {16} \frac{\text{ g}}{\text{L}} \end{aligned}$$

(1)

This estimate assumes that ADS has an abundance of 1.3 × 10⁶ enzymes per cell, which is the upper limit of native protein copies in a yeast cell (BNID 104150). This expression level is ~ 3% of the total protein (BNID 110550), and overexpression to 2 × 10⁶ ADS/cell would result in the target titer of 25 g/L. This analysis gives us an intuition for the bioprocess and serves as a benchmark for the physical limitations of this cellular process. Further increases to the product titer could be achieved by increasing the cell density, culture time, and reaction rate of ADS. A limitation of the analysis presented here is that the rate-determining enzyme must be known, and its kinetic parameters measured or estimated. When faced with a poorly defined set of enzymes, there is no substitute for intuition and accurate guesses of enzyme activity. To aid in assessing processes with limited kinetic information a lower bound for k_cat of 0.1 s⁻¹ captures the large majority of known activities [15].

Estimates for production of electrical energy

This type of analysis is not limited to the production of molecules—microbial production of high-energy electrons is also amenable to analysis using rough estimates. For example, in a microbial fuel cell, bacteria are used as a catalyst to convert carbon-based chemical energy to electrical energy. Chaudhuri and Lovley [16] showed that the rate of metabolism, efficiency of electron transfer, and microbial density on the electrode are determining factors for predicting the current density of a microbial fuel cell. In order to improve fuel cell performance, which parameter should an engineer first modify? The authors used a straightforward calculation to determine that electron transfer efficiency is already quite efficient—the bacterium Rhodoferax ferrireducens achieved a yield of 740 coulombs (C) of the theoretical limit of 900 C that can be extracted by the complete oxidation of the 0.39 mmol of glucose fed. On the other hand, they found that current density is improved by selecting appropriate electrode materials. Interestingly, the authors observed a twofold increase in current density with a new anode material, which correlated with a twofold increase in microbial density on the electrode. Using the authors’ measurement of 0.086 mg protein/cm², we estimate this new material resulted in 6 × 10⁸ cells/cm² cell density on the anode (calculation 2).

Experimental cell density on anode (BNIDs 109352 and 103904)

$$\left( {\frac{{0.086\,{\text{mg}}\,{\text{protein}}}}{{{\text{cm}}^{ 2} }}} \right)\left( {\frac{{2\,{\text{mg}}\,{\text{DCW}}}}{\text{mg protein}}} \right)\left( {\frac{\text{cell}}{{ 2 8 0 {\text{ fg}}}}} \right)\left( {\frac{{10^{12} \,{\text{fg}}}}{{ 1 {\text{ mg}}}}} \right) \approx 6\, \times \,10^{8} \,\frac{{\text{cells}}}{{{{\text{cm}}}^{ 2} }}$$

(2)

Assuming that ~ 1/6 of the 6 μm² bacterial surface area (BNID 101792) is in contact with the electrode, we estimate up to 1 × 10⁸ cells/cm² could be attached to the anode (calculation 3).

Maximum cell density on anode

$$\left( {\frac{\text{cell}}{{1\,\upmu{\text{m}}^{2} }}} \right)\left( {\frac{{\left( {10^{6} } \right)^{2} {\mkern 1mu}\upmu{\text{m}}^{2} }}{{\left( {10^{2} } \right)^{2} {\mkern 1mu} {\text{cm}}^{2} }}} \right) \approx 1 \times \,10^{8} {\mkern 1mu} \frac{{\text{cells}}}{{{{\text{cm}}}^{2} }}$$

(3)

This theoretical estimate is within one order of magnitude of the experimental estimate and suggests that the current density cannot be improved by increasing microbial density on the anode, as the anode surface is likely already saturated with bacteria. Further improvements to this system may also focus on the rate of metabolism of the bacterial cell. We support this claim using values estimated for Escherichia coli, which is similar in size and shape to R. ferrireducens [17]. E. coli can take up 12 mmol glucose/g dry cell weight (DCW)/h (BNID 109686) [18]. Such uptake could yield a current density of 1 mA/cm², over two orders of magnitude greater than the observed 7.4 × 10⁻³ mA/cm², assuming a similar cellular attachment density (calculation 4).

Current density at maximum glucose uptake (BNIDs 109686 and 109352)

$$\begin{aligned} \left( {12\frac{{\text{mmol glucose}}}{{{{\text{g DCW}}} \times {\text{h}}}}} \right)\left( {\frac{{{\text{g}}\,{\text{DCW}}}}{{10^{3} \,{\text{mg}}\,{\text{DCW}}}}} \right)\left( {\frac{{2\,{\text{mg}}\,{\text{DCW}}}}{\text{mg protein}}} \right) \times \left( {\frac{{0.086\,{\text{mg}}\,{\text{protein}}}}{{{\text{cm}}^{ 2} }}} \right)\left( {\frac{{1\,{\text{h}}}}{{60^{2} \,{\text{s}}}}} \right)\left( {\frac{{24\,{\text{mmol}}\,e^{ - } }}{{1\,{\text{mmol}}\,{\text{glucose}}}}} \right) \hfill \\ \times \left( {\frac{{9.65 \times 10^{4} \,{\text{mC}}}}{{1\,{\text{mmol}}\,e^{ - } }}} \right)\left( {\frac{\text{C}}{{10^{3} \,{\text{mC}}}}} \right) \approx 1\frac{{\text{mA}}}{{{{\text{cm}}}^{ 2} }} \hfill \\ \end{aligned}$$

(4)

Clearly, future improvements to this system should focus on increasing metabolic flux. In a related study, Nocera and colleagues showed how rough estimates can be used to improve the design of bioelectrochemical cells for fuel production from sunlight. Here, increased bacterial viability and a redesigned apparatus were offered by the authors as future system improvements [19]. Indeed, in a recent paper, the authors show how a redesigned electrode catalyst increases bacterial viability and improves efficiency of biofuel production by over 20-fold [20]. Together, these examples reveal how rough estimates of cell metabolism and physiology provide important insight into improving a bioprocess.

Case study 1: evaluation of the capacity of a protein secretion system

Many bioprocesses take advantage of existing, natural biological functions that are engineered with a top-down approach to improve function in the bioreactor environment. For such systems, estimates evaluate native function and guide experiments to modify the system for improved performance. In the following example, we describe how a protein secretion apparatus might be modified for increased protein production. Our targets are difficult-to-produce heterologous proteins. Engineering bacteria to secrete the protein product to the extracellular space is expected to improve production of these toxic or hard-to-purify proteins [21]. To achieve this activity, we adapted the type III secretion system of Salmonella enterica. When we started working on this problem, we used estimates to answer three key questions:

1.
Can secretion improve the production of proteins with toxic effects?
2.
What is the native capacity of the secretion system?
3.
If the native capacity is below the desired production level, how should the secretion system be manipulated to achieve increased protein yield?

First, we predict the steady-state intracellular concentration of the toxic protein of interest. Cellular fitness may be increased if the rate of protein secretion is matched with the rate of protein production, such that a low intracellular concentration of the toxic protein is maintained at steady-state, while the toxic protein accumulates in the extracellular space. As an example, we consider that a 50 kDa protein of interest is produced by the ribosomes at a rate of up to 10³ proteins/s/cell (calculation 5).

Maximal translation rate per cell (BNIDs 100059 and 101441)

$$\left( {10^{1} \frac{{{\text{amino}}\,{\text{acids}}}}{{{\text{s}} \times {\text{ribosome}}}}} \right)\left( {\frac{{ 1 {\text{ protein}}}}{{ 4 5 0 {\text{ amino}}\,{\text{acids}}}}} \right) \times \left( {10^{4} - 10^{5} \frac{{\text{ribosomes}}}{{\text{cell}}}} \right) \approx 10^{2} {-} 10^{3} \frac{{\text{proteins}}}{{{{\text{s}}} \times {\text{cell}}}}$$

(5)

Note that in this example, we assume that all ribosomes are actively translating the protein of interest. We also estimate the secretion rate per cell from the type III secretion system using known parameters [1, 22]. Proteins are secreted at a rate of 10³–10⁴ amino acids per second per apparatus [23, 24], and each cell has 10¹–10² secretion apparatus per cell [25]. Therefore we estimate a secretion rate of 10¹–10³ proteins per second per cell for a 50 kDa protein (calculation 6) (Fig. 2a).

Maximal secretion rate per cell

$$\left( {10^{3} - 10^{4} \frac{{{\text{amino}}\,{\text{acids}}}}{{{\text{s}} \times {\text{apparatus}}}}} \right)\left( {10^{1} - 10^{2} \frac{{\text{apparatus}}}{{\text{cell}}}} \right) \times \left( {\frac{{1\,{\text{protein}}}}{{450\,{\text{amino}}\,{\text{acids}}}}} \right) \approx 10^{1} - 10^{3} \frac{{\text{proteins}}}{{{{\text{s}}} \times {\text{cell}}}}$$

(6)

Our estimate of the maximal secretion rate is on the same order of magnitude as the maximal translation rate. This suggests that a low intracellular concentration of protein can be maintained by controlling the rate of translation to match the rate of secretion [26]. Thus, we expect that increased production of a toxic protein can be achieved by mitigating cytotoxic effects through maintaining a low steady-state intracellular concentration.

Now we address the second question: What is the capacity of the native protein secretion system? We desire a product titer of 10 g/L in a 72 h batch in order to compete with current industry performance [27, 28]. We estimate the secreted titer by integrating the estimated secretion rate per cell across all cells in the culture. Cultures reach an optical density of ~ 1 OD, equivalent to about 10⁹ cells/mL (BNID 104831) [29]. Only 30% of cells secrete product in this environment [30, 31], such that the predicted secreted protein titer of a 50 kDa protein is 10¹–10³ mg/L in an 8 h batch (calculation 7) (Fig. 2b).

Native production capacity (BNID 104831)

$$\begin{aligned} \left( {8\,{\text{h}}} \right)\left( {\frac{{60^{2} \,{\text{s}}}}{{1\,{\text{h}}}}} \right)\left( {10^{3} - 10^{4} \frac{{{\text{amino}}\,{\text{acids}}}}{{{\text{s}} \times {\text{apparatus}}}}} \right)\times \left( {10^{1} - 10^{2} \frac{{\text{apparatus}}}{{\text{cell}}}} \right)\left( {10^{9} \frac{{\text{cells}}}{{\text{mL}}}} \right) \hfill \\ \times \left( {\frac{{ 0. 3 {\text{ secretion - active cells}}}}{{ 1 {\text{ total cells}}}}} \right)\left( {\frac{{1\,{\text{protein}}}}{{450\,{\text{amino}}\,{\text{acids}}}}} \right) \times \left( {\frac{{1\,{\text{mole}}\,{\text{protein}}}}{{6.02 \times 10^{23} \,{\text{proteins}}}}} \right)\left( {\frac{{50 \times 10^{3} \,{\text{g}}\,{\text{protein}}}}{{1\,{\text{mole}}\;{\text{protein}}}}} \right) \hfill \\ \times \left( {\frac{{10^{3} \,{\text{mL}}}}{{1\,{\text{L}}}}} \right)\left( {\frac{{10^{3} \,{\text{mg}}}}{{1\,{\text{g}}}}} \right) \approx 10^{1} - 10^{3} \frac{{\text{mg}}}{{\text{L}}} \hfill \\ \end{aligned}$$

(7)

This range of values agrees well with published titers of ~ 10 mg/L from an 8 h batch [32], supporting the validity of our analysis and suggesting that some of the parameters used in our analysis may be overestimated. This estimate also corresponds to a titer of 10⁻¹– 10¹ g/L in a 72 h batch. This analysis reveals that our engineering goal of 10 g/L secreted protein might be achieved by optimizing the native secretion capacity of the type III secretion system, and identifies five parameters that contribute to secreted protein titer:

1.
Fraction of cells that are secretion-active.
2.
Culture density.
3.
Number of apparatus per cell.
4.
Secretion rate.
5.
Culture time with which proteins are secreted.

This list helps us address our third question—we can now identify parameters to manipulate to achieve the target titer of 10 g/L. An increase in any of the five parameters will result in a proportional change in the product titer. Some parameters, such as culture density [33] or the fraction of cells that are secretion-active [30], can be easily manipulated. Improving the culturing conditions for high cell density culture, while maintaining secretion activity, will cause a concomitant increase in secreted protein titer. Further, the secretion activity on a per cell basis can be manipulated using transcriptional control to increase expression of type III secretion system genes [23]. Other parameters are harder to manipulate experimentally due to physiological limits. For example, if we approximate a cross-sectional area of 1000 nm²/apparatus [25], the average S. enterica cell (6 µm², BNID 103711) experiences 0.1–1% of the inner membrane surface area occupied by type III secretion system apparatus (calculation 8).

Surface area occupied by secretion system apparatus (BNID 103711)

$$\frac{{{\text{surface}}\,{\text{ area}}\,{\text{ of}}\,{\text{ apparatus} }}}{{\text{cellular}}\,{\text{surface}}\,{\text{area}}} = \left( {1000\frac{{{\text{nm}}^{{\text{2}}} }}{{{\text{apparatus}}}}}\right) \left( {\frac{{{\text{1}}\, {\upmu}{\text{m}}^{{\text{2}}} }}{{(10^{3} )^{2}\,{\text{nm}}^{2} }}} \right) \left( {10 - 100\frac{{{{\text{apparatus}}}}}{{{{\text{cell}}}}}} \right) = 0.1-1\%$$

(8)

If the number of apparatus were increased to 10³ per cell, this would increase the fraction of the inner membrane occupied by the type III secretion apparatus to 10%, likely decreasing cell viability, as this is a very high fraction of the membrane to devote to a large structure that spans both the inner and outer membrane. Thus, attempting to manipulate this variable would not likely be fruitful in achieving the desired process goal.

In our work, we controlled expression of the secretion system to increase the fraction of cells that are secretion-active by ~ threefold and enable a ~ threefold increase in culture density. By introducing transcriptional control, we manipulate these two key variables simultaneously and achieve a ~ tenfold increase in secreted protein titer [34]. Engineering improvements that were identified by rough estimates resulted in a bacterial strain that was able to produce and secrete heterologous proteins at high titer and enabled the production of difficult-to-express repetitive proteins [35]. We expect increased product titer by further manipulation of the five aforementioned variables. With a goal of 10 g/L in 72 h, we expect that a fivefold further increase in culture density will achieve the target titer of secreted proteins.

Case study 2: feasibility of enzyme pathway compartmentalization

Estimates can also be used to understand the physical limits of cellular properties and thus establish the upper limit on production of a desired product. Here, we consider the design of subcellular nanoreactors based on naturally occurring organelles. Subcellular structures, such as the carboxysome and the mitochondrion, are spatially and chemically segregated from the rest of the cell to create a specialized metabolic environment [36]. Inspired by these examples from Nature, a subcellular compartment optimized for production of a desired molecule could increase titer, as bioproduction and metabolic homeostasis are decoupled through spatial separation. Towards this goal, our group has sought to repurpose the native bacterial microcompartment (MCP) complex of S. enterica for metabolic engineering of diverse bioproducts. When we began this project, we first asked, “Can MCPs be used in the production of an industrially relevant compound at sufficient titer?” Again, we obtain an order-of-magnitude estimate of physical requirements for a desired product yield using rough estimates of the relevant parameters.

We calculate the feasibility of a desired titer from the amount of enzyme that can physically fit within the MCP compartment volume. We note that the MCP is approximately spherical, with a diameter of ~ 100 nm, and that the maximum number of MCPs per cell is likely to be around 100 [37, 38], such that up to 1–10% of the cell volume is occupied by MCPs. At a culture density of 1 OD, MCPs represent 0.005% of the culture volume (calculation 9) (Fig. 3a).

Culture volume fraction of microcompartments (BNID 104831)

$$\left( {\frac{{\frac{4}{3}\pi \left( {50\,{\text{nm}}} \right)^{3} }}{\text{MCP}}} \right)\left( {100\frac{{\text{MCP}}}{{\text{cell}}}} \right)\left( {\frac{{10^{9} \,{\text{cells}}}}{\text{mL}}} \right)\left( {\frac{{\,{\text{mL}}}}{{\left( {10^{7} } \right)^{3} \,{\text{nm}}^{3} }}} \right) \times 100 \approx 0.005\%$$

(9)

From this calculation, it is clear that the fractional volume of MCPs in a culture is most significantly determined by the culture density. Does the estimated MCP volume afford enough space for enzymes inside the MCP to produce industrially relevant amounts of a compound of interest? Commodity chemicals are typically produced at concentrations of 50–150 g/L after 48 h of fermentation, and desired titers are dictated by process economics [39, 40]. For this estimate, we set a target of 50 g/L in 48 h of the commodity product 1,2-propanediol (1,2-PD). We calculate the quantity needed of the enzyme with the lowest k_cat from the 1,2-propanediol production pathway, GldA (Table 2), under substrate saturating conditions to see if this target is physically possible. The specific activity of GldA has been experimentally determined to be 5 μmol 1,2-PD/min/mg GldA at saturation [41]. Assuming saturation of GldA, we calculate the minimum concentration of GldA required to achieve this product titer is 50 mg/L (calculation 10) (Fig. 3b).

Table 2 Turnover number (k_cat) for enzymes in the 1,2-propanediol pathway [41]

Full size table

Concentration of rate-determining enzyme to achieve desired product titer

$$\left( {50\frac{{{\text{g}}\;1,2 - {\text{PD}}}}{{{\text{L culture}}}}} \right) \left( {\frac{{1\,\upmu {\text{mol }}1,2 - {\text{PD}}}}{{76\,\upmu {\text{g}}\,1,2 - {\text{PD}}}}} \right) \left( {\frac{{\min \cdot {\text{mg GldA}}}}{{5\,\upmu {\text{mol 1}},2 - {\text{PD}}}}} \right) \left( {\frac{{10^{6} \,\upmu {\text{g}}}}{{1\,{\text{g}}}}} \right) \left( {\frac{{1{\text{h}}}}{{60\min }}} \right) \left( {\frac{1}{{48\,{\text{h}}}}} \right) = 50\frac{{{\text{mg GldA}}}}{{{\text{L culture}}}}$$

(10)

Does this concentration of enzyme physically fit inside the MCPs? The approximate density of the GldA enzyme was calculated from the amino acid sequence using the Northwestern peptide properties calculator [44]. The volume of the GldA protein molecule is 5 × 10⁴ Å³ and the molecular weight is 40 × 10³ g/mol, giving a density of 1.4 g/cm³. We then calculate the volume fraction of GldA in MCPs required for our desired product titer (calculation 11).

Fraction of MCP required for GldA

$$\frac{{\left( {\frac{{50\,{\text{mg}}\,{\text{enzyme}}}}{{{\text{L}}\,{\text{culture}}}}} \right)\left( {\frac{{{\text{cm}}^{ 3} \,{\text{enzyme}}}}{{ 1. 4\,{\text{g}}\,{\text{enzyme}}}}} \right)\left( {\frac{\text{g}}{{10^{3} \,{\text{mg}}}}} \right)\left( {\frac{\text{L}}{{10^{3} \,{\text{cm}}^{ 3} }}} \right)}}{{\frac{{0.00005\,{\text{L}}\,{\text{MCP}}}}{{{\text{L}}\,{\text{culture}}}}}} \approx 0.7\frac{{{\text{L}}\,{\text{enzyme}}}}{{{\text{L}}\,{\text{MCP}}}}$$

(11)

To produce the desired titer, 70% of the MCP volume must be occupied by GldA. While this fractional loading is high, is shows that the process is feasible and that modest improvements to the process would improve titer. For example, if the cell density was increased to 10 OD, only 7% of the MCP volume would need to be occupied by GldA. This fractional loading of the rate-determining enzyme suggests that MCPs are large enough to fit multiple enzymes in a pathway. The rest of the MCP volume is available for other enzymes in the pathway, as well as metabolites.

The variables we found to affect this calculation are the:

1.
Size of MCPs per cell.
2.
Number of MCPs per cell.
3.
Saturation of the encapsulated enzymes by cognate substrates.
4.
Loading of enzymes within the MCP.
5.
Maximum reaction rates of the loaded enzymes.
6.
Culture density.

Of these variables, the saturation of encapsulated enzymes, loading of enzymes within the shell, and improved culture density all make attractive targets. For the 1,2-PD example, we assumed saturation of the enzyme and an enzyme loading of 70%. Saturation is a feature that depends on culture conditions and shell permeability, as well as on the kinetic constants of the pathway enzymes, and the impact of engineering enzyme turnover and saturation will vary depending on the system [45]. Further, if we consider enzyme engineering to increase the specific activity, we would expect a concomitant increase in product titer, though this is often not trivial. Thus, to increase product titer tenfold, we predict that increasing culture density to 10 OD would suffice. Changing the size or number of MCPs is a much more challenging target, as the mechanisms controlling these phenotypes are unknown, and would not improve titers by orders of magnitude.

Therefore, in our work we first set out to improve control over MCP expression, the permeability of the MCP shell, and enzyme loading. Controlling expression of MCP genes enables increased culture density [46]. Further, controlling permeability of the protein shell to metabolites changes the concentration of substrates in the MCP [45, 47], enabling operation at substrate-saturating regimes. Finally, the loading of enzymes in the MCP can be modulated via the targeting sequence and expression levels [48], and the low fractional volume of enzymes in MCPs enables modulation of enzyme loading. If further improvements are needed, increasing the specific activity of the enzyme—by engineering or identifying more active homologs—might be the next best target because it could improve yield by an additional one or more orders of magnitude.

Using rough estimates in new bioprocesses

The above examples highlight the value of the rough estimates in successful bioprocess engineering projects. At the project outset, estimates help to determine if the process is feasible. Further along in the project, estimates identify properties to improve. In all cases, successful use of estimates will require a keen biological intuition. To help build an intuition, we encourage others working in this area to use a decision scheme (Fig. 4) to guide their process analysis and engineering efforts, and emphasize that moving through this scheme requires balancing the potential payoff with the amount of effort it will require. Moreover, we expect that the majority of successful projects used such a decision scheme, yet few have discussed their rough estimates in the literature alongside the project results. Given its importance, we strongly promote the inclusion of rough estimates in all future published work in the synthetic biology field. It will benefit new researchers learning to analyze new processes, as well as everyone interested in understanding the context of the work, including why certain parameters were not chosen for optimization.

Abbreviations

BNID:: BioNumber identification number
DCW:: dry cell weight
MCP:: bacterial microcompartment

References

Phillips R, Milo R. A feeling for the numbers in biology. Proc Natl Acad Sci. 2009;106:21465–71.
Article PubMed PubMed Central Google Scholar
Milo R, Jorgensen P, Moran U, Weber G, Springer M. BioNumbers—the database of key numbers in molecular and cell biology. Nucleic Acids Res. 2010;38:D750–3.
Article CAS PubMed Google Scholar
Milo R, Phillips R. Cell biology by the numbers. 1st ed. New York: Garland Science; 2015.
Google Scholar
Ro D-K, Paradise EM, Ouellet M, Fisher KJ, Newman KL, Ndungu JM, et al. Production of the antimalarial drug precursor artemisinic acid in engineered yeast. Nature. 2006;440:940–3.
Article CAS PubMed Google Scholar
Paddon CJ, Westfall PJ, Pitera DJ, Benjamin K, Fisher K, McPhee D, et al. High-level semi-synthetic production of the potent antimalarial artemisinin. Nature. 2013;496:528–32.
Article CAS PubMed Google Scholar
Fang X, Li J-X, Huang J-Q, Xiao Y-L, Zhang P, Chen X-Y. Systematic identification of functional residues of Artemisia annua amorpha-4,11-diene synthase. Biochem J. 2017;474:2191–202.
Article CAS PubMed Google Scholar
Han J, Wang H, Kanagarajan S, Hao M, Lundgren A, Brodelius PE. Promoting artemisinin biosynthesis in Artemisia annua plants by substrate channeling. Mol Plant. 2016;9:946–8.
Article CAS PubMed Google Scholar
Paddon CJ, Keasling JD. Semi-synthetic artemisinin: a model for the use of synthetic biology in pharmaceutical development. Nat Rev Microbiol. 2014;12:355–67.
Article CAS PubMed Google Scholar
Basson ME, Thorsness M, Rine J. Saccharomyces cerevisiae contains two functional genes encoding 3-hydroxy-3-methylglutaryl-coenzyme A reductase. Proc Natl Acad Sci U S A. 1986;83:5563–7.
Article CAS PubMed PubMed Central Google Scholar
Kazieva E, Yamamoto Y, Tajima Y, Yokoyama K, Katashkina J, Nishio Y. Characterization of feedback-resistant mevalonate kinases from the methanogenic archaeons Methanosaeta concilii and Methanocella paludicola. Microbiol Read Engl. 2017;163:1283–91.
Article CAS Google Scholar
Garcia DE, Keasling JD. Kinetics of phosphomevalonate kinase from Saccharomyces cerevisiae. PLoS ONE. 2014;9:e87112.
Article CAS PubMed PubMed Central Google Scholar
Krepkiy D, Miziorko HM. Identification of active site residues in mevalonate diphosphate decarboxylase: implications for a family of phosphotransferases. Protein Sci. 2004;13:1875–81.
Article CAS PubMed PubMed Central Google Scholar
Anderson MS, Muehlbacher M, Street IP, Proffitt J, Poulter CD. Isopentenyl diphosphate:dimethylallyl diphosphate isomerase. An improved purification of the enzyme and isolation of the gene from Saccharomyces cerevisiae. J Biol Chem. 1989;264:19169–75.
CAS PubMed Google Scholar
Teoh KH, Polichuk DR, Reed DW, Covello PS. Molecular cloning of an aldehyde dehydrogenase implicated in artemisinin biosynthesis in Artemisia annua. Botany. 2009;87:635–42.
Article CAS Google Scholar
Bar-Even A, Noor E, Savir Y, Liebermeister W, Davidi D, Tawfik DS, et al. The moderately efficient enzyme: evolutionary and physicochemical trends shaping enzyme parameters. Biochemistry. 2011;50:4402–10.
Article CAS PubMed Google Scholar
Chaudhuri SK, Lovley DR. Electricity generation by direct oxidation of glucose in mediatorless microbial fuel cells. Nat Biotechnol. 2003;21:1229–32.
Article CAS PubMed Google Scholar
Finneran KT, Johnsen CV, Lovley DR. Rhodoferax ferrireducens sp. nov., a psychrotolerant, facultatively anaerobic bacterium that oxidizes acetate with the reduction of Fe(III). Int J Syst Evol Microbiol. 2003;53:669–73.
Article CAS PubMed Google Scholar
Jain R, Srivastava R. Metabolic investigation of host/pathogen interaction using MS2-infected Escherichia coli. BMC Syst Biol. 2009;3:121.
Article CAS PubMed PubMed Central Google Scholar
Torella JP, Gagliardi CJ, Chen JS, Bediako DK, Colón B, Way JC, et al. Efficient solar-to-fuels production from a hybrid microbial–water-splitting catalyst system. Proc Natl Acad Sci. 2015;112:2337–42.
Article CAS PubMed PubMed Central Google Scholar
Liu C, Colón BC, Ziesack M, Silver PA, Nocera DG. Water splitting–biosynthetic system with CO₂ reduction efficiencies exceeding photosynthesis. Science. 2016;352:1210–3.
Article CAS PubMed Google Scholar
Schein CH. Production of soluble recombinant proteins in bacteria. Nat Biotechnol. 1989;7:1141–9.
Article CAS Google Scholar
Flamholz A, Phillips R, Milo R. The quantified cell. Mol Biol Cell. 2014;25:3497–500.
Article PubMed PubMed Central Google Scholar
Schlumberger MC, Müller AJ, Ehrbar K, Winnen B, Duss I, Stecher B, et al. Real-time imaging of type III secretion: Salmonella SipA injection into host cells. Proc Natl Acad Sci U S A. 2005;102:12548–53.
Article CAS PubMed PubMed Central Google Scholar
Singer HM, Erhardt M, Steiner AM, Zhang M-M, Yoshikami D, Bulaj G, et al. Selective purification of recombinant neuroactive peptides using the flagellar type III secretion system. MBio. 2012;3:e00115-12.
Article CAS PubMed PubMed Central Google Scholar
Kubori T, Matsushima Y, Nakamura D, Uralil J, Lara-Tejero M, Sukhan A, et al. Supramolecular Structure of the Salmonella typhimurium type III protein secretion system. Science. 1998;280:602–5.
Article CAS PubMed Google Scholar
Huang D, Gore PR, Shusta EV. Increasing yeast secretion of heterologous proteins by regulating expression rates and post-secretory loss. Biotechnol Bioeng. 2008;101:1264–75.
Article CAS PubMed Google Scholar
Li F, Vijayasankaran N, Shen A, Kiss R, Amanullah A. Cell culture processes for monoclonal antibody production. mAbs. 2010;2:466–77.
Article PubMed PubMed Central Google Scholar
Reilly DE, Yansura DG. Production of monoclonal antibodies in E. coli. In: Shire S, Gombotz W, Bechtold-Peters K, Andya J, editors. Current trends in monoclonal antibody development and manufacturing. Biotechnology: pharmaceutical aspects, vol XI. New York, NY: Springer; 2010. P. 295–308.
Chapter Google Scholar
Moran U, Phillips R, Milo R. SnapShot: key numbers in biology. Cell. 2010;141(1262–1262):e1.
Google Scholar
Sturm A, Heinemann M, Arnoldini M, Benecke A, Ackermann M, Benz M, et al. The cost of virulence: retarded growth of Salmonella typhimurium cells expressing type III secretion system 1. PLoS Pathog. 2011;7:e1002143.
Article CAS PubMed PubMed Central Google Scholar
Hautefort I, Proença MJ, Hinton JCD. Single-copy green fluorescent protein gene fusions allow accurate measurement of Salmonella gene expression in vitro and during infection of mammalian cells. Appl Environ Microbiol. 2003;69:7480–91.
Article CAS PubMed PubMed Central Google Scholar
Widmaier DM, Tullman-Ercek D, Mirsky EA, Hill R, Govindarajan S, Minshull J, et al. Engineering the Salmonella type III secretion system to export spider silk monomers. Mol Syst Biol. 2009;5:309.
Article PubMed PubMed Central Google Scholar
Lee SY. High cell-density culture of Escherichia coli. Trends Biotechnol. 1996;14:98–105.
Article CAS PubMed Google Scholar
Metcalf KJ, Finnerty C, Azam A, Valdivia E, Tullman-Ercek D. Using transcriptional control to increase titers of secreted heterologous proteins by the type III secretion system. Appl Environ Microbiol. 2014;80:5927–34.
Article CAS PubMed PubMed Central Google Scholar
Azam A, Li C, Metcalf KJ, Tullman-Ercek D. Type III secretion as a generalizable strategy for the production of full-length biopolymer-forming proteins. Biotechnol Bioeng. 2016;113:2313–20.
Article CAS PubMed Google Scholar
Polka JK, Silver PA. Building synthetic cellular organization. Mol Biol Cell. 2013;24:3585–7.
Article CAS PubMed PubMed Central Google Scholar
Parsons JB, Frank S, Bhella D, Liang M, Prentice MB, Mulvihill DP, et al. Synthesis of empty bacterial microcompartments, directed organelle protein incorporation, and evidence of filament-associated organelle movement. Mol Cell. 2010;38:305–15.
Article CAS PubMed Google Scholar
Bobik TA, Havemann GD, Busch RJ, Williams DS, Aldrich HC. The Propanediol utilization (pdu) operon of Salmonella enterica serovar typhimurium LT2 includes genes necessary for formation of polyhedral organelles involved in coenzyme B12-dependent 1,2-Propanediol degradation. J Bacteriol. 1999;181:5967–75.
CAS PubMed PubMed Central Google Scholar
Lechner A, Brunk E, Keasling JD. The need for integrated approaches in metabolic engineering. Cold Spring Harb Perspect Biol. 2016;8:a023903.
Article CAS PubMed PubMed Central Google Scholar
Shanmugam KT, Ingram LO. Engineering biocatalysts for production of commodity chemicals. J Mol Microbiol Biotechnol. 2008;15:8–15.
Article CAS PubMed Google Scholar
Subedi KP, Kim I, Kim J, Min B, Park C. Role of GldA in dihydroxyacetone and methylglyoxal metabolism of Escherichia coli K12. FEMS Microbiol Lett. 2008;279:180–7.
Article CAS PubMed Google Scholar
Saadat D, Harrison DH. Identification of catalytic bases in the active site of Escherichia coli methylglyoxal synthase: cloning, expression, and functional characterization of conserved aspartic acid residues. Biochemistry. 1998;37:10074–86.
Article CAS PubMed Google Scholar
Ko J, Kim I, Yoo S, Min B, Kim K, Park C. Conversion of methylglyoxal to acetol by Escherichia coli aldo-keto reductases. J Bacteriol. 2005;187:5782–9.
Article CAS PubMed PubMed Central Google Scholar
Chazan A. Peptide properties calculator. http://biotools.nubic.northwestern.edu/proteincalc.html. Accessed 8 Aug 2018.
Jakobson CM, Tullman-Ercek D, Slininger MF, Mangan NM. A systems-level model reveals that 1,2-Propanediol utilization microcompartments enhance pathway flux through intermediate sequestration. PLoS Comput Biol. 2017;13:e1005525.
Article CAS PubMed PubMed Central Google Scholar
Kim EY, Jakobson CM, Tullman-Ercek D. Engineering transcriptional regulation to control Pdu microcompartment formation. PLoS ONE. 2014;9:e113814.
Article CAS PubMed PubMed Central Google Scholar
Slininger Lee MF, Jakobson CM, Tullman-Ercek D. Evidence for improved encapsulated pathway behavior in a bacterial microcompartment through shell protein engineering. ACS Synth Biol. 2017;6:1880–91.
Article CAS PubMed Google Scholar
Jakobson CM, Kim EY, Slininger MF, Chien A, Tullman-Ercek D. Localization of proteins to the 1,2-Propanediol utilization microcompartment by non-native signal sequences is mediated by a common hydrophobic motif. J Biol Chem. 2015;290:24519–33.
Article CAS PubMed PubMed Central Google Scholar
Jakobson CM, Chen Y, Slininger MF, Valdivia E, Kim EY, Tullman-Ercek D. Tuning the catalytic activity of subcellular nanoreactors. J Mol Biol. 2016;428:2989–96.
Article CAS PubMed Google Scholar

Download references

Authors’ contributions

All authors contributed in writing this review. All authors read and approved the final manuscript.

Acknowledgements

The authors would like to thank Shobhit Patoria and Michael Vincent for critical reading of the manuscript, as well as past and present members of the Tullman-Ercek group for stimulating discussions on the use of rough estimates in synthetic biology.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

All data generated or analyzed during this study are included in this published article.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Funding

This work was supported by the National Science Foundation (award MCB1150567 to D.T.E. and a graduate research fellowship to K.J.M.), the Army Research Office (grant W911NF-15-1-0144 to D.T.E.), and fellowships from the University of California (to K.J.M. and C.M.J.).

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Kevin James Metcalf
Present address: Department of Biomedical Engineering, Northwestern University, Evanston, IL, 60208, USA
Marilyn F. Slininger Lee
Present address: U.S. Army Edgewood Chemical Biological Center, Gunpowder, MD, 21010, USA
Christopher Matthew Jakobson
Present address: Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, 94305, USA

Authors and Affiliations

Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA, 94720, USA
Kevin James Metcalf, Marilyn F. Slininger Lee & Christopher Matthew Jakobson
Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, 60208, USA
Danielle Tullman-Ercek

Authors

Kevin James Metcalf
View author publications
You can also search for this author in PubMed Google Scholar
Marilyn F. Slininger Lee
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Matthew Jakobson
View author publications
You can also search for this author in PubMed Google Scholar
Danielle Tullman-Ercek
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kevin James Metcalf.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Metcalf, K.J., Slininger Lee, M.F., Jakobson, C.M. et al. An estimate is worth about a thousand experiments: using order-of-magnitude estimates to identify cellular engineering targets. Microb Cell Fact 17, 135 (2018). https://doi.org/10.1186/s12934-018-0979-7

Download citation

Received: 29 May 2018
Accepted: 21 August 2018
Published: 30 August 2018
DOI: https://doi.org/10.1186/s12934-018-0979-7

An estimate is worth about a thousand experiments: using order-of-magnitude estimates to identify cellular engineering targets

Abstract

Background

Estimates for production of artemisinin

Estimated maximum titer of artemisinic acid in S. cerevisiae (BNIDs 104150 and 100986)

Estimates for production of electrical energy