Genome-scale metabolic network reconstruction and in silico flux analysis of the thermophilic bacterium Thermus thermophilus HB27
Microbial Cell Factories volume 13, Article number: 61 (2014)
Thermus thermophilus, an extremely thermophilic bacterium, has been widely recognized as a model organism for studying how microbes can survive and adapt under high temperature environment. However, the thermotolerant mechanisms and cellular metabolism still remains mostly unravelled. Thus, it is highly required to consider systems biological approaches where T. thermophilus metabolic network model can be employed together with high throughput experimental data for elucidating its physiological characteristics under such harsh conditions.
We reconstructed a genome-scale metabolic model of T. thermophilus, i TT548, the first ever large-scale network of a thermophilic bacterium, accounting for 548 unique genes, 796 reactions and 635 unique metabolites. Our initial comparative analysis of the model with Escherichia coli has revealed several distinctive metabolic reactions, mainly in amino acid metabolism and carotenoid biosynthesis, producing relevant compounds to retain the cellular membrane for withstanding high temperature. Constraints-based flux analysis was, then, applied to simulate the metabolic state in glucose minimal and amino acid rich media. Remarkably, resulting growth predictions were highly consistent with the experimental observations. The subsequent comparative flux analysis under different environmental conditions highlighted that the cells consumed branched chain amino acids preferably and utilized them directly in the relevant anabolic pathways for the fatty acid synthesis. Finally, gene essentiality study was also conducted via single gene deletion analysis, to identify the conditional essential genes in glucose minimal and complex media.
The reconstructed genome-scale metabolic model elucidates the phenotypes of T. thermophilus, thus allowing us to gain valuable insights into its cellular metabolism through in silico simulations. The information obtained from such analysis would not only shed light on the understanding of physiology of thermophiles but also helps us to devise metabolic engineering strategies to develop T. thermophilus as a thermostable microbial cell factory.
Thermus thermophilus is a gram-negative, obligate aerobic bacterium, representing one of the best-studied thermophiles. It usually colonizes the terrestrial volcanic hot springs (grows optimally between 65 and 72°C) and was originally isolated from a Japanese thermal spa . In addition to the ability of surviving at such high temperatures, T. thermophilus is resistant to other stress such as harsh chemical conditions . These properties motivated researchers to extract or isolate numerous proteins from T. thermophilus, making it as a model organism in structural genomics with significant industrial potential [3–6]. For example, several thermostable proteins are already used in commercial processes, including the DNA polymerase in PCR techniques, α-amylases and glucose isomerases in starch processing, esterases, lipases and proteases in organic synthesis, and xylanases in paper and pulp manufacturing [7, 8]. Moreover, T. thermophilus is being recognized as a potential microbial cell factory for the low cost ethanol fermentation from lignocellulosic waste materials since it can grow by utilizing most of the C5/C6 carbon sources at relatively high temperatures, i.e. 70–80°C, thus reducing the energy costs: no cooling step is required following enzymatic hydrolysis, rendering it easier to distil subsequent fermentations .
Despite enormous potentials for biotechnological applications, the current knowledge on the unique cellular physiology of T. thermophilus is very limited; to date, the production of distinctive carotenoid molecules  and the use of adaptive protein synthesis strategies  are only two notable traits unravelled at the molecular level. Such limited studies are mostly due to the technical difficulties in cultivating and analysing thermophilic microbes; cell culture experiments require high amount of energy to maintain the optimal growth conditions. Hence, it is indeed required to develop more systematic approaches for improving our understanding of T. thermophilus cellular behaviour. In this regard, constraints-based in silico metabolic modeling and analysis can be considered as one of the promising techniques to characterize the physiological behaviour and metabolic states of an organism upon various environmental/genetic changes as they systematically capture the genotype-phenotype relationships from the entire genome information [12, 13]. As a result, several genome-scale metabolic models (GSMMs) are now available for describing the metabolic organization of various organisms including Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Pichia pastoris, Corynebacterium glutamicum, Ralstonia eutropha, Pseudomonas aeruginosa, and even for multicellular eukaryotes such as Mus musculus and Homo sapiens. Moreover, with the availability of several conveniently accessible constraints-based modeling software tools , these models have been largely utilized to postulate various strain improvement strategies [17–19, 24, 25]. Thus, the development of T. thermophilus GSMM based on the currently available biochemical and genomic information and its subsequent in silico analysis enables us to elucidate its unique metabolic behaviour.
In thermophilic microbes regard, there have been only a few initial attempts to model their cellular metabolisms. First, an in silico model of Thermotoga maritima was presented, covering its central metabolism along with the 3D structures of all the enzymes accounted in the network . Recently, the genome-scale metabolic model of thermophilic archeon, Sulfolobus solfataricus, was also developed, and used to describe its autotrophic growth in bicarbonate via hydroxypropionate-hydroxybutyrate cycle under aerobic conditions . However, both models are not mature enough to explain the molecular mechanisms of high temperature adaptations as they do not consider the detailed biosynthetic machinery of biomolecules which help them to retain the integrity of their cell wall membranes. Therefore, in this work, we reconstructed the genome-scale metabolic model of T. thermophilus based on the genome annotation of HB27 wild-type strain  for investigating unique characteristics of thermophilic microbes. Additionally, the model was functionally characterized by gene essentiality studies to identify essential genes for cellular growth while growing in both glucose minimal and amino acid supplemented complex media.
Reconstruction of T. thermophilus genome-scale metabolic network
The genome-scale metabolic network of T. thermophilus HB27 was reconstructed through a three step procedure: (1) construction of draft network via compilation of genes, reactions and pathway information from biochemical databases based on the genome annotation of T. thermophilus HB27, (2) manual curation of the draft model by verifying the elemental balances in reactions and assigning proper gene-reaction relationships, and (3) gap filling using organism specific knowledge (see Methods). During the reconstruction process, significant efforts were highly required to identify and resolve the network gaps across various metabolic pathways. Such gaps exist due to the incomplete genome annotations which result in missing biochemical reactions and dead ends. These gaps can be appropriately filled by the addition of new reactions based on information obtained from the literature or inferred by the genome annotation of other organisms. For example, the initial model contained several metabolic gaps in the synthetic pathway of thermozeaxanthin and thermobiszeaxanthin, the unique type of carotenoids that are found only in thermophiles, enabling their cellular membrane to retain its fluidity even at very high temperatures [29–31]. Therefore, in order to fill such gaps, we added the reactions corresponding to glycosyltransferase and acyltransferase enzymes from Staphylococcus aureus subsp. aureus 11819–97 and Halobacillus halophilus DSM 2266 for the sequential glycosylation and esterification of zeaxanthin with glucose and branched-chain fatty acids, producing thermozeaxanthin. Similarly, the draft model also had several gaps in the carbon assimilatory pathways, and thus was unable to consume six carbon sources, namely, trehalose, palatinose, isomaltose, cellobiose, glutamate and mannose. However, earlier studies have reported that T. thermophilus can grow on all these carbon sources [32, 33]. Such discrepancies were again resolved by adding new reactions corresponding to the α-glucosidase and mannokinase enzymes based on the information available from KEGG and MetaCyc databases. Overall, we included 74 new reactions for 63 enzymes, thereby improving its network connectivity (see Additional file 1 for complete list of reactions added). The gap filling of draft model was followed by the identification of genetic evidence for the newly added enzymes via sequence-based homology searches. For this purpose, a BLASTp search was performed in NCBI database for the enzymes that could resolve the network gaps using their amino acid sequences collected from various other organisms against the non-redundant protein sequences of T. thermophilus HB27 genome. In such a way, of the 63 enzymes added, we could assign a putative locus for 10 of them, thus providing possible new annotations (see Additional file 1 for the list of new annotations). The final genome-scale metabolic network of T. thermophilus, i TT548, contains 548 unique genes (ORF coverage – 24%), 796 reactions and 635 unique metabolites. In i TT548, all the 796 reactions were classified into seven major metabolic subsystems: carbohydrates, amino acid, energy and cofactors, lipids, nucleotides, carotenoids and transport. Among them, amino acid metabolism has largest number of reactions and genes, followed by carbohydrates and energy and cofactors metabolism (Figure 1). The detailed list of completely curated T. thermophilus HB27 metabolic network containing the various genes, reactions, and metabolites can be obtained from Additional file 1, and also available as Systems Biology Markup Language (SBML) file (level 2, version 1, http://sbml.org/) (Additional file 2).
Network characteristics of i TT548 and its comparison with E. coli GSMM
Figure 2A presents the overall network features of i TT548 and its comparison with E. coli using the metabolic model (i AF1260)  in terms of the EC numbers. Here, it should be noted that we were unable to fairly compare the metabolic characteristics of T. thermophilus with other thermophiles since the T. maritima model is only limited to the central metabolism while S. solfataricus GSMM was not accessible and no link was provided. From the comparison, there are 363 enzymes/genes are common between E. coli and T. thermophilus, mostly belonging to the central metabolic pathways such as glycolysis, pentose phosphate pathway and the TCA cycle. However, certain notable differences were observed in the amino acid synthetic pathways of T. thermophilus. For instance, lysine is synthesized via the alpha-aminoadipate pathway instead of diaminopimelate pathway. Similarly, the upstream of methionine synthetic pathway is not conserved with E. coli: the precursor molecule, homocysteine, is produced from O-acetyl-L-homoserine and hydrogen sulphide via O-acetyl-L-(homo) serine sulfhydrylase in T. thermophilus and from O-succinyl-L-homoserine and cysteine through O-succinyl-homoserine lyase in E. coli. The comparison between transport reactions further revealed that T. thermophilus lacks the phosphoenolpyruvate-dependent phosphotransferase system (PTS), a typical bacterial transport system, and thus consumes most of the carbohydrates including glucose via ATP-binding cassette (ABC) transporters.
Interestingly, as a unique feature of T. thermophilus, i TT548 contains the necessary biosynthetic machinery for synthesizing several molecules which help them in habituating high temperatures. Unlike many other gram-negative bacteria, T. thermophilus does not contain lipopolysaccharides in cell outer membrane . Instead, it embeds complex carotenoid glucoside esters with various branched chain fatty acids, known as thermozeaxanthins and thermobiszeaxanthins, in the lipid bilayers. Such an arrangement offers multiple advantages including the retention of membrane fluidity at high temperatures and reduction of oxygen diffusion through the membrane for preventing oxidation damage [10, 35]. Furthermore, T. thermophilus synthesizes several unique polyamines such as thermine, spermine, thermospermine and caldopentamine using a distinct pathway from L-arginine via aminopropyl agmatine . These polyamines are essential for high temperature protein synthesis by ensuring the proper structure formation of the initiation complex among 30S ribosomal subunit, the messenger, and the initial aminoacyl-tRNA. As a notable exception to gram-negative bacterium, T. thermophilus also synthesizes branched chain fatty acids from amino acids such as valine, leucine and isoleucine via ketoisovalerate oxidoreductase (vorA) as in gram-positive bacteria such as Bacillus. Table 1 list the unique carotenoid and polyamine molecules, and relevant genes of the corresponding synthetic pathways accounted in i TT548; Figure 2 illustrates the central metabolic network of T. thermophilus where the branched chain fatty acids and carotenoid synthetic pathways are highlighted.
In i TT548, we have also included a biomass equation based on our amino acid compositional analysis and the data obtained from literature. Importantly, such biomass composition must be carefully formulated to avoid any erroneous conclusions from the flux balance analysis . It should be noted that the earlier thermophile models, T. maritima and S. solfataricus, adopted the biomass equation from E. coli and M. bakeri, respectively, both grow at 37°C which is well below the optimal growing temperature range of thermophiles (50 ~ 80°C). In this regard, the compositions of some amino acids, valine, lysine, threonine, lysine, glutamine and isoleucine, are very distinctive between T. thermophilus and E. coli (Figure 2C). Similarly, the fatty acid compositions of T. thermophilus were also different from E. coli; the linear chain fatty acids composition are almost negligible in T. thermophilus while the branched-chain fatty acids such as iso-C17:0 and iso-C15:0, which are not present in E. coli, contribute the bulk of total lipid compositions (Figure 2D). Collectively, these results highlight the need for the careful estimation of biomass equation while modeling thermophiles.
Model validation using minimal and complex media during batch cultures
We validated i TT548 using data from the batch cultures of T. thermophilus growing in glucose minimal and complex media. In case of glucose minimal medium, cells were cultured in a DMM containing 0.6% (w/v) glucose at 70°C. The residual concentration of glucose and the cell density were monitored (Figure 3A). Initially, a prolonged lag phase of 2 d was observed, followed by an exponential growth phase of 8 h before the stationary phase was reached. The cell cultures also indicated the presence of acetate, lactate and ethanol in trace amounts during lag and exponential growth phases. In order to analyse the growth behaviour of T. thermophilus in a rich medium, the cells were grown in a complex TM medium at 70°C. It should be noted that this medium was supplemented with all 20 amino acids. The nutrient consumption profiles indicated that glucose was consumed first. Subsequently, other carbohydrates such as trehalose and amino acids were assimilated (Figure 3B). Notably, cells did not consume all the amino acid supplemented in the medium and preferred branched chain amino acids ahead of other amino acids, possibly to synthesize the branched chain fatty acids (Figures 3C and D). Furthermore, the complex medium did not show any appreciable lag phase and the cells grew almost twice as fast as the minimal media.
To simulate the cellular growth in the minimal medium, the biomass equation was maximized in the flux analysis simulations while simultaneously constraining the glucose uptake rates measured during the exponential phase based on the assumption that wild-type organisms typically evolve towards the maximization of cellular growth during exponential phase . The exchange fluxes of NH3, phosphate, sulphite, H2O, Fe2+, Mg and H+ were left unconstrained to provide basic nutrients and minerals for cell growth. The oxygen uptake rate was constrained at the average specific uptake rate of 10 mmol g-1 DCW hr-1 based on previous publication . Additionally, the lactate exchange flux was also constrained at the measured uptake/secretion rates in each phase. A growth associated maintenance (GAM) value of 58.34 mmol g-1 DCW h-1, from E. coli GSMM, and a NGAM requirement of 14 mmol g-1 DCW h-1, calculated based on established methods , were also used for the simulations (see Additional file 3 for detailed calculation of NGAM calculations). Here, it should be noted that while comparing the NGAM requirement of T. thermophilus with E. coli, it possess a very high value (14 compared to that of 8 mmol g-1 DCW h-1) possibly due to the differences in H+ permeability of their cytoplasmic membrane at optimal growth temperature; T. thermophilus cytoplasmic membrane is highly permeable to H+, and thus, leaks protons without ATP synthesis via ATPase . Notably, simulation results were highly consistent with observed growth rates (Figure 4). For complex media, once again the biomass equation was maximized while simultaneously constraining the uptake/secretion rates of all nutrients (glucose, trehalose, lactate, acetate and amino acids) during the exponential growth phase. The in silico predicted cell growth of 0.66 h-1 was very close (within the acceptable error range of 10%) to the experimentally observed specific growth rate of 0.64 h-1, thus clearly indicating the high predictive ability of i TT548 even in complex media.
In silico comparative metabolic flux analysis of minimal and complex media
When comparing the cellular growth rates of T. thermophilus in minimal and complex media, not surprisingly, it was significantly higher in the latter owing to the availability of rich carbon and nitrogen sources in the form of various amino acids. Since the microbe can take up some of the amino acids directly from the medium, it could be possible that the protein biosynthetic demand can be partially fulfilled. In order to confirm such hypothesis, we compared the simulated metabolic fluxes between minimal (phase P1) and complex media, and observed much lower fluxes through the relevant biosynthetic reactions of amino acids in the complex medium (Table 2). Herein, it should be noted that we also conducted the flux variability analysis  to re-assure the confidence of the simulated metabolic states in each environmental condition.
Interestingly, although all 20 amino acids were supplemented in the complex medium, the cells consumed only a few of them preferably. In order to understand this cellular behavior, we compared the uptake rate of individual amino acids with their actual biosynthetic demands. Methionine and cysteine were consumed from the medium according to their biosynthetic demands while alanine, valine, leucine, isoleucine and glutamate were consumed in excess of their individual biosynthetic requirements (Table 3). Further analysis revealed that the surplus valine/leucine/isoleucine and alanine/glutamate were utilized to synthesize the branched chain fatty acids and other amino acids, respectively. Notably, leucine was consumed much more than its biosynthetic demand (almost 7 times higher) mainly to synthesize iso-15 and iso-17 fatty acids, which constitute the major composition in T. thermophilus. Remarkably, this amino acid utilization pattern is completely different from E. coli grown in complex media, which consumed serine, aspartate, glycine and threonine ahead of other amino acids . The subsequent in silico analysis in E. coli revealed that the rapidly depleted serine was primarily converted into pyruvate via serine deaminase, and then to acetate for producing more ATP as similarly observed in several other microbes such as C. glutamicum and Lactococcus lactis. In this regard, it is interesting to note that T. thermophilus possess unique nutrient consumption pattern, preferably synthesizing branched chain fatty acids rather than improving energy production further. Overall, as the in silico analysis highlights that all consumed amino acids contribute either directly or indirectly towards biomass synthesis, strategies to increase the uptake of non-consumed amino acids such as tyrosine, lysine, tryptophan and histidine can be postulated to enhance the cellular growth in complex medium.
Analysis of essential genes T. thermophilus
We analysed the essentiality of individual genes of the T. thermophilus under glucose minimal and complex TT medium conditions using i TT548 model by deleting every gene one at a time (see Methods). The genes were then categorized into three classes: (i) completely essential – genes which are required for cellular growth in both the media, (ii) conditionally essential – required only in one of the media and (iii) non-essential – dispensable in both the media. The gene essentiality study revealed that 23.5% and 19.5% of the total 548 genes in i TT548 are essential for cell growth in minimal and complex media, respectively (see Additional file 4 for complete list of essential genes). A total of 107 genes were essential in both conditions and an additional 21 genes were essential only in minimal media. In order to further understand the knockout of which functional category of genes are more crucial for cell viability in either condition, we identified the distribution of essential genes across various metabolic processes (Figure 5). Interestingly, the carotenoids metabolism contained most of the lethal genes (82%), suggesting that this unique pathway synthesizing the thermozeaxanthin and thermobiszeaxanthin does not have many alternative routes and is quite rigid in T. thermophilus. Following carotenoids metabolism, the nucleotides and lipids metabolism has second and third highest number of completely essential genes (32.7% and 24.6%, respectively). On the other hand, examination of conditionally essential genes, i.e. genes which are essential only in minimal media, revealed that amino acid metabolism contains almost all such genes (Figure 5). Since the complex media is supplemented with all the amino acids, some of them were directly consumed from the media without utilizing their biosynthetic pathways, thus classifying the genes from those pathways as non-essential under such conditions. However, at the same time, distribution of certain completely essential genes in amino acid metabolism also indicates that biosynthetic pathways of tryptophan, proline and tyrosine are crucial for the cell growth since they should be synthesized within the cell albeit their availability in the complex medium. Interestingly, our gene deletion analyses also showed a high number of completely essential genes within the oxidative phosphorylation and TCA cycle. Oxygen is the key electron acceptor in cellular metabolism which can accept electrons from other redox cofactors involved in the TCA cycle and generates energy through oxidative phosphorylation. If oxygen is devoid or any of the oxidative phosphorylation and TCA cycle reactions are perturbed, generally, most of the bacteria regenerate the redox cofactors by switching to fermentative growth with the help of substrate level phosphorylation. In this regard, since T. thermophilus lacks the PTS and utilizes ABC transporters even for glucose uptake, it could be possible that the microbe cannot switch to fermentative metabolism completely and thus, requires oxygen to generate sufficient energy for cell growth via TCA cycle and oxidative phosphorylation.
Thermophilic microbes represent a unique class of organisms with a distinct cell wall assembly that enables them to maintain the cellular membrane integrity even at very high temperatures. It is reported that many thermophilic organisms, including T. maritima and S. solfataricus, synthesize ether lipids from long chain dicarboxylic fatty acids and fatty alcohols . However, Thermus sp. do not synthesize ether lipids but produce unique carotenoid molecules such as thermozeaxanthin and thermobizeaxanthin, and embed them in the lipid bi-layer to attain the required cellular membrane fluidity at high temperatures . In this regard, i TT548 completely captures all the biosynthetic pathways of thermozeaxanthins, in addition to the metabolic routes of other biomass precursors such as amino acids, nucleotides and lipids. Similarly, i TT548 also contains the unique biosynthetic pathways of several unusual polyamines which help in stabilizing the nucleotide strands and proteins synthesis at high temperatures. It has been earlier reported that T. thermophilus is unique in polyamine synthesis: even the extreme thermophiles such as S. solfataricus produces relatively shorter polyamines . Collectively, these results clearly show the detailed metabolic coverage of i TT548 of thermophiles when comparing with its preceding GSMMs.
Furthermore, this work includes a prudently drafted biomass equation that is specific to thermophiles, especially Thermus sp. As mentioned earlier, the comparative analysis of T. thermophilus and E. coli biomass compositions have highlighted significant differences between amino acid and fatty acid compositions. Noticeably, the T. thermophilus biomass analysis revealed that the concentration of some of the thermolabile amino acids such as threonine and histidine are substantially lesser than E. coli whereas the proline concentration is much higher. It should be highlighted that these observations are in good agreement with earlier reports which suggested the selective usage of amino acid residues as one of the key adaptive strategy employed by thermophiles [44, 45]. Arguably, the cellular compositions in thermophilic microbes may change depending on growth temperature; the current biomass equation was derived based on compositional analysis of T. thermophilus grown at 70°C. In order to clarify the temperature dependent compositional change in biomass, we measured amino acid compositions in T. thermophilus at 45°C. Their comparison with compositional data at 70°C clearly indicated that there is no significant difference in both individual and overall amino acid concentrations (Figure 6A). Similarly, we also compared the fatty acid compositions between 40°C and 70°C using the data from literature . Very interestingly, unlike amino acid comparison, fatty acid compositions, both overall and individual were much lower at 40°C (Figure 6B). Although we were not able to make a complete comparison between low and high growth temperatures since no data was available on other cellular constituents such as peptidoglycans and thermotolerant carotenoids at low temperature range, we can still hypothesize that thermophiles are most likely to adjust their biomass composition selectively to better adapt to the growth environment. Therefore, the use of appropriate biomass equations for simulating the cellular growth in corresponding temperature ranges is crucial for reliable prediction.
The gene deletion analyses of i TT548 have revealed several interesting traits about the function of deleted genes with respect to overall cellular metabolism of T. thermophilus. Among them, the most notable is the relatively high percentage of essential genes when compared to E. coli (23% to that of 13%), possibly due to the smaller Open Reading Frames (ORF) content (only 2,263 as compared to 4,623) despite possessing all the necessary modules for the cell to be viable at high temperatures. Furthermore, this observation also highlights the fact that since T. thermophilus thrives at higher temperatures than most other microbes, its fitness might be relatively less competitive with more rigid network organization. Interestingly, the gene essentiality analyses also indicated that the carotenoids metabolism is functionally quite fragile since almost all of its genes are essential for cellular growth. However, it should be noted that the gene deletion analysis results are sensitive to several parameters such as in silico medium setup and biomass composition. In this regard, the current biomass composition is obtain from T. thermophilus at optimum growth temperatures, i.e. 70°C, and thus the gene deletion results of the current study are only applicable to this condition.
We presented the genome-scale metabolic network of T. thermophilus, i TT548, the first ever representing thermophiles, containing 548 unique genes, 796 reactions and 635 unique metabolites. As a unique feature of T. thermophilus, i TT548 contains necessary metabolic pathways for synthesizing several unique carotenoids and polyamines which help them in habituating high temperatures. The reconstructed metabolic model was subsequently validated with the batch culture experiments on glucose minimal and complex medium where the in silico growth predictions of the i TT548 were in good agreement with the observed experimental results. The comparative flux analysis between minimal and complex media highlighted that the consumption and utilization of branched chain amino acids directly in the relevant fatty acids anabolic pathways, thus resulting in higher growth rates in the rich medium. A gene essentiality study was also conducted through in silico simulation studies in both minimal and complex media, highlighting a very high percentage of lethal genes in comparison with E. coli, suggesting that the metabolic backbone of T. thermophilus could to be quite rigid. Overall, the metabolic network presented in the current study is expected to be a significant contribution towards systems analysis of thermophiles where the metabolic model can be utilized along with high throughput datasets for the better understanding of organism.
Microorganism and culture conditions
T. thermophilus HB27 strain was used as a model organism. For fermentation in complex medium, a single colony was cultivated overnight at 70°C with 150 rpm in 5 mL of the TM medium , and the culture was transferred to a 500 mL baffled-flask containing 100 mL of TM broth. In case of cultivation in defined glucose minimal medium, the overnight seed grown at 70°C with 150 rpm in 5 mL of the TM medium was then transferred to a 500 mL baffled-flask containing 100 mL of defined minimal medium (DMM) with 0.6% (w/v) glucose and cultivated for 24 hours. Then, 5 mL of flask culture in DMM was inoculated to 100 mL of fresh DMM. During fermentations, cell growth was monitored by measuring the optical density at 600 nm. The dry cell weight (DCW) was then estimated by a predetermined conversion factor of 0.34.
Concentrations of glucose, organic acids and ethanol in the culture broth were measured by high performance liquid chromatography (HPLC) (Waters, Milford, MA) equipped with an HPX-87H column (Bio-Rad, Hercules, CA), a dual λ absorbance detector. The collected samples were centrifuged at 14,000 g and 4°C for 5 min and the supernatant was analyzed with the column using 5 mM sulfuric acid as a mobile phase at 0.6 mL-1 min. Concentrations of amino acids were determined by gas chromatograph/mass spectrophotometer (GC/MS) (Agilent, Santa Ciara, CA) equipped with an HP-5MS column (Agilent), as previously reported . In brief, the samples were centrifuged, dried and derivatized with methyl-N-t-butyldimethylsilyl-trifluoro-acetamide (MBDSTFA) in DMF at 80°C for 30 min. After centrifugation at 14,000 g for 5 min, the supernatant was injected to GC/MS in split injection mode (1:10 split ratio).
Metabolic network reconstruction
The genome-scale metabolic network of T. thermophilus HB27 was reconstructed using the published genome annotation  and the information collected from various biological and genomic databases on the basis of the established procedure . First, an initial draft model was constructed by compiling the annotated metabolic genes and their corresponding biochemical reactions from KEGG  and MetaCyc . Then, these reactions were corrected for any elemental imbalances and mapped with appropriate genes to devise proper gene-protein-reaction (GPR) relationships. Additionally, some spontaneous as well as non-gene-associated reactions including metabolite transport were also incorporated into the model based on the physiological evidence from literature and databases. The connectivity of the draft network was then checked using the GapFind algorithm to find the gaps . The identified missing links were filled either by introduction of sink reactions to allow for material exchange between the cell and its surrounding environment or by adding reactions from other similar microbes to close the knowledge gaps.
Cellular biomass composition is an important prerequisite for the in silico flux analysis, especially during the exponential growth phase, where the primary cellular objective is to maximize growth. Amino acid composition of T. thermophilus HB27 was estimated by hydrolyzing the cell pellets with 6 N HCl for 24 h at 130°C, and subsequently analysing the hydrolysates using HPLC equipped with UV-detector and C18 column. Cell wall and lipid compositions were obtained from previous publications on Thermus sp. [30, 31, 35, 52]. The overall DNA and RNA composition was assumed to be same as E. coli since no data was available on Thermus sp. The individual weights of nucleotides in the DNA and RNA were calculated based on the reported G + C content of 69.4% . Detailed information on biomass composition calculations could be found in Additional file 3.
Constraints-based flux analysis
We implemented constraints-based flux analysis to simulate the T. thermophilus metabolism under varying environmental conditions. The biomass reaction was maximized to simulate the exponential growth phase as described elsewhere [53–55]. Mathematically, the optimization problem, i.e. maximization of biomass subjected to stoichiometric and capacity constraints, can be formulated as follows:
where S ij refers to the stoichiometric coefficient of metabolite i involved in reaction j, v j denotes to the flux or specific rate of metabolic reaction j, and represent the lower and upper limits on the flux of reaction j, respectively; and Z corresponds to the cellular objective as a linear function of all the metabolic reactions where the relative weights are determined by the coefficient c j . In this study, the constraints-based flux analysis problems were solved using COBRA toolbox .
Flux variability analysis
As constraints-based flux analysis is an optimization based technique, it is often possible to have multiple flux distributions attaining the same physiological state. Therefore, in order to confirm the plausibility of internal metabolic fluxes simulated in minimal and complex media by flux analysis, we performed the flux variability analysis (FVA) to identify the possible range of all fluxes while simulating a particular phenotypic state. Mathematically, the optimization problem specific to FVA can be represented as follows:
where Z obj denotes the value of objective calculated by flux analysis and n is the number of fluxes. The upper range of fluxes is identified by maximizing the objective whereas the lower range is obtained by minimizing the same. In this study, the FVA was implemented using COBRA toolbox.
Gene deletion analysis
Gene deletion simulations were performed by maximizing the cellular biomass while constraining flux through the corresponding reaction(s) to be zero via the GPR relationships under defined nutrient uptake rates. In case of glucose minimal medium, only glucose was fueled as carbon source. On the other hand, glucose, trehalose and amino acids such as valine, leucine and isoleucine were supplied as carbon source based on the complex media based on nutrient consumption profile. The simulation results were subsequently analyzed to identify the essential genes where a gene is classified to be essential if the resulting cell growth prediction for the corresponding mutant is less than or equal to 5% of wild-type. Note that all the gene deletion analysis in this study was performed using COBRA toolbox.
Oshima T, Imahori K: Description of Themus thermophilus (Yoshida and Oshima) comb. nov., a nonsporulation thermophilic bacterium from a Japanese thermal spa. Int J Syst Bacteriol. 1974, 24: 102-112. 10.1099/00207713-24-1-102.
Cava F, Hidalgo A, Berenguer J: Thermus thermophilus as biological model. Extremophiles. 2009, 13: 213-231.
Wimberly BT, Brodersen DE, Clemons WM, Morgan-Warren RJ, Carter AP, Vonrhein C, Hartsch T, Ramakrishnan V: Structure of the 30S ribosomal subunit. Nature. 2000, 407: 327-339.
Sazanov LA, Hinchliffe P: Structure of the hydrophilic domain of respiratory complex I from Thermus thermophilus. Science. 2006, 311: 1430-1436.
Selmer M, Dunham CM, Murphy FV, Weixlbaumer A, Petry S, Kelley AC, Weir JR, Ramakrishnan V: Structure of the 70S ribosome complexed with mRNA and tRNA. Science. 2006, 313: 1935-1942.
Yokoyama K, Ohkuma S, Taguchi H, Yasunaga T, Wakabayashi T, Yoshida M: V-type H + -ATPase/synthase from a thermophilic eubacterium, Thermus thermophilus - Subunit structure and operon. J Biol Chem. 2000, 275: 13955-13961.
Pantazaki AA, Pritsa AA, Kyriakidis DA: Biotechnologically relevant enzymes from Thermus thermophilus. Appl Microbiol Biotechnol. 2002, 58: 1-12.
Niehaus F, Bertoldo C, Kahler M, Antranikian G: Extremophiles as a source of novel enzymes for industrial application. Appl Microbiol Biotechnol. 1999, 51: 711-729.
Riyanti EI: Genetic manipulation of thermophiles for ethanol production. PhD Thesis. 2007, The University of New South Wales: School of biotechnology and biomolecular sciences
Tian B, Hua Y: Carotenoid biosynthesis in extremophilic Deinococcus–Thermus bacteria. Trends Microbiol. 2010, 18: 512-520.
Shigi N, Suzuki T, Tamakoshi M, Oshima T, Watanabe K: Conserved bases in the TPsi C loop of tRNA are determinants for thermophile-specific 2-thiouridylation at position 54. J Biol Chem. 2002, 277: 39128-39135.
Lewis NE, Nagarajan H, Palsson BO: Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat Rev Microbiol. 2012, 10: 291-305.
Liu L, Agren R, Bordel S, Nielsen J: Use of genome-scale metabolic models for understanding microbial physiology. FEBS Letters. 2010, 584: 2556-2564.
Feist AM, Henry CS, Reed JL, Krummenacker M, Joyce AR, Karp PD, Broadbelt LJ, Hatzimanikatis V, Palsson BO: A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information. Mol Syst Biol. 2007, 3: 121-
Oh YK, Palsson BO, Park SM, Schilling CH, Mahadevan R: Genome-scale reconstruction of metabolic network in Bacillus subtilis based on high-throughput phenotyping and gene essentiality data. J Biol Chem. 2007, 282: 28791-28799.
Mo ML, Palsson BO, Herrgard MJ: Connecting extracellular metabolomic measurements to intracellular flux states in yeast. BMC Syst Biol. 2009, 3: 37-
Chung BK, Selvarasu S, Andrea C, Ryu J, Lee H, Ahn J, Lee DY: Genome-scale metabolic reconstruction and in silico analysis of methylotrophic yeast Pichia pastoris for strain improvement. Microb Cell Fact. 2010, 9: 50-
Kjeldsen KR, Nielsen J: In silico genome-scale reconstruction and validation of the Corynebacterium glutamicum metabolic network. Biotechnol Bioeng. 2009, 102: 583-597.
Park JM, Kim TY, Lee SY: Genome-scale reconstruction and in silico analysis of the Ralstonia eutropha H16 for polyhydroxyalkanoate synthesis, lithoautotrophic growth, and 2-methyl citric acid production. BMC Syst Biol. 2011, 5: 101-
Oberhardt MA, Puchalka J, Fryer KE, Martins dosSantos VA, Papin JA: Genome-scale metabolic network analysis of the opportunistic pathogen Pseudomonas aeruginosa PAO1. J Bacteriol. 2008, 190: 2790-2803.
Selvarasu S, Karimi IA, Ghim GH, Lee DY: Genome-scale modeling and in silico analysis of mouse cell metabolic network. Mol bioSyst. 2010, 6: 152-161. 10.1039/b912865d.
Duarte NC, Becker SA, Jamshidi N, Thiele I, Mo ML, Vo TD, Srivas R, Palsson BO: Global reconstruction of the human metabolic network based on genomic and bibliomic data. Proc Natl Acad Sci U S A. 2007, 104: 1777-1782.
Lakshmanan M, Koh G, Chung BK, Lee DY: Software applications for flux balance analysis. Brief Bioinform. 2014, 15: 108-122.
Lee SJ, Lee DY, Kim TY, Kim BH, Lee J, Lee SY: Metabolic engineering of Escherichia coli for enhanced production of succinic acid, based on genome comparison and in silico gene knockout simulation. Appl Environ Microbiol. 2005, 71: 7880-7887.
Matsuda F, Furusawa C, Kondo T, Ishii J, Shimizu H, Kondo A: Engineering strategy of yeast metabolism for higher alcohol production. Microb Cell Fact. 2011, 10: 70-
Zhang Y, Thiele I, Weekes D, Li Z, Jaroszewski L, Ginalski K, Deacon AM, Wooley J, Lesley SA, Wilson IA, Palsson BO, Osterman A, Godzik A: Three-dimensional structure view of the central metabolic network of Thermotoga maritima. Science. 2009, 325: 1544-1549.
Ulas T, Riemer SA, Zaparty M, Siebers B, Schomburg D: Genome-scale reconstruction and analysis of the metabolic network in the hyperthermophilic archaeon Sulfolobus solfataricus. PloS One. 2012, 7: e43401-
Henne A, Bruggemann H, Raasch C, Wiezer A, Hartsch T, Liesegang H, Johann A, Lienard T, Gohl O, Martinez-Arias R, Jacobi C, Starkuviene V, Schlenczeck S, Dencker S, Huber R, Klenk HP, Kramer W, Merkl R, Gottschalk G, Fritz HJ: The genome sequence of the extreme thermophile Thermus thermophilus. Nat Biotechnol. 2004, 22: 547-553.
Kaneda T: Iso- and anteiso-fatty acids in bacteria: biosynthesis, function, and taxonomic significance. Microbiological Rev. 1991, 55: 288-302.
Nordstrom KM, Laakso SV: Effect of growth temperature on fatty acid composition of ten Thermus strains. Appl Environ Microbiol. 1992, 58: 1656-1660.
Pask-Hughes RA, Shaw N: Glycolipids from some extreme thermophilic bacteria belonging to the geus Thermus. J Bacteriol. 1982, 149: 54-58.
Alarico S, da Costa MS, Empadinhas N: Molecular and physiological role of the trehalose-hydrolyzing alpha-glucosidase from Thermus thermophilus HB27. J Bacteriol. 2008, 190: 2298-2305.
Lengsfeld C, Schonert S, Dippel R, Boos W: Glucose- and glucokinase-controlled mal gene expression in Escherichia coli. J Bacteriol. 2009, 191: 701-712.
Holst O: Structure of the Lipopolysaccharide Core Region. Bacterial Lipopolysaccharides. Edited by: Knirel YA, Valvano MA. 2011, 21-39. Wien: Springer-Verlag
Mandelli F, Yamashita F, Pereira JL, Mercadante AZ: Evaluation of biomass production, carotenoid level and antioxidant capacity produced by Thermus filiformis using fractional factorial design. Braz J Microbiol. 2012, 43: 126-134.
Oshima T: Unique polyamines produced by an extreme thermophile, Thermus thermophilus. Amino Acids. 2007, 33: 367-372.
Raghunathan AU, Perez-Correa JR, Bieger LT: Data reconciliation and parameter estimation in flux-balance analysis. Biotechnol Bioeng. 2003, 84: 700-709.
Schuster S, Pfeiffer T, Fell DA: Is maximization of molar yield in metabolic networks favoured by evolution?. J Theor Biol. 2008, 252: 497-504.
Demirtas MU, Kolhatkar A, Kilbane JJ: Effect of aeration and agitation on growth rate of Thermus thermophilus in batch mode. J Biosci Bioeng. 2003, 95: 113-117.
Mckay A, Quilter J, Jones CW: Energy conservation in the extreme thermophile Thermus Thermophilus HB8. Arch Microbiol. 1982, 131: 43-50. 10.1007/BF00451497.
Mahadevan R, Schilling CH: The effects of alternate optimal solutions in constraint-based genome-scale metabolic model. Metab Eng. 2003, 5: 264-276.
Selvarasu S, Ow DS-W, Lee SY, Lee MM, Oh SK-W, Karimi IA, Lee D-Y: Characterizing Escherichia coli DH5α growth and metabolism in a complex medium using genome-scale flux analysis. Biotechnol Bioeng. 2009, 102: 923-934.
Koga Y: Thermal adaptation of the archaeal and bacterial lipid membranes. Archaea. 2012, 2012: 789652-
Kumar S, Tsai CJ, Nussinov R: Factors enhancing protein thermostability. Protein Eng. 2000, 13: 179-191.
Mallik S, Kundu S: A comparison of structural and evolutionary attributes of Escherichia coli and Thermus thermophilus small ribosomal subunits: signatures of thermal adaptation. PloS One. 2013, 8: e69898-
Koyama Y, Hoshino T, Tomizuka N, Furukawa K: Genetic transformation of the extreme thermophile Thermus thermophilus and of other Thermus spp. J Bacteriol. 1986, 166: 338-340.
Yang KM, Lee NR, Woo JM, Choi W, Zimmermann M, Blank LM, Park JB: Ethanol reduces mitochondrial membrane integrity and thereby impacts carbon metabolism of Saccharomyces cerevisiae. FEMS Yeast Res. 2012, 12: 675-684.
Thiele I, Palsson BO: A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat Protocol. 2010, 5: 93-121. 10.1038/nprot.2009.203.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006, 34: D354-D357.
Caspi R, Altman T, Dreher K, Fulcher CA, Subhraveti P, Keseler IM, Kothari A, Krummenacker M, Latendresse M, Mueller LA, Ong Q, Paley S, Pujar A, Shearer AG, Travers M, Weerasinghe D, Zhang P, Karp PD: The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 2012, 40: D742-D753.
Satish Kumar V, Dasika MS, Maranas CD: Optimization based automated curation of metabolic reconstructions. BMC Bioinform. 2007, 8: 212-10.1186/1471-2105-8-212.
Ray PH, White DC, Brock TD: Effect of growth temperature on the lipid composition of Thermus aquaticus. J Bacteriol. 1971, 108: 227-235.
Lee JM, Gianchandani EP, Papin JA: Flux balance analysis in the era of metabolomics. Brief Bioinform. 2006, 7: 140-150.
Oberhardt MA, Chavali AK, Papin JA: Flux balance analysis: interrogating genome-scale metabolic networks. Meth Mol Biol. 2009, 500: 61-80. 10.1007/978-1-59745-525-1_3.
Orth JD, Thiele I, Palsson BO: What is flux balance analysis?. Nat Biotechnol. 2010, 28: 245-248.
Schellenberger J, Que R, Fleming RM, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S, Kang J, Hyduke DR, Palsson BO: Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat Protocol. 2011, 6: 1290-1307. 10.1038/nprot.2011.308.
This work was supported by the National University of Singapore, Biomedical Research Council of A*STAR (Agency for Science, Technology and Research), Singapore, Korea Research Foundation (KRF- 2010–0009169), Republic of Korea, and a grant from the Next-Generation BioGreen 21 Program (SSAC, No. PJ009520), Rural Development Administration, Republic of Korea.
The authors declare that they have no competing interests.
NRL, ML, JBP and DYL conceived and designed the study. NRL and JWS performed the batch culture experiments. NRL and SA created the draft model. ML refined the model and performed simulations. NRL, ML, IAK, JBP and DYL wrote the manuscript. JBP and DYL coordinated and directed the project. All authors have read and approved the final manuscript.
Na-Rae Lee, Meiyappan Lakshmanan contributed equally to this work.
Electronic supplementary material
Additional file 1: Details of i TT548 containing all genes, reactions, metabolites. A list of reactions added during gap-filling and possible new annotations identified in this study are also provided. (XLS 398 KB)
Authors’ original submitted files for images
About this article
Cite this article
Lee, NR., Lakshmanan, M., Aggarwal, S. et al. Genome-scale metabolic network reconstruction and in silico flux analysis of the thermophilic bacterium Thermus thermophilus HB27. Microb Cell Fact 13, 61 (2014). https://doi.org/10.1186/1475-2859-13-61
- Thermus thermophilus
- Genome-scale metabolic model
- Constraints-based flux analysis