Open Access

The artificial neural network approach based on uniform design to optimize the fed-batch fermentation condition: application to the production of iturin A

Microbial Cell Factories201413:54

https://doi.org/10.1186/1475-2859-13-54

Received: 6 January 2014

Accepted: 8 April 2014

Published: 13 April 2014

Abstract

Background

Iturin A is a potential lipopeptide antibiotic produced by Bacillus subtilis. Optimization of iturin A yield by adding various concentrations of asparagine (Asn), glutamic acid (Glu) and proline (Pro) during the fed-batch fermentation process was studied using an artificial neural network-genetic algorithm (ANN-GA) and uniform design (UD). Here, ANN-GA based on the UD data was used for the first time to analyze the fed-batch fermentation process. The ANN-GA and UD methodologies were compared based on their fitting ability, prediction and generalization capacity and sensitivity analysis.

Results

The ANN model based on the UD data performed well on minimal statistical designed experimental number and the optimum iturin A yield was 13364.5 ± 271.3 U/mL compared with a yield of 9929.0 ± 280.9 U/mL for the control (batch fermentation without adding the amino acids). The root-mean-square-error for the ANN model with the training set and test set was 4.84 and 273.58 respectively, which was more than two times better than that for the UD model (32.21 and 483.12). The correlation coefficient for the ANN model with training and test sets was 100% and 92.62%, respectively (compared with 99.86% and 78.58% for UD). The error% for ANN with the training and test sets was 0.093 and 2.19 respectively (compared with 0.26 and 4.15 for UD). The sensitivity analysis of both methods showed the comparable results. The predictive error of the optimal iturin A yield for ANN-GA and UD was 0.8% and 2.17%, respectively.

Conclusions

The satisfactory fitting and predicting accuracy of ANN indicated that ANN worked well with the UD data. Through ANN-GA, the iturin A yield was significantly increased by 34.6%. The fitness, prediction, and generalization capacities of the ANN model were better than those of the UD model. Further, although UD could get the insight information between variables directly, ANN was also demonstrated to be efficient in the sensitivity analysis. The results of these comparisons indicated that ANN could be a better alternative way for fermentation optimization with limited number of experiments.

Keywords

Artificial neural network Uniform design Genetic algorithm Iturin A Fed-batch fermentation

Background

Iturin A is a nonribosomal lipopeptide antifungal antibiotic produced by Bacillus subtilis. The iturin A structure consists of two major parts: a peptide ring composed of seven amino acid residues (L-Asn-D-Tyr-D-Asn-L-Gln-L-Pro-D-Asn-L-Ser-) and an 11–12 carbons hydrophobic tail [13]. Iturin A is a potential bioresource with broad-spectrum antifungal activity that has been used to treat human and animal mycoses [4]. More recently studies have shown that iturin A could be used as a potential biocontrol agent against harmful plant pathogen that cause crop diseases [5], such as southern corn leaf blight [6] and for controlling fungi in grains [7].

The production of iturin A has gained considerable interest mainly because of its multifarious advantages and diverse potential applications; however, its production on a commercial level has not been successful so far. To increase the yield of iturin A, optimization of the fermentation conditions is one of the primary approaches that can be applied. Fermentation process optimization has been studied intensively by researchers for decades [8], [9]. Traditional optimization approaches are the statistical methods such as the Orthogonal Experiment Method and Response Surface Methodology (RSM) which are used widely on the lab-scale [10]. However, most of these methods require data from a large number of experiments and are not suitable for use on the industrial scale where experimental numbers need to be kept as lower as possible. Therefore, a uniform design (UD) method was developed to meet the industrial needs [11]. Compared with other statistical methods, UD reduces the number of experiments in a multiple-dimension optimization and allows the largest possible number of levels for each factor [12]. UD is an important statistical method that has been used successfully in many process optimizations [1315].

In the last decade, as a kind of artificial intelligence, artificial neural network (ANN) has been applied to the modeling of non-linear systems, simulating the chaos bioprocess and predicting the results [16, 17]. Because ANNs have higher modeling accuracy and generalization capacity, they have become an attractive tool [18]. Additionally, ANN can model all non-linear multivariate functions including the quadratic functions while the statistical methods can only be used for modeling the quadratic functions [19].

ANNs normally require a large number of patterns (experiments) to establish an accurate model, but when the patterns are relatively representative and statistically well distributed, which is the assumed characteristic of statistical methods like UD, ANNs can also establish accurate models with smaller amounts of data. Recent reports indicated that based on the RSM data, the ANN model was more accurate than RSM [17, 1921]. But few studies have reported building ANN models based on UD data, where the experimental numbers were much less than the numbers in RSM. In this work, for the first time an ANN model was established based on UD data and then a comparison between the ANN and UD models was performed to assess their efficiency.

A genetic algorithm (GA) was used to optimize iturin A yield with an ANN model. The GA is an efficient stochastic global optimizing method built on the principles of biological evolution implying survival of the fittest and random changes that has proved to be an outstanding method for multivariable optimizations in biochemical processes [22].

The addition of three amino acids, asparagine (Asn), glutamic acid (Glu), and proline (Pro), to the culture medium during fermentation has been shown to increase significantly the yield of iturin A [23]. In this paper, ANN-GA and UD were used to optimize the yield of iturin A with the same experimental design from UD. Meanwhile, validation experiments using unseen data were carried out to estimate the capacity of both the ANN and UD models as well as the sensitivity analyzing. To the best of our knowledge, this is the first report on optimization by an ANN based on UD data and a comparison of the two methods in antibiotic fermentation.

Results and discussion

Experimental design

A three-factor ten-level experimental design with 10 experiments was performed according to the UD table U 10 * (108). The experimental design and results as well as the predicted outputs calculated by the UD and ANN models are shown in Table 1.
Table 1

UD matrix of variables and their experimental responses and predicted values of iturin A titer

Trail

Factor (mg/L)

Iturin A titer (U/mL)

Asn(X1)

Glu(X2)

Pro(X3)

Experimental

ANN prediction

UD prediction

1

50

280

140

11866.6 ± 287.8

11866

11928.7

2

65

380

80

10967.8 ± 277.3

10966

10978.9

3

80

260

185

12627.2 ± 279.3

12622

12608.0

4

95

360

125

12429.8 ± 289.6

12419

12458.5

5

110

240

65

12496.7 ± 287.3

12492

12493.4

6

125

340

170

13057.1 ± 282.6

13048

13110.6

7

140

220

110

12604.6 ± 276.3

12602

12625.8

8

155

320

50

12519.1 ± 285.5

12513

12589.4

9

170

200

155

11890.0 ± 274.6

11893

11930.7

10

185

300

95

12706.5 ± 276.4

12702

12694.6

UD modeling, analysis and optimization

To determine the relationship between the concentrations of the three added amino acids (variables), regression analysis was performed using the Minitab software. A second order polynomial equation, as in Eq. (1), was obtained to correlate the variables with iturin A titer. The results were analyzed by ANOVA, shown in Table 2. The UD model F-value was 105.55, calculated as the ratio of mean square regression and mean square residual, and the model P-value (Prob > F) was 0.009, which is very low. These two values (F and P) implied that this model was significant. The crossed terms, X 1 X 2 and X 2 X 3 , were deleted for the high correlation with variable X.
Table 2

Analysis of variance (ANOVA) for the quadratic uniform design model

Factor

Coefficients

Sum of squares

Standard error

Degree of freedom

Ta

Pb

Constant

−4864

 

1002

 

−4.86

0.040*

X 1

59.97

642299

4.91

1

12.21

0.007*

X 2

87.57

6239

8.63

1

10.15

0.010*

X 3

21.81

298991

5.45

1

4.00

0.057

X 1 2

−0.23

656908

0.014

1

−15.97

0.004*

X 2 2

−0.15

1445754

0.015

1

−10.10

0.010*

X 3 2

−0.075

111229

0.015

1

−4.95

0.039*

X 1 X 3

0.0011

7

0.028

1

0.04

0.972

Model

F = 105.55 P = 0.009*

R2 = 99.7 % S = 65.4117

aT denotes t-value.

bP denotes P-value.

*denotes significant.

The P-values were also used in the analysis of the independent variables to check the significance of the coefficients, which are necessary to elucidate the pattern of the mutual interactions between them. The P-value of each term, as well as the estimated coefficient and t ratio are given in Table 2. The coefficient was deemed significant when P was less than 0.05, and the smaller the P was, the more significant it indicated. The P-value indicated that X 1 (Asn), X 2 (Glu), X 1 2 (Asn × Asn), X 2 2 (Glu × Glu) and X 3 2 (Pro × Pro) were the significant terms. Among the independent variables, addition of Asn to the culture medium (P = 0.007) had the most significant effect on the iturin A titer followed by Glu (P = 0.01) and Pro (P = 0.057). All the quadratic terms having small P-values indicated that the quadratic terms also had significant effects on the iturin A titer, and the effects of the variables on the responses were not a simple linear relationship. The interactions between the variables were not significant, and only the interaction between Asn and Pro had a small effect.

The regression equation was as follows:
Y = 4864 + 60.0 X 1 + 87.6 X 2 + 21.8 X 3 0 .226 X 1 2 0 .150 X 2 2 0 .0752 X 3 2 + 0.0011 X 1 × X 3

where Y is the titer of iturin A; and X 1 , X 2 and X 3 are the concentrations of added Asn, Glu and Pro respectively.

The fit of model was also indicated by the coefficient of determination R2, which was calculated to be 99.7%, suggesting that the model was efficient and could explain 99.7% of the experimental data.

The optimization was based on taking the partial derivative with respect to X, which were calculated as 133.1 mg/L, 292 mg/L and 145.9 mg/L for the concentrations of added Asn, Glu and Pro, respectively. This gave the maximum predicted result for the iturin A titer (Y) as 13509.1 U/mL.

Three repeat verification experiments were carried out and the average experimental iturin A titer at the optimal feed condition was 13221.7 ± 275.7 U/mL, which was close to the predicted result of 13509.1 U/mL. The prediction deviation was 2.17%.

Artificial neural network and genetic algorithm

ANN modeling and analysis

The same experimental design that was used in the UD method was used as the training set for the ANN modeling, as shown in Table 1. Other experimental data that were not part of the training set were used as the test set (unseen data) to test the generalization capacity of the ANN model, as shown in Table 3. A feed-forward back-propagation (BP) net was chosen with three neurons in the input layer and one neuron in the output layer. The Levenberg-Marquardt Back Propagation training function was chosen and the structure including one hidden layer with seven neurons was decided, which had the minimum mean-square-error (MSE) (0.083 for the scaled data) between the predicted outputs and experimental responses of the test set. Meanwhile, the MSE for the training set was 7.74 × 10−4 (for the scaled data). Thus, the selected ANN model had a 3-7-1 topology, i.e. an input layer with three neurons, a hidden layer with seven neurons, and an output layer with one neuron.
Table 3

ANN and UD prediction for unseen test data

Trail

Factor (mg/L)

Iturin A titer (U/mL)

Asn(X 1 )

Glu(X 2 )

Pro(X 3 )

Experimental

ANN prediction

UD prediction

1

50

200

125

11668.6 ± 276.1

12085

10647.9

2

50

300

50

10763.5 ± 279.8

11481

11255.8

3

50

380

180

10850.6 ± 277.3

10439

10696.4

4

125

200

50

12100.5 ± 277.0

12128

11533.6

5

125

300

125

13064.1 ± 275.9

13074

13451.9

6

125

380

180

12570.8 ± 274.9

12381

12245.0

7

180

200

180

11856.0 ± 171.4

11778

11656.8

8

180

300

125

12878.3 ± 272.9

12950

12968.4

9

180

380

50

12363.2 ± 268.6

11863

11153.5

10

100

333

200

12215.8 ± 272.7

12068

12787.5

11

175

233

175

12143.9 ± 274.1

12288

12527.9

12

50

400

150

10108.0 ± 274.4

9910

10197.3

13

200

200

100

11922.0 ± 275.6

11675

11066.0

14

75

366

75

10914.9 ± 282.2

11232

11551.1

15

150

266

50

12244.3 ± 271.2

12532

12649.5

When the ANN model was built, its fitting capacity and generalization capacity were measured by analyzing the experimental data and model prediction data of the training set and test set, respectively. The average error for the training set and test set were 0.039% and 2.19%, respectively. The correlation coefficient between the model-predicted results and experimental results was 99.998% for the training set and 92.618% for the test set. The small MSE and average error and the high correlation coefficient for the training set indicated that the ANN model processed outstanding approximation ability. Further, the satisfied values of MSE, average error and correlation coefficient for the test set suggested that the ANN model also had good generalization capacity.

Sensitivity analysis

A sensitivity analysis for the ANN model was performed to estimate the effects of input variables on the predicted iturin A titer. The analysis was carried out on the ANN model decided above. Because the ANN model cannot directly give the relationship between input and output, a perturbation analysis based on the mean value of the input data was used to elucidate such insights of the system. Thirty random perturbations of each variable were tested. The pertuibations fluctuated around the mean value of the input data by the amplitude of the standard error. The sensitivity estimation value was calculated as S in Eq. (6). The perturbation curve for each variable and the corresponding S value are shown in Figure 1 and Table 4. The S values for the concentrations of added Asn, Glu, and Pro were 271.1, 54.8 and 114.4, respectively. The sensitivity curve for variations in the concentrations of added Asn had the biggest fluctuation range around their mean values, compared with the fluctuations in the sensitivity curves for variations in the concentrations of added Pro and Glu. This result indicated that the concentration of added Asn had the most significant effect on the titer of iturin A, followed by the concentration of added Pro and Glu, when the variables perturbed around their mean values.
Figure 1

Sensitivity curves of inputs to outputs based on mean value. It indicated the effects of each independent variable when changing around their mean values.

Table 4

Sensitivity analysis of variables to outputs

Variables

Amplitude of perturbation

S value

Asn addition concentration

45.4

271.1

Glu addition concentration

60.6

54.8

Pro addition concentration

45.4

114.4

Interestingly, when an ‘perturb method’ [19] was applied over the entire range of input data using coded values, the sensitivity changed close to the edge. As shown in Figure 2 and Table 5, each series in the graph represented the changing rate of outputs caused by the variation of variables; the higher the slope, the greater the effect of the variable. When the perturbation emerged near the mean value (coded as 0, range from −1 to 1), the sensitivity of each variables was comparable to the results shown in Figure 1. However, when the perturbation emerged near the edge (range from −2 to −1 and 1 to 2), the slopes of curves of the Asn, Glu, and Pro were 660.5, 428.5 and 292.0, respectively, which indicated that the concentration of added Asn had the most significant effect, followed by Glu and Pro, on the iturin A titer. This result suggested that the model could not be explained simply by a linear relationship because the sensitivities were different within different input ranges.
Figure 2

Sensitivity analysis of ANN model using perturb method. It indicated the effects of each independent variable when changing in the entire optimized range. The coded values were shown in Table 5.

Table 5

coded values of variables

Variables

Coded values

−2

−1

0

1

2

Asn

50

83.75

117.5

151.25

185

Glu

200

245

290

335

380

Pro

50

83.75

117.5

151.25

185

Optimization based on GA

A GA-based method was used to optimize the input variables with the goal of maximizing the iturin A titer. The ANN model was used for the fitness function of GA and the optimization process was run several times with different random initial populations. These repetitions ensured that the global optimum to which GA converged under most of the initial conditions was obtained. Based on this ANN-GA optimization process, the maximum iturin A titer was predicted to be 13257 U/mL when the concentrations of added Asn, Glu, and Pro were 155 mg/L, 320 mg/L and 160 mg/L, respectively. Three repeat verification experiments were carried out under the feed condition described above, and the experimental result for the iturin A titer was 13364.5 ± 271.3 U/mL, which was reasonably close to the ANN-GA prediction. Based on this result, the predictive error of the ANN-GA method was 0.8%. The iturin A titer increased significantly by 34.6% from the control (9929.0 ± 280.9 U/mL) in which the batch fermentation was performed without the addition of the three amino acids.

Comparison of UD and ANN-GA

The same experimental design was used to train the UD and ANN models and the comparison was based on the correlation coefficient and the root-mean-square-error (RMSE). The fitting values (prediction) of ANN and UD model are shown in Table 1. Both models had excellent fitting accuracy. The ANN model had a smaller deviation (RMSE) than UD model (Table 6), suggesting that the ANN model had the better approximation capacity.
Table 6

Comparison of predictive capacity of UD and ANN

 

Training data

Validation data

UD

ANN

UD

ANN

RMSEa

32.21

4,84

483.12

237.58

Cb

99.86%

1.0

78.58%

92.62%

error %

0.26

0.039

4.15

2.19

aRMSE is the root-mean-square-error.

bC is the correlation coefficient.

Generalization capacity is an important parameter that is used to evaluate predictive models. Generalization capacity can be estimated only by using input data that is different from the training data set, therefore, 15 additional experiments were carried out to generate a test data set (shown in Table 3). The UD and ANN models were both tested using the new unseen test data. The experimental and predictive results are listed in Table 3. The RMSE between the experimental response and the result predicted by the UD and ANN models were 483.12 and 237.58, respectively, and the correlation coefficients for UD and ANN were 78.58% and 92.62%, respectively. Figure 3 shows the comparative parity plot for UD and ANN predictions. The deviations for both UD and ANN can be observed directly from the figure. The ANN model fitted the experimental data more accurately than the UD model prediction, which showed larger deviation than ANN. The RMSE for ANN was about two times less than the RMSE for UD and the correlation coefficient was higher for the unseen data by ANN, indicating that the ANN model had significantly better generalization capacity than the UD model. The higher predictive ability of the ANN model is likely to be because of its universal capacity to approximate any non-linear situation, while the UD model is limited to the second order polynomial.
Figure 3

Comparison of generalization capacity of UD and ANN model. It showed the parity plot for ANN and UD prediction for the unseen data.

In the sensitivity analysis of the UD model, the significance of the variable effects on the iturin A titer can be observed directly from the equation and the ANOVA analysis, and based on these, the addition of Asn was found to be the most significant factor followed by addition of Glu and Pro. By contrast, for the ANN model, its sensitivity of the input variables cannot be obtained directly, but several methods are available to obtain insights into the relationships between inputs and outputs. In the perturbation analysis based on the mean value of the input data, when the perturbations occurred around the mean values, Asn addition was the most significant factor followed by Pro and Glu addition. In the perturb method based on the entire range of the input data, close to the edge of the range, the significance was Asn, follow by Glu and Pro, which interestingly was quite comparable with the sensitivity analysis results for the UD model. Thus, ANN was also efficient in the sensitivity analysis.

The optimal results for UD and ANN-GA were compared. Although under optimal conditions their experimental values were not statistically significantly different, the prediction error of ANN-GA was only 0.8%, which was quite a bit less than the 2.17% error of UD. Therefore, the optimization ability of ANN-GA was demonstrated to be better than UD.

These comparisons indicated that both the ANN and UD model had satisfactory fitting accuracy. However, the ANN model had better predictive capacity than the UD model. Further, ANN-GA method had better performance than UD method for the iturin A yield optimization process. And more importantly, with higher optimization accuracy, the ANN-GA method needs much less experimental data than any other reported methods, which means the number of experiments can be significantly reduced. Thus, ANN-GA may be a better method, which can reduce the cost and increase the efficiency, for the large-scale optimization.

Conclusions

In the present work, the focus was on using ANN-GA and UD to optimize iturin A production in flask-shaking fed-batch fermentation. For the first time, an ANN model based on UD data was established and comparisons between ANN-GA and UD were carried out. Three-factor-ten-level UD experimental data were used as the training set, and other 15 experimental data that were different from the training set were used as the test set. For the training set, the MRSE, correlation coefficient and error% of ANN model were 4.84, 100% and 0.039 respectively (compared with 32.21, 99.86% and 0.26 for UD). The satisfactory accuracy of the ANN fitting capacity indicated that the ANN model could work well based on the UD data, which had the minimal experimental number required to meet the needs of industrialization optimization. For the test set, the MRSE, correlation coefficient and error% of the ANN model were 237.58, 92.62% and 2.19 respectively (compared with 483.12, 78.58% and 4.15 of UD). Thus, ANN had better fitting, prediction and generalization capacities than UD based on the same experimental design. The sensitivity analysis indicated that when the perturbations in the input data occurred around the mean values, Asn addition was the most significant factor that had an effect on the iturin A yield, followed by Pro and Glu addition. Interestingly, when the perturbations were close to the edge of the range, the significance of Asn addition, followed by Glu and Pro, were comparable in the two models. Thus, although UD could get the insight information between variables directly, ANN was also demonstrated efficient in the sensitivity analysis. Additionally, ANN-GA was found to be more accurate in finding optimum conditions and predicting optimum iturin A titer. The optimum iturin A titer was 13364.5 ± 271.3 U/mL when the concentrations of added Asn, Glu and Pro were 155 mg/L, 320 mg/L and 160 mg/L, respectively. Further, using ANN-GA based on UD data, the iturin A titer was increased significantly by 34.6% (from 9929.0 ± 280.9 U/mL in batch fermentation without amino acids addition). Therefore, ANN methodology could be a better alternative to the UD approach based on the minimal experimental number that will reduce cost and increase efficiency in industrial applications.

Material and methods

Microorganism and medium

The Bacillus subtilis ZK8 strain that produces iturin A, was separated and mutated in our laboratory and stored in 4°C.

The slant culture medium contained 1.8 g/L MgSO4 · 7H2O, 1.5 g/L K2HPO4, 20 g/L peptone, 10 ml/L glycerol, and 1.8 g/L agar. The seed culture medium contained 25 g/L glucose, 30 g/L peptone, 2.86 g/L KH2PO4, and 3 g/L MgSO4. The fermentation culture medium was prepared with 31 g/L glucose, 3.8 g/L MgSO4, 0.79 g/L KH2PO4, 0.8 g/L yeast extract, and 2.4 g/L soybean protein powder hydrolysate.

Fed-batch fermentation of iturin A

Strain ZK8 was activated in slant culture medium twice, and both cultures were incubated at 30°C for 36 h. The activated strain was then inoculated and incubated in the seed culture medium (100 mL culture in 500 mL flask) in a shaker at 30°C with 150 rpm for 20 h. Next, the seed culture was inoculated in 60 mL fermentation culture by 10% amount of inoculum for 48 h at 30°C with 150 rpm. After 24 hours of fermentation, three amino acids (Asn, Glu and Pro) were added to the broth in concentrations that were determined by statistical design. After fermentation, the broth was collected for further analysis.

Iturin A yield evaluation: titer measurement

The yield of iturin A was determined by titer measurement which is the best way to evaluate the capacity of natural biosynthetic antibiotic like iturin A. Titer can elucidate the effects of antibiotics directly and indicate their likely field effects or clinical effects. The fermentation broth was centrifuged at 9000 rpm, and the supernatant was collected and sterilized as measured samples. The cylinder-plate method was used to measure the titer of iturin A. This is a typical method that is described in most pharmacopeia that has proved to be efficient in antibiotic evaluation [2426]. Phytophthora sp. was used as the test organism.

Experimental design

A three-variable ten-level uniform design was carried out to optimize the concentrations of three added amino acids in fed-batch fermentation. The uniform design table U 10 * (108) was chosen for the experimental design. The concentration levels of independent variables X 1 (Asn concentration, mg/L), X 2 (Glu concentration, mg/L) and X 3 (Pro concentration, mg/L) are shown in Table 7. The relationship between the responses and variables was expressed by a second order polynomial Eq. (1) as follows:
Y = β 0 + i = 1 m β i X i + i = 1 m β i i X i 2 + i < j β ij X i X j
(1)
where Y is the predicted response (iturin A titer, U/mL); β 0 is a constant; β i, β ii and β ij are linear coefficient, square coefficient and cross coefficient respectively; and X i and X j are the independent variables.
Table 7

Factors and level values of uniform design

Factor

Levels (mg/L)

1

2

3

4

5

6

7

8

9

10

Asn(X1)

50

65

80

95

110

125

140

155

170

185

Glu(X2)

200

220

240

260

280

300

320

340

360

380

Pro(X3)

50

65

80

95

110

125

140

155

170

185

To estimate and avoid the system errors in the outputs, the experiments were repeated three times. The regression coefficients and ANOVA were calculated by Minitab software.

Artificial neural networks

Common back-propagation (BP) artificial neural networks with a feed-forward structure were used to establish the predictive model for the iturin A titer. The ANN structure was composed of three layers with three neurons (one for each input variable) in the input layer and one neuron (iturin A titer) in the output layer. Several training algorithms (including “Fletcher-Reeves conjugate gradient”, “Ploak-Ribiere conjugate gradient”, “Powell-Beale conjugate gradient”, “BFGS quasi-Newton”, “one step secant”, “Resilient gradient descent”, “gradient descent”, “gradient descent with momentum”, “gradient descent with adaptive lr momentum” and “Levenberg-Marquardt Back Propagation” functions) were tested. For the hidden layer, a cut-and-try method was used to adjust the number of neurons from 3 to 12. The data were divided into two parts; the training set based on UD data, and the validation set, which included different patterns of the training set (shown in Tables 1 and 3). The aim of adjusting the structure was to obtain the fastest network convergence speed and the lowest mean-square-error (MSE), calculated using Eq. (2), between the predicted and actual outputs.
MSE = 1 N i = 1 N y i y i ' 2
(2)

where N is the number of input values; y i is the actual output value; and y i is the predicted output value.

In this architecture, data always flows forward from input layer to hidden layer, and then to output layer. The adjustable parameter matrices, known as weights and biases, were associated with the connection between the nodes of the network. First the data were scaled by Eq. (3), and then the scaled input data were introduced to the hidden layer and summed using Eq. (4). The result was passed through the log-sigmoid activation function shown as Eq. (5), and became the input to the output layer. The activation function of the output layer was the linear function. The neuron in the output layer produced the output using the same procedure as the neurons in the hidden layer.
X scaled = X X min X max X min
(3)
where X scaled is the scaled data matrix; X is the non-scaled data matrix; and X min and X max are the minimum and maximum values of the data matrices respectively.
net = i = 1 N X i W i + b
(4)
where X i is the input data; W i is the weight; and b is the bias.
f net = 1 1 + exp net
(5)

The ANN model was implemented in MATLAB 2012b, trained with the Levenberg-Marquardt back-propagation function, using the learning rate at 0.01. The training was stopped when the performance goal (MSE < 0.00001) was reached. Details of training an optimal ANN model with outstanding generalization capacity have been described in many reports [16, 22, 27].

Sensitivity analysis

A sensitivity analysis was performed to find the effect of the input variables on the outputs. The effect of the addition of the individual amino acids can be captured in more obvious way in UD than in ANN. In the UD model, because of the regular form of variables in the quadratic equation, the importance and interactions of each variable can be read directly from the relative coefficients of the equation and ANOVA analysis. In the ANN model, which is performed as a ‘black box’, the information is not given directly, so other methods need to be used to assess its sensitivity analysis. In this study, a perturbation analysis [28, 29] based on the mean value was chosen for the sensitivity analysis [30]. The sensitivity of each variable was evaluated by the standard deviation of the net output perturbation caused by the perturbation of one input variable when the other variables were constant, according to Eq. (6) as follows:
S i = j = 1 n f X i + ϵ j , W 2 n σ i ϵ j σ i j = 1 , 2 , , n
(6)

where S i is the sensitivity estimating value of input on the output function; X i is the variables; ϵ j is the random value among the deviation area; σ i is the standard deviation of input data; n is the number of random values, which was 30 in this case; W is the weight in the ANN model; and f(X,W) is the output of the ANN model.

Optimization by GA

GA was carried out to optimize the input area of the ANN model with the aim of maximizing the iturin A titer. The GA program was implemented in MATLAB, with the real-coded chromosome length = 1, population size = 20, crossover probability = 0.4, mutation probability = 0.2, and max generations = 100. The ANN model was used as the fitness function of GA. And the GA optimizing procedure was run several times with different random initialized conditions.

For UD, optimization was achieved by taking the partial derivative of the function (Y) with respect to the variable (X).

Abbreviations

(ANN): 

Artificial neural network

(UD): 

Uniform design

(RSM): 

Response surface methodology

(GA): 

Genetic algorithm

(Asn): 

Asparagine

(Glu): 

Glutamic acid

(Pro): 

Proline

(MSE): 

Mean-square-error

(RMSE): 

Root-mean-square-error

(BP): 

Back propagation.

Declarations

Acknowledgements

This work was funded by the National Key Technologies R&D Program (No. 2011BAE06B04-18).

Authors’ Affiliations

(1)
Key Laboratory of Environmental and Applied Microbiology, Chengdu Institute of Biology, Chinese Academy of Sciences
(2)
University of Chinese Academy of Sciences

References

  1. Besson F, Peypoux F, Michel G, Delcambe L: Characterization of Iturin A in antibiotics from various strains of Bacillus Subtilis. J Antibiot. 1976, 10: 1043-1049.View ArticleGoogle Scholar
  2. Delcambe L, Peypoux F, Besson F, Guinand M, Michel G: Structure of iturin and iturin-like substances. Biochem Soc Trans. 1977, 5: 1122-1124.View ArticleGoogle Scholar
  3. Isogai A, Takayama S, Murakoshi S, Suzuki A: Structure of β-amino acids in antibiotics iturin A. Tetrahedron Lett. 1982, 23: 3065-3068. 10.1016/S0040-4039(00)87534-6.View ArticleGoogle Scholar
  4. Blocquiaux S, Delcambe L: Essais de traitement de dermatomycoses par l'iturine. Arch Belg Dermatol Syphiligr. 1956, 12: 224-226.Google Scholar
  5. Mizumoto S, Shoda M: Medium optimization of antifungal lipopeptide, iturin A, production by Bacillus subtilis in solid-state fermentation by response surface methodology. Appl Microbiol Biotechnol. 2007, 76: 101-108.View ArticleGoogle Scholar
  6. Ye YF, Li QQ, Fu G, Yuan GQ, Miao JH, Lin W: Identification of Antifungal Substance (Iturin A2) Produced by Bacillus subtilis B47 and Its Effect on Southern Corn Leaf Blight. J Integr Agric. 2012, 11 (1): 90-99. 10.1016/S1671-2927(12)60786-X.View ArticleGoogle Scholar
  7. Klich MA, Arthur KS, Lax AR, Bland JM: Iturin A: A potential new fungicide for stored grains. Mycopathologia. 1994, 127: 123-127.View ArticleGoogle Scholar
  8. Iwase N, Rahman MS, Ano T: Production of iturin A homologous under different culture conditions. J Environ Sci. 2009, 21 (Suppl 1): S28-S32.View ArticleGoogle Scholar
  9. Zhang XY, Zhou JY, Fu W, Li ZD, Zhong J, Yang J, Xiao L, Tan H: Response surface methodology used for statistical optimization of jiean-peptide production by Bacillus subtilis. Electron J Biotechnol. 2010, 13 (4): fulltext-5-View ArticleGoogle Scholar
  10. Zhou WW, He YL, Niu TG, Zhong JJ: Optimization of fermentation conditions for production of anti-TMV extracellular ribonuclease by Bacillus cereus using response surface methodology. Bioprocess Biosyst Eng. 2010, 33 (6): 657-663.View ArticleGoogle Scholar
  11. Fang KT: Orthogonal and Uniform Experimental Design. 2001, Beijing, China: Science PressGoogle Scholar
  12. Wu J-C, Cheng J, Shi X-l: Preparation of ACE Inhibitory Peptides from Mytilus coruscus Hydrolysate Using Uniform Design. BioMed Res Int. 2013, 2013: 1-5.Google Scholar
  13. Liu D, Wang P, Li F, Li J: Application of uniform design in L-isoleucine fermentation. Chin J Biotechnol. 1991, 7 (3): 207-212.Google Scholar
  14. Liang Y-z, Fang K-t, Xu Q-s: Uniform design and its applications in chemistry and chemical engineering. Chemometr Intell Lab Syst. 2001, 58: 43-57. 10.1016/S0169-7439(01)00139-3.View ArticleGoogle Scholar
  15. Wang Z, Luo D, Ena C: Optimization of polysaccharides extraction from Gynostemma pentaphyllum Makino using Uniform Design. Carbohyd Polym. 2007, 69 (2): 311-317. 10.1016/j.carbpol.2006.10.013.View ArticleGoogle Scholar
  16. Desai KM, Vaidya BK, Singhal RS, Bhagwat SS: Use of an artificial neural network in modeling yeast biomass and yield of b-glucan. Process Biochem. 2004, 39: 2193-2198. 10.1016/j.procbio.2003.11.009.View ArticleGoogle Scholar
  17. Caldeira AT, Arteiro JM, Roseiro JC, Neves J, Vicente H: An artificial intelligence approach to Bacillus amyloliquefaciens CCMI 1051 cultures: Application to the production of anti-fungal compounds. Bioresource Technol. 2011, 2011 (102): 1496-1502.View ArticleGoogle Scholar
  18. Silva R, Ferreira S, Bonifacio MJ, Dias JML, Queiroz JA, Passarinha LA: Optimization of fermentation conditions for the production of human soluble catechol-O-methyltransferase by Escherichia coli using artificial neural network. J Biotechnol. 2012, 2012 (160): 161-168.View ArticleGoogle Scholar
  19. Desai KM, Survase SA, Saudagar PS, Lele SS, Singhal RS: Comparion of artificial neural network (ANN) and response surface methodology (RSM) in fermentation media optimization: Case study of fermentative production of scleroglucan. Biochem Eng J. 2008, 41: 266-273. 10.1016/j.bej.2008.05.009.View ArticleGoogle Scholar
  20. Bas D, Boyac I: Modeling and optimization. II. Comparison of estimation capabilities of response surface methodology with artificial neural networks in a biochemical reaction. J Food Eng. 2007, 78: 846-854. 10.1016/j.jfoodeng.2005.11.025.View ArticleGoogle Scholar
  21. Guo Y, Xu JL, Zhang Y, Xu HJ, Yuan ZH, Li D: Medium optimization for ethanol production with Clostridium autoethanogenum with carbon monoxide as sole carbon source. Bioresource Technol. 2010, 101 (2010): 8784-8789.View ArticleGoogle Scholar
  22. Sarkar D, Modak JM: Optimization of fed-batch bioreactor using genetic algorithm. Chem Eng Sci. 2003, 58: 283-2296.Google Scholar
  23. Xu T, Ren YL, Zhong J, Tan H: Influence of amino acid addition on Jiean-peptide fermentation. Guangdong Agric Sci. 2012, 39 (12): 109-113.Google Scholar
  24. Hamedi J, Shahverdi AR, Samadi N, Mohammadi A, Shiran M, Akhondi S: A cylinder-plate method for microbiological assay of clavulanic acid. Pharmeur Sci Notes. 2006, 2006 (2): 53-54.Google Scholar
  25. Souza MJ, Rolim CM, Melo J, Souza Filho PS, Bergold AM: Development of a microbiological assay to determine the potency of ceftiofur sodium powder. J AOAC Int. 2007, 90 (6): 1724-1728.Google Scholar
  26. Lopes CC, Salgado HR: Development and validation of a stability-indicative agar diffusion assay to determine the potency of linezolid in tablets in the presence of photodegradation products. Talanta. 2010, 82 (3): 918-922.View ArticleGoogle Scholar
  27. Tambe SS, Kulkarni BD, Deshpande PB: Elements of Artificial Neural Network with Selective Applications in Chemical and Biological Sciences. 1996, Louisville: Simulation and Advanced Controls, IncGoogle Scholar
  28. Holtzman JM: On using perturbation analysis to do sensitivity analysis: derivatives versus differences. IEEE T Automat Contr. 1992, 37 (2): 243-247. 10.1109/9.121628.View ArticleGoogle Scholar
  29. Zurada JM, Malinowski A, Usui S: Perturbation method for deleting redundant inputs of Perceptron networks. Neurocomputing. 1997, 14 (2): 177-193. 10.1016/S0925-2312(96)00031-8.View ArticleGoogle Scholar
  30. Hu B-q, Guan K: Study on ANN based construction tender price index prediction model and sensitivity analysis. J Harbin Univ Commer Nat Sci Ed. 2004, 20 (1): 118-122.Google Scholar

Copyright

© Peng et al.; licensee BioMed Central Ltd. 2014

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Advertisement