Skip to main content
Fig. 1 | Microbial Cell Factories

Fig. 1

From: Characterization and optimization of 5´ untranslated region containing poly-adenine tracts in Kluyveromyces marxianus using machine-learning model

Fig. 1

5´ UTR poly(A)s are linked with mRNA levels and protein abundance. (A) Distributions of 5´ UTR lengths in the K. marxianus. Cells were collected after 16 h and 72 h of growth, and subjected to the nanopore sequencing. A total of 4228 5´ UTRs from the 16 h sample and 4210 5´ UTRs from the 72 h sample were analyzed. A peak around 50 nt was indicated. (B) Number and length distributions of 5´ UTRs with or without poly(A) in 16 h and 72 h samples. The significance was assessed by a two-tails t-test. **** p < 0.0001. (C) Number of 5´ UTR containing poly(A) of various lengths. (D) Distribution of distance between 5´ UTR poly(A) and start codon (AUG). A peak around 20 nt was indicated. (E, F) Enrichment and depletion of four bases between 30 nt preceding and 7 nt after AUG (-30 ~ + 10) in different groups of genes. The genes were grouped based on the abundance of the encoded proteins (E) or the mRNA levels (F). The significance was assessed using a two-tailed Fisher’s exact test. Red or blue logos represented p < 0.05, while gray logos represented p > 0.05. (G, H) Correlation between the percentage of genes containing 5´ UTR poly(A) and the genes grouped by the ratio between protein abundance and mRNA level. Protein abundance and mRNA level were represented by emPAI and TPM values, respectively. The genes containing 5´ UTR poly(A) located within 30 nt preceding AUG were shown on the left (G), while those containing 5´ UTR poly(A) at any position were shown on the right (H)

Back to article page