Skip to main content
Fig.Ā 1 | Microbial Cell Factories

Fig.Ā 1

From: DNA steganography: hiding undetectable secret messages within the single nucleotide polymorphisms of a genome and detecting mutation-induced errors

Fig.Ā 1

Collection of SNP data and exemplary encryption of the ā€œHā€ character. a SNP dataset collection. SNPs were downloaded from NCBI dbSNP (ftp://ftp.ncbi.nih.gov/snp/latest_release/JSON). To use SNPs to store encrypted messages, pathogenic SNPs were discarded, and SNPs that had maximum variation (A/T/G/C) were then selected. In addition, SNPs that were in transposable elements, CpG islands, or conserved regions were discarded. Then, the sequences around SNPs (āˆ’ā€‰10ā€‰~ā€‰+ā€‰10Ā nt, 21 nt in total) that were unique in the genomic sequences were selected. Through this process, 275,967 SNPs were found to be available for DNA steganography. b SNP hotspots. Regions that contained more than 35 SNPs within a 1Ā kb region were selected and named as SNP hotspots. The largest SNP number in a hotspot was 120Ā (Table 2). The SNPs in the hotspots were used to reduce the number of experiments. c An example of DNA steganography. The character ā€œHā€ was encrypted into a DNA sequence according to the encryption table (TableĀ 3), and each encrypted nucleotide replaced the original nucleotide at the predefined SNP positions. To construct a DNA sequence to decrypt, the SNP positions used for DNA insertion should be delivered to a recipient in addition to the encryption table. d Overall scheme of the secret message delivery and decryption

Back to article page