Occurrence analysis of three regions present in the CEN.PK113-7D genome, but not in the S288C genome. A) Venn diagram that represents the occurrence of the three regions over the available S. cerevisiae sequenced strains in Genbank (Additional file 12: Table S7). B and C) Annotation of the regions and RNA-seq expression profiles. RNA-seq data from glucose- and nitrogen limited anaerobic chemostat cultures (red and blue, respectively) were plotted (one bar every 10th base) for the CEN.PK113-7D specific ENA locus (B) and the two specific contigs (C). Expression data, expressed as the number of times a base is covered by a read, are ranged from are [0-750] for contig379 and contig151 and [0-250] for contig596.