Table 1 Characteristics of the P. lambertiana sequence data and 1.0 assembly, compared to known cytometric and cytological properties
Cytometric Genome Size31 Gbp
Chromosome number12
Assembly V1.0
Total size
 Scaffolds ≥ 200 bp4,259,911 scaffolds
27.6 Gbp including gaps
25.5 Gbp without gaps
 Scaffolds ≥ 500 bp1,089,992 scaffolds
26.9 Gbp including gaps
24.7 Gbp without gaps
54,147,744 contigs
 Contigs < 200 bp (“chaff”)6.5 Gbp
N50 scaffold size (31 Gb)246.6 kbp
N50 contig size (31 Gb)4.25 kbp
Sequence data
Number of paired-end libraries56
Paired end sequencing depth1,910 Gbp (61.5×)
 By platform
  Hiseq 2000 (125 bp + 125 bp)2.8 × 1011 bp (9.0×)
  Hiseq 2500 (150 bp + 150 bp)1.4 × 1012 bp (45.1×)
  GAIIx (160 bp + 156 bp)1.8 × 1011 bp (5.8×)
  MiSeq (255 bp + 255 bp)4.7 × 1010 bp (1.5×)
 By fragment size
  [200 bp, 400 bp]9.6 × 1011 bp (31.0×)
  [400 bp, 600 bp]4.6 × 1011 bp (15.0×)
  [600 bp, 900 bp]4.8 × 1011 bp (15.6×)
Long fragment libraries (1.5–25 kbp)34
Long fragment coverage
 Illumina Trueseq22.5× physical coverage
 Nextera mate pair71.2× physical coverage
  • N50 statistics were calculated using an estimated genome size of 31 Gbp. Paired end sequencing depth represents the raw output prior to error correction. Physical coverage estimated by MaSuRCA (including the inferred DNA fragement) is reported here for all libraries by chemistry (see Supplementary Methods in File S1).