Citrus clementina Genome

Source

The Citrus clementina v1.0 (Clementine) assembly and the following text comes from JGI Phytozome

Overview

The Clementine mandarin (Citrus clementina) is related to the sweet orange. Clementine is normally diploid, however this genome sequence was generated from a haploid plant greatly simplifying genomic sequencing and assembly. The haploid plant was generated by in situ parthenogenesis of clementine Clemenules induced by irradiated pollen of Fortune mandarin, followed by direct embryo germination in vitro. The sequencing project is the work of the International Citrus Genome Consortium (ICGC) Draft sequencing of the genome by IGCG, Genoscope, IGA, HudsonAlpha and JGI using a Sanger whole genome shotgun approach generated a high-quality draft assembly which was subsequently integrated with a genetic map to generate a chromosome-scale (pseudo-molecule) assembly.
Assembly details 

Statistics

Genome Size

This version of the assembly (v1.0) is 301.4 Mb spread over 1,398 scaffolds with 2.1% gaps at 7.0x coverage. Over 96% of the assembly is accounted for by the 9 chromosome pseudo-molecules ~21-51 Mbp in length.

Loci

The current gene set (clementine1.0) integrates 1.560 M ESTs with homology and ab initio-based gene predictions (by GenomeScan, Fgenesh, exonerate). 24,533 protein-coding loci have been predicted. Each encodes a primary transcript. There are an additional 9,396 alternative transcripts encoded on the genome generating a total of 33,929 transcripts. 16,963 primary transcripts have EST support over at least 50% of their length. A third of the primary transcripts (8,684) have EST support over 100% of their length.

Sequencing, Assembly, and Annotation

Whole genome sequencing strategy

Genomic sequence was generated by the IGCG, Genoscope, IGA and JGI using a whole genome shotgun approach using Sanger technology sequencing 2-3kb, 6-12kb insert libraries as well as a 39kb fosmid end library totaling 6x coverage.

How was the assembly generated?

The genome was assembled with Arachne by Jeremy Schmutz at HudsonAlpha. Scaffolds were placed on chromosome pseudo-molecules by aligning SSR markers from the Clementine genetic map (Ollitrault, Terol, Chen, et al., 2012) to scaffolds, followed by breaking and rejoining scaffolds. Over 96% of the genome is on a chromosome-scale pseudo-molecule.

Reference Publication

Wu GA, Prochnik S, Jenkins J, Salse J, Hellsten U, Murat F, Perrier X, Ruiz M, Scalabrin S, Terol J, Takita MA, Labadie K, Poulain J, Couloux A, Jabbari K, Cattonaro F, Del Fabbro C, Pinosio S, Zuccolo A, Chapman J, Grimwood J, Tadeo FR, Estornell LH, Muñoz-Sanz JV, Ibanez V, Herrero-Ortega A, Aleza P, Pérez-Pérez J, Ramón D, Brunel D, Luro F, Chen C, Farmerie WG, Desany B, Kodira C, Mohiuddin M, Harkins T, Fredrikson K, Burns P, Lomsadze A, Borodovsky M, Reforgiato G, Freitas-Astúa J, Quetier F, Navarro L, Roose M, Wincker P, Schmutz J, Morgante M, Machado MA, Talon M, Jaillon O, Ollitrault P, Gmitter F, Rokhsar D, Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication., Nature biotechnology. 2014 Jul; 32 7 656-62

Data sets 
The genome assembly, pseudomolecules, annotations and genome browser are available through the links below.
Bulk Datasets (via Phytozome) Genome browser
BLAST Pathways