Annotation of Psyllid Genome
This project includes annotators across multiple sites including Kansas State University, Indian River State College and Cornell University. We are using a Basecamp project to organize our activities. We have training webinars and regular online lab meetings. You are most welcome to join us. Please let us know if you are interested using the contact form. Please include information about your annotation experience and genes of interest, if any. We will add you to the Basecamp project and invite you to the next annotation video conference.
A screenshot of the WebApollo genome annotation platform
State of Psyllid Annotation
Diaci v1.91 is the interim genome version and is being used for annotation on WebApollo. Please get in touch using the contact form for access. All public RNAseq data has been mapped to this genome. We have also mapped DNAseq and RNAseq reads to assist in identification of correct splice sites. Gene models from OGS v1.0, MCOT v1 and NCBI Gnomon have also been made available as tracks.
Diaci1.1 is the current official version of the Diaphorina citri genome. The contig N50 for this assembly is 38 kb. It was submitted to NCBI and called NCBI-diaci1.1. You can access it here. It was annotated with the NCBI Gnomon pipeline.
The Maker annotations from the Diaci1.0 version of the genome have also been mapped to the current version. The mapping program looked for exact matches between the two assemblies, accounting for stretches of the assembly that were converted to N’s. It created an ‘alignment file’, which was used to perform the re-mapping of features in gff3 format. Maker annotations that mapped to sequence that did not exist in Diaci1.1 or only contained N’s in the NCBI-approved assembly were discarded. The Maker annotations that were on retained sequence were preserved in their original format, except for their location (scaffold name, coordinates, and strand if appropriate). 37 Maker gene models from Diaci1.0 did not map to Diaci1.1. Both the Maker annotations and NCBI annotations are available as evidence in Web Apollo.
Official Gene Set v1.0 available (March 2017)
The manual annotation paper describes the workflow for the curation of genes and specific gene families in developmental, physiological, RNAi regulatory, and immunity-related pathways in this gene set. This community effort produced 530 manually curated gene models across developmental, physiological, RNAi regulatory, and immunity-related pathways. As previously shown in the pea aphid, RNAi machinery genes putatively involved in the microRNA pathway have been specifically duplicated. The comprehensive MCOT transcriptome enabled us to identify a number of gene families that are either missing or misassembled in the draft genome. In order to develop biocuration as a training experience, we included undergraduate and graduate students from multiple institutions, as well as experienced annotators from the insect genomics research community. The resulting gene set (OGS v1.0) combines both automatically predicted and manually curated gene models. Please see the supplementary notes and annotation updates for individual gene reports and protein sequences.
There are many training resources available. Monica Munoz-Torres has put together a comprehensive guide on the manual curation process. A recent version of her training slides can be found here. Monica Poelchau at NAL created a tutorial for the new BLAST interface.
- The genome page for Diaphorina citri is available here.
- The organism page for Diaphorina citri at i5k is available here.
- New annotators can register to annotate here.
- The data files for annotation are available on the FTP site.
We are using annotation rules adapted from the i5k pilot. The annotators will use the replaced models field to state what model they are replacing with each annotation. Monica Poelchau at the NAL created the following guide to explain the rationale and details of this process.
The Diaphorina citri data that National Agricultural Library hosts can be downloaded from here. The Maker annotations re-mapped to the NCBI-diaci1.1 assembly are available here.