Dear Annotators,
I have been looking at a number of genes lately that show evidence for
alternative splicing.
Unfortunately, many times those genes got predicted in rather small,
successive gene models, most of the time very close to each other.
I would guess that the system for gene-prediction got overloaded with
transcript information (from the RNAseq), while not being able to sort
out which reads might belong one and the same gene-model.
I would like to bring you to your attention that it would be great, in
the case you would encounter such a case to try to correct the existing
models in order to have the longest variant in the database.
In the case it would be too difficult to come up with a longest
variant, could please put a comment to the genes as they are likely too
short and with which genes-models they should be fused with.
Also take this into account when you count the number of copies of the
genes present in the genome.
best regards
Stephane
--
==================================================================
Stephane Rombauts Principal Scientific staff
Research group of Bioinformatics & Evolutionary Genomics
Tel:+32 (0)9 331 38 21 fax:+32 (0)9 3313809
VIB Department of Plant Systems Biology, Ghent University
Technologiepark 927, 9052 Gent, BELGIUM
strom@psb.vib-ugent.be http://www.psb.vib-ugent.be
==================================================================