Hi,
perhaps you guys can help me with this.
we have 2 tables:
- gf_data: 2 columns (gene_family id`s and gene_id`s) . This table
has about 500k rows
- annotation: lots of columns, but we only need the ones with the
gene_id`s and the species. This table has about 300k rows.
We now need a couple of things:
- For each species, we need a count of genes that are in
gene_families with a size larger than 1
- For each species, we need a count of gene_families.
Now, I've tried already a lot, but run into the problem of having to
use either joins or nested queries, which do not really scale
well apparently :s
I thought about creating a view (by appending the gene_family into the
annotation table), but it's been such a long time
i've done this that I cannot remember whether this will be any help at
all.
So, any suggestions?
Michiel
PS: I'll think about a price for the winner :)
--
==================================================================
Michiel Van Bel
PhD student
Tel:+32 (0)9 331 36 95 fax:+32 (0)9 3313809
VIB Department of Plant Systems Biology, Ghent University
Technologiepark 927, 9052 Gent, BELGIUM
mibel@psb.ugent.be http://www.psb.ugent.be
==================================================================