On Aug 5, 2004, at 12:03 PM, Jason Stajich wrote:
> I believe there are ways of building virtual subsets of the databases
> locally with some tricks with formatdb but I don't really know them.
You can make a file that has the GI numbers of the sequences you want
to search against (say, a GI number dump from the Taxonomy DB), and
then use "-l gilist" with NCBI blast tools
> You can also parse your reports and lookup the GI number using the
> tools
> in Bio::Taxonomy and Bio::DB::Taxonomy to get the species name and or
> restrict hits to a particular subset of the taxonomy hierarchy.
This also works, but your expectation values will be artificially
conservative (as you actually searched a potentially much larger
database).
-Aaron