- This tool allows the user to search the DRSC database
using any sequence the user desires or to search by a non-fly gene.
- The user may provide a sequence to use for the search or the
user may provide a gene name and organism name for the search. The user may not
provide both a sequence and a gene name at the same time.
- If the user decides to use the gene name option, the user may enter a gene name.
- The organism name should be the scientific name of the organism.
- If a particular gene name is tried, and the program cannot find the
gene in GenBank, the user should try an alternative name. The number of gene synonyms
stored locally is not very high. Also, Homologene is somewhat stringent.
If a less stringent search is needed, cut and paste a sequence for a Blast-based search.
- This tool uses NCBI's BLAST to search the DRSC database.
- The user must specify whether the sequence provided is nucleotide or
protein.
- The user must specify an appropriate blast algorithm. Currently, the options are
"blastn", "tblastn", and "tblastx". blastn and tblastx are appropriate for
nucleotide query sequences. tblastn is appropriate for protein query
sequences. tblastx is less stringent than blastn. More information about BLAST can be
found here.
- The user must specify what target to search. Currently, there are
two options: "DRSC Amplicon Set" and "BDGP 3.1 Fly Transcripts". Essentially,
the former includes just the sequences of the amplicons and the latter includes
entire genes. Choose the latter if a broader search is desired.
- The score threshold indicates the BLAST score that should be used
as a cutoff. The more negative the score, the more stringent.
- To maximize the number of results, the user should use tblastx with a
nucleotide sequence, use BDGP 3.1 as the target, and set the threshold
to -5.
|