Term
|
Definition
variables can store multiple values, retrieved using integer index |
|
|
Term
|
Definition
Basic Local Alignment Search Tool - algorithm for comparing sequences like amino acids, nucleotides of dna or proteins |
|
|
Term
National Center for Biotechnology Information |
|
Definition
organization that manages Genbank, BLAST and entrez |
|
|
Term
3 parts of genbank flat file |
|
Definition
Header - contains lookup info like definition, locus, acession #
Features - contains features of the molecule: cds, gene,
Sequence - the actual sequence |
|
|
Term
PAM matrices(Percentage of Acceptable point Mutations per 108 years) |
|
Definition
are based on global alignments of closely related proteins. |
|
|
Term
BLOSUM (BLOcks SUbstitution Matrix) |
|
Definition
based on local multiple alignments of more distantly related sequences |
|
|
Term
|
Definition
Comparisons of closely related sequences should use BLOSUM matrices with higher numbers and PAM matrices with lower numbers
comparisons of distantly related proteins use BLOSUM matrices with low numbers and PAM matrices with high numbers |
|
|
Term
Steps of the blast algorithm |
|
Definition
1. Preprocessing of query: for each position, find the list of words whose scores are greater than T when paired with the word starting at p (wp -word)
2. Generation of hits: for each word list, find all exactmatches (hits) with the database sequences
3. Extension of hits: for each hit, extend alignment in both directions to find alignments whose scores are greater than threshold S |
|
|