|
|
- |
A |
C |
G |
T |
T |
A |
C |
|
- |
|
|
|
|
|
|
|
|
|
C |
|
|
|
|
|
|
|
|
|
G |
|
|
|
|
|
|
|
|
|
A |
|
|
|
|
|
|
|
|
|
T |
|
|
|
|
|
|
|
|
|
A |
|
|
|
|
|
|
|
|
|
A |
|
|
|
|
|
|
|
|
|
C |
|
|
|
|
|
|
|
|
|
|
- |
A |
C |
G |
T |
T |
A |
C |
|
- |
|
|
|
|
|
|
|
|
|
C |
|
|
|
|
|
|
|
|
|
G |
|
|
|
|
|
|
|
|
|
A |
|
|
|
|
|
|
|
|
|
T |
|
|
|
|
|
|
|
|
|
A |
|
|
|
|
|
|
|
|
|
A |
|
|
|
|
|
|
|
|
|
C |
|
|
|
|
|
|
|
|
p(X,Y)
val(X,Y)=log ----------
p(X) p(Y)
where p(X,Y) is estimated as
number of times X aligned with Y
p(X,Y)= -----------------------------------
total number of aligned pairs
and p(X) and p(Y) can be estimated as
number of times seeing X in any position of a sequence
p(X) = -------------------------------------------------------
total number of positions in all the sequences
number of times seeing Y in any position of a sequence
p(Y) = -------------------------------------------------------
total number of positions in all the sequences
A brief note about how to compute a BLOSUM matrix is available here.
BLOCK1
Seq1 = A B C A
Seq2 = A B B B
Seq3 = A B B C
We will have 3 possible pairwise alignments between these sequences and a total of 3x4=12 alignment pairs.
Compute the BLOSUM matrix for these 3 amino acids.
BLOCK2
Seq1 = A B C A
Seq2 = A B B B
Seq3 = A B B C
Seq4 = A B B B
Compute the BLOSUM matrix for BLOCK2. Show all the computations.
Please turn in a hardcopy of your written answers at the class