OFFSET
1,1
COMMENTS
The BLOSUM62 (BLocks SUbstitution Matrix) was introduced by Steven and Jorja Henikoff by scanning the BLOCKS database for very conserved regions of protein families (that do not have gaps in the sequence alignment) using sequences with >= 62% identity.
The sequences were then clustered and relative frequencies of amino acids and their substitution probabilities were calculated.
Then, they calculated a log-odds score for each of the 210 possible substitution pairs of the 20 standard amino acids.
The BLOSUM-62 matrix is among the best for detecting most weak protein similarities and still remains the default substitution matrix used by the software BLAST (Basic Local Alignment Search Tool) by the NCBI (National Center for Biotechnology Information).
The original authors made a miscalculation in their algorithms that generated the matrix, though surprisingly the miscalculated BLOSUM62 matrix has a better search performance than the corrected version.
LINKS
S. Henikoff and J. G. Henikoff, Amino Acid Substitution Matrices from Protein Blocks, PNAS, (1992), 89 (22): 10915-10919.
EXAMPLE
The substitution matrix can be written as follows for each of the 20 amino acids using their single letter codes arranged by biochemical properties.
A positive score is given to the more likely substitutions while a negative score is given to the less likely substitutions.
C 9
S -1 4
T -1 1 5
P -3 -1 -1 7
A 0 1 0 -1 4
G -3 0 -2 -2 0 6
N -3 1 0 -2 -2 0 6
D -3 0 -1 -1 -2 -1 1 6
E -4 0 -1 -1 -1 -2 0 2 5
Q -3 0 -1 -1 -1 -2 0 0 2 5
H -3 -1 -2 -2 -2 -2 1 -1 0 0 8
R -3 -1 -1 -2 -1 -2 0 -2 0 1 0 5
K -3 0 -1 -1 -1 -2 0 -1 1 1 -1 2 5
M -1 -1 -1 -2 -1 -3 -2 -3 -2 0 -2 -1 -1 5
I -1 -2 -1 -3 -1 -4 -3 -3 -3 -3 -3 -3 -3 1 4
L -1 -2 -1 -3 -1 -4 -3 -4 -3 -2 -3 -2 -2 2 2 4
V -1 -2 0 -2 0 -3 -3 -3 -2 -2 -3 -3 -2 1 3 1 4
F -2 -2 -2 -4 -2 -3 -3 -3 -3 -3 -1 -3 -3 0 0 0 -1 6
Y -2 -2 -2 -3 -2 -3 -2 -3 -2 -1 2 -2 -2 -1 -1 -1 -1 3 7
W -2 -3 -2 -4 -3 -2 -4 -4 -3 -2 -2 -3 -3 -1 -3 -2 -3 1 2 11
C S T P A G N D E Q H R K M I L V F Y W
CROSSREFS
KEYWORD
sign,fini,full
AUTHOR
T. Jeffrey Cole, Jun 12 2020
STATUS
approved