M.M. Dalkilic, PhD
Monday, September 08, 2008
Class V
Indiana University, Bloomington, IN
Sequence Homology
Outline
New Programming Assignment and homework will be
posted today
New Reading Posted on Website
Readings [R] Chaps 5
Most Important Aspect of Bioinformatics
—
homology search
through sequence similarity (cont’d)
Sequence Alignment (Theoretical)
Sequence Alignment (Practical) FASTA and BLAST
Quick detour first
Brief Review of Probability
Review cont’d
Review cont’d
Review cont’d
We know from the previous slides that
P
is a measure over a Boolean algebra.
Review cont’d
Review cont’d
Review cont’d
Review cont’d
Review cont’d
Review cont’d
Review cont’d
Odds or (subjective probability) play a significant role in bioinformatics
We either have “odds for (or on)” or “odds against”
Review cont’d
Increasing information about an event can lead to a change in probability
—
since, for some, a probability is a degree of belief…Bayesians.
Review cont’d
FASTA and BLAST
—
Dot Plots
Simplest means of comparing two sequences
Visualization is easy to understand
—
can be the basis for
explaining both FASTA and BLAST
A Dot plot is simply a rectangular grid whose leftmost
column and bottom row are sequences. A box is checked if,
for cell
i
,
j
,
the symbols match
i
units from the bottom

to

top
and
j
units from left

to

right
Dot Plots
A Dot plot is simply a rectangular grid whose leftmost
column and bottom row are sequences. A box is checked if,
for cell
i
,
j
,
the symbols match
i
units from the bottom

to

top
and
j
units from left

to

right
Dot Plots
A Dot plot is simply a rectangular grid whose leftmost
column and bottom row are sequences. A box is checked if,
for cell
i
,
j
,
the symbols match
i
units from the bottom

to

top
and
j
units from left

to

right
Scoring Matrix
Noise or possible
motifs separated by
gaps
Dot Plots
Learn to read plots

