Isochore Predictions by IsoFinder | Try
IsoFinder
on the web |
This track shows the partition of chromosome sequences into fairly homogeneous genome regions or isochores (Bernardi et al., 1985; Bernardi, 2000). Isochore prediction was carried out using IsoFinder (Oliver et al., 2004), a new version of the segmentation algorithm described by Oliver et al (2001, 2002). See also Bernaola-Galván et al (1996, 1999) for further theoretical background. In brief, we move a sliding pointer from left to right along the DNA sequence. At each position of the pointer, we compute the mean G+C values to the left and to the right of the pointer. We then determine the position of the pointer for which the difference between left and right mean values (as measured by the t-statistic) reaches its maximum. Next, we determine the statistical significance of this potential cutting point, after filtering out short-scale heterogeneities below 3 kb by applying a coarse-graining technique. Finally, the program checks whether this significance exceeds a probability threshold. If so, the sequence is cut at this point into two subsequences; otherwise, the sequence remains undivided. The procedure continues recursively for each of the two resulting subsequences created by each cut. This leads to the decomposition of a chromosome sequence into long homogeneous genome regions (LHGRs) with well-defined mean G+C contents, each significantly different from the G+C contents of the adjacent LHGRs. Most LHGRs can be identified with Bernardi's isochores, given their correlation with biological features such as gene, SINE and LINE densities, recombination rate or single nucleotide polymorphism variability (e.g. see Oliver et al 2001, 2002).
Since IsoFinder
can only work with finished sequence, the first task was to locate the islands
of Ns in the sequence of each chromosome, then extracting the contigs of finished
sequence.
The GC content was used to determine the level of shading in which the isochore
is displayed in the Genome Browser. The GC content was transformed in an score
in order to enhance the color contrast between different isochores on the screen.
The following expression can be used to get the isochore GC content from the
score given in data files:
Isochore GC = ((0.35 x score) / 1000) + 0.30
For
isochore maps of other complete eukaryotic genomes, please watch our
On-line Resource on
Isochore Mapping
©
Jose L. Oliver, 2004 Universidad de Granada, Spain |