If this is your first visit, be sure to
check out the FAQ by clicking the
link above. You may have to register
before you can post: click the register link above to proceed. To start viewing messages,
select the forum that you want to visit from the selection below.

Correlizer

Description: Genomes are fantastic keepers of genetic information and are the outcome of evolutionary replication, mutation and selection. Genomes organize functions from the cellular level, via the organismic level, up to the complex basis of mind. In human cells the genetic information controlling most processes from the cellular level, over embryogenesis to cognitive ability, manifests in a diploid set of 23 DNA molecules (chromosomes), combined they consist of ~3x10^9 base pairs (bp) stored in ~2.80 GB of data. This whole genome, whose added molecular length totals ~2m, is kept in comparably small cell nuclei with typical diameters of ~10 µm or volumes of 500 µm^3. The sequential organization of genomes, i.e. the relations between distant base pairs and regions within sequences, and its connection to the three-dimensional architectural organization of genomes is still a largely unresolved problem.

http://svahesrv2.bioquant.uni-heidel...cleus-ACGT.jpg
Correlizer has been set up to unravel these mysteries, and we found long-range power-law correlations on almost the entire observable scale of 132 completely sequenced chromosomes of 0.5 x 10^6 to 3.0 x 10^7 bp. Varying from Archaea, Bacteria, Arabidopsis thaliana, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster, and Homo sapiens. The local correlation coefficients show a species-specific multi-scaling behavior: close to random correlations on the scale of a few base pairs, a first maximum from 40 to 3,400 bp (for Arabidopsis thaliana and Drosophila melanogaster divided in two submaxima), and often a region of one or more second maxima from 10^5 to 3 x 10^5 bp. Within this multi-scaling behavior, an additional fine-structure is present and attributable to codon usage in all except the human sequences, where it is related to nucleosomal binding.