Variations of the mononucleotide and short oligonucleotide distributions in the genomes of various organisms
Language English Country England, Great Britain Media print
Document type Journal Article, Research Support, Non-U.S. Gov't
PubMed
10556022
DOI
10.1006/jtbi.1999.1019
PII: S0022-5193(99)91019-6
Knihovny.cz E-resources
- MeSH
- Arabidopsis genetics MeSH
- Electronic Data Processing MeSH
- Drosophila melanogaster genetics MeSH
- Escherichia coli genetics MeSH
- Genome, Bacterial MeSH
- Genome, Fungal MeSH
- Genome, Human MeSH
- Genome, Plant MeSH
- Genome * MeSH
- Humans MeSH
- Evolution, Molecular * MeSH
- Mice MeSH
- Oligonucleotides genetics MeSH
- Saccharomyces cerevisiae genetics MeSH
- Sequence Analysis, DNA * MeSH
- Animals MeSH
- Check Tag
- Humans MeSH
- Mice MeSH
- Animals MeSH
- Publication type
- Journal Article MeSH
- Research Support, Non-U.S. Gov't MeSH
- Names of Substances
- Oligonucleotides MeSH
We calculated the variation coefficients of the mononucleotide and short oligonucleotide distributions in over 1700 long genomic sequences originating from six organisms to demonstrate that the human and Escherichia coli genomic sequences were the least and the most uniform, respectively. The most non-random genomic distributions were exhibited by the four canonical nucleotides, followed by the strong and weak nucleotides, while the distributions of purine or pyrimidine nucleotides and especially the distributions of (A+C) and (G+T) were significantly more uniform even in the human genome. In the human and mouse genomes, the highest coefficients of variation were further observed with the oligonucleotides where CG was combined with the strong nucleotides while its combination with the weak nucleotides significantly decreased the variation which, however, was still very high. High variation was also exhibited by the remaining oligonucleotides composed exclusively of the strong nucleotides or those containing only weak nucleotides. On the other hand, the distributions of oligonucleotides containing similar and especially the same numbers of the strong and weak nucleotides, but no CG or TA dinucleotide, were the most uniform. The information following from the present analysis will be useful not only in the identification of important genomic regions but also in computer simulations of the genomic nucleotide sequences in order to trace and reproduce the pathways of genome evolution.
References provided by Crossref.org
Mosaic structure of the DNA molecules of the human chromosomes 21 and 22