TY - JOUR
T1 - The exceptional genomic word symmetry along DNA sequences
AU - Afreixo, Vera
AU - Rodrigues, João M. O. S.
AU - Bastos, Carlos A. C.
AU - Silva, Raquel M.
N1 - Funding Information:
This work was supported by Portuguese funds through the iBiMED - Institute of Biomedicine, IEETA - Institute of Electronics and Telematics Engineering of Aveiro and the Portuguese Foundation for Science and Technology (“FCT–Fundação para a Ciência e a Tecnologia”), within projects: COMPETE/FEDER UID/BIM/04501/2013 and PEst-OE/EEI/UI0127/2014.
Publisher Copyright:
© 2016 Afreixo et al.
PY - 2016/2/3
Y1 - 2016/2/3
N2 - Background: The second Chargaff's parity rule and its extensions are recognized as universal phenomena in DNA sequences. However, parity of the frequencies of reverse complementary oligonucleotides could be a mere consequence of the single nucleotide parity rule, if nucleotide independence is assumed. Exceptional symmetry (symmetry beyond that expected under an independent nucleotide assumption) was proposed previously as a meaningful measure of the extension of the second parity rule to oligonucleotides. The global exceptional symmetry was detected in long and short genomes. Results: To explore the exceptional genomic word symmetry along the genome sequences, we propose a sliding window method to extract the values of exceptional symmetry (for all words or by word groups). We compare the exceptional symmetry effect size distribution in all human chromosomes against control scenarios (positive and negative controls), testing the differences and performing a residual analysis. We explore local exceptional symmetry in equivalent composition word groups, and find that the behaviour of the local exceptional symmetry depends on the word group. Conclusions: We conclude that the exceptional symmetry is a local phenomenon in genome sequences, with distinct characteristics along the sequence of each chromosome. The local exceptional symmetry along the genomic sequences shows outlying segments, and those segments have high biological annotation density.
AB - Background: The second Chargaff's parity rule and its extensions are recognized as universal phenomena in DNA sequences. However, parity of the frequencies of reverse complementary oligonucleotides could be a mere consequence of the single nucleotide parity rule, if nucleotide independence is assumed. Exceptional symmetry (symmetry beyond that expected under an independent nucleotide assumption) was proposed previously as a meaningful measure of the extension of the second parity rule to oligonucleotides. The global exceptional symmetry was detected in long and short genomes. Results: To explore the exceptional genomic word symmetry along the genome sequences, we propose a sliding window method to extract the values of exceptional symmetry (for all words or by word groups). We compare the exceptional symmetry effect size distribution in all human chromosomes against control scenarios (positive and negative controls), testing the differences and performing a residual analysis. We explore local exceptional symmetry in equivalent composition word groups, and find that the behaviour of the local exceptional symmetry depends on the word group. Conclusions: We conclude that the exceptional symmetry is a local phenomenon in genome sequences, with distinct characteristics along the sequence of each chromosome. The local exceptional symmetry along the genomic sequences shows outlying segments, and those segments have high biological annotation density.
KW - Chargaff's second parity rule
KW - Exceptional symmetry
KW - Genome
KW - Window analysis
UR - http://www.scopus.com/inward/record.url?scp=84957009081&partnerID=8YFLogxK
U2 - 10.1186/s12859-016-0905-0
DO - 10.1186/s12859-016-0905-0
M3 - Article
C2 - 26842742
AN - SCOPUS:84957009081
SN - 1471-2105
VL - 17
JO - BMC Bioinformatics
JF - BMC Bioinformatics
IS - 1
M1 - 59
ER -