Authors:
Daniyah A. Aloqalaa
1
;
Dariusz R. Kowalski
1
;
2
;
Paweł Błażej
3
;
Małgorzata Wnetrzak
3
;
Dorota Mackiewicz
3
and
Paweł Mackiewicz
3
Affiliations:
1
Department of Computer Science University of Liverpool, U.K.
;
2
SWPS University of Social Sciences and Humanities, Warsaw, Poland
;
3
Faculty of Biotechnology, University of Wrocław, Poland
Keyword(s):
Code Degeneracy, Graph Theory, Mutation, Set Conductance, Standard Genetic Code, Transition, Transversion.
Abstract:
The standard genetic code (SGC) is a system of rules ascribing 20 amino acids and stop translation signal to 64 codons, i.e triplets of nucleotides. It was proposed that the structure of the SGC evolved to minimize harmful consequences of mutations and translational errors. To study this problem, we described the SGC structure by a graph, in which codons are vertices and edges correspond to single nucleotide mutations occurring between the codons. We also introduced weights (W) for mutation types to distinguish transversions from transitions. Using this representation, the SGC is a partition of the set of vertices into 21 disjoint subsets. In this case, the question about the potential robustness of the genetic code to the mutations can be reformulated into the optimal graph clustering task. To investigate this problem, we applied an appropriate clustering algorithm, which searched for the codes characterized by the minimum average calculated from the set W-conductance of codon group
s. Our algorithm found three best codes for various ranges of the applied weights. The average W-conductance of the SGC was the most similar to that of the best codes in the range of weights corresponding to the observed transversion/transition ratio in natural mutational pressures. However, it should be noted that the optimization of the SGC was not as perfect as the best codes. It implies that the evolution of the SGC was driven not only by the selection for the robustness against mutations or mistranslations but also other factors, e.g. subsequent addition of amino acids to the code according to the expansion of amino acid metabolic pathways.
(More)