ISSN:
1432-1432
Keywords:
Proteins
;
Compositional Non-randomness in
;
Evolution, Balanced Selective and Stochastic Processes in Molecular
;
Genetic Code, Evolution of
;
Charge Conservation in Proteins
Source:
Springer Online Journal Archives 1860-2000
Topics:
Biology
Notes:
Summary Eight proteins of diverse lengths, functions, and origin, are examined for compositional non-randomness amino acid by amino acid. The proteins investigated are human fibrinopeptide A, guinea pig insulin, rattlesnake cytochromec, MS2 phage coat protein, rabbit triosephosphate isomerase, bovine pancreatic deoxyribonuclease A, bovine glutamate dehydrogenase, andBacillus thermoproteolyticus thermolysin. As a result of this study the experimentally testable hypothesis is put forth that for a large class of proteins the ratio of that fraction of the molecule which exhibits compositional non-randomness to that fraction which does not is on the average, stable about a mean value (estimated as 0.32 ± 0.17) and (nearly) independent of protein length. Stochastic and selective evolutionary forces are viewed as interacting rather than independent phenomena. With respect to amino acid composition, this coupling ameliorates the current controversy over Darwinian vs. non-Darwinian evolution, selectionist vs. neutralist, in favor of neither: Within the context of the quantitative data, the evolution of real proteins is seen as a compromise between the two viewpoints, both important. The compositional fluctuations of the electrically charged amino acids glutamic and aspartic acid, lysine and arginine, are examined in depth for over eighty protein families, both prokaryotic and eukaryotic. For both taxa, each of the acidic amino acids is present in amounts roughly twice that predicted from the genetic code. The presence of an excess of glutamic acid is independent of the presence of an excess of aspartic acid and vice versa. However, about twice as many prokaryotic families show observable deviations from randomness as eukaryotic families, although the magnitude of these deviations is the same for both taxa. The behavior of lysine and arginine is quite different, both qualitatively and quantitatively. For both taxa lysine is present in amounts roughly twice that predicted from the genetic code, but the deficiency of arginine is about twice as great for prokaryotes as for eukaryotes for which the amounts present are only half that expected from the code. Although in prokaryotic families the presence of a lysine excess is not correlated with the presence of an arginine deficit, in eukaryotic families a strong correlation exists. However, as for the acidic amino acids, the magnitude of the lysine excess is uncorrelated with the magnitude of the arginine deficit in both taxa. The greater arginine deficit in prokaryotes is thus due to the fact that fewer prokaryotic families seem to be able to utilize arginine effectively. These rather diverse experimental data with respect to electrical charge in proteins are unified by our finding that the genetic code possesses a hitherto undescribed dynamic property: namely, the transition probabilities for each of the 380 possible amino acid substitutions are quantitatively such as to maintain constant, on the average, under random mutation, the absolute proportions of acidic, basic and neutral amino acids about predictable numerical values which are closely the same independent of the type (minimal 1-, 2- or 3-base) or extent of amino acid substitution. This finding, together with the experimental data, lead to the conclusion that electrostatic stability of the experimentally observed type cannot be maintained in the absence of a certain minimum number of invariant amino acid residues. Theory permits a quantitative estimate of this number to be made for each protein family.
Type of Medium:
Electronic Resource
URL:
http://dx.doi.org/10.1007/BF01732532
Permalink