The size of natural peptide molecules (proteins) can be considered as the number of amino acid residues p (protein length). The aim of the work was to analyze the region… Click to show full abstract
The size of natural peptide molecules (proteins) can be considered as the number of amino acid residues p (protein length). The aim of the work was to analyze the region of existence and occurrence of natural amino acid residue sequences formed as a result of matrix synthesis on the p scale. The object of the study was the Swiss-Prot database consisting of more than 5.6 × 105 primary peptide structures, which were fully determined (complete sequence). Sequences containing non-standard amino acid residues, as well as identical copies of sequences, were removed from them. The remaining 463,450 different sequences with a length of 2–35,213 residues were used for further analysis. It was shown that the protein lengths of different biological domains and kingdoms are characterized by different regions of existence, and the profile shapes of the obtained curves are close to a number of known distributions. At the same time, they have sharp high peaks, indicating the existence of a large number of specific molecules with the same protein length. One of these peaks characterizes more than 1,000 different sequences of mitochondrial cytochrome b molecules at p = 379. Such examples may indicate that the most perfect protein lengths were selected in the evolutionary process to perform this function. As a result, many protein molecules with different sequences of the same length and characterized by the same functions were formed.
               
Click one of the above tabs to view related content.