A statistical method for predicting alpha-helical and beta-sheet regions in proteins from their amino acidic sequences

Cuomo, V.; Macchiato, M. F.; Tramontano, A.

doi:10.1007/BF02457469

A statistical method for predicting alpha-helical and beta-sheet regions in proteins from their amino acidic sequences

Published: February 1984

Volume 3, pages 421–435, (1984)
Cite this article

Il Nuovo Cimento D

V. Cuomo¹,
M. F. Macchiato² &
A. Tramontano³

25 Accesses
3 Citations
Explore all metrics

Summary

In this paper we propose a new method to predict the secondary structure of proteins from sequence data. A satisfactory improvement of the available efficiency of prediction is obtained. The described method takes into account the frequency of each pair of amino acids in alpha-helical, beta-sheet and random coil regions according to previous results that the sequences of amino acidic residues in these regions are autocorrelated. The rules of the method are not derived from the analysis of the regions of proteins with a known secondary structure, but they are instead based on statistical considerations. In such a way the obtained value of efficiency of the method (88%) has a high reliability: in fact, it is correct to test a method only on the data not used to construct it. A new definition of efficiency of a predictive method is given to resolve the ambiguities arising from the previously accepted definitions.

Riassunto

In questo lavoro si propone un nuovo algoritmo per predire la struttura secondaria di una proteina dall’analisi della sua sequenza aminoacidica, con il quale si è ottenuto un significativo miglioramento delle efficienze di previsione della struttura secondaria disponibili fino ad ora. Il metodo descritto tiene conto della frequenza delle coppie adiacenti di aminoacidi nelle regioni ad alpha-helix, beta sheet e random coil, in accordo con il nostro precedente risultato che in tali regioni le sequenze aminoacidiche sono autocorrelate. Le regole usate non derivano dall’analisi delle regioni di proteine con una struttura secondaria nota, ma sono invece basate esclusivamente su considerazioni statistiche. In tal modo il valore ottenuto per l’efficienza del metodo (88%) ha un’alta affidabilità, essendo corretto controllare un metodo solo sui dati non utilizzati per la sua costruzione. Per risolvere le ambiguità esistenti, inoltre, si dà qui una nuova definizione di efficienza per un metodo di previsione di strutture secondarie.

Резюме

В этой работе предлагается новый метод для предсказания вторичной структуры белков, исходя из последовательности аминокислот. Получается существенное улучшение эффективности предсказания. Предложенный метод учитывает частоту каждой пары аминокислот в альфа-спиральной области, бета-слоистой области и в области случайных спиралей, в соответствии с предыдущими результатами, согласно которым последовательности аминокислот в этих областях являются автокоррелированными. Правила метода не выводятся из анализа областей белков с известной вторичной структурой, а основываются на статистических рассмотрениях. Полученное значение эффективности (88%) имеет высокую надежность. Корректность метода проверялась не только на данных, использованных для его конструирования. Предлагается новое определение эффективности для разрешения неоднозначностей, связанных с ранее принятыми определениями.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Distribution of dipeptides in different protein structural classes: an effort to find new similarities

Article 13 June 2017

A hydrophobic spine stabilizes a surface-exposed α-helix according to analysis of the solvent-accessible surface area

Article Open access 22 December 2016

Amino acid torsion angles enable prediction of protein fold classification

Article Open access 10 December 2020

References

C. B. Anfinsen, E. Haber, M. Sela andF. H. White:Proc. Natl. Acad. Sci. USA,47, 1309 (1961).
Article ADS Google Scholar
K. Nagano:J. Mol. Biol.,75, 401 (1973).
Article Google Scholar
V. I. Lim:J. Mol. Biol.,88, 857 (1974).
Article Google Scholar
V. I. Lim:J. Mol. Biol.,88, 873 (1974).
Article Google Scholar
P. Y. Chou andG. D. Fasman:Biochemistry,13, 211 (1974).
Article Google Scholar
P. Y. Chou andG. D. Fasman:Biochemistry,13, 222 (1974).
Article Google Scholar
P. Y. Chou andG. D. Fasman:Adv. Enzimol.,47, 45 (1978).
Google Scholar
R. J. Garnier, D. J. Osguthorpe andB. J. Robson:J. Mol. Biol.,120, 97 (1978).
Article Google Scholar
M. F. Macchiato andA. Tramontano:Lett. Nuovo Cimento,37, 89 (1983).
Google Scholar
C. E. Nockolds, R. H. Kretsinger, C. J. Coffee andR. A. Bradshaw:Proc. Natl. Acad. Sci. USA,69, 581 (1972).
Article ADS Google Scholar
K. K. Kànnan:Proc. Natl. Acad. Sci. USA,72, 51 (1975).
Article ADS Google Scholar
F. A. Quiocho andW. N. Lipscomb:Adv. Protein Chem.,25, 1 (1971).
Google Scholar
J. J. Birktoft andD. M. Blow:J. Mol. Biol.,68, 187 (1972).
Article Google Scholar
A. Jack, J. Weinziern andA. J. Kalb:J. Mol. Biol.,58, 389 (1971).
Article Google Scholar
F. S. Matthews, P. Argos andM. Levine:Cold Spring Harbor Symp. Quant. Biol.,36, 387 (1971).
Google Scholar
T. Ashida, T. Ueki, A. Tsukihara, T. Takano andM. Kakudo:J. Biochem.,70, 913 (1971).
Google Scholar
D. M. Shotton andH. C. Watson:Philos. Trans. R. Soc. London,257, 111 (1970).
ADS Google Scholar
D. M. Shotton andB. S. Hattley:Nature (London),225, 802 (1970).
Article ADS Google Scholar
E. T. Adman, L. C. Sieker andL. H. Jensen:J. Biol. Chem.,248, 3987 (1973).
Google Scholar
P. Y. Chou andG. D. Fasman:Fed. Eur. Biochem. Soc. Meet. Proc.,128, 13 (1977).
Google Scholar
M. F. Perutz, M. G. Rossman, A. F. Cullis, H. Muirhead, G. Will andA. C. T. North:Nature (London),185, 416 (1960).
Article ADS Google Scholar
M. F. Perutz, H. Muirhead, J. M. Cox andL. C. G. Goaman:Nature (London),219, 131 (1968).
Article ADS Google Scholar
W. E. Love, P. A. Klock, E. E. Lattman, E. A. Padlan, K. B. Ward jr. andW. A. Hendrickson:Cold Spring Harbor Symp. Quant. Biol.,36, 349 (1971).
Google Scholar
T. L. Blundell, J. F. Cutfield, E. J. Dodson, G. G. Dodson, D. C. Hodgkin andD. A. Mercola:Cold Spring Harbor Symp. Quant. Biol.,36, 233 (1971).
Google Scholar
M. J. Adams, G. C. Ford, A. Lilijas andM. G. Rossman:Biochem. Biophys. Res. Commun.,53, 46 (1973).
Article Google Scholar
T. Imoto, L. N. Johnson, A. C. T. North, D. C. Philips andJ. A. Rupley:The Enzymes, edited byP. D. Boyer, 3rd edition, Vol.7 (1972), p. 665.
Article Google Scholar
W. A. Hendrickson andK. B. Ward:Biochem. Biophys. Res. Commun.,66, 1349 (1975).
Article Google Scholar
J. C. Kendrew, R. E. Dickerson, B. E. Strandberg, R. G. Hast, D. R. Davies, D. C. Phillips andV. C. Shore:Nature (London),185, 422 (1960).
Article ADS Google Scholar
F. A. Cotton, C. J. Bier, V. W. Day, E. E. Hazen andS. Larsen:Cold Spring Harbor Symp. Quant. Biol.,36, 243 (1971).
Google Scholar
R. Huber, D. Kukla, A. Ruhlmann, O. Epp andH. Formenek:Naturwissenschaften,57, 389 (1970).
Article ADS Google Scholar
J. Drenth, J. N. Jansonius, R. Koekoek andB. G. Wolthers:Adv. Prot. Chem.,25, 79 (1971).
Article Google Scholar
H. W. Wychoff, D. Tsernoglou, A. W. Hanson, J. R. Knox, B. Lee andF. M. Richards:J. Biol. Chem.,245, 305 (1970).
Google Scholar
K. D. Watenpaugh, L. C. Sicker, J. R. Herriott andL. H. Jensen:Cold Spring. Harbor Symp. Quant. Biol.,36, 359 (1971).
Google Scholar
J. Drenth, W. G. Hol, J. N. Jansonius andR. Koekoek:Cold. Spring Harbor Symp. Quant. Biol.,36, 107, (1971).
Google Scholar
J. S. Richardson, K. A. Thomas andD. C. Richardson:Biochem. Biophys. Commun.,63, 286 (1975).
Article Google Scholar
P. M. Colman, J. N. Jansonius andB. W. Matthews:J. Mol. Biol.,70, 701 (1972).
Article Google Scholar
A. Holmgren, B. O. Sodemberg, H. Eklund andC. I. Braden:Proc. Natl. Acad. Sci. USA,72, 2305 (1975).
Article ADS Google Scholar
D. W. Banner, A. C. Bloomen, G. A. Petsko, D. C. Phillips, C. I. Porgson, J. A. Wilson, P. H. Corran, A. J. Furth, J. D. Milman, R. E. Offord, J. D. Priddle andS. G. Weley:Nature (London),255, 609 (1975).
Article ADS Google Scholar
M. O. Dayhoff andL. T. Hunt:Atlas of Protein Sequence and Structure (New York, N. Y., 1972).
W. Feller:An Introduction to Probability Theory and its Applications, Vol. I and II (New York, N. Y., 1970).
A. Cascino, M. Cipollaro, A. M. Guerrini, G. Mastrocinque, A. Spena, V. Scarlato:Nucleic Acids Research,9, 1499 (1981).
Google Scholar
G. Afeltra, M. Macchiato, C. Moscatelli, A. Tramontano, A. Cascino:Atti Assoc. Genet. Ital.,32, 1 (1981).
Google Scholar
M. Rossman andP. Argos:Annu. Rev. Biochem.,50, 497 (1981).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Istituto di Fisica della Facoltà d’Ingegneria dell’Università, Napoli
V. Cuomo
Istituto di Fisica Sperimentale della Facoltà di Scienze dell’Università, Napoli
M. F. Macchiato
Istituto Internazionale di Genetica e Biofisica, Napoli
A. Tramontano

Authors

V. Cuomo
View author publications
You can also search for this author in PubMed Google Scholar
M. F. Macchiato
View author publications
You can also search for this author in PubMed Google Scholar
A. Tramontano
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cuomo, V., Macchiato, M.F. & Tramontano, A. A statistical method for predicting alpha-helical and beta-sheet regions in proteins from their amino acidic sequences. Il Nuovo Cimento D 3, 421–435 (1984). https://doi.org/10.1007/BF02457469

Download citation

Received: 06 July 1983
Issue Date: February 1984
DOI: https://doi.org/10.1007/BF02457469

PACS. 87.10

General, theoretical and mathematical biophysics (including logic of biophysics, quantum biology and relevant aspects of thermodynamics, information theory, cybernetics and bionics)

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A statistical method for predicting alpha-helical and beta-sheet regions in proteins from their amino acidic sequences

Summary

Riassunto

Резюме

Access this article

Similar content being viewed by others

Distribution of dipeptides in different protein structural classes: an effort to find new similarities

A hydrophobic spine stabilizes a surface-exposed α-helix according to analysis of the solvent-accessible surface area

Amino acid torsion angles enable prediction of protein fold classification

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

PACS. 87.10

Navigation

A statistical method for predicting alpha-helical and beta-sheet regions in proteins from their amino acidic sequences

Summary

Riassunto

Резюме

Access this article

Similar content being viewed by others

Distribution of dipeptides in different protein structural classes: an effort to find new similarities

A hydrophobic spine stabilizes a surface-exposed α-helix according to analysis of the solvent-accessible surface area

Amino acid torsion angles enable prediction of protein fold classification

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

PACS. 87.10

Search

Navigation