Library

feed icon rss

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
  • 1
    Electronic Resource
    Electronic Resource
    New York, NY : Wiley-Blackwell
    Proteins: Structure, Function, and Genetics 33 (1998), S. 88-96 
    ISSN: 0887-3585
    Keywords: fold classification ; substructures ; Dali ; protein families ; structural similarity ; Chemistry ; Biochemistry and Biotechnology
    Source: Wiley InterScience Backfile Collection 1832-2000
    Topics: Medicine
    Notes: The rapid growth in the number of experimentally determined three-dimensional protein structures has sharpened the need for comprehensive and up-to-date surveys of known structures. Classic work on protein structure classification has made it clear that a structural survey is best carried out at the level of domains, i.e., substructures that recur in evolution as functional units in different protein contexts. We present a method for automated domain identification from protein structure atomic coordinates based on quantitative measures of compactness and, as the new element, recurrence. Compactness criteria are used to recursively divide a protein into a series of successively smaller and smaller substructures. Recurrence criteria are used to select an optimal size level of these substructures, so that many of the chosen substructures are common to different proteins at a high level of statistical significance. The joint application of these criteria automatically yields consistent domain definitions between remote homologs, a result difficult to achieve using compactness criteria alone. The method is applied to a representative set of 1,137 sequence-unique protein families covering 6,500 known structures. Clustering of the resulting set of domains (substructures) yields 594 distinct fold classes (types of substructures). The Dali Domain Dictionary (http://www.embl-ebi.ac.uk/dali) not only provides a global structural classification, but also a comprehensive description of families of protein sequences grouped around representative proteins of known structure. The classification will be continuously updated and can serve as a basis for improving our understanding of protein evolution and function and for evolving optimal strategies to complete the map of all natural protein structures. Proteins 33:88-96, 1998. © 1998 Wiley-Liss, Inc.
    Additional Material: 7 Ill.
    Type of Medium: Electronic Resource
    Library Location Call Number Volume/Issue/Year Availability
    BibTip Others were also interested in ...
  • 2
    ISSN: 0749-503X
    Keywords: Saccharomyces cerevisiae ; short ORFs ; computational ORF verification ; ORF properties ; sequence similarity ; Life and Medical Sciences ; Genetics
    Source: Wiley InterScience Backfile Collection 1832-2000
    Topics: Biology
    Notes: We have analysed short open reading frames (between 150 and 300 base pairs long) of the yeast genome (Saccharomyces cerevisiae) with a two-step strategy. The first step selects a candidate set of open reading frames from the DNA sequence based on statistical evaluation of DNA and protein sequence properties. The second step filters the candidate set by selecting open reading frames with high similarity to other known sequences (from any organism). As a result, we report ten new predicted proteins not present in the current sequence databases. These include a new alcohol dehydrogenase, a protein probably related to the cell cycle, as well as a homolog of the prokaryotic ribosomal protein L36 likely to be a mitochondrial ribosomal protein coded in the nuclear genome. We conclude that the analysis of short open reading frames leads to biologically interesting discoveries, even though the quantitative yield of new proteins is relatively low. © 1997 John Wiley & Sons, Ltd.
    Additional Material: 5 Ill.
    Type of Medium: Electronic Resource
    Library Location Call Number Volume/Issue/Year Availability
    BibTip Others were also interested in ...
  • 3
    ISSN: 0749-503X
    Keywords: Saccharomyces cerevisiae ; chromosome IV ; SNQ2 ; SES1 ; GCV1 ; RPL2B ; HEX2/SRN1 ; RPS18A ; tRNA-Val12a ; Life Sciences ; Life Sciences (general)
    Source: Wiley InterScience Backfile Collection 1832-2000
    Topics: Biology
    Notes: The complete DNA sequence of cosmid clone 31A5 containing a 35 452 bp segment from the left arm of chromosome IV from Saccharomyces cerevisiae, was determined from an ordered set of subclones in combination with primer walking on the cosmid. The sequence contains 23 open reading frames (ORFs) of more than 100 amino acid residues and the tRNA-Val2a gene. Five ORFs corresponded to the known yeast genes SNQ2, SES1, GCV1, RPL2B and RPS18A. The DNA sequence for RPS18A is interrupted by an intron. One ORF corresponded to a part of the yeast gene HEX2 at the end of the cosmid insert. Four ORFs encoded putative proteins which showed strong homologies to other previously known proteins, three of yeast origin and one of non-yeast origin. Two ORFs were classified as having borderline homologies: one had similarity to two protein families and another to two protein products of unknown function from other species. The remaining 11 ORFs bore no significant similarity to any published protein. The complete DNA sequence has been submitted to the EMBL data library, Accession Number X95966.
    Additional Material: 2 Ill.
    Type of Medium: Electronic Resource
    Library Location Call Number Volume/Issue/Year Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...