fbpx
Wikipedia

PSIPRED

PSI-blast based secondary structure PREDiction (PSIPRED) is a method used to investigate protein structure. It uses artificial neural network machine learning methods in its algorithm.[2][3][4] It is a server-side program, featuring a website serving as a front-end interface, which can predict a protein's secondary structure (beta sheets, alpha helixes and coils) from the primary sequence.

PSIPRED
Original author(s)
  • David T. Jones
  • Daniel Buchan
  • Tim Nugent
  • Liam McGuffin
  • Federico Minneci
  • Kevin Bryson
Developer(s)University College London, Bioinformatics Group
Initial release1999; 25 years ago (1999)
Stable release
4.02 / 26 September 2018; 5 years ago (2018-09-26)[1]
Written inC
Operating systemWindows 2000, Unix
Platformx86, Java
Size14 MB
Available inEnglish
TypeBioinformatics secondary structure prediction
LicenseProprietary freeware source code
Websitebioinf.cs.ucl.ac.uk/psipred

PSIPRED is available as a web service and as software. The software is distributed as source code, licensed technically as proprietary software. It allows modifying, but enforces freeware provisions by forbidding for-profit distribution of the software and its results.

Secondary structure edit

Secondary structure is the general three-dimensional form of local segments of biopolymers such as proteins and nucleic acids (DNA, RNA). It does not, however, describe specific atomic positions in three-dimensional space, which are considered to be the tertiary structure. Secondary structure can be formally defined by the hydrogen bonds of the biopolymer, as observed in an atomic-resolution structure. In proteins, the secondary structure is defined by the patterns of hydrogen bond between backbone amino and carboxyl groups. Conversely, for nucleic acids, the secondary structure consists of the hydrogen bonding between the nitrogenous bases. The hydrogen bonding patterns may be significantly distorted, which makes automatic determination of secondary structure difficult. Efforts to use computers to predict protein secondary structures, based only on their given primary structure sequences, have been ongoing since the 1970s.[5]

Secondary structure prediction involves a set of methods in bioinformatics that aim to predict the local secondary structures of proteins and RNA sequences based only on knowledge of their primary structureamino acid or nucleotide sequence, respectively. For proteins, a prediction consists of assigning regions of the amino acid sequence as highly probable alpha helixes, beta strands (often noted as extended conformations), or turns. The success of a prediction is determined by comparing it to the results of the DSSP algorithm applied to the crystal structure of the protein; for nucleic acids, it may be determined from the hydrogen bonding pattern. Specialized algorithms have been developed to detect specific well-defined patterns such as transmembrane helixes and coiled coils in proteins, or canonical micro-RNA structures in RNA.

Basic information edit

The idea of this method is to use the information of the evolutionarily related proteins to predict the secondary structure of a new amino acid sequence. PSIBLAST is used to find related sequences and to build a position-specific scoring matrix. This matrix is processed by an artificial neural network,[3][6] which was constructed and trained to predict the secondary structure of the input sequence;[7] in short, it is a machine learning method.[8]

Prediction algorithm (method) edit

The prediction method or algorithm is split into three stages: generating a sequence profile, predicting initial secondary structure, and filtering the predicted structure.[9] PSIPRED works to normalize the sequence profile generated by PSIBLAST.[3] Then, by using neural networking, initial secondary structure is predicted. For each amino acid in the sequence, the neural network is fed with a window of 15 acids. Added information is attached, indicating if the window spans the N or C terminus of the chain. This results in a final input layer of 315 input units, divided into 15 groups of 21 units. The network has one hidden layer of 75 units and 3 output nodes (one for each secondary structure element: helix, sheet, coil).[6]

A second neural network is used to filter the predicted structure of the first network. This network is also fed with a window of 15 positions. The indicator on the possible position of the window at a chain terminus is also forwarded. This results in 60 input units, divided into 15 groups of four. The network has one hidden layer of 60 units and results in three output nodes (one for each secondary structure element: helix, sheet, coil).[9]

The three final output nodes deliver a score for each secondary structure element for the central position of the window. Using the secondary structure with the highest score, PSIPRED generates the protein prediction.[9] The Q3 value is the fraction of residues predicted correctly in the secondary structure states, namely helix, strand, and coil.[9]

See also edit

References edit

  1. ^ "Index of /downloads/psipred". bioinfadmin.cs.ucl.ac.uk. Retrieved 26 April 2021.
  2. ^ Gajendra P. S. Raghava; Harpreet Kaur. "Prediction of beta turn types". Retrieved 5 May 2014.
  3. ^ a b c Yi-Ping Phoebe Chen (18 January 2005). Bioinformatics Technologies. Springer. p. 107. ISBN 978-3-540-20873-0.
  4. ^ Cuff, James A.; Barton, Geoffrey A. (15 August 2000). "Application of multiple sequence alignment profiles to improve protein secondary structure prediction". Proteins. 40 (3): 502–11. doi:10.1002/1097-0134(20000815)40:3<502::aid-prot170>3.0.co;2-q. PMID 10861942.
  5. ^ Heringa, Jaap (2000). "Computational Methods for Protein Secondary Structure Prediction Using Multiple Sequence Alignments". Current Protein & Peptide Science. 1 (3): 273–301(29). CiteSeerX 10.1.1.470.7673. doi:10.2174/1389203003381324. PMID 12369910.
  6. ^ a b S. C. Rastogi; Namitra Mendiratta; Parag Rastogi (22 May 2013). Bioinformatics: Methods and Applications: (Genomics, Proteomics and Drug Discovery). PHI Learning Pvt. Ltd. pp. 302–. ISBN 978-81-203-4785-4.
  7. ^ "PSIPRED | Bioinformatic Technology". 10 April 2014. Retrieved 7 May 2014.
  8. ^ "PSIPRED overview". Retrieved 7 May 2014.
  9. ^ a b c d Jones, David T. (17 September 1999). "Protein Secondary Structure Prediction Based on Position-specific Scoring Matrices" (PDF). Journal of Molecular Biology. 292 (2): 195–202. doi:10.1006/jmbi.1999.3091. PMID 10493868. Retrieved 7 May 2014.

psipred, blast, based, secondary, structure, prediction, method, used, investigate, protein, structure, uses, artificial, neural, network, machine, learning, methods, algorithm, server, side, program, featuring, website, serving, front, interface, which, predi. PSI blast based secondary structure PREDiction PSIPRED is a method used to investigate protein structure It uses artificial neural network machine learning methods in its algorithm 2 3 4 It is a server side program featuring a website serving as a front end interface which can predict a protein s secondary structure beta sheets alpha helixes and coils from the primary sequence PSIPREDOriginal author s David T Jones Daniel Buchan Tim Nugent Liam McGuffin Federico Minneci Kevin BrysonDeveloper s University College London Bioinformatics GroupInitial release1999 25 years ago 1999 Stable release4 02 26 September 2018 5 years ago 2018 09 26 1 Written inCOperating systemWindows 2000 UnixPlatformx86 JavaSize14 MBAvailable inEnglishTypeBioinformatics secondary structure predictionLicenseProprietary freeware source codeWebsitebioinf wbr cs wbr ucl wbr ac wbr uk wbr psipredPSIPRED is available as a web service and as software The software is distributed as source code licensed technically as proprietary software It allows modifying but enforces freeware provisions by forbidding for profit distribution of the software and its results Contents 1 Secondary structure 2 Basic information 3 Prediction algorithm method 4 See also 5 ReferencesSecondary structure editMain articles Protein secondary structure and Biomolecular structure Secondary structure is the general three dimensional form of local segments of biopolymers such as proteins and nucleic acids DNA RNA It does not however describe specific atomic positions in three dimensional space which are considered to be the tertiary structure Secondary structure can be formally defined by the hydrogen bonds of the biopolymer as observed in an atomic resolution structure In proteins the secondary structure is defined by the patterns of hydrogen bond between backbone amino and carboxyl groups Conversely for nucleic acids the secondary structure consists of the hydrogen bonding between the nitrogenous bases The hydrogen bonding patterns may be significantly distorted which makes automatic determination of secondary structure difficult Efforts to use computers to predict protein secondary structures based only on their given primary structure sequences have been ongoing since the 1970s 5 Secondary structure prediction involves a set of methods in bioinformatics that aim to predict the local secondary structures of proteins and RNA sequences based only on knowledge of their primary structure amino acid or nucleotide sequence respectively For proteins a prediction consists of assigning regions of the amino acid sequence as highly probable alpha helixes beta strands often noted as extended conformations or turns The success of a prediction is determined by comparing it to the results of the DSSP algorithm applied to the crystal structure of the protein for nucleic acids it may be determined from the hydrogen bonding pattern Specialized algorithms have been developed to detect specific well defined patterns such as transmembrane helixes and coiled coils in proteins or canonical micro RNA structures in RNA Basic information editThe idea of this method is to use the information of the evolutionarily related proteins to predict the secondary structure of a new amino acid sequence PSIBLAST is used to find related sequences and to build a position specific scoring matrix This matrix is processed by an artificial neural network 3 6 which was constructed and trained to predict the secondary structure of the input sequence 7 in short it is a machine learning method 8 Prediction algorithm method editThe prediction method or algorithm is split into three stages generating a sequence profile predicting initial secondary structure and filtering the predicted structure 9 PSIPRED works to normalize the sequence profile generated by PSIBLAST 3 Then by using neural networking initial secondary structure is predicted For each amino acid in the sequence the neural network is fed with a window of 15 acids Added information is attached indicating if the window spans the N or C terminus of the chain This results in a final input layer of 315 input units divided into 15 groups of 21 units The network has one hidden layer of 75 units and 3 output nodes one for each secondary structure element helix sheet coil 6 A second neural network is used to filter the predicted structure of the first network This network is also fed with a window of 15 positions The indicator on the possible position of the window at a chain terminus is also forwarded This results in 60 input units divided into 15 groups of four The network has one hidden layer of 60 units and results in three output nodes one for each secondary structure element helix sheet coil 9 The three final output nodes deliver a score for each secondary structure element for the central position of the window Using the secondary structure with the highest score PSIPRED generates the protein prediction 9 The Q3 value is the fraction of residues predicted correctly in the secondary structure states namely helix strand and coil 9 See also editJpred Protein design Protein function prediction De novo protein structure prediction Molecular design software List of protein structure prediction software Comparison of software for molecular mechanics modeling Modelling biological systems Protein fragment library Lattice proteins Statistical potentialReferences edit Index of downloads psipred bioinfadmin cs ucl ac uk Retrieved 26 April 2021 Gajendra P S Raghava Harpreet Kaur Prediction of beta turn types Retrieved 5 May 2014 a b c Yi Ping Phoebe Chen 18 January 2005 Bioinformatics Technologies Springer p 107 ISBN 978 3 540 20873 0 Cuff James A Barton Geoffrey A 15 August 2000 Application of multiple sequence alignment profiles to improve protein secondary structure prediction Proteins 40 3 502 11 doi 10 1002 1097 0134 20000815 40 3 lt 502 aid prot170 gt 3 0 co 2 q PMID 10861942 Heringa Jaap 2000 Computational Methods for Protein Secondary Structure Prediction Using Multiple Sequence Alignments Current Protein amp Peptide Science 1 3 273 301 29 CiteSeerX 10 1 1 470 7673 doi 10 2174 1389203003381324 PMID 12369910 a b S C Rastogi Namitra Mendiratta Parag Rastogi 22 May 2013 Bioinformatics Methods and Applications Genomics Proteomics and Drug Discovery PHI Learning Pvt Ltd pp 302 ISBN 978 81 203 4785 4 PSIPRED Bioinformatic Technology 10 April 2014 Retrieved 7 May 2014 PSIPRED overview Retrieved 7 May 2014 a b c d Jones David T 17 September 1999 Protein Secondary Structure Prediction Based on Position specific Scoring Matrices PDF Journal of Molecular Biology 292 2 195 202 doi 10 1006 jmbi 1999 3091 PMID 10493868 Retrieved 7 May 2014 Retrieved from https en wikipedia org w index php title PSIPRED amp oldid 1189505090, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.