Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
Yusuke Komiyama committed Sep 5, 2016
1 parent c3bebec commit 522c1e0
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,12 @@
Sugar-Binding Residue Predictor (SBRP)
=============

by Masaki Banno at BILAB

Sugar-Binding Residue Predictor (SBRP) is sugar binding prediction tool using support vector machine. SBRP can predict sugar-binding residue from amino acid sequence information. Protein Structure information is not necessary.

# Installation

SBRP requires python2.7, libsvm, Legacy Blast and nr database.

You can use following links install and download these programs and libraries and database.
Expand All @@ -14,6 +16,7 @@ You can use following links install and download these programs and libraries an
- Python2.7x

# Usage

SBRP contains the binding residue predictor for acidic sugar and the binding-residue predictor for nonacidic sugar. Acidic sugar has phosphate group or carboxyl group or sulfate group. NonAcidic Sugar is except for acidic sugar. So, You can get prediction what residue in the protein sequence is belonged to acidic sugar binding residue or nonacidic binding residue.

If you download above programs, you can run SBRP like below command.
Expand All @@ -24,6 +27,7 @@ The output example is below.


---

148L:E|PDBID|CHAIN|SEQUENCE 8 -1.67762885786 1.70627963313 False False
148L:E|PDBID|CHAIN|SEQUENCE 9 -0.49179158867 1.40920844499 False False
148L:E|PDBID|CHAIN|SEQUENCE 10 1.65258659258 0.973523358096 True False
Expand All @@ -32,6 +36,7 @@ The output example is below.
148L:E|PDBID|CHAIN|SEQUENCE 13 0.790677700098 0.749706572303 True False
148L:E|PDBID|CHAIN|SEQUENCE 14 -1.51129664951 1.21914159325 False False
148L:E|PDBID|CHAIN|SEQUENCE 15 -0.426752423351 0.890993747856 False False

---


Expand All @@ -42,17 +47,22 @@ The output example is below.


# Method

## Dataset

Training dataset is collected from PDB. Sugar binding residue is defined as residues within 4 Å from sugar. The proteins are collected from PDB if the protein has sugar-binding residue.

After that, these datasets are refined under the following condition.
The PDB datum which is using the additive for the experiment of sugar.
Residue within 1.5 Å from sugar. (This is not interactions but covalent bond.)
Finally, These proteins are removed redundancy using blastclust.


## Feature Vector

SBRP is using the Position Specific Scoring Matrix (PSSM) calculated by PSI-Blast for a feature vector of each residue.

## Training method

Support Vector Machine is needed positive dataset and negative dataset. The positive dataset is sugar-binding residue. The negative dataset is within 25 Å without 5 Å from sugar binding residue.

0 comments on commit 522c1e0

Please sign in to comment.