Input protein sequences
below: The format of this site should be fasta format.
The first line is any text that starts with ">" and the analysis only works for fasta header with "|" character and labels.
Starting from the second line is the sequence itself, allowing only the use of established amino acid encoding symbols.
You can paste the sequence into the following table(or use the example ) or Upload data file
Welcome to RFAmy server
This paper uses 188-D feature representation with the best performance
according to the experimental results. The method entails the following
main steps. First, the original protein sequence was generated from the Uniprot and AmyPro datasets and
then subjected to a de-redundant operation to generate the final protein sequence data called Amy.
The second step is feature extraction of protein sequences.
The third step is to use RF to classify protein sequences. Figure 1 shows the experimentalprocess in this paper.
Figure 1. Overview of the paper framework for a Amyloid classifier.
This article uses a self-built data set named Amy. The source databases for protein
sequences are the Universal Protein (UniProt, http://www.uniprot.org/)and Amyloid (AmyPro, http://www.amypro.net/)databases.
It consists of 165 amyloid proteins(positive samples) and 382 non-amyloid(negative samples).
1. The dataset Amy with FASTA format can be download (positive)and (negative) 。
2. The feature set corresponding to the dataset Amy can be downloaded(Here).
The site is for the identification of protein sequences to determine whether it is amyloid protein.
The format of this site should be fasta format.The first line is any text that starts with ">" and the analysis only
works for fasta header with "|" character and labels. Starting from the second line is the sequence itself, allowing
only the use of established amino acid encoding symbols.