CHAPS Help

Step 1: enter a set of accessions (one per line), or a set of fasta-formatted sequences; make sure you specify what you've entered in the Format: field.

Step 2: Click "Run ClustalW now"; the page should come back after a few seconds with a multiple alignment in the input box. You could have skipped Step 1 if you simply pasted in an already created multiple alignment, in "pseudo-ClustalW" format (i.e. just the alignment, no headers or footers).

Step 3: To generate the corresponding position-specific scoring matrix (PSSM or "profile"), click the "Generate PSSM now"; after a few seconds the PSSM field should be filled with the profile scoring matrix, followed by a second matrix indicating relative amino acid usage at each position, followed by two columns that provide the information content of the position and the fractional weight of the alignments, compared to pseudocounts.

Step 4: To use this profile in a PSI-BLAST seach, click the "Send to PSI-BLAST" button.

Optional step: To save either the multiple alignment, PSSM, or HMM to your local machine, press:
Download Multiple Alignment,
Download PSSM (ASN.1),
Download HMM.

These files can then be used for seaches with PSI-BLAST or HMM Search.


What are the numbers in the PSSM window?

The PSSM profile box is the output of the blastpgp -Q option. The left most 20 numbers are the scores that would be used in a similarity search. The next 20 numbers are the frequencies (percents) of each amino acid at that position. The last two floating point numbers are the information content at the position, and the relative weight of the multiple alignment data and the pseudocounts. If the last number is low, then BLOSUM62 is contributing most of the score information.