Assignment 2 (50 points)
This assignment will be dedicated to protein analysis
The goal of this exercise:
* To learn about the basic protein structure prediction tools available on the WWW.
You will take a protein sequence, and predict features related to its primary, secondary
and tertiary structures.
*To get experience with phylogenic programs
* To use some of the important modeling WWW servers
Part-1: Sequence analysis (Sub-cellular localization)
1. From NCBI or SwissProt databases pick proteins that allow you to test one or several of the following purposes:
-Signal peptide (secretory pathway)
-Chloroplast
-Mitochondria
-Peroxisome
-Transmembrane domains
-GPI-anchoring
-Hydrophobicity
2. define potential posttranslational modifications
3. Use the websites from today’s lecture
4. Submit:
*Graphical presentations whenever is possible.
* Discuss and interpret your data and indicate which program you used.
Part-2: Phylogeny analysis.
1. Pick a protein sequence. Use your favorite protein sequence, or pick any random sequence, or use what you used in Assignment 1. Save it as FASTA format.
2. Search Swiss-Prot or NCBI using BLAST search program
3- Pick randomly between 10 and 15 sequences (proteins). Submit the list of these sequences
(not the sequences).
4-Using this web-based program ( ), perform several trees. You can also look for another program:
-Submit three types of trees.
-Answer:
*What the difference between the methods that are used to create the trees?
* Discuss your trees.
Part-3: 3-D fold prediction.
We will be visiting these websites. The easy one to work with is the first (PHYRE), but feel free to try the two other sites.
Protein Homology/analogYRecognition Engine (Phyre2, NEW!)
( for structuremodeling
1. Use 2 protein sequences from NCBI
* Report some details about the proteins
* For further processing, save the protein sequences on your computer as
FASTA format.
2. Create a 3-D model for your protein sequences
* go to Phyre website and submit your sequences.
* You will get by email the link to the results of the modeling.
* Report top template information.
* How many residues out of the sequence have been modeled? Confidence?
* Submit a picture of the model for each protein that you can view via Jmol program.
* What secondary elements can you identify? at what positions? Can you attempt to change the presentation of the molecule?
1