Analysis

The analysis interface offers two basic ways to select proteins for analysis. 1) A user enter select a valid PDB ID. If the this entry has already been analyzed, the results will be presented, otherwise the server will automatically retrieve the needed sequence data from the PDB and begin processing it. 2) A user may submit custom data in FASTA format. The user will also be required to enter a 4-letter alphanumeric code to identify the submission and will then be provided with a link to the output for the sequence.

Figure 1 is a snapshot from the page and shows the options available to the user. The first part of the form allows the user to select the source of data to use while the second part requires the user to enter an email address so they can be notified should analysis have needed to be performed.

Figure 1: MIR analysis form

  • "PDB ID(s)": Here the user may enter Protein Data Bank (PDB) [1] IDs for analysis. Up to five PDB IDs may be entered here, provided that they are separated by commas. The follow page will either provides links to the existing analysis or provide links that will be populated once automatic analysis has completed. Note that proteins longer than 500 residues cannot be run automatically. If you are interested in running longer proteins, please contact us.
  • "Retrieval Code": Initially, the user must provide a retrieval that will be used to track the protein in our system. This must be a four letter alphanumeric string that is not a valid PDB ID. Hence, the user must remember the code they have selected in order to retrieve its data or save the link provided on the following page.
  • "FASTA file": The user must browse and select a text FASTA file from their local computer.  The FASTA file must contain exactly one sequence and contain the protein in the 1-letter AA format. 
  • "Email:" Optionally the user may enter an email to receive a notification in the case that analysis results are not already available. The address will not be used for any other process.

References

  1. Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N. and Bourne, P.E. (2000) The Protein Data Bank. Nucleic Acids Res., 28, 235-242.
  2. Papandreou, N., Berezovsky, I.N., Lopes, A., Eliopoulos, E. and Chomilier, J. (2004) Universal positions in globular proteins. Eur. J. Biochem., 271, 4762-4768.
  3. Chomilier, J., Lamarine, M., Mornon, J.-P., Torres, J.H., Eliopoulos, E. and Papandreou, N. (2004) Analysis of fragments induced by simulated lattice protein folding. C R Biol., 327, 431-443.
  4. Bostock, M., Ogievetsky, V., and Heer, J. (2011) D3: Data-Drive Documents. IEEE Trans. Visualization & Comp. Graphics (Proc. InfoVis) .