[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [modeller_usage] PDB updates for Modeller

To: Bruno Afonso <brunomiguel AT dequim.ist.utl.pt>
Subject: Re: [modeller_usage] PDB updates for Modeller
From: Eswar Narayanan <eashwar AT salilab.org>
Date: Mon, 4 Oct 2004 17:37:20 -0700
Cc: Modeller Usage mailing list <modeller_usage@listsrv.ucsf.edu>


On Oct 3, 2004, at 7:47 AM, Bruno Afonso wrote:

Eswar Narayanan wrote:
Since I know this PDB is probably a good model and it won't come upin seq_search I was wondering how I could manually updateCHAINS_all.seq or create my own sequence database.
The latest release of MODELLER (version 7v7, released last month) hasa new command called SEQFILTER that can be used to cluster PDBsequences. You can use MAKE_CHAINS (also in the latest release) tocollect the PDB chains prior to running SEQFILTER.
There was a mistake in my previous e-mail. :) The PDB sequence ismissing, which is *bad*, not good. I'm sorry to ask this questions,but I'm still puzzled as to how to deal with this:

If you know exactly what your template(s) is(are) going to be, you donot have to use SEQUENCE_SEARCH to "identify" your template. You canuse any of the alignment commands (ALIGN, ALIGN2D etc) to create youralignment and model your sequence based on that alignment.

1) What's the criteria for make chains_all.seq? I ask this becauseclearly not all of PDB is there :) and there are sequences there withresolutions as high as 5.0 angstroms...

One usually wants to use a non-redundant version of PDB to search fortemplates. One way is to first select sequences of all X-ray structuresthat are solved at a resolution better than 3.5A, that are longer than30aa, have no more than 10 non-standard residues, have at least 30standard residues. These can all be specified as options toMAKE_CHAINS. You can then cluster these sequences using SEQFILTER toremove redundancies with a sequence identity threshold (usually set at30% or 95%).

Ben has put these files on the web athttp://salilab.org/modeller/supplemental.html. These are therepresentative sequences derived PDB files at 30% and 95% sequenceidentity. All x-ray and NMR PDB chains, with no limits on resolution,that are at least 30aa long, have more than 30 standard residues andnot more than 10 non-standard residues were use to get these files.This is just the output of SEQFILTER on last weeks' release (09-28-04)of PDB.

2) Can't I make a chains_all.seq alike with MY criteria without makingmy own script? ie, is there a "right way"(TM) to do it?


See the comments above.

3) I can use MAKE_CHAINS and then load the .chn as a database, butthat involves having me first finding the good PDBs that aren't on themodeller's DB, which is kind of misses the whole point. I was usingmodeller to try to find the good ones in the first place.
Thanks for the tip on seqfilter, but my problem was the sequencemissing in the modeller's default database in the first place ;-)

The reviews listed on the modeller web-site(http://salilab.org/modeller/documentation.html) will help youunderstand the process of identifying a useful template for modelling.


---
Eswar Narayanan, Ph.D
Mission Bay Genentech Hall
600 16th Street, Suite N474Q
University of California - San Francisco
San Francisco, CA 94143-2240 (CA 94158 for courier)
Tel +1 (415) 514-4233; Fax +1 (415) 514-4231
http://www.salilab.org/~eashwar

References:
- [modeller_usage] PDB updates for Modeller
  - From: Bruno Afonso <brunomiguel AT dequim.ist.utl.pt>
- Re: [modeller_usage] PDB updates for Modeller
  - From: Eswar Narayanan <eashwar AT salilab.org>
- Re: [modeller_usage] PDB updates for Modeller
  - From: Bruno Afonso <brunomiguel AT dequim.ist.utl.pt>

Prev by Date: [modeller_usage] Markus Jaritz is out of the office.
Next by Date: Re: [modeller_usage] PDB updates for Modeller
Previous by thread: Re: [modeller_usage] PDB updates for Modeller
Next by thread: [modeller_usage] Hard limits for size of sequence database
Index(es):
- Date
- Thread