Subject: [modeller_usage] Building multi-chain models
From:
Date: Thu, 06 Jun 2024 11:56:31 +0000
Good day,
I would like to build a GABA A receptor with (two alpha2 subunits chain A, D), (two beta2 subunits chain B, E) and (one gamma subunit chain C).
I create an .ali file with template (6HUG) and with target sequences.
I have received the error: 'There should be 10 fields separated by colons, : This line actually contains 15 fields.'
I don't understand how I should correctly change it, if I have 5 chains. Could you please advise, how I should correctly write this 10 field line. I have already read the manual https://salilab.org/modeller/manual/node501.html#alignmentformat, but there is information about 2 chains. I still can not understand how correctly I should write it for 5 chains.
The alignment for my template is look like:
>P1;6HUG
structureX:6HUG:FIRST:A 437:A 473:B 495:C 437:D 473:E:::3.5:-1.00
------------------------------DYKDDD----DKQPSLQDEL---------K
DNTTVFTRILDRLLDGYDNRL---------------RPG----LGERVTEVKTD-IFVTS
FGPVSDHDMEYTIDVFFRQSWKDERLKFKGPMTVLRLNNLMASKIWTPDTFFHNGKKSVA
HNMTMPNKLLRITEDGTLLYTMRLTV----RAECPMHLEDFPMDAHACPLKFGSYAYTRA
EVVYEWTREPARSVVVAEDGSRLNQYDLLGQTVDSGIVQSSTGEYVVMTTHFHLKRKIGY
FVIQTYLPC---------------IMTVILSQVSFWLNRE-SVPARTV-FGVTTVLTMTT
LSISA----RNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRGYAWDGKSVVP
EKPKKVKDPLIKKNN---TY--------------APTA--------TSYT----------
-----------------------------PNLARGD------------------------
---------------PGLATIAKSATIEPKEVK----------------------PETKP
---------PEPKKTFNSVSKIDRLSRIAFPLLFGIFNLVYWAT-----YLNREPQLKAP
TPHQ----------/
-----MCSGLLEL-------------LLPIWLSWTLGTRGSEPRSV-----------NDP
GNMSFVKETVDKLLKGYDIRL---------------RPD----FGGPPVCVGMN-IDIAS
IDMVSEVNMDYTLTMYFQQYWRDKRLAYSGIPLNLTLDNRVADQLWVPDTYFLNDKKSFV
HGVTVKNRMIRLHPDGTVLYGLRITT----TAACMMDLRRYPLDEQNCTLEIESYGYTTD
DIEFYWRGGDKAV--TGVERIELPQFSIVEHRLVSRNVVFATGAYPRLSLSFRLKRNIGY
FILQTYMPS---------------ILITILSWVSFWINYD-ASAARVA-LGITTVLTMTT
INTHL----RETLPKIPYVKAIDMYLMGCFVFVFLALLEYAFVNYIFFGRGPQRQKKLAE
KTAK-AKNDRSKSES--------------------------------------NRVDAHG
NILLTSL--E-VHNE---------MNE--VSGGIGD------------------------
---------------TRNSAISFDNSGI-QYRK-----QSMPREGHGRFLGDRSLPHKKT
HLRRRSSQLKIKIPDLTDVNAIDRWSRIVFPFTFSLFNLVYWLY-----YVN--------
--------------/
MSSPNIWSTGSSVYSTPVFSQKMTVWILLLLSLYPGFTSQKSDDDYEDYASNKTWVLTPK
VPEGDVTVILNNLLEGYDNKL---------------RPD----IGVKPTLIHTD-MYVNS
IGPVNAINMEYTIDIFFAQTWYDRRLKFNSTIKVLRLNSNMVGKIWIPDTFFRNSKKADA
HWITTPNRMLRIWNDGRVLYTLRLTI----DAECQLQLHNFPMDEHSCPLEFSSYGYPRE
EIVYQWKRSSVEV--GDTRSWRLYQFSFVGLRNTTEVVKTTSGDYVVMSVYFDLSRRMGY
FTIQTYIPC---------------TLIVVLSWVSFWINKD-AVPARTS-LGITTVLTMTT
LSTIA----RKSLPKVSYVTAMDLFVSVCFIFVFSALVEYGTLHYFVSNRKPS------K
DKDKKKKNPLLRMFS---FK--------------APTI--------D-I-----------
-----------------------------------R------------------------
---------------PRSATIQMNNATHLQERDEEYGYECLDGKDCASFFCCFEDCRTGA
---------WRHGRIHIRIAKMDSYARIFFPTAFCLFNLVYWVS-----YLYLGGSGGSG
GSGKTETSQVAPA-/
------------------------------DYKDDD----DKQPSLQDEL---------K
DNTTVFTRILDRLLDGYDNRL---------------RPG----LGERVTEVKTD-IFVTS
FGPVSDHDMEYTIDVFFRQSWKDERLKFKGPMTVLRLNNLMASKIWTPDTFFHNGKKSVA
HNMTMPNKLLRITEDGTLLYTMRLTV----RAECPMHLEDFPMDAHACPLKFGSYAYTRA
EVVYEWTREPARSVVVAEDGSRLNQYDLLGQTVDSGIVQSSTGEYVVMTTHFHLKRKIGY
FVIQTYLPC---------------IMTVILSQVSFWLNRE-SVPARTV-FGVTTVLTMTT
LSISA----RNSLPKVAYATAMDWFIAVCYAFVFSALIEFATVNYFTKRGYAWDGKSVVP
EKPKKVKDPLIKKNN---TY--------------APTA--------TSYT----------
-----------------------------PNLARGD------------------------
---------------PGLATIAKSATIEPKEVK----------------------PETKP
---------PEPKKTFNSVSKIDRLSRIAFPLLFGIFNLVYWAT-----YLNREPQLKAP
TPHQ----------/
-----MCSGLLEL-------------LLPIWLSWTLGTRGSEPRSV-----------NDP
GNMSFVKETVDKLLKGYDIRL---------------RPD----FGGPPVCVGMN-IDIAS
IDMVSEVNMDYTLTMYFQQYWRDKRLAYSGIPLNLTLDNRVADQLWVPDTYFLNDKKSFV
HGVTVKNRMIRLHPDGTVLYGLRITT----TAACMMDLRRYPLDEQNCTLEIESYGYTTD
DIEFYWRGGDKAV--TGVERIELPQFSIVEHRLVSRNVVFATGAYPRLSLSFRLKRNIGY
FILQTYMPS---------------ILITILSWVSFWINYD-ASAARVA-LGITTVLTMTT
INTHL----RETLPKIPYVKAIDMYLMGCFVFVFLALLEYAFVNYIFFGRGPQRQKKLAE
KTAK-AKNDRSKSES--------------------------------------NRVDAHG
NILLTSL--E-VHNE---------MNE--VSGGIGD------------------------
---------------TRNSAISFDNSGI-QYRK-----QSMPREGHGRFLGDRSLPHKKT
HLRRRSSQLKIKIPDLTDVNAIDRWSRIVFPFTFSLFNLVYWLY-----YVN--------
--------------*
Thank you for your help.