annotate mayachemtools/docs/scripts/txt/ExtractFromPDBFiles.txt @ 9:ab29fa5c8c1f draft default tip

Uploaded
author deepakjadmin
date Thu, 15 Dec 2016 14:18:03 -0500
parents 73ae111cf86f
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
2 ExtractFromPDBFiles.pl - Extract specific data from PDBFile(s)
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
3
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
4 SYNOPSIS
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
5 ExtractFromPDBFiles.pl PDBFile(s)...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
6
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
7 ExtractFromPDBFiles.pl [-a, --Atoms "AtomNum, [AtomNum...]" |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
8 "StartAtomNum, EndAtomNum" | "AtomName, [AtomName...]"] [-c, --chains
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
9 First | All | "ChainID, [ChainID,...]"] [<--CombineChains> yes | no]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
10 [-d, --distance number] [--DistanceMode Atom | Hetatm | Residue | XYZ]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
11 [--DistanceOrigin "AtomNumber, AtomName" | "HetatmNumber, HetAtmName" |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
12 "ResidueNumber, ResidueName, [ChainID]" | "X,Y,Z">]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
13 [<--DistanceSelectionMode> ByAtom | ByResidue] [-h, --help] [-k,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
14 --KeepOldRecords yes | no] [-m, --mode Chains | Sequences | Atoms |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
15 CAlphas | AtomNums | AtomsRange | AtomNames | ResidueNums |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
16 ResiduesRange | ResidueNames | Distance | NonWater | NonHydrogens]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
17 [--ModifyHeader yes | no] [--NonStandardKeep yes | no]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
18 [--NonStandardCode character] [-o, --overwrite] [-r, --root rootname]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
19 --RecordMode *Atom | Hetatm | AtomAndHetatm*] [--Residues
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
20 "ResidueNum,[ResidueNum...]" | StartResidueNum,EndResiduNum ]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
21 [--SequenceLength number] [--SequenceRecords Atom | SeqRes]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
22 [--SequenceIDPrefix FileName | HeaderRecord | Automatic]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
23 [--WaterResidueNames Automatic | "ResidueName, [ResidueName,...]"] [-w,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
24 --WorkingDir dirname] PDBFile(s)...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
25
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
26 DESCRIPTION
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
27 Extract specific data from *PDBFile(s)* and generate appropriate PDB or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
28 sequence file(s). Multiple PDBFile names are separated by spaces. The
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
29 valid file extension is *.pdb*. All other file name extensions are
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
30 ignored during the wild card expansion. All the PDB files in a current
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
31 directory can be specified either by **.pdb* or the current directory
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
32 name.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
33
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
34 During *Chains* and *Sequences* values of -m, --mode option, all
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
35 ATOM/HETAM records for chains after the first model in PDB fils
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
36 containing data for multiple models are ignored.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
37
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
38 OPTIONS
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
39 -a, --Atoms *"AtomNum,[AtomNum...]" | "StartAtomNum,EndAtomNum" |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
40 "AtomName,[AtomName...]"*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
41 Specify which atom records to extract from *PDBFiles(s)* during
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
42 *AtomNums*, *AtomsRange*, and *AtomNames* value of -m, --mode
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
43 option: extract records corresponding to atom numbers specified in a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
44 comma delimited list of atom numbers/names, or with in the range of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
45 start and end atom numbers. Possible values:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
46 *"AtomNum[,AtomNum,..]"*, *StartAtomNum,EndAtomNum*, or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
47 *"AtomName[,AtomName,..]"*. Default: *None*. Examples:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
48
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
49 10
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
50 15,20
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
51 N,CA,C,O
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
52
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
53 -c, --chains *First | All | ChainID,[ChainID,...]*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
54 Specify which chains to extract from *PDBFile(s)* during *Chains |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
55 Sequences* value of -m, --mode option: first chain, all chains, or a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
56 specific list of comma delimited chain IDs. Possible values: *First
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
57 | All | ChainID,[ChainID,...]*. Default: *First*. Examples:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
58
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
59 A
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
60 A,B
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
61 All
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
62
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
63 --CombineChains *yes | no*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
64 Specify whether to combine extracted chains data into a single file
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
65 during *Chains* or *Sequences* value of -m, --mode option. Possible
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
66 values: *yes | no*. Default: *no*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
67
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
68 During *Chains* value of <-m, --mode> option with *Yes* value of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
69 <--CombineChains>, extracted data for specified chains is written
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
70 into a single file instead of individual file for each chain.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
71
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
72 During *Sequences* value of <-m, --mode> option with *Yes* value of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
73 <--CombineChains>, residues sequences for specified chains are
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
74 extracted and concatenated into a single sequence file instead of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
75 individual file for each chain.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
76
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
77 -d, --distance *number*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
78 Specify distance used to extract ATOM/HETATM recods during
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
79 *Distance* value of -m, --mode option. Default: *10.0* angstroms.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
80
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
81 --RecordMode option controls type of record lines to extract from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
82 *PDBFile(s)*: ATOM, HETATM or both.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
83
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
84 --DistanceMode *Atom | Hetatm | Residue | XYZ*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
85 Specify how to extract ATOM/HETATM records from *PDBFile(s)* during
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
86 *Distance* value of -m, --mode option: extract all the records
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
87 within a certain distance specifed by -d, --distance from an atom or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
88 hetro atom record, a residue, or any artbitrary point. Possible
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
89 values: *Atom | Hetatm | Residue | XYZ*. Default: *XYZ*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
90
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
91 During *Residue* value of --distancemode, distance of ATOM/HETATM
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
92 records is calculated from all the atoms in the residue and the
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
93 records are selected as long as any atom of the residue lies with in
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
94 the distace specified using -d, --distance option.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
95
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
96 --RecordMode option controls type of record lines to extract from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
97 *PDBFile(s)*: ATOM, HETATM or both.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
98
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
99 --DistanceSelectionMode *ByAtom | ByResidue*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
100 Specify how how to extract ATOM/HETATM records from *PDBFile(s)*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
101 during *Distance* value of -m, --mode option for all values of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
102 --DistanceMode option: extract only those ATOM/HETATM records that
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
103 meet specified distance criterion; extract all records corresponding
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
104 to a residue as long as one of the ATOM/HETATM record in the residue
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
105 satisfies specified distance criterion. Possible values: *ByAtom,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
106 ByResidue*. Default value: *ByAtom*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
107
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
108 --DistanceOrigin *"AtomNumber,AtomName" | "HetatmNumber,HetAtmName" |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
109 "ResidueNumber,ResidueName[,ChainID]" | "X,Y,Z"*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
110 This value is --distancemode specific. In general, it identifies a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
111 point used to select other ATOM/HETATMS with in a specific distance
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
112 from this point.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
113
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
114 For *Atom* value of --distancemode, this option corresponds to an
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
115 atom specification. Format: *AtomNumber,AtomName*. Example:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
116
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
117 455,CA
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
118
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
119 For *Hetatm* value of --distancemode, this option corresponds to a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
120 hetatm specification. Format: *HetatmNumber,HetAtmName*. Example:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
121
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
122 5295,C1
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
123
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
124 For *Residue* value of --distancemode, this option corresponds to a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
125 residue specification. Format: *ResidueNumber,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
126 ResidueName[,ChainID]*. Example:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
127
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
128 78,MSE
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
129 977,RET,A
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
130 978,RET,B
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
131
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
132 For *XYZ* value of --distancemode, this option corresponds to a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
133 coordinate of an arbitrary point. Format: *X,Y,X*. Example:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
134
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
135 10.044,19.261,-4.292
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
136
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
137 --RecordMode option controls type of record lines to extract from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
138 *PDBFile(s)*: ATOM, HETATM or both.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
139
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
140 -h, --help
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
141 Print this help message.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
142
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
143 -k, --KeepOldRecords *yes | no*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
144 Specify whether to transfer old non ATOM and HETATM records from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
145 input PDBFile(s) to new PDBFile(s) during *Chains | Atoms | HetAtms
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
146 | CAlphas | Distance| NonWater | NonHydrogens* value of -m --mode
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
147 option. By default, except for the HEADER record, all other
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
148 unnecessary non ATOM/HETATM records are dropped during the
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
149 generation of new PDB files. Possible values: *yes | no*. Default:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
150 *no*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
151
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
152 -m, --mode *Chains | Sequences | Atoms | CAlphas | AtomNums | AtomsRange
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
153 | AtomNames | ResidueNums | ResiduesRange | ResidueNames | Distance |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
154 NonWater | NonHydrogens*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
155 Specify what to extract from *PDBFile(s)*: *Chains* - retrieve
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
156 records for specified chains; *Sequences* - generate sequence files
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
157 for specific chains; *Atoms* - extract atom records; *CAlphas* -
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
158 extract atom records for alpha carbon atoms; *AtomNums* - extract
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
159 atom records for specified atom numbers; *AtomsRange* - extract atom
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
160 records between specified atom number range; *AtomNames* - extract
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
161 atom records for specified atom names; *ResidueNums* - extract
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
162 records for specified residue numbers; *ResiduesRange* - extract
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
163 records for residues between specified residue number range;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
164 *ResidueNames* - extract records for specified residue names;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
165 *Distance* - extract records with in a certain distance from a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
166 specific position; *NonWater* - extract records corresponding to
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
167 residues other than water; *NonHydrogens* - extract non-hydrogen
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
168 records.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
169
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
170 Possible values: *Chains, Sequences Atoms, CAlphas, AtomNums,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
171 AtomsRange, AtomNames, ResidueNums, ResiduesRange, ResidueNames,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
172 Distance, NonWater, NonHydrogens*. Default value: *NonWater*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
173
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
174 During the generation of new PDB files, unnecessay CONECT records
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
175 are dropped.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
176
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
177 For *Chains* mode, data for appropriate chains specified by --c
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
178 --chains option is extracted from *PDBFile(s)* and placed into new
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
179 PDB file(s).
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
180
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
181 For *Sequences* mode, residues names using various sequence related
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
182 options are extracted for chains specified by --c --chains option
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
183 from *PDBFile(s)* and FASTA sequence file(s) are generated.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
184
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
185 For *Distance* mode, all ATOM/HETATM records with in a distance
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
186 specified by -d --distance option from a specific atom, residue or a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
187 point indicated by --distancemode are extracted and placed into new
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
188 PDB file(s).
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
189
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
190 For *NonWater* mode, non water ATOM/HETATM record lines, identified
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
191 using value of --WaterResidueNames, are extracted and written to new
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
192 PDB file(s).
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
193
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
194 For *NonHydrogens* mode, ATOM/HETATOM record lines containing
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
195 element symbol other than *H* are extracted and written to new PDB
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
196 file(s).
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
197
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
198 For all other options, appropriate ATOM/HETATM records are extracted
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
199 to generate new PDB file(s).
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
200
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
201 --RecordMode option controls type of record lines to extract and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
202 process from *PDBFile(s)*: ATOM, HETATM or both.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
203
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
204 --ModifyHeader *yes | no*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
205 Specify whether to modify HEADER record during the generation of new
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
206 PDB files for -m, --mode values of *Chains | Atoms | CAlphas |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
207 Distance*. Possible values: *yes | no*. Default: *yes*. By default,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
208 Classification data is replaced by *Data extracted using
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
209 MayaChemTools* before writing out HEADER record.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
210
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
211 --NonStandardKeep *yes | no*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
212 Specify whether to include and convert non-standard three letter
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
213 residue codes into a code specified using --nonstandardcode option
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
214 and include them into sequence file(s) generated during *Sequences*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
215 value of -m, --mode option. Possible values: *yes | no*. Default:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
216 *yes*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
217
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
218 A warning is also printed about the presence of non-standard
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
219 residues. Any residue other than standard 20 amino acids and 5
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
220 nucleic acid is considered non-standard; additionally, HETATM
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
221 residues in chains also tagged as non-standard.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
222
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
223 --NonStandardCode *character*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
224 A single character code to use for non-standard residues. Default:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
225 *X*. Possible values: *?, -, or X*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
226
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
227 -o, --overwrite
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
228 Overwrite existing files.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
229
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
230 -r, --root *rootname*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
231 New PDB and sequence file name is generated using the root:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
232 <Root><Mode>.<Ext>. Default new file name:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
233 <PDBFileName>Chain<ChainID>.pdb for *Chains* mode;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
234 <PDBFileName>SequenceChain<ChainID>.fasta for *Sequences* mode;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
235 <PDBFileName>DistanceBy<DistanceMode>.pdb for *Distance* -m, --mode
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
236 <PDBFileName><Mode>.pdb for *Atoms | CAlphas | NonWater |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
237 NonHydrogens* -m, --mode values. This option is ignored for multiple
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
238 input files.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
239
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
240 --RecordMode *Atom | Hetatm | AtomAndHetatm*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
241 Specify type of record lines to extract and process from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
242 *PDBFile(s)* during various values of -m, --mode option: extract
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
243 only ATOM record lines; extract only HETATM record lines; extract
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
244 both ATOM and HETATM lines. Possible values: *Atom | Hetatm |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
245 AtomAndHetatm | XYZ*. Default during *Atoms, CAlphas, AtomNums,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
246 AtomsRange, AtomNames* values of -m, --mode option: *Atom*;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
247 otherwise: *AtomAndHetatm*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
248
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
249 This option is ignored during *Chains, Sequences* values of -m,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
250 --mode option.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
251
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
252 --Residues *"ResidueNum,[ResidueNum...]" |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
253 "StartResidueNum,EndResiduNum" | "ResidueName,[ResidueName...]"*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
254 Specify which resiude records to extract from *PDBFiles(s)* during
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
255 *ResidueNums*, *ResiduesRange*,and *ResidueNames* value of -m,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
256 --mode option: extract records corresponding to residue numbers
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
257 specified in a comma delimited list of residue numbers/names, or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
258 with in the range of start and end residue numbers. Possible values:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
259 *"ResidueNum[,ResidueNum,..]"*, *StartResidueNum,EndResiduNum*, or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
260 *<"ResidueName[,ResidueName,..]"*. Default: *None*. Examples:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
261
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
262 20
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
263 5,10
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
264 TYR,SER,THR
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
265
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
266 --RecordMode option controls type of record lines to extract from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
267 *PDBFile(s)*: ATOM, HETATM or both.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
268
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
269 --SequenceLength *number*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
270 Maximum sequence length per line in sequence file(s). Default: *80*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
271
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
272 --SequenceRecords *Atom | SeqRes*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
273 Specify which records to use for extracting residue names from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
274 *PDBFiles(s)* during *Sequences* value of -m, --mode option: use
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
275 ATOM records to compile a list of residues in a chain or parse
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
276 SEQRES record to get a list of residues. Possible values: *Atom |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
277 SeqRes*. Default: *Atom*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
278
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
279 --SequenceIDPrefix *FileName | HeaderRecord | Automatic*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
280 Specify how to generate a prefix for sequence IDs during *Sequences*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
281 value of -m, --mode option: use input file name prefix; retrieve PDB
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
282 ID from HEADER record; or automatically decide the method for
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
283 generating the prefix. The chain IDs are also appended to the
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
284 prefix. Possible values: *FileName | HeaderRecord | Automatic*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
285 Default: *Automatic*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
286
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
287 --WaterResidueNames *Automatic | "ResidueName,[ResidueName,...]"*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
288 Identification of water residues during *NonWater* value of -m,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
289 --mode option. Possible values: *Automatic |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
290 "ResidueName,[ResidueName,...]"*. Default: *Automatic* - corresponds
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
291 to "HOH,WAT,H20". You can also specify a different comma delimited
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
292 list of residue names to use for water.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
293
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
294 -w, --WorkingDir *dirname*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
295 Location of working directory. Default: current directory.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
296
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
297 EXAMPLES
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
298 To extract non-water records from Sample2.pdb file and generate
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
299 Sample2NonWater.pdb file, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
300
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
301 % ExtractFromPDBFiles.pl Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
302
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
303 To extract non-water records corresponding to only ATOM records from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
304 Sample2.pdb file and generate Sample2NonWater.pdb file, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
305
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
306 % ExtractFromPDBFiles.pl --RecordMode Atom Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
307
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
308 To extract non-water records from Sample2.pdb file using HOH or WAT
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
309 residue name for water along with all old non-coordinate records and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
310 generate Sample2NewNonWater.pdb file, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
311
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
312 % ExtractFromPDBFiles.pl -m NonWater --WaterResidueNames "HOH,WAT"
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
313 -KeepOldRecords Yes -r Sample2New -o Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
314
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
315 To extract non-hydrogens records from Sample2.pdb file and generate
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
316 Sample2NonHydrogen.pdb file, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
317
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
318 % ExtractFromPDBFiles.pl -m NonHydrogens Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
319
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
320 To extract data for first chain in Sample2.pdb and generate
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
321 Sample2ChainA.pdb, type file, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
322
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
323 % ExtractFromPDBFiles.pl -m chains -o Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
324
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
325 To extract data for both chains in Sample2.pdb and generate
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
326 Sample2ChainA.pdb and Sample2ChainB.pdb, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
327
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
328 % ExtractFromPDBFiles.pl -m chains -c All -o Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
329
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
330 To extract data for alpha carbons in Sample2.pdb and generate
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
331 Sample2CAlphas.pdb, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
332
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
333 % ExtractFromPDBFiles.pl -m CAlphas -o Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
334
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
335 To extract records for specific residue numbers in all chains from
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
336 Sample2.pdb file and generate Sample2ResidueNums.pdb file, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
337
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
338 % ExtractFromPDBFiles.pl -m ResidueNums --Residues "3,6"
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
339 Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
340
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
341 To extract records for a specific range of residue number in all chains
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
342 from Sample2.pdb file and generate Sample2ResiduesRange.pdb file, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
343
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
344 % ExtractFromPDBFiles.pl -m ResiduesRange --Residues "10,30"
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
345 Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
346
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
347 To extract data for all ATOM and HETATM records with in 10 angstrom of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
348 an atom specifed by atom serial number and name "1,N" in Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
349 file and generate Sample2DistanceByAtom.pdb, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
350
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
351 % ExtractFromPDBFiles.pl -m Distance --DistanceMode Atom
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
352 --DistanceOrigin "1,N" -k No --distance 10 -o Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
353
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
354 To extract data for all ATOM and HETATM records for complete residues
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
355 with any atom or hetatm less than 10 angstrom of an atom specifed by
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
356 atom serial number and name "1,N" in Sample2.pdb file and generate
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
357 Sample2DistanceByAtom.pdb, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
358
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
359 % ExtractFromPDBFiles.pl -m Distance --DistanceMode Atom
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
360 --DistanceOrigin "1,N" --DistanceSelectionMode ByResidue
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
361 -k No --distance 10 -o Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
362
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
363 To extract data for all ATOM and HETATM records with in 25 angstrom of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
364 an arbitrary point "0,0,0" in Sample2.pdb file and generate
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
365 Sample2DistanceByXYZ.pdb, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
366
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
367 % ExtractFromPDBFiles.pl -m Distance --DistanceMode XYZ
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
368 --DistanceOrigin "0,0,0" -k No --distance 25 -o Sample2.pdb
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
369
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
370 AUTHOR
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
371 Manish Sud <msud@san.rr.com>
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
372
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
373 SEE ALSO
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
374 InfoPDBFiles.pl, ModifyPDBFiles.pl
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
375
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
376 COPYRIGHT
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
377 Copyright (C) 2015 Manish Sud. All rights reserved.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
378
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
379 This file is part of MayaChemTools.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
380
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
381 MayaChemTools is free software; you can redistribute it and/or modify it
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
382 under the terms of the GNU Lesser General Public License as published by
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
383 the Free Software Foundation; either version 3 of the License, or (at
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
384 your option) any later version.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
385