Mercurial > repos > deepakjadmin > mayatool3_test2
comparison docs/scripts/html/MACCSKeysFingerprints.html @ 0:4816e4a8ae95 draft default tip
Uploaded
author | deepakjadmin |
---|---|
date | Wed, 20 Jan 2016 09:23:18 -0500 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:4816e4a8ae95 |
---|---|
1 <html> | |
2 <head> | |
3 <title>MayaChemTools:Documentation:MACCSKeysFingerprints.pl</title> | |
4 <meta http-equiv="content-type" content="text/html;charset=utf-8"> | |
5 <link rel="stylesheet" type="text/css" href="../../css/MayaChemTools.css"> | |
6 </head> | |
7 <body leftmargin="20" rightmargin="20" topmargin="10" bottommargin="10"> | |
8 <br/> | |
9 <center> | |
10 <a href="http://www.mayachemtools.org" title="MayaChemTools Home"><img src="../../images/MayaChemToolsLogo.gif" border="0" alt="MayaChemTools"></a> | |
11 </center> | |
12 <br/> | |
13 <div class="DocNav"> | |
14 <table width="100%" border=0 cellpadding=0 cellspacing=2> | |
15 <tr align="left" valign="top"><td width="33%" align="left"><a href="./JoinTextFiles.html" title="JoinTextFiles.html">Previous</a> <a href="./index.html" title="Table of Contents">TOC</a> <a href="./MergeTextFiles.html" title="MergeTextFiles.html">Next</a></td><td width="34%" align="middle"><strong>MACCSKeysFingerprints.pl</strong></td><td width="33%" align="right"><a href="././code/MACCSKeysFingerprints.html" title="View source code">Code</a> | <a href="./../pdf/MACCSKeysFingerprints.pdf" title="PDF US Letter Size">PDF</a> | <a href="./../pdfgreen/MACCSKeysFingerprints.pdf" title="PDF US Letter Size with narrow margins: www.changethemargins.com">PDFGreen</a> | <a href="./../pdfa4/MACCSKeysFingerprints.pdf" title="PDF A4 Size">PDFA4</a> | <a href="./../pdfa4green/MACCSKeysFingerprints.pdf" title="PDF A4 Size with narrow margins: www.changethemargins.com">PDFA4Green</a></td></tr> | |
16 </table> | |
17 </div> | |
18 <p> | |
19 </p> | |
20 <h2>NAME</h2> | |
21 <p>MACCSKeysFingerprints.pl - Generate MACCS key fingerprints for SD files</p> | |
22 <p> | |
23 </p> | |
24 <h2>SYNOPSIS</h2> | |
25 <p>MACCSKeysFingerprints.pl SDFile(s)...</p> | |
26 <p>MACCSKeysFingerprints.pl [<strong>--AromaticityModel</strong> <em>AromaticityModelType</em>] | |
27 [<strong>--BitsOrder</strong> <em>Ascending | Descending</em>] | |
28 [<strong>-b, --BitStringFormat</strong> <em>BinaryString | HexadecimalString</em>] | |
29 [<strong>--CompoundID</strong> <em>DataFieldName or LabelPrefixString</em>] [<strong>--CompoundIDLabel</strong> <em>text</em>] | |
30 [<strong>--CompoundIDMode</strong> <em>DataField | MolName | LabelPrefix | MolNameOrLabelPrefix</em>] | |
31 [<strong>--DataFields</strong> <em>"FieldLabel1,FieldLabel2,..."</em>] [<strong>-d, --DataFieldsMode</strong> <em>All | Common | Specify | CompoundID</em>] | |
32 [<strong>-f, --Filter</strong> <em>Yes | No</em>] [<strong>--FingerprintsLabel</strong> <em>text</em>] [<strong>-h, --help</strong>] [<strong>-k, --KeepLargestComponent</strong> <em>Yes | No</em>] | |
33 [<strong>-m, --mode</strong> <em>MACCSKeyBits | MACCSKeyCount</em>] [<strong>--OutDelim</strong> <em>comma | tab | semicolon</em>] | |
34 [<strong>--output</strong> <em>SD | FP | text | all</em>] [<strong>-o, --overwrite</strong>] | |
35 [<strong>-q, --quote</strong> <em>Yes | No</em>] [<strong>-r, --root</strong> <em>RootName</em>] [<strong>-s, --size</strong> <em>number</em>] | |
36 [<strong>-v, --VectorStringFormat</strong> <em>IDsAndValuesString | IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString</em>] | |
37 [<strong>-w, --WorkingDir</strong> <em>DirName</em>]</p> | |
38 <p> | |
39 </p> | |
40 <h2>DESCRIPTION</h2> | |
41 <p>Generate MACCS (Molecular ACCess System) keys fingerprints [ Ref 45-47 ] for <em>SDFile(s)</em> | |
42 and create appropriate SD, FP or CSV/TSV text file(s) containing fingerprints bit-vector or | |
43 vector strings corresponding to molecular fingerprints.</p> | |
44 <p>Multiple SDFile names are separated by spaces. The valid file extensions are <em>.sdf</em> | |
45 and <em>.sd</em>. All other file names are ignored. All the SD files in a current directory | |
46 can be specified either by <em>*.sdf</em> or the current directory name.</p> | |
47 <p>For each MACCS keys definition, atoms are processed to determine their membership to the key | |
48 and the appropriate molecular fingerprints strings are generated. An atom can belong to multiple | |
49 MACCS keys.</p> | |
50 <p>For <em>MACCSKeyBits</em> value of <strong>-m, --mode</strong> option, a fingerprint bit-vector string containing | |
51 zeros and ones is generated and for <em>MACCSKeyCount</em> value, a fingerprint vector string | |
52 corresponding to number of MACCS keys [ Ref 45-47 ] is generated.</p> | |
53 <p><em>MACCSKeyBits | MACCSKeyCount</em> values for <strong>-m, --mode</strong> option along with two possible | |
54 <em>166 | 322</em> values of <strong>-s, --size</strong> supports generation of four different types of MACCS | |
55 keys fingerprint: <em>MACCS166KeyBits, MACCS166KeyCount, MACCS322KeyBits, MACCS322KeyCount</em>.</p> | |
56 <p>Example of <em>SD</em> file containing MAACS keys fingerprints string data:</p> | |
57 <div class="OptionsBox"> | |
58 ... ... | |
59 <br/> ... ... | |
60 <br/> $$$$ | |
61 <br/> ... ... | |
62 <br/> ... ... | |
63 <br/> ... ... | |
64 <br/> 41 44 0 0 0 0 0 0 0 0999 V2000 | |
65 -3.3652 1.4499 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0 | |
66 <br/> ... ... | |
67 <br/> 2 3 1 0 0 0 0 | |
68 <br/> ... ... | |
69 <br/> M END | |
70 <br/> > <CmpdID> | |
71 <br/> Cmpd1</div> | |
72 <div class="OptionsBox"> | |
73 > <MACCSKeysFingerprints> | |
74 <br/> FingerprintsBitVector;MACCSKeyBits;166;BinaryString;Ascending;000000000 | |
75 <br/> 00000000000000000000000000000000100100001001000000001001000000001110001 | |
76 <br/> 00101010111100011011000100110110000011011110100110111111111111011111111 | |
77 <br/> 11111111110111000</div> | |
78 <div class="OptionsBox"> | |
79 $$$$ | |
80 <br/> ... ... | |
81 <br/> ... ...</div> | |
82 <p>Example of <em>FP</em> file containing MAACS keys fingerprints string data:</p> | |
83 <div class="OptionsBox"> | |
84 # | |
85 <br/> # Package = MayaChemTools 7.4 | |
86 <br/> # Release Date = Oct 21, 2010 | |
87 <br/> # | |
88 <br/> # TimeStamp = Fri Mar 11 14:57:24 2011 | |
89 <br/> # | |
90 <br/> # FingerprintsStringType = FingerprintsBitVector | |
91 <br/> # | |
92 <br/> # Description = MACCSKeyBits | |
93 <br/> # Size = 166 | |
94 <br/> # BitStringFormat = BinaryString | |
95 <br/> # BitsOrder = Ascending | |
96 <br/> # | |
97 <br/> Cmpd1 00000000000000000000000000000000000000000100100001001000000001... | |
98 <br/> Cmpd2 00000000000000000000000010000000001000000010000000001000000000... | |
99 <br/> ... ... | |
100 <br/> ... ..</div> | |
101 <p>Example of CSV <em>Text</em> file containing MAACS keys fingerprints string data:</p> | |
102 <div class="OptionsBox"> | |
103 "CompoundID","MACCSKeysFingerprints" | |
104 <br/> "Cmpd1","FingerprintsBitVector;MACCSKeyBits;166;BinaryString;Ascending; | |
105 <br/> 00000000000000000000000000000000000000000100100001001000000001001000000 | |
106 <br/> 00111000100101010111100011011000100110110000011011110100110111111111111 | |
107 <br/> 01111111111111111110111000" | |
108 <br/> ... ... | |
109 <br/> ... ...</div> | |
110 <p>The current release of MayaChemTools generates the following types of MACCS keys | |
111 fingerprints bit-vector and vector strings:</p> | |
112 <div class="OptionsBox"> | |
113 FingerprintsBitVector;MACCSKeyBits;166;BinaryString;Ascending;00000000 | |
114 <br/> 0000000000000000000000000000000001001000010010000000010010000000011100 | |
115 <br/> 0100101010111100011011000100110110000011011110100110111111111111011111 | |
116 <br/> 11111111111110111000</div> | |
117 <div class="OptionsBox"> | |
118 FingerprintsBitVector;MACCSKeyBits;166;HexadecimalString;Ascending;000 | |
119 <br/> 000000021210210e845f8d8c60b79dffbffffd1</div> | |
120 <div class="OptionsBox"> | |
121 FingerprintsBitVector;MACCSKeyBits;322;BinaryString;Ascending;11101011 | |
122 <br/> 1110011111100101111111000111101100110000000000000011100010000000000000 | |
123 <br/> 0000000000000000000000000000000000000000000000101000000000000000000000 | |
124 <br/> 0000000000000000000000000000000000000000000000000000000000000000000000 | |
125 <br/> 0000000000000000000000000000000000000011000000000000000000000000000000 | |
126 <br/> 0000000000000000000000000000000000000000</div> | |
127 <div class="OptionsBox"> | |
128 FingerprintsBitVector;MACCSKeyBits;322;HexadecimalString;Ascending;7d7 | |
129 <br/> e7af3edc000c1100000000000000500000000000000000000000000000000300000000 | |
130 <br/> 000000000</div> | |
131 <div class="OptionsBox"> | |
132 FingerprintsVector;MACCSKeyCount;166;OrderedNumericalValues;ValuesStri | |
133 <br/> ng;0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | |
134 <br/> 0 0 0 0 0 0 0 1 0 0 3 0 0 0 0 4 0 0 2 0 0 0 0 0 0 0 0 2 0 0 2 0 0 0 0 | |
135 <br/> 0 0 0 0 1 1 8 0 0 0 1 0 0 1 0 1 0 1 0 3 1 3 1 0 0 0 1 2 0 11 1 0 0 0 | |
136 <br/> 5 0 0 1 2 0 1 1 0 0 0 0 0 1 1 0 1 1 1 1 0 4 0 0 1 1 0 4 6 1 1 1 2 1 1 | |
137 <br/> 3 5 2 2 0 5 3 5 1 1 2 5 1 2 1 2 4 8 3 5 5 2 2 0 3 5 4 1</div> | |
138 <div class="OptionsBox"> | |
139 FingerprintsVector;MACCSKeyCount;322;OrderedNumericalValues;ValuesStri | |
140 <br/> ng;14 8 2 0 2 0 4 4 2 1 4 0 0 2 5 10 5 2 1 0 0 2 0 5 13 3 28 5 5 3 0 0 | |
141 <br/> 0 4 2 1 1 0 1 1 0 0 2 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 22 5 3 0 0 0 1 0 | |
142 <br/> 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | |
143 <br/> 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 2 0 0 0 0 0 0 0 0 0 | |
144 <br/> 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ...</div> | |
145 <p> | |
146 </p> | |
147 <h2>OPTIONS</h2> | |
148 <dl> | |
149 <dt><strong><strong>--AromaticityModel</strong> <em>MDLAromaticityModel | TriposAromaticityModel | MMFFAromaticityModel | ChemAxonBasicAromaticityModel | ChemAxonGeneralAromaticityModel | DaylightAromaticityModel | MayaChemToolsAromaticityModel</em></strong></dt> | |
150 <dd> | |
151 <p>Specify aromaticity model to use during detection of aromaticity. Possible values in the current | |
152 release are: <em>MDLAromaticityModel, TriposAromaticityModel, MMFFAromaticityModel, | |
153 ChemAxonBasicAromaticityModel, ChemAxonGeneralAromaticityModel, DaylightAromaticityModel | |
154 or MayaChemToolsAromaticityModel</em>. Default value: <em>MayaChemToolsAromaticityModel</em>.</p> | |
155 <p>The supported aromaticity model names along with model specific control parameters | |
156 are defined in <strong>AromaticityModelsData.csv</strong>, which is distributed with the current release | |
157 and is available under <strong>lib/data</strong> directory. <strong>Molecule.pm</strong> module retrieves data from | |
158 this file during class instantiation and makes it available to method <strong>DetectAromaticity</strong> | |
159 for detecting aromaticity corresponding to a specific model.</p> | |
160 </dd> | |
161 <dt><strong><strong>--BitsOrder</strong> <em>Ascending | Descending</em></strong></dt> | |
162 <dd> | |
163 <p>Bits order to use during generation of fingerprints bit-vector string for <em>MACCSKeyBits</em> value of | |
164 <strong>-m, --mode</strong> option. Possible values: <em>Ascending, Descending</em>. Default: <em>Ascending</em>.</p> | |
165 <p><em>Ascending</em> bit order which corresponds to first bit in each byte as the lowest bit as | |
166 opposed to the highest bit.</p> | |
167 <p>Internally, bits are stored in <em>Ascending</em> order using Perl vec function. Regardless | |
168 of machine order, big-endian or little-endian, vec function always considers first | |
169 string byte as the lowest byte and first bit within each byte as the lowest bit.</p> | |
170 </dd> | |
171 <dt><strong><strong>-b, --BitStringFormat</strong> <em>BinaryString | HexadecimalString</em></strong></dt> | |
172 <dd> | |
173 <p>Format of fingerprints bit-vector string data in output SD, FP or CSV/TSV text file(s) specified by | |
174 <strong>--output</strong> used during <em>MACCSKeyBits</em> value of <strong>-m, --mode</strong> option. Possible | |
175 values: <em>BinaryString, HexadecimalString</em>. Default value: <em>BinaryString</em>.</p> | |
176 <p><em>BinaryString</em> corresponds to an ASCII string containing 1s and 0s. <em>HexadecimalString</em> | |
177 contains bit values in ASCII hexadecimal format.</p> | |
178 <p>Examples:</p> | |
179 <div class="OptionsBox"> | |
180 FingerprintsBitVector;MACCSKeyBits;166;BinaryString;Ascending;00000000 | |
181 <br/> 0000000000000000000000000000000001001000010010000000010010000000011100 | |
182 <br/> 0100101010111100011011000100110110000011011110100110111111111111011111 | |
183 <br/> 11111111111110111000</div> | |
184 <div class="OptionsBox"> | |
185 FingerprintsBitVector;MACCSKeyBits;166;HexadecimalString;Ascending;000 | |
186 <br/> 000000021210210e845f8d8c60b79dffbffffd1</div> | |
187 <div class="OptionsBox"> | |
188 FingerprintsBitVector;MACCSKeyBits;322;BinaryString;Ascending;11101011 | |
189 <br/> 1110011111100101111111000111101100110000000000000011100010000000000000 | |
190 <br/> 0000000000000000000000000000000000000000000000101000000000000000000000 | |
191 <br/> 0000000000000000000000000000000000000000000000000000000000000000000000 | |
192 <br/> 0000000000000000000000000000000000000011000000000000000000000000000000 | |
193 <br/> 0000000000000000000000000000000000000000</div> | |
194 <div class="OptionsBox"> | |
195 FingerprintsBitVector;MACCSKeyBits;322;HexadecimalString;Ascending;7d7 | |
196 <br/> e7af3edc000c1100000000000000500000000000000000000000000000000300000000 | |
197 <br/> 000000000</div> | |
198 </dd> | |
199 <dt><strong><strong>--CompoundID</strong> <em>DataFieldName or LabelPrefixString</em></strong></dt> | |
200 <dd> | |
201 <p>This value is <strong>--CompoundIDMode</strong> specific and indicates how compound ID is generated.</p> | |
202 <p>For <em>DataField</em> value of <strong>--CompoundIDMode</strong> option, it corresponds to datafield label name | |
203 whose value is used as compound ID; otherwise, it's a prefix string used for generating compound | |
204 IDs like LabelPrefixString<Number>. Default value, <em>Cmpd</em>, generates compound IDs which | |
205 look like Cmpd<Number>.</p> | |
206 <p>Examples for <em>DataField</em> value of <strong>--CompoundIDMode</strong>:</p> | |
207 <div class="OptionsBox"> | |
208 MolID | |
209 <br/> ExtReg</div> | |
210 <p>Examples for <em>LabelPrefix</em> or <em>MolNameOrLabelPrefix</em> value of <strong>--CompoundIDMode</strong>:</p> | |
211 <div class="OptionsBox"> | |
212 Compound</div> | |
213 <p>The value specified above generates compound IDs which correspond to Compound<Number> | |
214 instead of default value of Cmpd<Number>.</p> | |
215 </dd> | |
216 <dt><strong><strong>--CompoundIDLabel</strong> <em>text</em></strong></dt> | |
217 <dd> | |
218 <p>Specify compound ID column label for FP or CSV/TSV text file(s) used during <em>CompoundID</em> value | |
219 of <strong>--DataFieldsMode</strong> option. Default: <em>CompoundID</em>.</p> | |
220 </dd> | |
221 <dt><strong><strong>--CompoundIDMode</strong> <em>DataField | MolName | LabelPrefix | MolNameOrLabelPrefix</em></strong></dt> | |
222 <dd> | |
223 <p>Specify how to generate compound IDs and write to FP or CSV/TSV text file(s) along with generated | |
224 fingerprints for <em>FP | text | all</em> values of <strong>--output</strong> option: use a <em>SDFile(s)</em> datafield value; | |
225 use molname line from <em>SDFile(s)</em>; generate a sequential ID with specific prefix; use combination | |
226 of both MolName and LabelPrefix with usage of LabelPrefix values for empty molname lines.</p> | |
227 <p>Possible values: <em>DataField | MolName | LabelPrefix | MolNameOrLabelPrefix</em>. | |
228 Default: <em>LabelPrefix</em>.</p> | |
229 <p>For <em>MolNameAndLabelPrefix</em> value of <strong>--CompoundIDMode</strong>, molname line in <em>SDFile(s)</em> takes | |
230 precedence over sequential compound IDs generated using <em>LabelPrefix</em> and only empty molname | |
231 values are replaced with sequential compound IDs.</p> | |
232 <p>This is only used for <em>CompoundID</em> value of <strong>--DataFieldsMode</strong> option.</p> | |
233 </dd> | |
234 <dt><strong><strong>--DataFields</strong> <em>"FieldLabel1,FieldLabel2,..."</em></strong></dt> | |
235 <dd> | |
236 <p>Comma delimited list of <em>SDFiles(s)</em> data fields to extract and write to CSV/TSV text file(s) along | |
237 with generated fingerprints for <em>text | all</em> values of <strong>--output</strong> option.</p> | |
238 <p>This is only used for <em>Specify</em> value of <strong>--DataFieldsMode</strong> option.</p> | |
239 <p>Examples:</p> | |
240 <div class="OptionsBox"> | |
241 Extreg | |
242 <br/> MolID,CompoundName</div> | |
243 </dd> | |
244 <dt><strong><strong>-d, --DataFieldsMode</strong> <em>All | Common | Specify | CompoundID</em></strong></dt> | |
245 <dd> | |
246 <p>Specify how data fields in <em>SDFile(s)</em> are transferred to output CSV/TSV text file(s) along | |
247 with generated fingerprints for <em>text | all</em> values of <strong>--output</strong> option: transfer all SD | |
248 data field; transfer SD data files common to all compounds; extract specified data fields; | |
249 generate a compound ID using molname line, a compound prefix, or a combination of both. | |
250 Possible values: <em>All | Common | specify | CompoundID</em>. Default value: <em>CompoundID</em>.</p> | |
251 </dd> | |
252 <dt><strong><strong>-f, --Filter</strong> <em>Yes | No</em></strong></dt> | |
253 <dd> | |
254 <p>Specify whether to check and filter compound data in SDFile(s). Possible values: <em>Yes or No</em>. | |
255 Default value: <em>Yes</em>.</p> | |
256 <p>By default, compound data is checked before calculating fingerprints and compounds containing | |
257 atom data corresponding to non-element symbols or no atom data are ignored.</p> | |
258 </dd> | |
259 <dt><strong><strong>--FingerprintsLabel</strong> <em>text</em></strong></dt> | |
260 <dd> | |
261 <p>SD data label or text file column label to use for fingerprints string in output SD or | |
262 CSV/TSV text file(s) specified by <strong>--output</strong>. Default value: <em>MACCSKeyFingerprints</em>.</p> | |
263 </dd> | |
264 <dt><strong><strong>-h, --help</strong></strong></dt> | |
265 <dd> | |
266 <p>Print this help message.</p> | |
267 </dd> | |
268 <dt><strong><strong>-k, --KeepLargestComponent</strong> <em>Yes | No</em></strong></dt> | |
269 <dd> | |
270 <p>Generate fingerprints for only the largest component in molecule. Possible values: | |
271 <em>Yes or No</em>. Default value: <em>Yes</em>.</p> | |
272 <p>For molecules containing multiple connected components, fingerprints can be generated | |
273 in two different ways: use all connected components or just the largest connected | |
274 component. By default, all atoms except for the largest connected component are | |
275 deleted before generation of fingerprints.</p> | |
276 </dd> | |
277 <dt><strong><strong>-m, --mode</strong> <em>MACCSKeyBits | MACCSKeyCount</em></strong></dt> | |
278 <dd> | |
279 <p>Specify type of MACCS keys [ Ref 45-47 ] fingerprints to generate for molecules in <em>SDFile(s)</em>. | |
280 Possible values: <em>MACCSKeyBits, MACCSKeyCount</em>. Default value: <em>MACCSKeyBits</em>.</p> | |
281 <p>For <em>MACCSKeyBits</em> value of <strong>-m, --mode</strong> option, a fingerprint bit-vector string containing | |
282 zeros and ones is generated and for <em>MACCSKeyCount</em> value, a fingerprint vector string | |
283 corresponding to number of MACCS keys is generated.</p> | |
284 <p><em>MACCSKeyBits | MACCSKeyCount</em> values for <strong>-m, --mode</strong> option along with two possible | |
285 <em>166 | 322</em> values of <strong>-s, --size</strong> supports generation of four different types of MACCS | |
286 keys fingerprint: <em>MACCS166KeyBits, MACCS166KeyCount, MACCS322KeyBits, MACCS322KeyCount</em>.</p> | |
287 <p>Definition of MACCS keys uses the following atom and bond symbols to define atom and | |
288 bond environments:</p> | |
289 <div class="OptionsBox"> | |
290 Atom symbols for 166 keys [ Ref 47 ]:</div> | |
291 <div class="OptionsBox"> | |
292 A : Any valid periodic table element symbol | |
293 <br/> Q : Hetro atoms; any non-C or non-H atom | |
294 <br/> X : Halogens; F, Cl, Br, I | |
295 <br/> Z : Others; other than H, C, N, O, Si, P, S, F, Cl, Br, I</div> | |
296 <div class="OptionsBox"> | |
297 Atom symbols for 322 keys [ Ref 46 ]:</div> | |
298 <div class="OptionsBox"> | |
299 A : Any valid periodic table element symbol | |
300 <br/> Q : Hetro atoms; any non-C or non-H atom | |
301 <br/> X : Others; other than H, C, N, O, Si, P, S, F, Cl, Br, I | |
302 <br/> Z is neither defined nor used</div> | |
303 <div class="OptionsBox"> | |
304 Bond types:</div> | |
305 <div class="OptionsBox"> | |
306 - : Single | |
307 <br/> = : Double | |
308 <br/> T : Triple | |
309 <br/> # : Triple | |
310 <br/> ~ : Single or double query bond | |
311 <br/> % : An aromatic query bond</div> | |
312 <div class="OptionsBox"> | |
313 None : Any bond type; no explicit bond specified</div> | |
314 <div class="OptionsBox"> | |
315 $ : Ring bond; $ before a bond type specifies ring bond | |
316 <br/> ! : Chain or non-ring bond; ! before a bond type specifies chain bond</div> | |
317 <div class="OptionsBox"> | |
318 @ : A ring linkage and the number following it specifies the | |
319 atoms position in the line, thus @1 means linked back to the first | |
320 atom in the list.</div> | |
321 <div class="OptionsBox"> | |
322 Aromatic: Kekule or Arom5</div> | |
323 <div class="OptionsBox"> | |
324 Kekule: Bonds in 6-membered rings with alternate single/double bonds | |
325 or perimeter bonds | |
326 <br/> Arom5: Bonds in 5-membered rings with two double bonds and a hetro | |
327 atom at the apex of the ring.</div> | |
328 <p>MACCS 166 keys [ Ref 45-47 ] are defined as follows:</p> | |
329 <div class="OptionsBox"> | |
330 Key Description</div> | |
331 <div class="OptionsBox"> | |
332 1 ISOTOPE | |
333 <br/> 2 103 < ATOMIC NO. < 256 | |
334 <br/> 3 GROUP IVA,VA,VIA PERIODS 4-6 (Ge...) | |
335 <br/> 4 ACTINIDE | |
336 <br/> 5 GROUP IIIB,IVB (Sc...) | |
337 <br/> 6 LANTHANIDE | |
338 <br/> 7 GROUP VB,VIB,VIIB (V...) | |
339 <br/> 8 QAAA@1 | |
340 <br/> 9 GROUP VIII (Fe...) | |
341 <br/> 10 GROUP IIA (ALKALINE EARTH) | |
342 <br/> 11 4M RING | |
343 <br/> 12 GROUP IB,IIB (Cu...) | |
344 <br/> 13 ON(C)C | |
345 <br/> 14 S-S | |
346 <br/> 15 OC(O)O | |
347 <br/> 16 QAA@1 | |
348 <br/> 17 CTC | |
349 <br/> 18 GROUP IIIA (B...) | |
350 <br/> 19 7M RING | |
351 <br/> 20 SI | |
352 <br/> 21 C=C(Q)Q | |
353 <br/> 22 3M RING | |
354 <br/> 23 NC(O)O | |
355 <br/> 24 N-O | |
356 <br/> 25 NC(N)N | |
357 <br/> 26 C$=C($A)$A | |
358 <br/> 27 I | |
359 <br/> 28 QCH2Q | |
360 <br/> 29 P | |
361 <br/> 30 CQ(C)(C)A | |
362 <br/> 31 QX | |
363 <br/> 32 CSN | |
364 <br/> 33 NS | |
365 <br/> 34 CH2=A | |
366 <br/> 35 GROUP IA (ALKALI METAL) | |
367 <br/> 36 S HETEROCYCLE | |
368 <br/> 37 NC(O)N | |
369 <br/> 38 NC(C)N | |
370 <br/> 39 OS(O)O | |
371 <br/> 40 S-O | |
372 <br/> 41 CTN | |
373 <br/> 42 F | |
374 <br/> 43 QHAQH | |
375 <br/> 44 OTHER | |
376 <br/> 45 C=CN | |
377 <br/> 46 BR | |
378 <br/> 47 SAN | |
379 <br/> 48 OQ(O)O | |
380 <br/> 49 CHARGE | |
381 <br/> 50 C=C(C)C | |
382 <br/> 51 CSO | |
383 <br/> 52 NN | |
384 <br/> 53 QHAAAQH | |
385 <br/> 54 QHAAQH | |
386 <br/> 55 OSO | |
387 <br/> 56 ON(O)C | |
388 <br/> 57 O HETEROCYCLE | |
389 <br/> 58 QSQ | |
390 <br/> 59 Snot%A%A | |
391 <br/> 60 S=O | |
392 <br/> 61 AS(A)A | |
393 <br/> 62 A$A!A$A | |
394 <br/> 63 N=O | |
395 <br/> 64 A$A!S | |
396 <br/> 65 C%N | |
397 <br/> 66 CC(C)(C)A | |
398 <br/> 67 QS | |
399 <br/> 68 QHQH (&...) | |
400 <br/> 69 QQH | |
401 <br/> 70 QNQ | |
402 <br/> 71 NO | |
403 <br/> 72 OAAO | |
404 <br/> 73 S=A | |
405 <br/> 74 CH3ACH3 | |
406 <br/> 75 A!N$A | |
407 <br/> 76 C=C(A)A | |
408 <br/> 77 NAN | |
409 <br/> 78 C=N | |
410 <br/> 79 NAAN | |
411 <br/> 80 NAAAN | |
412 <br/> 81 SA(A)A | |
413 <br/> 82 ACH2QH | |
414 <br/> 83 QAAAA@1 | |
415 <br/> 84 NH2 | |
416 <br/> 85 CN(C)C | |
417 <br/> 86 CH2QCH2 | |
418 <br/> 87 X!A$A | |
419 <br/> 88 S | |
420 <br/> 89 OAAAO | |
421 <br/> 90 QHAACH2A | |
422 <br/> 91 QHAAACH2A | |
423 <br/> 92 OC(N)C | |
424 <br/> 93 QCH3 | |
425 <br/> 94 QN | |
426 <br/> 95 NAAO | |
427 <br/> 96 5M RING | |
428 <br/> 97 NAAAO | |
429 <br/> 98 QAAAAA@1 | |
430 <br/> 99 C=C | |
431 <br/> 100 ACH2N | |
432 <br/> 101 8M RING | |
433 <br/> 102 QO | |
434 <br/> 103 CL | |
435 <br/> 104 QHACH2A | |
436 <br/> 105 A$A($A)$A | |
437 <br/> 106 QA(Q)Q | |
438 <br/> 107 XA(A)A | |
439 <br/> 108 CH3AAACH2A | |
440 <br/> 109 ACH2O | |
441 <br/> 110 NCO | |
442 <br/> 111 NACH2A | |
443 <br/> 112 AA(A)(A)A | |
444 <br/> 113 Onot%A%A | |
445 <br/> 114 CH3CH2A | |
446 <br/> 115 CH3ACH2A | |
447 <br/> 116 CH3AACH2A | |
448 <br/> 117 NAO | |
449 <br/> 118 ACH2CH2A > 1 | |
450 <br/> 119 N=A | |
451 <br/> 120 HETEROCYCLIC ATOM > 1 (&...) | |
452 <br/> 121 N HETEROCYCLE | |
453 <br/> 122 AN(A)A | |
454 <br/> 123 OCO | |
455 <br/> 124 QQ | |
456 <br/> 125 AROMATIC RING > 1 | |
457 <br/> 126 A!O!A | |
458 <br/> 127 A$A!O > 1 (&...) | |
459 <br/> 128 ACH2AAACH2A | |
460 <br/> 129 ACH2AACH2A | |
461 <br/> 130 QQ > 1 (&...) | |
462 <br/> 131 QH > 1 | |
463 <br/> 132 OACH2A | |
464 <br/> 133 A$A!N | |
465 <br/> 134 X (HALOGEN) | |
466 <br/> 135 Nnot%A%A | |
467 <br/> 136 O=A > 1 | |
468 <br/> 137 HETEROCYCLE | |
469 <br/> 138 QCH2A > 1 (&...) | |
470 <br/> 139 OH | |
471 <br/> 140 O > 3 (&...) | |
472 <br/> 141 CH3 > 2 (&...) | |
473 <br/> 142 N > 1 | |
474 <br/> 143 A$A!O | |
475 <br/> 144 Anot%A%Anot%A | |
476 <br/> 145 6M RING > 1 | |
477 <br/> 146 O > 2 | |
478 <br/> 147 ACH2CH2A | |
479 <br/> 148 AQ(A)A | |
480 <br/> 149 CH3 > 1 | |
481 <br/> 150 A!A$A!A | |
482 <br/> 151 NH | |
483 <br/> 152 OC(C)C | |
484 <br/> 153 QCH2A | |
485 <br/> 154 C=O | |
486 <br/> 155 A!CH2!A | |
487 <br/> 156 NA(A)A | |
488 <br/> 157 C-O | |
489 <br/> 158 C-N | |
490 <br/> 159 O > 1 | |
491 <br/> 160 CH3 | |
492 <br/> 161 N | |
493 <br/> 162 AROMATIC | |
494 <br/> 163 6M RING | |
495 <br/> 164 O | |
496 <br/> 165 RING | |
497 <br/> 166 FRAGMENTS</div> | |
498 <p>MACCS 322 keys set as defined in tables 1, 2 and 3 [ Ref 46 ] include:</p> | |
499 <div class="OptionsBox"> | |
500 . 26 atom properties of type P, as listed in Table 1 | |
501 <br/> . 32 one-atom environments, as listed in Table 3 | |
502 <br/> . 264 atom-bond-atom combinations listed in Table 4</div> | |
503 <p>Total number of keys in three tables is : 322</p> | |
504 <p>Atom symbol, X, used for 322 keys [ Ref 46 ] doesn't refer to Halogens as it does for 166 keys. In | |
505 order to keep the definition of 322 keys consistent with the published definitions, the symbol X is | |
506 used to imply "others" atoms, but it's internally mapped to symbol X as defined for 166 keys | |
507 during the generation of key values.</p> | |
508 <p>Atom properties-based keys (26):</p> | |
509 <div class="OptionsBox"> | |
510 Key Description | |
511 <br/> 1 A(AAA) or AA(A)A - atom with at least three neighbors | |
512 <br/> 2 Q - heteroatom | |
513 <br/> 3 Anot%not-A - atom involved in one or more multiple bonds, not aromatic | |
514 <br/> 4 A(AAAA) or AA(A)(A)A - atom with at least four neighbors | |
515 <br/> 5 A(QQ) or QA(Q) - atom with at least two heteroatom neighbors | |
516 <br/> 6 A(QQQ) or QA(Q)Q - atom with at least three heteroatom neighbors | |
517 <br/> 7 QH - heteroatom with at least one hydrogen attached | |
518 <br/> 8 CH2(AA) or ACH2A - carbon with at least two single bonds and at least | |
519 two hydrogens attached | |
520 <br/> 9 CH3(A) or ACH3 - carbon with at least one single bond and at least three | |
521 hydrogens attached | |
522 <br/> 10 Halogen | |
523 <br/> 11 A(-A-A-A) or A-A(-A)-A - atom has at least three single bonds | |
524 <br/> 12 AAAAAA@1 > 2 - atom is in at least two different six-membered rings | |
525 <br/> 13 A($A$A$A) or A$A($A)$A - atom has more than two ring bonds | |
526 <br/> 14 A$A!A$A - atom is at a ring/chain boundary. When a comparison is done | |
527 with another atom the path passes through the chain bond. | |
528 <br/> 15 Anot%A%Anot%A - atom is at an aromatic/nonaromatic boundary. When a | |
529 comparison is done with another atom the path | |
530 passes through the aromatic bond. | |
531 <br/> 16 A!A!A - atom with more than one chain bond | |
532 <br/> 17 A!A$A!A - atom is at a ring/chain boundary. When a comparison is done | |
533 with another atom the path passes through the ring bond. | |
534 <br/> 18 A%Anot%A%A - atom is at an aromatic/nonaromatic boundary. When a | |
535 comparison is done with another atom the | |
536 path passes through the nonaromatic bond. | |
537 <br/> 19 HETEROCYCLE - atom is a heteroatom in a ring. | |
538 <br/> 20 rare properties: atom with five or more neighbors, atom in | |
539 four or more rings, or atom types other than | |
540 H, C, N, O, S, F, Cl, Br, or I | |
541 <br/> 21 rare properties: atom has a charge, is an isotope, has two or | |
542 more multiple bonds, or has a triple bond. | |
543 <br/> 22 N - nitrogen | |
544 <br/> 23 S - sulfur | |
545 <br/> 24 O - oxygen | |
546 <br/> 25 A(AA)A(A)A(AA) - atom has two neighbors, each with three or | |
547 more neighbors (including the central atom). | |
548 <br/> 26 CHACH2 - atom has two hydrocarbon (CH2) neighbors</div> | |
549 <p>Atomic environments properties-based keys (32):</p> | |
550 <div class="OptionsBox"> | |
551 Key Description | |
552 <br/> 27 C(CC) | |
553 <br/> 28 C(CCC) | |
554 <br/> 29 C(CN) | |
555 <br/> 30 C(CCN) | |
556 <br/> 31 C(NN) | |
557 <br/> 32 C(NNC) | |
558 <br/> 33 C(NNN) | |
559 <br/> 34 C(CO) | |
560 <br/> 35 C(CCO) | |
561 <br/> 36 C(NO) | |
562 <br/> 37 C(NCO) | |
563 <br/> 38 C(NNO) | |
564 <br/> 39 C(OO) | |
565 <br/> 40 C(COO) | |
566 <br/> 41 C(NOO) | |
567 <br/> 42 C(OOO) | |
568 <br/> 43 Q(CC) | |
569 <br/> 44 Q(CCC) | |
570 <br/> 45 Q(CN) | |
571 <br/> 46 Q(CCN) | |
572 <br/> 47 Q(NN) | |
573 <br/> 48 Q(CNN) | |
574 <br/> 49 Q(NNN) | |
575 <br/> 50 Q(CO) | |
576 <br/> 51 Q(CCO) | |
577 <br/> 52 Q(NO) | |
578 <br/> 53 Q(CNO) | |
579 <br/> 54 Q(NNO) | |
580 <br/> 55 Q(OO) | |
581 <br/> 56 Q(COO) | |
582 <br/> 57 Q(NOO) | |
583 <br/> 58 Q(OOO)</div> | |
584 <p>Note: The first symbol is the central atom, with atoms bonded to the central atom listed in | |
585 parentheses. Q is any non-C, non-H atom. If only two atoms are in parentheses, there is | |
586 no implication concerning the other atoms bonded to the central atom.</p> | |
587 <p>Atom-Bond-Atom properties-based keys: (264)</p> | |
588 <div class="OptionsBox"> | |
589 Key Description | |
590 <br/> 59 C-C | |
591 <br/> 60 C-N | |
592 <br/> 61 C-O | |
593 <br/> 62 C-S | |
594 <br/> 63 C-Cl | |
595 <br/> 64 C-P | |
596 <br/> 65 C-F | |
597 <br/> 66 C-Br | |
598 <br/> 67 C-Si | |
599 <br/> 68 C-I | |
600 <br/> 69 C-X | |
601 <br/> 70 N-N | |
602 <br/> 71 N-O | |
603 <br/> 72 N-S | |
604 <br/> 73 N-Cl | |
605 <br/> 74 N-P | |
606 <br/> 75 N-F | |
607 <br/> 76 N-Br | |
608 <br/> 77 N-Si | |
609 <br/> 78 N-I | |
610 <br/> 79 N-X | |
611 <br/> 80 O-O | |
612 <br/> 81 O-S | |
613 <br/> 82 O-Cl | |
614 <br/> 83 O-P | |
615 <br/> 84 O-F | |
616 <br/> 85 O-Br | |
617 <br/> 86 O-Si | |
618 <br/> 87 O-I | |
619 <br/> 88 O-X | |
620 <br/> 89 S-S | |
621 <br/> 90 S-Cl | |
622 <br/> 91 S-P | |
623 <br/> 92 S-F | |
624 <br/> 93 S-Br | |
625 <br/> 94 S-Si | |
626 <br/> 95 S-I | |
627 <br/> 96 S-X | |
628 <br/> 97 Cl-Cl | |
629 <br/> 98 Cl-P | |
630 <br/> 99 Cl-F | |
631 <br/> 100 Cl-Br | |
632 <br/> 101 Cl-Si | |
633 <br/> 102 Cl-I | |
634 <br/> 103 Cl-X | |
635 <br/> 104 P-P | |
636 <br/> 105 P-F | |
637 <br/> 106 P-Br | |
638 <br/> 107 P-Si | |
639 <br/> 108 P-I | |
640 <br/> 109 P-X | |
641 <br/> 110 F-F | |
642 <br/> 111 F-Br | |
643 <br/> 112 F-Si | |
644 <br/> 113 F-I | |
645 <br/> 114 F-X | |
646 <br/> 115 Br-Br | |
647 <br/> 116 Br-Si | |
648 <br/> 117 Br-I | |
649 <br/> 118 Br-X | |
650 <br/> 119 Si-Si | |
651 <br/> 120 Si-I | |
652 <br/> 121 Si-X | |
653 <br/> 122 I-I | |
654 <br/> 123 I-X | |
655 <br/> 124 X-X | |
656 <br/> 125 C=C | |
657 <br/> 126 C=N | |
658 <br/> 127 C=O | |
659 <br/> 128 C=S | |
660 <br/> 129 C=Cl | |
661 <br/> 130 C=P | |
662 <br/> 131 C=F | |
663 <br/> 132 C=Br | |
664 <br/> 133 C=Si | |
665 <br/> 134 C=I | |
666 <br/> 135 C=X | |
667 <br/> 136 N=N | |
668 <br/> 137 N=O | |
669 <br/> 138 N=S | |
670 <br/> 139 N=Cl | |
671 <br/> 140 N=P | |
672 <br/> 141 N=F | |
673 <br/> 142 N=Br | |
674 <br/> 143 N=Si | |
675 <br/> 144 N=I | |
676 <br/> 145 N=X | |
677 <br/> 146 O=O | |
678 <br/> 147 O=S | |
679 <br/> 148 O=Cl | |
680 <br/> 149 O=P | |
681 <br/> 150 O=F | |
682 <br/> 151 O=Br | |
683 <br/> 152 O=Si | |
684 <br/> 153 O=I | |
685 <br/> 154 O=X | |
686 <br/> 155 S=S | |
687 <br/> 156 S=Cl | |
688 <br/> 157 S=P | |
689 <br/> 158 S=F | |
690 <br/> 159 S=Br | |
691 <br/> 160 S=Si | |
692 <br/> 161 S=I | |
693 <br/> 162 S=X | |
694 <br/> 163 Cl=Cl | |
695 <br/> 164 Cl=P | |
696 <br/> 165 Cl=F | |
697 <br/> 166 Cl=Br | |
698 <br/> 167 Cl=Si | |
699 <br/> 168 Cl=I | |
700 <br/> 169 Cl=X | |
701 <br/> 170 P=P | |
702 <br/> 171 P=F | |
703 <br/> 172 P=Br | |
704 <br/> 173 P=Si | |
705 <br/> 174 P=I | |
706 <br/> 175 P=X | |
707 <br/> 176 F=F | |
708 <br/> 177 F=Br | |
709 <br/> 178 F=Si | |
710 <br/> 179 F=I | |
711 <br/> 180 F=X | |
712 <br/> 181 Br=Br | |
713 <br/> 182 Br=Si | |
714 <br/> 183 Br=I | |
715 <br/> 184 Br=X | |
716 <br/> 185 Si=Si | |
717 <br/> 186 Si=I | |
718 <br/> 187 Si=X | |
719 <br/> 188 I=I | |
720 <br/> 189 I=X | |
721 <br/> 190 X=X | |
722 <br/> 191 C#C | |
723 <br/> 192 C#N | |
724 <br/> 193 C#O | |
725 <br/> 194 C#S | |
726 <br/> 195 C#Cl | |
727 <br/> 196 C#P | |
728 <br/> 197 C#F | |
729 <br/> 198 C#Br | |
730 <br/> 199 C#Si | |
731 <br/> 200 C#I | |
732 <br/> 201 C#X | |
733 <br/> 202 N#N | |
734 <br/> 203 N#O | |
735 <br/> 204 N#S | |
736 <br/> 205 N#Cl | |
737 <br/> 206 N#P | |
738 <br/> 207 N#F | |
739 <br/> 208 N#Br | |
740 <br/> 209 N#Si | |
741 <br/> 210 N#I | |
742 <br/> 211 N#X | |
743 <br/> 212 O#O | |
744 <br/> 213 O#S | |
745 <br/> 214 O#Cl | |
746 <br/> 215 O#P | |
747 <br/> 216 O#F | |
748 <br/> 217 O#Br | |
749 <br/> 218 O#Si | |
750 <br/> 219 O#I | |
751 <br/> 220 O#X | |
752 <br/> 221 S#S | |
753 <br/> 222 S#Cl | |
754 <br/> 223 S#P | |
755 <br/> 224 S#F | |
756 <br/> 225 S#Br | |
757 <br/> 226 S#Si | |
758 <br/> 227 S#I | |
759 <br/> 228 S#X | |
760 <br/> 229 Cl#Cl | |
761 <br/> 230 Cl#P | |
762 <br/> 231 Cl#F | |
763 <br/> 232 Cl#Br | |
764 <br/> 233 Cl#Si | |
765 <br/> 234 Cl#I | |
766 <br/> 235 Cl#X | |
767 <br/> 236 P#P | |
768 <br/> 237 P#F | |
769 <br/> 238 P#Br | |
770 <br/> 239 P#Si | |
771 <br/> 240 P#I | |
772 <br/> 241 P#X | |
773 <br/> 242 F#F | |
774 <br/> 243 F#Br | |
775 <br/> 244 F#Si | |
776 <br/> 245 F#I | |
777 <br/> 246 F#X | |
778 <br/> 247 Br#Br | |
779 <br/> 248 Br#Si | |
780 <br/> 249 Br#I | |
781 <br/> 250 Br#X | |
782 <br/> 251 Si#Si | |
783 <br/> 252 Si#I | |
784 <br/> 253 Si#X | |
785 <br/> 254 I#I | |
786 <br/> 255 I#X | |
787 <br/> 256 X#X | |
788 <br/> 257 C$C | |
789 <br/> 258 C$N | |
790 <br/> 259 C$O | |
791 <br/> 260 C$S | |
792 <br/> 261 C$Cl | |
793 <br/> 262 C$P | |
794 <br/> 263 C$F | |
795 <br/> 264 C$Br | |
796 <br/> 265 C$Si | |
797 <br/> 266 C$I | |
798 <br/> 267 C$X | |
799 <br/> 268 N$N | |
800 <br/> 269 N$O | |
801 <br/> 270 N$S | |
802 <br/> 271 N$Cl | |
803 <br/> 272 N$P | |
804 <br/> 273 N$F | |
805 <br/> 274 N$Br | |
806 <br/> 275 N$Si | |
807 <br/> 276 N$I | |
808 <br/> 277 N$X | |
809 <br/> 278 O$O | |
810 <br/> 279 O$S | |
811 <br/> 280 O$Cl | |
812 <br/> 281 O$P | |
813 <br/> 282 O$F | |
814 <br/> 283 O$Br | |
815 <br/> 284 O$Si | |
816 <br/> 285 O$I | |
817 <br/> 286 O$X | |
818 <br/> 287 S$S | |
819 <br/> 288 S$Cl | |
820 <br/> 289 S$P | |
821 <br/> 290 S$F | |
822 <br/> 291 S$Br | |
823 <br/> 292 S$Si | |
824 <br/> 293 S$I | |
825 <br/> 294 S$X | |
826 <br/> 295 Cl$Cl | |
827 <br/> 296 Cl$P | |
828 <br/> 297 Cl$F | |
829 <br/> 298 Cl$Br | |
830 <br/> 299 Cl$Si | |
831 <br/> 300 Cl$I | |
832 <br/> 301 Cl$X | |
833 <br/> 302 P$P | |
834 <br/> 303 P$F | |
835 <br/> 304 P$Br | |
836 <br/> 305 P$Si | |
837 <br/> 306 P$I | |
838 <br/> 307 P$X | |
839 <br/> 308 F$F | |
840 <br/> 309 F$Br | |
841 <br/> 310 F$Si | |
842 <br/> 311 F$I | |
843 <br/> 312 F$X | |
844 <br/> 313 Br$Br | |
845 <br/> 314 Br$Si | |
846 <br/> 315 Br$I | |
847 <br/> 316 Br$X | |
848 <br/> 317 Si$Si | |
849 <br/> 318 Si$I | |
850 <br/> 319 Si$X | |
851 <br/> 320 I$I | |
852 <br/> 321 I$X | |
853 <br/> 322 X$X</div> | |
854 </dd> | |
855 <dt><strong><strong>--OutDelim</strong> <em>comma | tab | semicolon</em></strong></dt> | |
856 <dd> | |
857 <p>Delimiter for output CSV/TSV text file(s). Possible values: <em>comma, tab, or semicolon</em> | |
858 Default value: <em>comma</em>.</p> | |
859 </dd> | |
860 <dt><strong><strong>--output</strong> <em>SD | FP | text | all</em></strong></dt> | |
861 <dd> | |
862 <p>Type of output files to generate. Possible values: <em>SD, FP, text, or all</em>. Default value: <em>text</em>.</p> | |
863 </dd> | |
864 <dt><strong><strong>-o, --overwrite</strong></strong></dt> | |
865 <dd> | |
866 <p>Overwrite existing files.</p> | |
867 </dd> | |
868 <dt><strong><strong>-q, --quote</strong> <em>Yes | No</em></strong></dt> | |
869 <dd> | |
870 <p>Put quote around column values in output CSV/TSV text file(s). Possible values: | |
871 <em>Yes or No</em>. Default value: <em>Yes</em>.</p> | |
872 </dd> | |
873 <dt><strong><strong>-r, --root</strong> <em>RootName</em></strong></dt> | |
874 <dd> | |
875 <p>New file name is generated using the root: <Root>.<Ext>. Default for new file | |
876 names: <SDFileName><MACCSKeysFP>.<Ext>. The file type determines <Ext> value. | |
877 The sdf, fpf, csv, and tsv <Ext> values are used for SD, FP, comma/semicolon, and tab | |
878 delimited text files, respectively.This option is ignored for multiple input files.</p> | |
879 </dd> | |
880 <dt><strong><strong>-s, --size</strong> <em>number</em></strong></dt> | |
881 <dd> | |
882 <p>Size of MACCS keys [ Ref 45-47 ] set to use during fingerprints generation. Possible values: <em>166 or 322</em>. | |
883 Default value: <em>166</em>.</p> | |
884 </dd> | |
885 <dt><strong><strong>-v, --VectorStringFormat</strong> <em>ValuesString | IDsAndValuesString | IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString</em></strong></dt> | |
886 <dd> | |
887 <p>Format of fingerprints vector string data in output SD, FP or CSV/TSV text file(s) specified by | |
888 <strong>--output</strong> used during <em>MACCSKeyCount</em> value of <strong>-m, --mode</strong> option. Possible | |
889 values: <em>ValuesString, IDsAndValuesString | IDsAndValuesPairsString | ValuesAndIDsString | | |
890 ValuesAndIDsPairsString</em>. Defaultvalue: <em>ValuesString</em>.</p> | |
891 <p>Examples:</p> | |
892 <div class="OptionsBox"> | |
893 FingerprintsVector;MACCSKeyCount;166;OrderedNumericalValues;ValuesStri | |
894 <br/> ng;0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | |
895 <br/> 0 0 0 0 0 0 0 1 0 0 3 0 0 0 0 4 0 0 2 0 0 0 0 0 0 0 0 2 0 0 2 0 0 0 0 | |
896 <br/> 0 0 0 0 1 1 8 0 0 0 1 0 0 1 0 1 0 1 0 3 1 3 1 0 0 0 1 2 0 11 1 0 0 0 | |
897 <br/> 5 0 0 1 2 0 1 1 0 0 0 0 0 1 1 0 1 1 1 1 0 4 0 0 1 1 0 4 6 1 1 1 2 1 1 | |
898 <br/> 3 5 2 2 0 5 3 5 1 1 2 5 1 2 1 2 4 8 3 5 5 2 2 0 3 5 4 1</div> | |
899 <div class="OptionsBox"> | |
900 FingerprintsVector;MACCSKeyCount;322;OrderedNumericalValues;ValuesStri | |
901 <br/> ng;14 8 2 0 2 0 4 4 2 1 4 0 0 2 5 10 5 2 1 0 0 2 0 5 13 3 28 5 5 3 0 0 | |
902 <br/> 0 4 2 1 1 0 1 1 0 0 2 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 22 5 3 0 0 0 1 0 | |
903 <br/> 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 | |
904 <br/> 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 11 0 2 0 0 0 0 0 0 0 0 0 | |
905 <br/> 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ...</div> | |
906 </dd> | |
907 <dt><strong><strong>-w, --WorkingDir</strong> <em>DirName</em></strong></dt> | |
908 <dd> | |
909 <p>Location of working directory. Default: current directory.</p> | |
910 </dd> | |
911 </dl> | |
912 <p> | |
913 </p> | |
914 <h2>EXAMPLES</h2> | |
915 <p>To generate MACCS keys fingerprints of size 166 in binary bit-vector string format | |
916 and create a SampleMACCS166FPBin.csv file containing sequential compound IDs along with | |
917 fingerprints bit-vector strings data, type:</p> | |
918 <div class="ExampleBox"> | |
919 % MACCSKeysFingerprints.pl -r SampleMACCS166FPBin -o Sample.sdf</div> | |
920 <p>To generate MACCS keys fingerprints of size 166 in binary bit-vector string format | |
921 and create SampleMACCS166FPBin.sdf, SampleMACCS166FPBin.csv and SampleMACCS166FPBin.csv | |
922 files containing sequential compound IDs in CSV file along with fingerprints bit-vector strings data, type:</p> | |
923 <div class="ExampleBox"> | |
924 % MACCSKeysFingerprints.pl --output all -r SampleMACCS166FPBin | |
925 -o Sample.sdf</div> | |
926 <p>To generate MACCS keys fingerprints of size 322 in binary bit-vector string format | |
927 and create a SampleMACCS322FPBin.csv file containing sequential compound IDs along with | |
928 fingerprints bit-vector strings data, type:</p> | |
929 <div class="ExampleBox"> | |
930 % MACCSKeysFingerprints.pl -size 322 -r SampleMACCS322FPBin -o Sample.sdf</div> | |
931 <p>To generate MACCS keys fingerprints of size 166 corresponding to count of keys in | |
932 ValuesString format and create a SampleMACCS166FPCount.csv file containing sequential | |
933 compound IDs along with fingerprints vector strings data, type:</p> | |
934 <div class="ExampleBox"> | |
935 % MACCSKeysFingerprints.pl -m MACCSKeyCount -r SampleMACCS166FPCount | |
936 -o Sample.sdf</div> | |
937 <p>To generate MACCS keys fingerprints of size 322 corresponding to count of keys in | |
938 ValuesString format and create a SampleMACCS322FPCount.csv file containing sequential | |
939 compound IDs along with fingerprints vector strings data, type:</p> | |
940 <div class="ExampleBox"> | |
941 % MACCSKeysFingerprints.pl -m MACCSKeyCount -size 322 | |
942 -r SampleMACCS322FPCount -o Sample.sdf</div> | |
943 <p>To generate MACCS keys fingerprints of size 166 in hexadecimal bit-vector string format with | |
944 ascending bits order and create a SampleMACCS166FPHex.csv file containing compound IDs | |
945 from MolName along with fingerprints bit-vector strings data, type:</p> | |
946 <div class="ExampleBox"> | |
947 % MACCSKeysFingerprints.pl -m MACCSKeyBits --size 166 --BitStringFormat | |
948 HexadecimalString --BitsOrder Ascending --DataFieldsMode CompoundID | |
949 --CompoundIDMode MolName -r SampleMACCS166FPBin -o Sample.sdf</div> | |
950 <p>To generate MACCS keys fingerprints of size 166 corresponding to count of keys in | |
951 IDsAndValuesString format and create a SampleMACCS166FPCount.csv file containing | |
952 compound IDs from MolName line along with fingerprints vector strings data, type:</p> | |
953 <div class="ExampleBox"> | |
954 % MACCSKeysFingerprints.pl -m MACCSKeyCount --size 166 | |
955 --VectorStringFormat IDsAndValuesString --DataFieldsMode CompoundID | |
956 --CompoundIDMode MolName -r SampleMACCS166FPCount -o Sample.sdf</div> | |
957 <p>To generate MACCS keys fingerprints of size 166 corresponding to count of keys in | |
958 IDsAndValuesString format and create a SampleMACCS166FPCount.csv file containing | |
959 compound IDs using specified data field along with fingerprints vector strings data, type:</p> | |
960 <div class="ExampleBox"> | |
961 % MACCSKeysFingerprints.pl -m MACCSKeyCount --size 166 | |
962 --VectorStringFormat IDsAndValuesString --DataFieldsMode CompoundID | |
963 --CompoundIDMode DataField --CompoundID Mol_ID -r | |
964 SampleMACCS166FPCount -o Sample.sdf</div> | |
965 <p>To generate MACCS keys fingerprints of size 322 corresponding to count of keys in | |
966 ValuesString format and create a SampleMACCS322FPCount.tsv file containing compound | |
967 IDs derived from combination of molecule name line and an explicit compound prefix | |
968 along with fingerprints vector strings data in a column labels MACCSKeyCountFP, type:</p> | |
969 <div class="ExampleBox"> | |
970 % MACCSKeysFingerprints.pl -m MACCSKeyCount -size 322 --DataFieldsMode | |
971 CompoundID --CompoundIDMode MolnameOrLabelPrefix --CompoundID Cmpd | |
972 --CompoundIDLabel MolID --FingerprintsLabel MACCSKeyCountFP --OutDelim | |
973 Tab -r SampleMACCS322FPCount -o Sample.sdf</div> | |
974 <p>To generate MACCS keys fingerprints of size 166 corresponding to count of keys in | |
975 ValuesString format and create a SampleMACCS166FPCount.csv file containing | |
976 specific data fields columns along with fingerprints vector strings data, type:</p> | |
977 <div class="ExampleBox"> | |
978 % MACCSKeysFingerprints.pl -m MACCSKeyCount --size 166 | |
979 --VectorStringFormat ValuesString --DataFieldsMode Specify --DataFields | |
980 Mol_ID -r SampleMACCS166FPCount -o Sample.sdf</div> | |
981 <p>To generate MACCS keys fingerprints of size 322 corresponding to count of keys in | |
982 ValuesString format and create a SampleMACCS322FPCount.csv file containing | |
983 common data fields columns along with fingerprints vector strings data, type:</p> | |
984 <div class="ExampleBox"> | |
985 % MACCSKeysFingerprints.pl -m MACCSKeyCount --size 322 | |
986 --VectorStringFormat ValuesString --DataFieldsMode Common -r | |
987 SampleMACCS322FPCount -o Sample.sdf</div> | |
988 <p>To generate MACCS keys fingerprints of size 166 corresponding to count of keys in | |
989 ValuesString format and create SampleMACCS166FPCount.sdf, SampleMACCS166FPCount.fpf and | |
990 SampleMACCS166FPCount.csv files containing all data fields columns in CSV file | |
991 along with fingerprints vector strings data, type:</p> | |
992 <div class="ExampleBox"> | |
993 % MACCSKeysFingerprints.pl -m MACCSKeyCount --size 166 --output all | |
994 --VectorStringFormat ValuesString --DataFieldsMode All -r | |
995 SampleMACCS166FPCount -o Sample.sdf</div> | |
996 <p> | |
997 </p> | |
998 <h2>AUTHOR</h2> | |
999 <p><a href="mailto:msud@san.rr.com">Manish Sud</a></p> | |
1000 <p> | |
1001 </p> | |
1002 <h2>SEE ALSO</h2> | |
1003 <p><a href="./InfoFingerprintsFiles.html">InfoFingerprintsFiles.pl</a>, <a href="./SimilarityMatricesFingerprints.html">SimilarityMatricesFingerprints.pl</a>, <a href="./AtomNeighborhoodsFingerprints.html">AtomNeighborhoodsFingerprints.pl</a>,  | |
1004 <a href="./ExtendedConnectivityFingerprints.html">ExtendedConnectivityFingerprints.pl</a>, <a href="./PathLengthFingerprints.html">PathLengthFingerprints.pl</a>,  | |
1005 <a href="./TopologicalAtomPairsFingerprints.html">TopologicalAtomPairsFingerprints.pl</a>, <a href="./TopologicalAtomTorsionsFingerprints.html">TopologicalAtomTorsionsFingerprints.pl</a>,  | |
1006 <a href="./TopologicalPharmacophoreAtomPairsFingerprints.html">TopologicalPharmacophoreAtomPairsFingerprints.pl</a>, <a href="./TopologicalPharmacophoreAtomTripletsFingerprints.html">TopologicalPharmacophoreAtomTripletsFingerprints.pl</a> | |
1007 </p> | |
1008 <p> | |
1009 </p> | |
1010 <h2>COPYRIGHT</h2> | |
1011 <p>Copyright (C) 2015 Manish Sud. All rights reserved.</p> | |
1012 <p>This file is part of MayaChemTools.</p> | |
1013 <p>MayaChemTools is free software; you can redistribute it and/or modify it under | |
1014 the terms of the GNU Lesser General Public License as published by the Free | |
1015 Software Foundation; either version 3 of the License, or (at your option) | |
1016 any later version.</p> | |
1017 <p> </p><p> </p><div class="DocNav"> | |
1018 <table width="100%" border=0 cellpadding=0 cellspacing=2> | |
1019 <tr align="left" valign="top"><td width="33%" align="left"><a href="./JoinTextFiles.html" title="JoinTextFiles.html">Previous</a> <a href="./index.html" title="Table of Contents">TOC</a> <a href="./MergeTextFiles.html" title="MergeTextFiles.html">Next</a></td><td width="34%" align="middle"><strong>March 29, 2015</strong></td><td width="33%" align="right"><strong>MACCSKeysFingerprints.pl</strong></td></tr> | |
1020 </table> | |
1021 </div> | |
1022 <br /> | |
1023 <center> | |
1024 <img src="../../images/h2o2.png"> | |
1025 </center> | |
1026 </body> | |
1027 </html> |