annotate mayachemtools/docs/scripts/txt/EStateIndiciesFingerprints.txt @ 9:ab29fa5c8c1f draft default tip

Uploaded
author deepakjadmin
date Thu, 15 Dec 2016 14:18:03 -0500
parents 73ae111cf86f
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
2 EStateIndiciesFingerprints.pl - Generate E-state indicies fingerprints
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
3 for SD files
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
4
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
5 SYNOPSIS
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
6 EStateIndiciesFingerprints.pl SDFile(s)...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
7
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
8 EStateIndiciesFingerprints.pl [--AromaticityModel
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
9 *AromaticityModelType*] [--CompoundID *DataFieldName or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
10 LabelPrefixString*] [--CompoundIDLabel *text*] [--CompoundIDMode
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
11 *DataField | MolName | LabelPrefix | MolNameOrLabelPrefix*]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
12 [--DataFields *"FieldLabel1,FieldLabel2,..."*] [-d, --DataFieldsMode
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
13 *All | Common | Specify | CompoundID*] [-e, --EStateAtomTypesSetToUse
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
14 *ArbitrarySize or FixedSize*] [-f, --Filter *Yes | No*]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
15 [--FingerprintsLabelMode *FingerprintsLabelOnly |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
16 FingerprintsLabelWithIDs*] [--FingerprintsLabel *text*] [-h, --help]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
17 [-k, --KeepLargestComponent *Yes | No*] [--OutDelim *comma | tab |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
18 semicolon*] [--output *SD | FP | text | all*] [-o, --overwrite] [-q,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
19 --quote *Yes | No*] [-r, --root *RootName*] [-s, --size *number*]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
20 [--ValuesPrecision *number*] [-v, --VectorStringFormat
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
21 *IDsAndValuesString | IDsAndValuesPairsString | ValuesAndIDsString |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
22 ValuesAndIDsPairsString*] [-w, --WorkingDir *DirName*]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
23
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
24 DESCRIPTION
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
25 Generate E-state indicies fingerprints [ Ref 75-78 ] for *SDFile(s)* and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
26 create appropriate SD, FP, or CSV/TSV text file(s) containing
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
27 fingerprints bit-vector or vector strings corresponding to molecular
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
28 fingerprints.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
29
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
30 Multiple SDFile names are separated by spaces. The valid file extensions
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
31 are *.sdf* and *.sd*. All other file names are ignored. All the SD files
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
32 in a current directory can be specified either by **.sdf* or the current
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
33 directory name.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
34
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
35 E-state atom types are assigned to all non-hydrogen atoms in a molecule
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
36 using module AtomTypes::EStateAtomTypes.pm and E-state values are
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
37 calculated using module AtomicDescriptors::EStateValues.pm. Using
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
38 E-state atom types and E-state values, EStateIndiciesFingerprints
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
39 constituting sum of E-state values for E-sate atom types is generated.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
40
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
41 Two types of E-state atom types set size are allowed:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
42
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
43 ArbitrarySize - Corresponds to only E-state atom types detected
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
44 in molecule
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
45 FixedSize - Corresponds to fixed number of E-state atom types previously
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
46 defined
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
47
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
48 Module AtomTypes::EStateAtomTypes.pm, used to assign E-state atom types
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
49 to non-hydrogen atoms in the molecule, is able to assign atom types to
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
50 any valid atom group. However, for *FixedSize* value of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
51 EStateAtomTypesSetToUse, only a fixed set of E-state atom types
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
52 corresponding to specific atom groups [ Appendix III in Ref 77 ] are
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
53 used for fingerprints.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
54
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
55 The fixed size E-state atom type set size used during generation of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
56 fingerprints contains 87 E-state non-hydrogen atom types in
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
57 EStateAtomTypes.csv data file distributed with MayaChemTools.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
58
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
59 Combination of Type and EStateAtomTypesSetToUse allow generation of 2
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
60 different types of E-state indicies fingerprints:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
61
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
62 Type EStateAtomTypesSetToUse
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
63
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
64 EStateIndicies ArbitrarySize [ default fingerprints ]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
65 EStateIndicies FixedSize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
66
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
67 Example of *SD* file containing E-state indicies fingerprints string
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
68 data:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
69
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
70 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
71 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
72 $$$$
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
73 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
74 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
75 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
76 41 44 0 0 0 0 0 0 0 0999 V2000
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
77 -3.3652 1.4499 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
78 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
79 2 3 1 0 0 0 0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
80 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
81 M END
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
82 > <CmpdID>
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
83 Cmpd1
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
84
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
85 > <EStateIndiciesFingerprints>
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
86 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDsA
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
87 ndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssNH
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
88 SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3.02
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
89 4 -2.270
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
90
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
91 $$$$
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
92 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
93 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
94
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
95 Example of *FP* file containing E-state indicies fingerprints string
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
96 data:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
97
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
98 #
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
99 # Package = MayaChemTools 7.4
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
100 # Release Date = Oct 21, 2010
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
101 #
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
102 # TimeStamp = Fri Mar 11 14:35:11 2011
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
103 #
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
104 # FingerprintsStringType = FingerprintsVector
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
105 #
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
106 # Description = EStateIndicies:ArbitrarySize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
107 # VectorStringFormat = IDsAndValuesString
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
108 # VectorValuesType = NumericalValues
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
109 #
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
110 Cmpd1 11;SaaCH SaasC SaasN SdO SdssC...;24.778 4.387 1.993 25.023 -1...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
111 Cmpd2 9;SdNH SdO SdssC SsCH3 SsNH...;7.418 22.984 -1.583 5.387 5.400...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
112 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
113 ... ..
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
114
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
115 Example of CSV *Text* file containing E-state indicies fingerprints
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
116 string data:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
117
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
118 "CompoundID","EStateIndiciesFingerprints"
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
119 "Cmpd1","FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalVa
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
120 lues;IDsAndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssC
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
121 H2 SssNH SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
122 .073 3.024 -2.270"
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
123 "Cmpd2","FingerprintsVector;EStateIndicies:ArbitrarySize;9;NumericalVal
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
124 ues;IDsAndValuesString;SdNH SdO SdssC SsCH3 SsNH2 SsOH SssCH2 SssNH Sss
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
125 sCH;7.418 22.984 -1.583 5.387 5.400 19.852 1.737 5.624 -3.319"
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
126 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
127 ... ...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
128
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
129 The current release of MayaChemTools generates the following types of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
130 E-state fingerprints vector strings:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
131
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
132 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDs
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
133 AndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssN
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
134 H SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
135 .024 -2.270
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
136
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
137 FingerprintsVector;EStateIndicies:FixedSize;87;OrderedNumericalValues;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
138 ValuesString;0 0 0 0 0 0 0 3.975 0 -0.073 0 0 24.778 -2.270 0 0 -1.435
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
139 4.387 0 0 0 0 0 0 3.024 0 0 0 0 0 0 0 1.993 0 29.759 25.023 0 0 0 0 1
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
140 4.006 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
141 0 0 0 0 0 0 0 0 0 0 0 0 0 0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
142
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
143 FingerprintsVector;EStateIndicies:FixedSize;87;OrderedNumericalValues;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
144 IDsAndValuesString;SsLi SssBe SssssBem SsBH2 SssBH SsssB SssssBm SsCH3
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
145 SdCH2 SssCH2 StCH SdsCH SaaCH SsssCH SddC StsC SdssC SaasC SaaaC Sssss
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
146 C SsNH3p SsNH2 SssNH2p SdNH SssNH SaaNH StN SsssNHp SdsN SaaN SsssN Sd
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
147 0 0 0 0 0 0 0 3.975 0 -0.073 0 0 24.778 -2.270 0 0 -1.435 4.387 0 0 0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
148 0 0 0 3.024 0 0 0 0 0 0 0 1.993 0 29.759 25.023 0 0 0 0 14.006 0 0 0 0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
149 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
150
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
151 OPTIONS
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
152 --AromaticityModel *MDLAromaticityModel | TriposAromaticityModel |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
153 MMFFAromaticityModel | ChemAxonBasicAromaticityModel |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
154 ChemAxonGeneralAromaticityModel | DaylightAromaticityModel |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
155 MayaChemToolsAromaticityModel*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
156 Specify aromaticity model to use during detection of aromaticity.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
157 Possible values in the current release are: *MDLAromaticityModel,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
158 TriposAromaticityModel, MMFFAromaticityModel,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
159 ChemAxonBasicAromaticityModel, ChemAxonGeneralAromaticityModel,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
160 DaylightAromaticityModel or MayaChemToolsAromaticityModel*. Default
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
161 value: *MayaChemToolsAromaticityModel*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
162
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
163 The supported aromaticity model names along with model specific
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
164 control parameters are defined in AromaticityModelsData.csv, which
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
165 is distributed with the current release and is available under
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
166 lib/data directory. Molecule.pm module retrieves data from this file
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
167 during class instantiation and makes it available to method
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
168 DetectAromaticity for detecting aromaticity corresponding to a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
169 specific model.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
170
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
171 --CompoundID *DataFieldName or LabelPrefixString*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
172 This value is --CompoundIDMode specific and indicates how compound
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
173 ID is generated.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
174
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
175 For *DataField* value of --CompoundIDMode option, it corresponds to
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
176 datafield label name whose value is used as compound ID; otherwise,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
177 it's a prefix string used for generating compound IDs like
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
178 LabelPrefixString<Number>. Default value, *Cmpd*, generates compound
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
179 IDs which look like Cmpd<Number>.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
180
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
181 Examples for *DataField* value of --CompoundIDMode:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
182
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
183 MolID
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
184 ExtReg
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
185
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
186 Examples for *LabelPrefix* or *MolNameOrLabelPrefix* value of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
187 --CompoundIDMode:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
188
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
189 Compound
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
190
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
191 The value specified above generates compound IDs which correspond to
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
192 Compound<Number> instead of default value of Cmpd<Number>.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
193
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
194 --CompoundIDLabel *text*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
195 Specify compound ID column label for FP or CSV/TSV text file(s) used
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
196 during *CompoundID* value of --DataFieldsMode option. Default:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
197 *CompoundID*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
198
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
199 --CompoundIDMode *DataField | MolName | LabelPrefix |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
200 MolNameOrLabelPrefix*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
201 Specify how to generate compound IDs and write to FP or CSV/TSV text
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
202 file(s) along with generated fingerprints for *FP | text | all*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
203 values of --output option: use a *SDFile(s)* datafield value; use
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
204 molname line from *SDFile(s)*; generate a sequential ID with
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
205 specific prefix; use combination of both MolName and LabelPrefix
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
206 with usage of LabelPrefix values for empty molname lines.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
207
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
208 Possible values: *DataField | MolName | LabelPrefix |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
209 MolNameOrLabelPrefix*. Default: *LabelPrefix*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
210
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
211 For *MolNameAndLabelPrefix* value of --CompoundIDMode, molname line
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
212 in *SDFile(s)* takes precedence over sequential compound IDs
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
213 generated using *LabelPrefix* and only empty molname values are
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
214 replaced with sequential compound IDs.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
215
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
216 This is only used for *CompoundID* value of --DataFieldsMode option.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
217
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
218 --DataFields *"FieldLabel1,FieldLabel2,..."*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
219 Comma delimited list of *SDFiles(s)* data fields to extract and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
220 write to CSV/TSV text file(s) along with generated fingerprints for
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
221 *text | all* values of --output option.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
222
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
223 This is only used for *Specify* value of --DataFieldsMode option.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
224
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
225 Examples:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
226
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
227 Extreg
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
228 MolID,CompoundName
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
229
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
230 -d, --DataFieldsMode *All | Common | Specify | CompoundID*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
231 Specify how data fields in *SDFile(s)* are transferred to output
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
232 CSV/TSV text file(s) along with generated fingerprints for *text |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
233 all* values of --output option: transfer all SD data field; transfer
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
234 SD data files common to all compounds; extract specified data
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
235 fields; generate a compound ID using molname line, a compound
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
236 prefix, or a combination of both. Possible values: *All | Common |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
237 specify | CompoundID*. Default value: *CompoundID*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
238
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
239 -e, --EStateAtomTypesSetToUse *ArbitrarySize | FixedSize*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
240 E-state atom types set size to use during generation of E-state
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
241 indicies fingerprints. Possible values: *ArbitrarySize | FixedSize*;
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
242 Default value: *ArbitrarySize*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
243
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
244 *ArbitrarySize* corrresponds to only E-state atom types detected in
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
245 molecule; *FixedSize* corresponds to fixed number of previously
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
246 defined E-state atom types.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
247
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
248 For *EStateIndicies*, a fingerprint vector string is generated. The
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
249 vector string corresponding to *EStateIndicies* contains sum of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
250 E-state values for E-state atom types.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
251
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
252 Module AtomTypes::EStateAtomTypes.pm is used to assign E-state atom
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
253 types to non-hydrogen atoms in the molecule which is able to assign
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
254 atom types to any valid atom group. However, for *FixedSize* value
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
255 of EStateAtomTypesSetToUse, only a fixed set of E-state atom types
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
256 corresponding to specific atom groups [ Appendix III in Ref 77 ] are
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
257 used for fingerprints.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
258
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
259 The fixed size E-state atom type set size used during generation of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
260 fingerprints contains 87 E-state non-hydrogen atom types in
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
261 EStateAtomTypes.csv data file distributed with MayaChemTools.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
262
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
263 -f, --Filter *Yes | No*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
264 Specify whether to check and filter compound data in SDFile(s).
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
265 Possible values: *Yes or No*. Default value: *Yes*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
266
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
267 By default, compound data is checked before calculating fingerprints
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
268 and compounds containing atom data corresponding to non-element
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
269 symbols or no atom data are ignored.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
270
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
271 --FingerprintsLabelMode *FingerprintsLabelOnly |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
272 FingerprintsLabelWithIDs*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
273 Specify how fingerprints label is generated in conjunction with
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
274 --FingerprintsLabel option value: use fingerprints label generated
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
275 only by --FingerprintsLabel option value or append E-state atom type
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
276 value IDs to --FingerprintsLabel option value.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
277
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
278 Possible values: *FingerprintsLabelOnly | FingerprintsLabelWithIDs*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
279 Default value: *FingerprintsLabelOnly*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
280
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
281 This option is only used for *FixedSize* value of -e,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
282 --EStateAtomTypesSetToUse option during generation of
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
283 *EStateIndicies* E-state fingerprints.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
284
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
285 E-state atom type IDs appended to --FingerprintsLabel value during
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
286 *FingerprintsLabelWithIDs* values of --FingerprintsLabelMode
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
287 correspond to fixed number of previously defined E-state atom types.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
288
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
289 --FingerprintsLabel *text*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
290 SD data label or text file column label to use for fingerprints
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
291 string in output SD or CSV/TSV text file(s) specified by --output.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
292 Default value: *EStateIndiciesFingerprints*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
293
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
294 -h, --help
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
295 Print this help message.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
296
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
297 -k, --KeepLargestComponent *Yes | No*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
298 Generate fingerprints for only the largest component in molecule.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
299 Possible values: *Yes or No*. Default value: *Yes*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
300
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
301 For molecules containing multiple connected components, fingerprints
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
302 can be generated in two different ways: use all connected components
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
303 or just the largest connected component. By default, all atoms
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
304 except for the largest connected component are deleted before
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
305 generation of fingerprints.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
306
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
307 --OutDelim *comma | tab | semicolon*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
308 Delimiter for output CSV/TSV text file(s). Possible values: *comma,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
309 tab, or semicolon* Default value: *comma*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
310
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
311 --output *SD | FP | text | all*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
312 Type of output files to generate. Possible values: *SD, FP, text, or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
313 all*. Default value: *text*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
314
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
315 -o, --overwrite
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
316 Overwrite existing files.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
317
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
318 -q, --quote *Yes | No*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
319 Put quote around column values in output CSV/TSV text file(s).
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
320 Possible values: *Yes or No*. Default value: *Yes*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
321
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
322 -r, --root *RootName*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
323 New file name is generated using the root: <Root>.<Ext>. Default for
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
324 new file names: <SDFileName><EStateIndiciesFP>.<Ext>. The file type
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
325 determines <Ext> value. The sdf, fpf, csv, and tsv <Ext> values are
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
326 used for SD, FP, comma/semicolon, and tab delimited text files,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
327 respectively.This option is ignored for multiple input files.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
328
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
329 --ValuesPrecision *number*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
330 Precision of values for E-state indicies option. Default value: up
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
331 to *3* decimal places. Valid values: positive integers.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
332
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
333 -v, --VectorStringFormat *ValuesString | IDsAndValuesString |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
334 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
335 Format of fingerprints vector string data in output SD, FP or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
336 CSV/TSV text file(s) specified by --output used for
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
337 *EStateIndicies*. Possible values: *ValuesString,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
338 IDsAndValuesString, IDsAndValuesPairsString, ValuesAndIDsString,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
339 ValuesAndIDsPairsString*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
340
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
341 Default value during *ArbitrarySize* value of -e,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
342 --EStateAtomTypesSetToUse option: *IDsAndValuesString*. Default
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
343 value during *FixedSize* value of -e, --EStateAtomTypesSetToUse
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
344 option: *ValuesString*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
345
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
346 Examples:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
347
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
348 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDs
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
349 AndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssN
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
350 H SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
351 .024 -2.270
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
352
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
353 -w, --WorkingDir *DirName*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
354 Location of working directory. Default: current directory.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
355
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
356 EXAMPLES
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
357 To generate E-state fingerprints of arbitrary size in vector string
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
358 format and create a SampleESFP.csv file containing sequential compound
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
359 IDs along with fingerprints vector strings data, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
360
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
361 % EStateIndiciesFingerprints.pl -r SampleESFP -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
362
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
363 To generate E-state fingerprints of fixed size in vector string format
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
364 and create a SampleESFP.csv file containing sequential compound IDs
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
365 along with fingerprints vector strings data, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
366
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
367 % EStateIndiciesFingerprints.pl -e FixedSize -r SampleESFP
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
368 -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
369
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
370 To generate E-state fingerprints of fixed size in vector string with
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
371 IDsAndValues format and create a SampleESFP.csv file containing
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
372 sequential compound IDs along with fingerprints vector strings data,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
373 type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
374
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
375 % EStateIndiciesFingerprints.pl -e FixedSize -v IDsAndValuesString
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
376 -r SampleESFP -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
377
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
378 To generate E-state fingerprints of fixed size in vector string format
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
379 and create a SampleESFP.csv file containing compound ID from molecule
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
380 name line along with fingerprints vector strings data, type
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
381
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
382 % EStateIndiciesFingerprints.pl -e FixedSize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
383 --DataFieldsMode CompoundID --CompoundIDMode MolName
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
384 -r SampleESFP -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
385
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
386 To generate E-state fingerprints of fixed size in vector string format
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
387 and create a SampleESFP.csv file containing compound IDs using specified
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
388 data field along with fingerprints vector strings data, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
389
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
390 % EStateIndiciesFingerprints.pl -e FixedSize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
391 --DataFieldsMode CompoundID --CompoundIDMode DataField --CompoundID
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
392 Mol_ID -r SampleESFP -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
393
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
394 To generate E-state fingerprints of fixed size in vector string format
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
395 and create a SampleESFP.csv file containing compound ID using
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
396 combination of molecule name line and an explicit compound prefix along
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
397 with fingerprints vector strings data, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
398
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
399 % EStateIndiciesFingerprints.pl -e FixedSize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
400 --DataFieldsMode CompoundID --CompoundIDMode MolnameOrLabelPrefix
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
401 --CompoundID Cmpd --CompoundIDLabel MolID -r SampleESFP -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
402
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
403 To generate E-state fingerprints of fixed size in vector string format
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
404 and create a SampleESFP.csv file containing specific data fields columns
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
405 along with fingerprints vector strings data, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
406
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
407 % EStateIndiciesFingerprints.pl -e FixedSize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
408 --DataFieldsMode Specify --DataFields Mol_ID -r SampleESFP
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
409 -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
410
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
411 To generate E-state fingerprints of fixed size in vector string format
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
412 and create a SampleESFP.csv file containing common data fields columns
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
413 along with fingerprints vector strings data, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
414
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
415 % EStateIndiciesFingerprints.pl -e FixedSize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
416 --DataFieldsMode Common -r SampleESFP -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
417
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
418 To generate E-state fingerprints of fixed size in vector string format
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
419 and create SampleESFP.sdf, SampleESFP.fpf, and SampleESFP.csv files
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
420 containing all data fields columns in CSV file along with fingerprints
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
421 vector strings data, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
422
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
423 % EStateIndiciesFingerprints.pl -e FixedSize
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
424 --DataFieldsMode All --output all -r SampleESFP -o Sample.sdf
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
425
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
426 AUTHOR
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
427 Manish Sud <msud@san.rr.com>
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
428
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
429 SEE ALSO
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
430 InfoFingerprintsFiles.pl, SimilarityMatricesFingerprints.pl,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
431 AtomNeighborhoodsFingerprints.pl, ExtendedConnectivityFingerprints.pl,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
432 MACCSKeysFingeprints.pl, PathLengthFingerprints.pl,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
433 TopologicalAtomPairsFingerprints.pl,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
434 TopologicalAtomTorsionsFingerprints.pl,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
435 TopologicalPharmacophoreAtomPairsFingerprints.pl,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
436 TopologicalPharmacophoreAtomTripletsFingerprints.pl
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
437
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
438 COPYRIGHT
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
439 Copyright (C) 2015 Manish Sud. All rights reserved.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
440
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
441 This file is part of MayaChemTools.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
442
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
443 MayaChemTools is free software; you can redistribute it and/or modify it
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
444 under the terms of the GNU Lesser General Public License as published by
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
445 the Free Software Foundation; either version 3 of the License, or (at
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
446 your option) any later version.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
447