annotate docs/scripts/txt/EStateIndiciesFingerprints.txt @ 3:90ea638ce878 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:11:59 -0500
parents 2abf0d43254d
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
1
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
2 EStateIndiciesFingerprints.pl - Generate E-state indicies fingerprints
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
3 for SD files
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
4
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
5 SYNOPSIS
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
6 EStateIndiciesFingerprints.pl SDFile(s)...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
7
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
8 EStateIndiciesFingerprints.pl [--AromaticityModel
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
9 *AromaticityModelType*] [--CompoundID *DataFieldName or
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
10 LabelPrefixString*] [--CompoundIDLabel *text*] [--CompoundIDMode
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
11 *DataField | MolName | LabelPrefix | MolNameOrLabelPrefix*]
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
12 [--DataFields *"FieldLabel1,FieldLabel2,..."*] [-d, --DataFieldsMode
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
13 *All | Common | Specify | CompoundID*] [-e, --EStateAtomTypesSetToUse
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
14 *ArbitrarySize or FixedSize*] [-f, --Filter *Yes | No*]
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
15 [--FingerprintsLabelMode *FingerprintsLabelOnly |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
16 FingerprintsLabelWithIDs*] [--FingerprintsLabel *text*] [-h, --help]
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
17 [-k, --KeepLargestComponent *Yes | No*] [--OutDelim *comma | tab |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
18 semicolon*] [--output *SD | FP | text | all*] [-o, --overwrite] [-q,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
19 --quote *Yes | No*] [-r, --root *RootName*] [-s, --size *number*]
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
20 [--ValuesPrecision *number*] [-v, --VectorStringFormat
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
21 *IDsAndValuesString | IDsAndValuesPairsString | ValuesAndIDsString |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
22 ValuesAndIDsPairsString*] [-w, --WorkingDir *DirName*]
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
23
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
24 DESCRIPTION
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
25 Generate E-state indicies fingerprints [ Ref 75-78 ] for *SDFile(s)* and
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
26 create appropriate SD, FP, or CSV/TSV text file(s) containing
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
27 fingerprints bit-vector or vector strings corresponding to molecular
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
28 fingerprints.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
29
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
30 Multiple SDFile names are separated by spaces. The valid file extensions
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
31 are *.sdf* and *.sd*. All other file names are ignored. All the SD files
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
32 in a current directory can be specified either by **.sdf* or the current
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
33 directory name.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
34
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
35 E-state atom types are assigned to all non-hydrogen atoms in a molecule
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
36 using module AtomTypes::EStateAtomTypes.pm and E-state values are
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
37 calculated using module AtomicDescriptors::EStateValues.pm. Using
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
38 E-state atom types and E-state values, EStateIndiciesFingerprints
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
39 constituting sum of E-state values for E-sate atom types is generated.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
40
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
41 Two types of E-state atom types set size are allowed:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
42
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
43 ArbitrarySize - Corresponds to only E-state atom types detected
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
44 in molecule
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
45 FixedSize - Corresponds to fixed number of E-state atom types previously
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
46 defined
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
47
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
48 Module AtomTypes::EStateAtomTypes.pm, used to assign E-state atom types
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
49 to non-hydrogen atoms in the molecule, is able to assign atom types to
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
50 any valid atom group. However, for *FixedSize* value of
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
51 EStateAtomTypesSetToUse, only a fixed set of E-state atom types
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
52 corresponding to specific atom groups [ Appendix III in Ref 77 ] are
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
53 used for fingerprints.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
54
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
55 The fixed size E-state atom type set size used during generation of
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
56 fingerprints contains 87 E-state non-hydrogen atom types in
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
57 EStateAtomTypes.csv data file distributed with MayaChemTools.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
58
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
59 Combination of Type and EStateAtomTypesSetToUse allow generation of 2
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
60 different types of E-state indicies fingerprints:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
61
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
62 Type EStateAtomTypesSetToUse
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
63
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
64 EStateIndicies ArbitrarySize [ default fingerprints ]
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
65 EStateIndicies FixedSize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
66
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
67 Example of *SD* file containing E-state indicies fingerprints string
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
68 data:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
69
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
70 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
71 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
72 $$$$
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
73 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
74 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
75 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
76 41 44 0 0 0 0 0 0 0 0999 V2000
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
77 -3.3652 1.4499 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
78 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
79 2 3 1 0 0 0 0
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
80 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
81 M END
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
82 > <CmpdID>
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
83 Cmpd1
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
84
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
85 > <EStateIndiciesFingerprints>
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
86 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDsA
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
87 ndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssNH
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
88 SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3.02
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
89 4 -2.270
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
90
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
91 $$$$
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
92 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
93 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
94
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
95 Example of *FP* file containing E-state indicies fingerprints string
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
96 data:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
97
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
98 #
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
99 # Package = MayaChemTools 7.4
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
100 # Release Date = Oct 21, 2010
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
101 #
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
102 # TimeStamp = Fri Mar 11 14:35:11 2011
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
103 #
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
104 # FingerprintsStringType = FingerprintsVector
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
105 #
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
106 # Description = EStateIndicies:ArbitrarySize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
107 # VectorStringFormat = IDsAndValuesString
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
108 # VectorValuesType = NumericalValues
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
109 #
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
110 Cmpd1 11;SaaCH SaasC SaasN SdO SdssC...;24.778 4.387 1.993 25.023 -1...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
111 Cmpd2 9;SdNH SdO SdssC SsCH3 SsNH...;7.418 22.984 -1.583 5.387 5.400...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
112 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
113 ... ..
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
114
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
115 Example of CSV *Text* file containing E-state indicies fingerprints
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
116 string data:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
117
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
118 "CompoundID","EStateIndiciesFingerprints"
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
119 "Cmpd1","FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalVa
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
120 lues;IDsAndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssC
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
121 H2 SssNH SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
122 .073 3.024 -2.270"
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
123 "Cmpd2","FingerprintsVector;EStateIndicies:ArbitrarySize;9;NumericalVal
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
124 ues;IDsAndValuesString;SdNH SdO SdssC SsCH3 SsNH2 SsOH SssCH2 SssNH Sss
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
125 sCH;7.418 22.984 -1.583 5.387 5.400 19.852 1.737 5.624 -3.319"
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
126 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
127 ... ...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
128
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
129 The current release of MayaChemTools generates the following types of
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
130 E-state fingerprints vector strings:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
131
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
132 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDs
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
133 AndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssN
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
134 H SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
135 .024 -2.270
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
136
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
137 FingerprintsVector;EStateIndicies:FixedSize;87;OrderedNumericalValues;
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
138 ValuesString;0 0 0 0 0 0 0 3.975 0 -0.073 0 0 24.778 -2.270 0 0 -1.435
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
139 4.387 0 0 0 0 0 0 3.024 0 0 0 0 0 0 0 1.993 0 29.759 25.023 0 0 0 0 1
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
140 4.006 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
141 0 0 0 0 0 0 0 0 0 0 0 0 0 0
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
142
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
143 FingerprintsVector;EStateIndicies:FixedSize;87;OrderedNumericalValues;
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
144 IDsAndValuesString;SsLi SssBe SssssBem SsBH2 SssBH SsssB SssssBm SsCH3
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
145 SdCH2 SssCH2 StCH SdsCH SaaCH SsssCH SddC StsC SdssC SaasC SaaaC Sssss
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
146 C SsNH3p SsNH2 SssNH2p SdNH SssNH SaaNH StN SsssNHp SdsN SaaN SsssN Sd
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
147 0 0 0 0 0 0 0 3.975 0 -0.073 0 0 24.778 -2.270 0 0 -1.435 4.387 0 0 0
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
148 0 0 0 3.024 0 0 0 0 0 0 0 1.993 0 29.759 25.023 0 0 0 0 14.006 0 0 0 0
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
149 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0...
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
150
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
151 OPTIONS
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
152 --AromaticityModel *MDLAromaticityModel | TriposAromaticityModel |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
153 MMFFAromaticityModel | ChemAxonBasicAromaticityModel |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
154 ChemAxonGeneralAromaticityModel | DaylightAromaticityModel |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
155 MayaChemToolsAromaticityModel*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
156 Specify aromaticity model to use during detection of aromaticity.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
157 Possible values in the current release are: *MDLAromaticityModel,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
158 TriposAromaticityModel, MMFFAromaticityModel,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
159 ChemAxonBasicAromaticityModel, ChemAxonGeneralAromaticityModel,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
160 DaylightAromaticityModel or MayaChemToolsAromaticityModel*. Default
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
161 value: *MayaChemToolsAromaticityModel*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
162
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
163 The supported aromaticity model names along with model specific
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
164 control parameters are defined in AromaticityModelsData.csv, which
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
165 is distributed with the current release and is available under
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
166 lib/data directory. Molecule.pm module retrieves data from this file
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
167 during class instantiation and makes it available to method
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
168 DetectAromaticity for detecting aromaticity corresponding to a
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
169 specific model.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
170
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
171 --CompoundID *DataFieldName or LabelPrefixString*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
172 This value is --CompoundIDMode specific and indicates how compound
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
173 ID is generated.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
174
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
175 For *DataField* value of --CompoundIDMode option, it corresponds to
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
176 datafield label name whose value is used as compound ID; otherwise,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
177 it's a prefix string used for generating compound IDs like
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
178 LabelPrefixString<Number>. Default value, *Cmpd*, generates compound
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
179 IDs which look like Cmpd<Number>.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
180
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
181 Examples for *DataField* value of --CompoundIDMode:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
182
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
183 MolID
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
184 ExtReg
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
185
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
186 Examples for *LabelPrefix* or *MolNameOrLabelPrefix* value of
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
187 --CompoundIDMode:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
188
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
189 Compound
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
190
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
191 The value specified above generates compound IDs which correspond to
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
192 Compound<Number> instead of default value of Cmpd<Number>.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
193
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
194 --CompoundIDLabel *text*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
195 Specify compound ID column label for FP or CSV/TSV text file(s) used
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
196 during *CompoundID* value of --DataFieldsMode option. Default:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
197 *CompoundID*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
198
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
199 --CompoundIDMode *DataField | MolName | LabelPrefix |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
200 MolNameOrLabelPrefix*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
201 Specify how to generate compound IDs and write to FP or CSV/TSV text
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
202 file(s) along with generated fingerprints for *FP | text | all*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
203 values of --output option: use a *SDFile(s)* datafield value; use
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
204 molname line from *SDFile(s)*; generate a sequential ID with
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
205 specific prefix; use combination of both MolName and LabelPrefix
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
206 with usage of LabelPrefix values for empty molname lines.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
207
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
208 Possible values: *DataField | MolName | LabelPrefix |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
209 MolNameOrLabelPrefix*. Default: *LabelPrefix*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
210
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
211 For *MolNameAndLabelPrefix* value of --CompoundIDMode, molname line
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
212 in *SDFile(s)* takes precedence over sequential compound IDs
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
213 generated using *LabelPrefix* and only empty molname values are
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
214 replaced with sequential compound IDs.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
215
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
216 This is only used for *CompoundID* value of --DataFieldsMode option.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
217
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
218 --DataFields *"FieldLabel1,FieldLabel2,..."*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
219 Comma delimited list of *SDFiles(s)* data fields to extract and
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
220 write to CSV/TSV text file(s) along with generated fingerprints for
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
221 *text | all* values of --output option.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
222
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
223 This is only used for *Specify* value of --DataFieldsMode option.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
224
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
225 Examples:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
226
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
227 Extreg
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
228 MolID,CompoundName
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
229
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
230 -d, --DataFieldsMode *All | Common | Specify | CompoundID*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
231 Specify how data fields in *SDFile(s)* are transferred to output
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
232 CSV/TSV text file(s) along with generated fingerprints for *text |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
233 all* values of --output option: transfer all SD data field; transfer
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
234 SD data files common to all compounds; extract specified data
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
235 fields; generate a compound ID using molname line, a compound
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
236 prefix, or a combination of both. Possible values: *All | Common |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
237 specify | CompoundID*. Default value: *CompoundID*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
238
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
239 -e, --EStateAtomTypesSetToUse *ArbitrarySize | FixedSize*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
240 E-state atom types set size to use during generation of E-state
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
241 indicies fingerprints. Possible values: *ArbitrarySize | FixedSize*;
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
242 Default value: *ArbitrarySize*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
243
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
244 *ArbitrarySize* corrresponds to only E-state atom types detected in
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
245 molecule; *FixedSize* corresponds to fixed number of previously
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
246 defined E-state atom types.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
247
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
248 For *EStateIndicies*, a fingerprint vector string is generated. The
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
249 vector string corresponding to *EStateIndicies* contains sum of
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
250 E-state values for E-state atom types.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
251
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
252 Module AtomTypes::EStateAtomTypes.pm is used to assign E-state atom
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
253 types to non-hydrogen atoms in the molecule which is able to assign
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
254 atom types to any valid atom group. However, for *FixedSize* value
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
255 of EStateAtomTypesSetToUse, only a fixed set of E-state atom types
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
256 corresponding to specific atom groups [ Appendix III in Ref 77 ] are
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
257 used for fingerprints.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
258
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
259 The fixed size E-state atom type set size used during generation of
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
260 fingerprints contains 87 E-state non-hydrogen atom types in
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
261 EStateAtomTypes.csv data file distributed with MayaChemTools.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
262
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
263 -f, --Filter *Yes | No*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
264 Specify whether to check and filter compound data in SDFile(s).
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
265 Possible values: *Yes or No*. Default value: *Yes*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
266
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
267 By default, compound data is checked before calculating fingerprints
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
268 and compounds containing atom data corresponding to non-element
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
269 symbols or no atom data are ignored.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
270
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
271 --FingerprintsLabelMode *FingerprintsLabelOnly |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
272 FingerprintsLabelWithIDs*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
273 Specify how fingerprints label is generated in conjunction with
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
274 --FingerprintsLabel option value: use fingerprints label generated
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
275 only by --FingerprintsLabel option value or append E-state atom type
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
276 value IDs to --FingerprintsLabel option value.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
277
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
278 Possible values: *FingerprintsLabelOnly | FingerprintsLabelWithIDs*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
279 Default value: *FingerprintsLabelOnly*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
280
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
281 This option is only used for *FixedSize* value of -e,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
282 --EStateAtomTypesSetToUse option during generation of
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
283 *EStateIndicies* E-state fingerprints.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
284
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
285 E-state atom type IDs appended to --FingerprintsLabel value during
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
286 *FingerprintsLabelWithIDs* values of --FingerprintsLabelMode
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
287 correspond to fixed number of previously defined E-state atom types.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
288
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
289 --FingerprintsLabel *text*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
290 SD data label or text file column label to use for fingerprints
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
291 string in output SD or CSV/TSV text file(s) specified by --output.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
292 Default value: *EStateIndiciesFingerprints*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
293
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
294 -h, --help
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
295 Print this help message.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
296
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
297 -k, --KeepLargestComponent *Yes | No*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
298 Generate fingerprints for only the largest component in molecule.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
299 Possible values: *Yes or No*. Default value: *Yes*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
300
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
301 For molecules containing multiple connected components, fingerprints
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
302 can be generated in two different ways: use all connected components
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
303 or just the largest connected component. By default, all atoms
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
304 except for the largest connected component are deleted before
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
305 generation of fingerprints.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
306
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
307 --OutDelim *comma | tab | semicolon*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
308 Delimiter for output CSV/TSV text file(s). Possible values: *comma,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
309 tab, or semicolon* Default value: *comma*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
310
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
311 --output *SD | FP | text | all*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
312 Type of output files to generate. Possible values: *SD, FP, text, or
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
313 all*. Default value: *text*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
314
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
315 -o, --overwrite
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
316 Overwrite existing files.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
317
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
318 -q, --quote *Yes | No*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
319 Put quote around column values in output CSV/TSV text file(s).
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
320 Possible values: *Yes or No*. Default value: *Yes*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
321
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
322 -r, --root *RootName*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
323 New file name is generated using the root: <Root>.<Ext>. Default for
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
324 new file names: <SDFileName><EStateIndiciesFP>.<Ext>. The file type
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
325 determines <Ext> value. The sdf, fpf, csv, and tsv <Ext> values are
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
326 used for SD, FP, comma/semicolon, and tab delimited text files,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
327 respectively.This option is ignored for multiple input files.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
328
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
329 --ValuesPrecision *number*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
330 Precision of values for E-state indicies option. Default value: up
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
331 to *3* decimal places. Valid values: positive integers.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
332
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
333 -v, --VectorStringFormat *ValuesString | IDsAndValuesString |
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
334 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
335 Format of fingerprints vector string data in output SD, FP or
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
336 CSV/TSV text file(s) specified by --output used for
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
337 *EStateIndicies*. Possible values: *ValuesString,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
338 IDsAndValuesString, IDsAndValuesPairsString, ValuesAndIDsString,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
339 ValuesAndIDsPairsString*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
340
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
341 Default value during *ArbitrarySize* value of -e,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
342 --EStateAtomTypesSetToUse option: *IDsAndValuesString*. Default
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
343 value during *FixedSize* value of -e, --EStateAtomTypesSetToUse
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
344 option: *ValuesString*.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
345
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
346 Examples:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
347
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
348 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDs
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
349 AndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssN
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
350 H SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
351 .024 -2.270
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
352
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
353 -w, --WorkingDir *DirName*
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
354 Location of working directory. Default: current directory.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
355
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
356 EXAMPLES
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
357 To generate E-state fingerprints of arbitrary size in vector string
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
358 format and create a SampleESFP.csv file containing sequential compound
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
359 IDs along with fingerprints vector strings data, type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
360
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
361 % EStateIndiciesFingerprints.pl -r SampleESFP -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
362
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
363 To generate E-state fingerprints of fixed size in vector string format
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
364 and create a SampleESFP.csv file containing sequential compound IDs
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
365 along with fingerprints vector strings data, type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
366
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
367 % EStateIndiciesFingerprints.pl -e FixedSize -r SampleESFP
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
368 -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
369
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
370 To generate E-state fingerprints of fixed size in vector string with
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
371 IDsAndValues format and create a SampleESFP.csv file containing
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
372 sequential compound IDs along with fingerprints vector strings data,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
373 type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
374
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
375 % EStateIndiciesFingerprints.pl -e FixedSize -v IDsAndValuesString
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
376 -r SampleESFP -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
377
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
378 To generate E-state fingerprints of fixed size in vector string format
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
379 and create a SampleESFP.csv file containing compound ID from molecule
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
380 name line along with fingerprints vector strings data, type
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
381
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
382 % EStateIndiciesFingerprints.pl -e FixedSize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
383 --DataFieldsMode CompoundID --CompoundIDMode MolName
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
384 -r SampleESFP -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
385
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
386 To generate E-state fingerprints of fixed size in vector string format
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
387 and create a SampleESFP.csv file containing compound IDs using specified
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
388 data field along with fingerprints vector strings data, type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
389
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
390 % EStateIndiciesFingerprints.pl -e FixedSize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
391 --DataFieldsMode CompoundID --CompoundIDMode DataField --CompoundID
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
392 Mol_ID -r SampleESFP -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
393
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
394 To generate E-state fingerprints of fixed size in vector string format
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
395 and create a SampleESFP.csv file containing compound ID using
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
396 combination of molecule name line and an explicit compound prefix along
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
397 with fingerprints vector strings data, type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
398
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
399 % EStateIndiciesFingerprints.pl -e FixedSize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
400 --DataFieldsMode CompoundID --CompoundIDMode MolnameOrLabelPrefix
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
401 --CompoundID Cmpd --CompoundIDLabel MolID -r SampleESFP -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
402
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
403 To generate E-state fingerprints of fixed size in vector string format
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
404 and create a SampleESFP.csv file containing specific data fields columns
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
405 along with fingerprints vector strings data, type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
406
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
407 % EStateIndiciesFingerprints.pl -e FixedSize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
408 --DataFieldsMode Specify --DataFields Mol_ID -r SampleESFP
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
409 -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
410
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
411 To generate E-state fingerprints of fixed size in vector string format
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
412 and create a SampleESFP.csv file containing common data fields columns
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
413 along with fingerprints vector strings data, type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
414
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
415 % EStateIndiciesFingerprints.pl -e FixedSize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
416 --DataFieldsMode Common -r SampleESFP -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
417
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
418 To generate E-state fingerprints of fixed size in vector string format
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
419 and create SampleESFP.sdf, SampleESFP.fpf, and SampleESFP.csv files
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
420 containing all data fields columns in CSV file along with fingerprints
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
421 vector strings data, type:
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
422
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
423 % EStateIndiciesFingerprints.pl -e FixedSize
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
424 --DataFieldsMode All --output all -r SampleESFP -o Sample.sdf
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
425
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
426 AUTHOR
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
427 Manish Sud <msud@san.rr.com>
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
428
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
429 SEE ALSO
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
430 InfoFingerprintsFiles.pl, SimilarityMatricesFingerprints.pl,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
431 AtomNeighborhoodsFingerprints.pl, ExtendedConnectivityFingerprints.pl,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
432 MACCSKeysFingeprints.pl, PathLengthFingerprints.pl,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
433 TopologicalAtomPairsFingerprints.pl,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
434 TopologicalAtomTorsionsFingerprints.pl,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
435 TopologicalPharmacophoreAtomPairsFingerprints.pl,
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
436 TopologicalPharmacophoreAtomTripletsFingerprints.pl
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
437
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
438 COPYRIGHT
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
439 Copyright (C) 2015 Manish Sud. All rights reserved.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
440
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
441 This file is part of MayaChemTools.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
442
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
443 MayaChemTools is free software; you can redistribute it and/or modify it
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
444 under the terms of the GNU Lesser General Public License as published by
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
445 the Free Software Foundation; either version 3 of the License, or (at
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
446 your option) any later version.
2abf0d43254d Uploaded
deepakjadmin
parents:
diff changeset
447