annotate docs/scripts/txt/EStateIndiciesFingerprints.txt @ 0:4816e4a8ae95 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:23:18 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
2 EStateIndiciesFingerprints.pl - Generate E-state indicies fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
3 for SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
5 SYNOPSIS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
6 EStateIndiciesFingerprints.pl SDFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
7
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
8 EStateIndiciesFingerprints.pl [--AromaticityModel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
9 *AromaticityModelType*] [--CompoundID *DataFieldName or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
10 LabelPrefixString*] [--CompoundIDLabel *text*] [--CompoundIDMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
11 *DataField | MolName | LabelPrefix | MolNameOrLabelPrefix*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
12 [--DataFields *"FieldLabel1,FieldLabel2,..."*] [-d, --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
13 *All | Common | Specify | CompoundID*] [-e, --EStateAtomTypesSetToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
14 *ArbitrarySize or FixedSize*] [-f, --Filter *Yes | No*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
15 [--FingerprintsLabelMode *FingerprintsLabelOnly |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
16 FingerprintsLabelWithIDs*] [--FingerprintsLabel *text*] [-h, --help]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
17 [-k, --KeepLargestComponent *Yes | No*] [--OutDelim *comma | tab |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
18 semicolon*] [--output *SD | FP | text | all*] [-o, --overwrite] [-q,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
19 --quote *Yes | No*] [-r, --root *RootName*] [-s, --size *number*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
20 [--ValuesPrecision *number*] [-v, --VectorStringFormat
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
21 *IDsAndValuesString | IDsAndValuesPairsString | ValuesAndIDsString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
22 ValuesAndIDsPairsString*] [-w, --WorkingDir *DirName*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
23
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
24 DESCRIPTION
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
25 Generate E-state indicies fingerprints [ Ref 75-78 ] for *SDFile(s)* and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
26 create appropriate SD, FP, or CSV/TSV text file(s) containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
27 fingerprints bit-vector or vector strings corresponding to molecular
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
28 fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
29
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
30 Multiple SDFile names are separated by spaces. The valid file extensions
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
31 are *.sdf* and *.sd*. All other file names are ignored. All the SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
32 in a current directory can be specified either by **.sdf* or the current
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
33 directory name.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
34
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
35 E-state atom types are assigned to all non-hydrogen atoms in a molecule
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
36 using module AtomTypes::EStateAtomTypes.pm and E-state values are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
37 calculated using module AtomicDescriptors::EStateValues.pm. Using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
38 E-state atom types and E-state values, EStateIndiciesFingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
39 constituting sum of E-state values for E-sate atom types is generated.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
40
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
41 Two types of E-state atom types set size are allowed:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
42
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
43 ArbitrarySize - Corresponds to only E-state atom types detected
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
44 in molecule
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
45 FixedSize - Corresponds to fixed number of E-state atom types previously
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
46 defined
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
47
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
48 Module AtomTypes::EStateAtomTypes.pm, used to assign E-state atom types
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
49 to non-hydrogen atoms in the molecule, is able to assign atom types to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
50 any valid atom group. However, for *FixedSize* value of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
51 EStateAtomTypesSetToUse, only a fixed set of E-state atom types
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
52 corresponding to specific atom groups [ Appendix III in Ref 77 ] are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
53 used for fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
54
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
55 The fixed size E-state atom type set size used during generation of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
56 fingerprints contains 87 E-state non-hydrogen atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
57 EStateAtomTypes.csv data file distributed with MayaChemTools.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
58
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
59 Combination of Type and EStateAtomTypesSetToUse allow generation of 2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
60 different types of E-state indicies fingerprints:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
61
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
62 Type EStateAtomTypesSetToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
63
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
64 EStateIndicies ArbitrarySize [ default fingerprints ]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
65 EStateIndicies FixedSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
66
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
67 Example of *SD* file containing E-state indicies fingerprints string
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
68 data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
69
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
70 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
71 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
72 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
73 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
74 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
75 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
76 41 44 0 0 0 0 0 0 0 0999 V2000
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
77 -3.3652 1.4499 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
78 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
79 2 3 1 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
80 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
81 M END
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
82 > <CmpdID>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
83 Cmpd1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
84
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
85 > <EStateIndiciesFingerprints>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
86 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDsA
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
87 ndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssNH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
88 SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3.02
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
89 4 -2.270
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
90
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
91 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
92 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
93 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
94
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
95 Example of *FP* file containing E-state indicies fingerprints string
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
96 data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
97
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
98 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
99 # Package = MayaChemTools 7.4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
100 # Release Date = Oct 21, 2010
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
101 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
102 # TimeStamp = Fri Mar 11 14:35:11 2011
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
103 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
104 # FingerprintsStringType = FingerprintsVector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
105 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
106 # Description = EStateIndicies:ArbitrarySize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
107 # VectorStringFormat = IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
108 # VectorValuesType = NumericalValues
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
109 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
110 Cmpd1 11;SaaCH SaasC SaasN SdO SdssC...;24.778 4.387 1.993 25.023 -1...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
111 Cmpd2 9;SdNH SdO SdssC SsCH3 SsNH...;7.418 22.984 -1.583 5.387 5.400...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
112 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
113 ... ..
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
114
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
115 Example of CSV *Text* file containing E-state indicies fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
116 string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
117
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
118 "CompoundID","EStateIndiciesFingerprints"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
119 "Cmpd1","FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalVa
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
120 lues;IDsAndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssC
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
121 H2 SssNH SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
122 .073 3.024 -2.270"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
123 "Cmpd2","FingerprintsVector;EStateIndicies:ArbitrarySize;9;NumericalVal
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
124 ues;IDsAndValuesString;SdNH SdO SdssC SsCH3 SsNH2 SsOH SssCH2 SssNH Sss
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
125 sCH;7.418 22.984 -1.583 5.387 5.400 19.852 1.737 5.624 -3.319"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
126 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
127 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
128
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
129 The current release of MayaChemTools generates the following types of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
130 E-state fingerprints vector strings:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
131
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
132 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
133 AndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssN
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
134 H SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
135 .024 -2.270
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
136
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
137 FingerprintsVector;EStateIndicies:FixedSize;87;OrderedNumericalValues;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
138 ValuesString;0 0 0 0 0 0 0 3.975 0 -0.073 0 0 24.778 -2.270 0 0 -1.435
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
139 4.387 0 0 0 0 0 0 3.024 0 0 0 0 0 0 0 1.993 0 29.759 25.023 0 0 0 0 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
140 4.006 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
141 0 0 0 0 0 0 0 0 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
142
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
143 FingerprintsVector;EStateIndicies:FixedSize;87;OrderedNumericalValues;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
144 IDsAndValuesString;SsLi SssBe SssssBem SsBH2 SssBH SsssB SssssBm SsCH3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
145 SdCH2 SssCH2 StCH SdsCH SaaCH SsssCH SddC StsC SdssC SaasC SaaaC Sssss
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
146 C SsNH3p SsNH2 SssNH2p SdNH SssNH SaaNH StN SsssNHp SdsN SaaN SsssN Sd
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
147 0 0 0 0 0 0 0 3.975 0 -0.073 0 0 24.778 -2.270 0 0 -1.435 4.387 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
148 0 0 0 3.024 0 0 0 0 0 0 0 1.993 0 29.759 25.023 0 0 0 0 14.006 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
149 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
150
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
151 OPTIONS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
152 --AromaticityModel *MDLAromaticityModel | TriposAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
153 MMFFAromaticityModel | ChemAxonBasicAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
154 ChemAxonGeneralAromaticityModel | DaylightAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
155 MayaChemToolsAromaticityModel*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
156 Specify aromaticity model to use during detection of aromaticity.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
157 Possible values in the current release are: *MDLAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
158 TriposAromaticityModel, MMFFAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
159 ChemAxonBasicAromaticityModel, ChemAxonGeneralAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
160 DaylightAromaticityModel or MayaChemToolsAromaticityModel*. Default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
161 value: *MayaChemToolsAromaticityModel*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
162
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
163 The supported aromaticity model names along with model specific
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
164 control parameters are defined in AromaticityModelsData.csv, which
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
165 is distributed with the current release and is available under
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
166 lib/data directory. Molecule.pm module retrieves data from this file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
167 during class instantiation and makes it available to method
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
168 DetectAromaticity for detecting aromaticity corresponding to a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
169 specific model.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
170
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
171 --CompoundID *DataFieldName or LabelPrefixString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
172 This value is --CompoundIDMode specific and indicates how compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
173 ID is generated.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
174
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
175 For *DataField* value of --CompoundIDMode option, it corresponds to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
176 datafield label name whose value is used as compound ID; otherwise,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
177 it's a prefix string used for generating compound IDs like
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
178 LabelPrefixString<Number>. Default value, *Cmpd*, generates compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
179 IDs which look like Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
180
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
181 Examples for *DataField* value of --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
182
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
183 MolID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
184 ExtReg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
185
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
186 Examples for *LabelPrefix* or *MolNameOrLabelPrefix* value of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
187 --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
188
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
189 Compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
190
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
191 The value specified above generates compound IDs which correspond to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
192 Compound<Number> instead of default value of Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
193
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
194 --CompoundIDLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
195 Specify compound ID column label for FP or CSV/TSV text file(s) used
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
196 during *CompoundID* value of --DataFieldsMode option. Default:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
197 *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
198
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
199 --CompoundIDMode *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
200 MolNameOrLabelPrefix*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
201 Specify how to generate compound IDs and write to FP or CSV/TSV text
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
202 file(s) along with generated fingerprints for *FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
203 values of --output option: use a *SDFile(s)* datafield value; use
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
204 molname line from *SDFile(s)*; generate a sequential ID with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
205 specific prefix; use combination of both MolName and LabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
206 with usage of LabelPrefix values for empty molname lines.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
207
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
208 Possible values: *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
209 MolNameOrLabelPrefix*. Default: *LabelPrefix*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
210
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
211 For *MolNameAndLabelPrefix* value of --CompoundIDMode, molname line
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
212 in *SDFile(s)* takes precedence over sequential compound IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
213 generated using *LabelPrefix* and only empty molname values are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
214 replaced with sequential compound IDs.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
215
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
216 This is only used for *CompoundID* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
217
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
218 --DataFields *"FieldLabel1,FieldLabel2,..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
219 Comma delimited list of *SDFiles(s)* data fields to extract and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
220 write to CSV/TSV text file(s) along with generated fingerprints for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
221 *text | all* values of --output option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
222
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
223 This is only used for *Specify* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
224
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
225 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
226
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
227 Extreg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
228 MolID,CompoundName
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
229
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
230 -d, --DataFieldsMode *All | Common | Specify | CompoundID*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
231 Specify how data fields in *SDFile(s)* are transferred to output
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
232 CSV/TSV text file(s) along with generated fingerprints for *text |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
233 all* values of --output option: transfer all SD data field; transfer
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
234 SD data files common to all compounds; extract specified data
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
235 fields; generate a compound ID using molname line, a compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
236 prefix, or a combination of both. Possible values: *All | Common |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
237 specify | CompoundID*. Default value: *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
238
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
239 -e, --EStateAtomTypesSetToUse *ArbitrarySize | FixedSize*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
240 E-state atom types set size to use during generation of E-state
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
241 indicies fingerprints. Possible values: *ArbitrarySize | FixedSize*;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
242 Default value: *ArbitrarySize*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
243
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
244 *ArbitrarySize* corrresponds to only E-state atom types detected in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
245 molecule; *FixedSize* corresponds to fixed number of previously
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
246 defined E-state atom types.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
247
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
248 For *EStateIndicies*, a fingerprint vector string is generated. The
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
249 vector string corresponding to *EStateIndicies* contains sum of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
250 E-state values for E-state atom types.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
251
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
252 Module AtomTypes::EStateAtomTypes.pm is used to assign E-state atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
253 types to non-hydrogen atoms in the molecule which is able to assign
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
254 atom types to any valid atom group. However, for *FixedSize* value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
255 of EStateAtomTypesSetToUse, only a fixed set of E-state atom types
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
256 corresponding to specific atom groups [ Appendix III in Ref 77 ] are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
257 used for fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
258
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
259 The fixed size E-state atom type set size used during generation of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
260 fingerprints contains 87 E-state non-hydrogen atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
261 EStateAtomTypes.csv data file distributed with MayaChemTools.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
262
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
263 -f, --Filter *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
264 Specify whether to check and filter compound data in SDFile(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
265 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
266
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
267 By default, compound data is checked before calculating fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
268 and compounds containing atom data corresponding to non-element
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
269 symbols or no atom data are ignored.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
270
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
271 --FingerprintsLabelMode *FingerprintsLabelOnly |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
272 FingerprintsLabelWithIDs*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
273 Specify how fingerprints label is generated in conjunction with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
274 --FingerprintsLabel option value: use fingerprints label generated
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
275 only by --FingerprintsLabel option value or append E-state atom type
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
276 value IDs to --FingerprintsLabel option value.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
277
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
278 Possible values: *FingerprintsLabelOnly | FingerprintsLabelWithIDs*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
279 Default value: *FingerprintsLabelOnly*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
280
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
281 This option is only used for *FixedSize* value of -e,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
282 --EStateAtomTypesSetToUse option during generation of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
283 *EStateIndicies* E-state fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
284
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
285 E-state atom type IDs appended to --FingerprintsLabel value during
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
286 *FingerprintsLabelWithIDs* values of --FingerprintsLabelMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
287 correspond to fixed number of previously defined E-state atom types.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
288
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
289 --FingerprintsLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
290 SD data label or text file column label to use for fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
291 string in output SD or CSV/TSV text file(s) specified by --output.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
292 Default value: *EStateIndiciesFingerprints*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
293
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
294 -h, --help
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
295 Print this help message.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
296
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
297 -k, --KeepLargestComponent *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
298 Generate fingerprints for only the largest component in molecule.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
299 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
300
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
301 For molecules containing multiple connected components, fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
302 can be generated in two different ways: use all connected components
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
303 or just the largest connected component. By default, all atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
304 except for the largest connected component are deleted before
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
305 generation of fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
306
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
307 --OutDelim *comma | tab | semicolon*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
308 Delimiter for output CSV/TSV text file(s). Possible values: *comma,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
309 tab, or semicolon* Default value: *comma*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
310
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
311 --output *SD | FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
312 Type of output files to generate. Possible values: *SD, FP, text, or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
313 all*. Default value: *text*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
314
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
315 -o, --overwrite
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
316 Overwrite existing files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
317
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
318 -q, --quote *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
319 Put quote around column values in output CSV/TSV text file(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
320 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
321
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
322 -r, --root *RootName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
323 New file name is generated using the root: <Root>.<Ext>. Default for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
324 new file names: <SDFileName><EStateIndiciesFP>.<Ext>. The file type
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
325 determines <Ext> value. The sdf, fpf, csv, and tsv <Ext> values are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
326 used for SD, FP, comma/semicolon, and tab delimited text files,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
327 respectively.This option is ignored for multiple input files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
328
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
329 --ValuesPrecision *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
330 Precision of values for E-state indicies option. Default value: up
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
331 to *3* decimal places. Valid values: positive integers.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
332
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
333 -v, --VectorStringFormat *ValuesString | IDsAndValuesString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
334 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
335 Format of fingerprints vector string data in output SD, FP or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
336 CSV/TSV text file(s) specified by --output used for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
337 *EStateIndicies*. Possible values: *ValuesString,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
338 IDsAndValuesString, IDsAndValuesPairsString, ValuesAndIDsString,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
339 ValuesAndIDsPairsString*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
340
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
341 Default value during *ArbitrarySize* value of -e,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
342 --EStateAtomTypesSetToUse option: *IDsAndValuesString*. Default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
343 value during *FixedSize* value of -e, --EStateAtomTypesSetToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
344 option: *ValuesString*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
345
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
346 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
347
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
348 FingerprintsVector;EStateIndicies:ArbitrarySize;11;NumericalValues;IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
349 AndValuesString;SaaCH SaasC SaasN SdO SdssC SsCH3 SsF SsOH SssCH2 SssN
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
350 H SsssCH;24.778 4.387 1.993 25.023 -1.435 3.975 14.006 29.759 -0.073 3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
351 .024 -2.270
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
352
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
353 -w, --WorkingDir *DirName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
354 Location of working directory. Default: current directory.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
355
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
356 EXAMPLES
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
357 To generate E-state fingerprints of arbitrary size in vector string
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
358 format and create a SampleESFP.csv file containing sequential compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
359 IDs along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
360
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
361 % EStateIndiciesFingerprints.pl -r SampleESFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
362
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
363 To generate E-state fingerprints of fixed size in vector string format
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
364 and create a SampleESFP.csv file containing sequential compound IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
365 along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
366
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
367 % EStateIndiciesFingerprints.pl -e FixedSize -r SampleESFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
368 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
369
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
370 To generate E-state fingerprints of fixed size in vector string with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
371 IDsAndValues format and create a SampleESFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
372 sequential compound IDs along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
373 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
374
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
375 % EStateIndiciesFingerprints.pl -e FixedSize -v IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
376 -r SampleESFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
377
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
378 To generate E-state fingerprints of fixed size in vector string format
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
379 and create a SampleESFP.csv file containing compound ID from molecule
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
380 name line along with fingerprints vector strings data, type
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
381
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
382 % EStateIndiciesFingerprints.pl -e FixedSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
383 --DataFieldsMode CompoundID --CompoundIDMode MolName
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
384 -r SampleESFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
385
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
386 To generate E-state fingerprints of fixed size in vector string format
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
387 and create a SampleESFP.csv file containing compound IDs using specified
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
388 data field along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
389
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
390 % EStateIndiciesFingerprints.pl -e FixedSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
391 --DataFieldsMode CompoundID --CompoundIDMode DataField --CompoundID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
392 Mol_ID -r SampleESFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
393
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
394 To generate E-state fingerprints of fixed size in vector string format
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
395 and create a SampleESFP.csv file containing compound ID using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
396 combination of molecule name line and an explicit compound prefix along
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
397 with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
398
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
399 % EStateIndiciesFingerprints.pl -e FixedSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
400 --DataFieldsMode CompoundID --CompoundIDMode MolnameOrLabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
401 --CompoundID Cmpd --CompoundIDLabel MolID -r SampleESFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
402
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
403 To generate E-state fingerprints of fixed size in vector string format
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
404 and create a SampleESFP.csv file containing specific data fields columns
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
405 along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
406
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
407 % EStateIndiciesFingerprints.pl -e FixedSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
408 --DataFieldsMode Specify --DataFields Mol_ID -r SampleESFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
409 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
410
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
411 To generate E-state fingerprints of fixed size in vector string format
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
412 and create a SampleESFP.csv file containing common data fields columns
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
413 along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
414
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
415 % EStateIndiciesFingerprints.pl -e FixedSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
416 --DataFieldsMode Common -r SampleESFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
417
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
418 To generate E-state fingerprints of fixed size in vector string format
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
419 and create SampleESFP.sdf, SampleESFP.fpf, and SampleESFP.csv files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
420 containing all data fields columns in CSV file along with fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
421 vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
422
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
423 % EStateIndiciesFingerprints.pl -e FixedSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
424 --DataFieldsMode All --output all -r SampleESFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
425
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
426 AUTHOR
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
427 Manish Sud <msud@san.rr.com>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
428
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
429 SEE ALSO
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
430 InfoFingerprintsFiles.pl, SimilarityMatricesFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
431 AtomNeighborhoodsFingerprints.pl, ExtendedConnectivityFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
432 MACCSKeysFingeprints.pl, PathLengthFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
433 TopologicalAtomPairsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
434 TopologicalAtomTorsionsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
435 TopologicalPharmacophoreAtomPairsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
436 TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
437
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
438 COPYRIGHT
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
439 Copyright (C) 2015 Manish Sud. All rights reserved.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
440
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
441 This file is part of MayaChemTools.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
442
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
443 MayaChemTools is free software; you can redistribute it and/or modify it
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
444 under the terms of the GNU Lesser General Public License as published by
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
445 the Free Software Foundation; either version 3 of the License, or (at
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
446 your option) any later version.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
447