annotate docs/scripts/txt/TopologicalPharmacophoreAtomPairsFingerprints.txt @ 0:4816e4a8ae95 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:23:18 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
2 TopologicalPharmacophoreAtomPairsFingerprints.pl - Generate topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
3 pharmacophore atom pairs fingerprints for SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
5 SYNOPSIS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
6 TopologicalPharmacophoreAtomPairsFingerprints.pl SDFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
7
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
8 TopologicalPharmacophoreAtomPairsFingerprints.pl [--AromaticityModel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
9 *AromaticityModelType*] [--AtomPairsSetSizeToUse *ArbitrarySize |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
10 FixedSize*] [-a, --AtomTypesToUse *"AtomType1, AtomType2..."*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
11 [--AtomTypesWeight *"AtomType1, Weight1, AtomType2, Weight2..."*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
12 [--CompoundID *DataFieldName or LabelPrefixString*] [--CompoundIDLabel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
13 *text*] [--CompoundIDMode] [--DataFields *"FieldLabel1,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
14 FieldLabel2,..."*] [-d, --DataFieldsMode *All | Common | Specify |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
15 CompoundID*] [-f, --Filter *Yes | No*] [--FingerprintsLabelMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
16 *FingerprintsLabelOnly | FingerprintsLabelWithIDs*] [--FingerprintsLabel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
17 *text*] [--FuzzifyAtomPairsCount *Yes | No*] [--FuzzificationMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
18 *FuzzyBinning | FuzzyBinSmoothing*] [--FuzzificationMethodology
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
19 *FuzzyBinning | FuzzyBinSmoothing*] [--FuzzFactor *number*] [-h, --help]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
20 [-k, --KeepLargestComponent *Yes | No*] [--MinDistance *number*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
21 [--MaxDistance *number*] [-n, --NormalizationMethodology *None |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
22 ByHeavyAtomsCount | ByAtomTypesCount*] [--OutDelim *comma | tab |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
23 semicolon*] [--output *SD | FP | text | all*] [-o, --overwrite] [-q,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
24 --quote *Yes | No*] [-r, --root *RootName*] [--ValuesPrecision *number*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
25 [-v, --VectorStringFormat *ValuesString, IDsAndValuesString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
26 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
27 [-w, --WorkingDir dirname] SDFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
28
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
29 DESCRIPTION
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
30 Generate topological pharmacophore atom pairs fingerprints [ Ref 60-62,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
31 Ref 65, Ref 68 ] for *SDFile(s)* and create appropriate SD, FP or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
32 CSV/TSV text file(s) containing fingerprints vector strings
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
33 corresponding to molecular fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
34
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
35 Multiple SDFile names are separated by spaces. The valid file extensions
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
36 are *.sdf* and *.sd*. All other file names are ignored. All the SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
37 in a current directory can be specified either by **.sdf* or the current
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
38 directory name.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
39
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
40 Based on the values specified for --AtomTypesToUse, pharmacophore atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
41 types are assigned to all non-hydrogen atoms in a molecule and a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
42 distance matrix is generated. A pharmacophore atom pairs basis set is
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
43 initialized for all unique possible pairs within --MinDistance and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
44 --MaxDistance range.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
45
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
46 Let:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
47
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
48 P = Valid pharmacophore atom type
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
49
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
50 Px = Pharmacophore atom type x
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
51 Py = Pharmacophore atom type y
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
52
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
53 Dmin = Minimum distance corresponding to number of bonds between
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
54 two atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
55 Dmax = Maximum distance corresponding to number of bonds between
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
56 two atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
57 D = Distance corresponding to number of bonds between two atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
58
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
59 Px-Dn-Py = Pharmacophore atom pair ID for atom types Px and Py at
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
60 distance Dn
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
61
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
62 P = Number of pharmacophore atom types to consider
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
63 PPDn = Number of possible unique pharmacophore atom pairs at a distance Dn
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
64
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
65 PPT = Total number of possible pharmacophore atom pairs at all distances
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
66 between Dmin and Dmax
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
67
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
68 Then:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
69
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
70 PPD = (P * (P - 1))/2 + P
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
71
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
72 PPT = ((Dmax - Dmin) + 1) * ((P * (P - 1))/2 + P)
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
73 = ((Dmax - Dmin) + 1) * PPD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
74
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
75 So for default values of Dmin = 1, Dmax = 10 and P = 5,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
76
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
77 PPD = (5 * (5 - 1))/2 + 5 = 15
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
78 PPT = ((10 - 1) + 1) * 15 = 150
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
79
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
80 The pharmacophore atom pairs bais set includes 150 values.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
81
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
82 The atom pair IDs correspond to:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
83
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
84 Px-Dn-Py = Pharmacophore atom pair ID for atom types Px and Py at
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
85 distance Dn
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
86
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
87 For example: H-D1-H, H-D2-HBA, PI-D5-PI and so on
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
88
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
89 Using distance matrix and pharmacohore atom types, occurrence of unique
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
90 pharmacohore atom pairs is counted. The contribution of each atom type
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
91 to atom pair interaction is optionally weighted by specified
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
92 --AtomTypesWeight before assigning its count to appropriate distance
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
93 bin. Based on --NormalizationMethodology option, pharmacophore atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
94 pairs count is optionally normalized. Additionally, pharmacohore atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
95 pairs count is optionally fuzzified before or after the normalization
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
96 controlled by values of --FuzzifyAtomPairsCount, --FuzzificationMode,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
97 --FuzzificationMethodology and --FuzzFactor options.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
98
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
99 The final pharmacophore atom pairs count along with atom pair
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
100 identifiers involving all non-hydrogen atoms, with optional
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
101 normalization and fuzzification, constitute pharmacophore topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
102 atom pairs fingerprints of the molecule.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
103
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
104 For *ArbitrarySize* value of --AtomPairsSetSizeToUse option, the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
105 fingerprint vector correspond to only those topological pharmacophore
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
106 atom pairs which are present and have non-zero count. However, for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
107 *FixedSize* value of --AtomPairsSetSizeToUse option, the fingerprint
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
108 vector contains all possible valid topological pharmacophore atom pairs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
109 with both zero and non-zero count values.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
110
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
111 Example of *SD* file containing topological pharmacophore atom pairs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
112 fingerprints string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
113
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
114 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
115 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
116 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
117 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
118 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
119 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
120 41 44 0 0 0 0 0 0 0 0999 V2000
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
121 -3.3652 1.4499 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
122 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
123 2 3 1 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
124 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
125 M END
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
126 > <CmpdID>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
127 Cmpd1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
128
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
129 > <TopologicalPharmacophoreAtomPairsFingerprints>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
130 FingerprintsVector;TopologicalPharmacophoreAtomPairs:ArbitrarySize:Min
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
131 Distance1:MaxDistance10;54;NumericalValues;IDsAndValuesString;H-D1-H H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
132 -D1-NI HBA-D1-NI HBD-D1-NI H-D2-H H-D2-HBA H-D2-HBD HBA-D2-HBA HBA-D2-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
133 HBD H-D3-H H-D3-HBA H-D3-HBD H-D3-NI HBA-D3-NI HBD-D3-NI H-D4-H H-D...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
134 18 1 2 1 22 12 8 1 2 18 6 3 1 1 1 22 13 6 5 7 2 28 9 5 1 1 1 36 16 10 3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
135 4 1 37 10 8 1 35 10 9 3 3 1 28 7 7 4 18 16 12 5 1 2 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
136
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
137 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
138 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
139 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
140
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
141 Example of *FP* file containing topological pharmacophore atom pairs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
142 fingerprints string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
143
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
144 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
145 # Package = MayaChemTools 7.4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
146 # Release Date = Oct 21, 2010
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
147 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
148 # TimeStamp = Fri Mar 11 15:32:48 2011
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
149 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
150 # FingerprintsStringType = FingerprintsVector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
151 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
152 # Description = TopologicalPharmacophoreAtomPairs:ArbitrarySize:MinDistance1:MaxDistance10
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
153 # VectorStringFormat = IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
154 # VectorValuesType = NumericalValues
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
155 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
156 Cmpd1 54;H-D1-H H-D1-NI HBA-D1-NI HBD-D1-NI H-D2-H H-D2-HBA...;18 1 2...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
157 Cmpd2 61;H-D1-H H-D1-NI HBA-D1-NI HBD-D1-NI H-D2-H H-D2-HBA...;5 1 2 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
158 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
159 ... ..
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
160
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
161 Example of CSV *Text* file containing topological pharmacophore atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
162 pairs fingerprints string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
163
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
164 "CompoundID","TopologicalPharmacophoreAtomPairsFingerprints"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
165 "Cmpd1","FingerprintsVector;TopologicalPharmacophoreAtomPairs:Arbitrary
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
166 Size:MinDistance1:MaxDistance10;54;NumericalValues;IDsAndValuesString;H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
167 -D1-H H-D1-NI HBA-D1-NI HBD-D1-NI H-D2-H H-D2-HBA H-D2-HBD HBA-D2-HBA H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
168 BA-D2-HBD H-D3-H H-D3-HBA H-D3-HBD H-D3-NI HBA-D3-NI HBD-D3-NI H-D4...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
169 18 1 2 1 22 12 8 1 2 18 6 3 1 1 1 22 13 6 5 7 2 28 9 5 1 1 1 36 16 10 3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
170 4 1 37 10 8 1 35 10 9 3 3 1 28 7 7 4 18 16 12 5 1 2 1"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
171 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
172 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
173
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
174 The current release of MayaChemTools generates the following types of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
175 topological pharmacophore atom pairs fingerprints vector strings:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
176
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
177 FingerprintsVector;TopologicalPharmacophoreAtomPairs:ArbitrarySize:Min
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
178 Distance1:MaxDistance10;54;NumericalValues;IDsAndValuesString;H-D1-H H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
179 -D1-NI HBA-D1-NI HBD-D1-NI H-D2-H H-D2-HBA H-D2-HBD HBA-D2-HBA HBA-D2-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
180 HBD H-D3-H H-D3-HBA H-D3-HBD H-D3-NI HBA-D3-NI HBD-D3-NI H-D4-H H-D4-H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
181 BA H-D4-HBD HBA-D4-HBA HBA-D4-HBD HBD-D4-HBD H-D5-H H-D5-HBA H-D5-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
182 18 1 2 1 22 12 8 1 2 18 6 3 1 1 1 22 13 6 5 7 2 28 9 5 1 1 1 36 16 10
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
183 3 4 1 37 10 8 1 35 10 9 3 3 1 28 7 7 4 18 16 12 5 1 2 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
184
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
185 FingerprintsVector;TopologicalPharmacophoreAtomPairs:FixedSize:MinDist
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
186 ance1:MaxDistance10;150;OrderedNumericalValues;ValuesString;18 0 0 1 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
187 0 0 2 0 0 1 0 0 0 0 22 12 8 0 0 1 2 0 0 0 0 0 0 0 0 18 6 3 1 0 0 0 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
188 0 0 1 0 0 0 0 22 13 6 0 0 5 7 0 0 2 0 0 0 0 0 28 9 5 1 0 0 0 1 0 0 1 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
189 0 0 0 36 16 10 0 0 3 4 0 0 1 0 0 0 0 0 37 10 8 0 0 0 0 1 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
190 0 35 10 9 0 0 3 3 0 0 1 0 0 0 0 0 28 7 7 4 0 0 0 0 0 0 0 0 0 0 0 18...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
191
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
192 FingerprintsVector;TopologicalPharmacophoreAtomPairs:FixedSize:MinDist
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
193 ance1:MaxDistance10;150;OrderedNumericalValues;IDsAndValuesString;H-D1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
194 -H H-D1-HBA H-D1-HBD H-D1-NI H-D1-PI HBA-D1-HBA HBA-D1-HBD HBA-D1-NI H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
195 BA-D1-PI HBD-D1-HBD HBD-D1-NI HBD-D1-PI NI-D1-NI NI-D1-PI PI-D1-PI H-D
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
196 2-H H-D2-HBA H-D2-HBD H-D2-NI H-D2-PI HBA-D2-HBA HBA-D2-HBD HBA-D2...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
197 18 0 0 1 0 0 0 2 0 0 1 0 0 0 0 22 12 8 0 0 1 2 0 0 0 0 0 0 0 0 18 6 3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
198 1 0 0 0 1 0 0 1 0 0 0 0 22 13 6 0 0 5 7 0 0 2 0 0 0 0 0 28 9 5 1 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
199 1 0 0 1 0 0 0 0 36 16 10 0 0 3 4 0 0 1 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
200
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
201 OPTIONS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
202 --AromaticityModel *MDLAromaticityModel | TriposAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
203 MMFFAromaticityModel | ChemAxonBasicAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
204 ChemAxonGeneralAromaticityModel | DaylightAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
205 MayaChemToolsAromaticityModel*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
206 Specify aromaticity model to use during detection of aromaticity.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
207 Possible values in the current release are: *MDLAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
208 TriposAromaticityModel, MMFFAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
209 ChemAxonBasicAromaticityModel, ChemAxonGeneralAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
210 DaylightAromaticityModel or MayaChemToolsAromaticityModel*. Default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
211 value: *MayaChemToolsAromaticityModel*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
212
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
213 The supported aromaticity model names along with model specific
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
214 control parameters are defined in AromaticityModelsData.csv, which
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
215 is distributed with the current release and is available under
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
216 lib/data directory. Molecule.pm module retrieves data from this file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
217 during class instantiation and makes it available to method
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
218 DetectAromaticity for detecting aromaticity corresponding to a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
219 specific model.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
220
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
221 --AtomPairsSetSizeToUse *ArbitrarySize | FixedSize*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
222 Atom pairs set size to use during generation of topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
223 pharmacophore atom pairs fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
224
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
225 Possible values: *ArbitrarySize | FixedSize*; Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
226 *ArbitrarySize*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
227
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
228 For *ArbitrarySize* value of --AtomPairsSetSizeToUse option, the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
229 fingerprint vector correspond to only those topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
230 pharmacophore atom pairs which are present and have non-zero count.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
231 However, for *FixedSize* value of --AtomPairsSetSizeToUse option,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
232 the fingerprint vector contains all possible valid topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
233 pharmacophore atom pairs with both zero and non-zero count values.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
234
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
235 -a, --AtomTypesToUse *"AtomType1,AtomType2,..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
236 Pharmacophore atom types to use during generation of topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
237 phramacophore atom pairs. It's a list of comma separated valid
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
238 pharmacophore atom types.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
239
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
240 Possible values for pharmacophore atom types are: *Ar, CA, H, HBA,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
241 HBD, Hal, NI, PI, RA*. Default value [ Ref 60-62 ] :
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
242 *HBD,HBA,PI,NI,H*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
243
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
244 The pharmacophore atom types abbreviations correspond to:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
245
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
246 HBD: HydrogenBondDonor
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
247 HBA: HydrogenBondAcceptor
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
248 PI : PositivelyIonizable
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
249 NI : NegativelyIonizable
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
250 Ar : Aromatic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
251 Hal : Halogen
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
252 H : Hydrophobic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
253 RA : RingAtom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
254 CA : ChainAtom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
255
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
256 *AtomTypes::FunctionalClassAtomTypes* module is used to assign
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
257 pharmacophore atom types. It uses following definitions [ Ref 60-61,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
258 Ref 65-66 ]:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
259
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
260 HydrogenBondDonor: NH, NH2, OH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
261 HydrogenBondAcceptor: N[!H], O
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
262 PositivelyIonizable: +, NH2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
263 NegativelyIonizable: -, C(=O)OH, S(=O)OH, P(=O)OH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
264
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
265 --AtomTypesWeight *"AtomType1,Weight1,AtomType2,Weight2..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
266 Weights of specified pharmacophore atom types to use during
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
267 calculation of their contribution to atom pair count. Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
268 *None*. Valid values: real numbers greater than 0. In general it's
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
269 comma delimited list of valid atom type and its weight.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
270
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
271 The weight values allow to increase the importance of specific
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
272 pharmacophore atom type in the generated fingerprints. A weight
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
273 value of 0 for an atom type eliminates its contribution to atom pair
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
274 count where as weight value of 2 doubles its contribution.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
275
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
276 --CompoundID *DataFieldName or LabelPrefixString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
277 This value is --CompoundIDMode specific and indicates how compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
278 ID is generated.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
279
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
280 For *DataField* value of --CompoundIDMode option, it corresponds to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
281 datafield label name whose value is used as compound ID; otherwise,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
282 it's a prefix string used for generating compound IDs like
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
283 LabelPrefixString<Number>. Default value, *Cmpd*, generates compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
284 IDs which look like Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
285
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
286 Examples for *DataField* value of --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
287
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
288 MolID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
289 ExtReg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
290
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
291 Examples for *LabelPrefix* or *MolNameOrLabelPrefix* value of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
292 --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
293
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
294 Compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
295
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
296 The value specified above generates compound IDs which correspond to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
297 Compound<Number> instead of default value of Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
298
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
299 --CompoundIDLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
300 Specify compound ID column label for CSV/TSV text file(s) used
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
301 during *CompoundID* value of --DataFieldsMode option. Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
302 *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
303
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
304 --CompoundIDMode *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
305 MolNameOrLabelPrefix*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
306 Specify how to generate compound IDs and write to FP or CSV/TSV text
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
307 file(s) along with generated fingerprints for *FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
308 values of --output option: use a *SDFile(s)* datafield value; use
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
309 molname line from *SDFile(s)*; generate a sequential ID with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
310 specific prefix; use combination of both MolName and LabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
311 with usage of LabelPrefix values for empty molname lines.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
312
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
313 Possible values: *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
314 MolNameOrLabelPrefix*. Default value: *LabelPrefix*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
315
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
316 For *MolNameAndLabelPrefix* value of --CompoundIDMode, molname line
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
317 in *SDFile(s)* takes precedence over sequential compound IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
318 generated using *LabelPrefix* and only empty molname values are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
319 replaced with sequential compound IDs.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
320
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
321 This is only used for *CompoundID* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
322
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
323 --DataFields *"FieldLabel1,FieldLabel2,..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
324 Comma delimited list of *SDFiles(s)* data fields to extract and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
325 write to CSV/TSV text file(s) along with generated fingerprints for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
326 *text | all* values of --output option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
327
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
328 This is only used for *Specify* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
329
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
330 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
331
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
332 Extreg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
333 MolID,CompoundName
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
334
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
335 -d, --DataFieldsMode *All | Common | Specify | CompoundID*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
336 Specify how data fields in *SDFile(s)* are transferred to output
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
337 CSV/TSV text file(s) along with generated fingerprints for *text |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
338 all* values of --output option: transfer all SD data field; transfer
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
339 SD data files common to all compounds; extract specified data
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
340 fields; generate a compound ID using molname line, a compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
341 prefix, or a combination of both. Possible values: *All | Common |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
342 specify | CompoundID*. Default value: *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
343
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
344 -f, --Filter *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
345 Specify whether to check and filter compound data in SDFile(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
346 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
347
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
348 By default, compound data is checked before calculating fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
349 and compounds containing atom data corresponding to non-element
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
350 symbols or no atom data are ignored.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
351
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
352 --FingerprintsLabelMode *FingerprintsLabelOnly |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
353 FingerprintsLabelWithIDs*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
354 Specify how fingerprints label is generated in conjunction with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
355 --FingerprintsLabel option value: use fingerprints label generated
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
356 only by --FingerprintsLabel option value or append topological atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
357 pair count value IDs to --FingerprintsLabel option value.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
358
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
359 Possible values: *FingerprintsLabelOnly | FingerprintsLabelWithIDs*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
360 Default value: *FingerprintsLabelOnly*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
361
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
362 Topological atom pairs IDs appended to --FingerprintsLabel value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
363 during *FingerprintsLabelWithIDs* values of --FingerprintsLabelMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
364 correspond to atom pair count values in fingerprint vector string.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
365
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
366 *FingerprintsLabelWithIDs* value of --FingerprintsLabelMode is
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
367 ignored during *ArbitrarySize* value of --AtomPairsSetSizeToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
368 option and topological atom pairs IDs not appended to the label.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
369
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
370 --FingerprintsLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
371 SD data label or text file column label to use for fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
372 string in output SD or CSV/TSV text file(s) specified by --output.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
373 Default value: *TopologicalPharmacophoreAtomPairsFingerprints*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
374
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
375 --FuzzifyAtomPairsCount *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
376 To fuzzify or not to fuzzify atom pairs count. Possible values: *Yes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
377 or No*. Default value: *No*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
378
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
379 --FuzzificationMode *BeforeNormalization | AfterNormalization*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
380 When to fuzzify atom pairs count. Possible values:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
381 *BeforeNormalization | AfterNormalizationYes*. Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
382 *AfterNormalization*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
383
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
384 --FuzzificationMethodology *FuzzyBinning | FuzzyBinSmoothing*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
385 How to fuzzify atom pairs count. Possible values: *FuzzyBinning |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
386 FuzzyBinSmoothing*. Default value: *FuzzyBinning*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
387
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
388 In conjunction with values for options --FuzzifyAtomPairsCount,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
389 --FuzzificationMode and --FuzzFactor, --FuzzificationMethodology
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
390 option is used to fuzzify pharmacophore atom pairs count.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
391
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
392 Let:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
393
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
394 Px = Pharmacophore atom type x
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
395 Py = Pharmacophore atom type y
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
396 PPxy = Pharmacophore atom pair between atom type Px and Py
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
397
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
398 PPxyDn = Pharmacophore atom pairs count between atom type Px and Py
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
399 at distance Dn
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
400 PPxyDn-1 = Pharmacophore atom pairs count between atom type Px and Py
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
401 at distance Dn - 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
402 PPxyDn+1 = Pharmacophore atom pairs count between atom type Px and Py
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
403 at distance Dn + 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
404
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
405 FF = FuzzFactor for FuzzyBinning and FuzzyBinSmoothing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
406
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
407 Then:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
408
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
409 For *FuzzyBinning*:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
410
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
411 PPxyDn = PPxyDn (Unchanged)
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
412
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
413 PPxyDn-1 = PPxyDn-1 + PPxyDn * FF
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
414 PPxyDn+1 = PPxyDn+1 + PPxyDn * FF
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
415
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
416 For *FuzzyBinSmoothing*:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
417
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
418 PPxyDn = PPxyDn - PPxyDn * 2FF for Dmin < Dn < Dmax
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
419 PPxyDn = PPxyDn - PPxyDn * FF for Dn = Dmin or Dmax
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
420
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
421 PPxyDn-1 = PPxyDn-1 + PPxyDn * FF
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
422 PPxyDn+1 = PPxyDn+1 + PPxyDn * FF
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
423
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
424 In both fuzzification schemes, a value of 0 for FF implies no
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
425 fuzzification of occurrence counts. A value of 1 during
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
426 *FuzzyBinning* corresponds to maximum fuzzification of occurrence
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
427 counts; however, a value of 1 during *FuzzyBinSmoothing* ends up
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
428 completely distributing the value over the previous and next
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
429 distance bins.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
430
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
431 So for default value of --FuzzFactor (FF) 0.15, the occurrence count
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
432 of pharmacohore atom pairs at distance Dn during FuzzyBinning is
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
433 left unchanged and the counts at distances Dn -1 and Dn + 1 are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
434 incremented by PPxyDn * 0.15.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
435
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
436 And during *FuzzyBinSmoothing* the occurrence counts at Distance Dn
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
437 is scaled back using multiplicative factor of (1 - 2*0.15) and the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
438 occurrence counts at distances Dn -1 and Dn + 1 are incremented by
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
439 PPxyDn * 0.15. In otherwords, occurrence bin count is smoothed out
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
440 by distributing it over the previous and next distance value.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
441
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
442 --FuzzFactor *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
443 Specify by how much to fuzzify atom pairs count. Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
444 *0.15*. Valid values: For *FuzzyBinning* value of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
445 --FuzzificationMethodology option: *between 0 and 1.0*; For
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
446 *FuzzyBinSmoothing* value of --FuzzificationMethodology option:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
447 *between 0 and 0.5*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
448
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
449 -h, --help
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
450 Print this help message.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
451
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
452 -k, --KeepLargestComponent *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
453 Generate fingerprints for only the largest component in molecule.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
454 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
455
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
456 For molecules containing multiple connected components, fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
457 can be generated in two different ways: use all connected components
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
458 or just the largest connected component. By default, all atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
459 except for the largest connected component are deleted before
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
460 generation of fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
461
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
462 --MinDistance *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
463 Minimum bond distance between atom pairs for generating topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
464 pharmacophore atom pairs. Default value: *1*. Valid values: positive
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
465 integers including 0 and less than --MaxDistance.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
466
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
467 --MaxDistance *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
468 Maximum bond distance between atom pairs for generating topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
469 pharmacophore atom pairs. Default value: *10*. Valid values:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
470 positive integers and greater than --MinDistance.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
471
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
472 -n, --NormalizationMethodology *None | ByHeavyAtomsCount |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
473 ByAtomTypesCount*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
474 Normalization methodology to use for scaling the occurrence count of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
475 pharmacophore atom pairs within specified distance range. Possible
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
476 values: *None, ByHeavyAtomsCount or ByAtomTypesCount*. Default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
477 value: *None*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
478
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
479 --OutDelim *comma | tab | semicolon*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
480 Delimiter for output CSV/TSV text file(s). Possible values: *comma,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
481 tab, or semicolon* Default value: *comma*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
482
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
483 --output *SD | FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
484 Type of output files to generate. Possible values: *SD, FP, text, or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
485 all*. Default value: *text*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
486
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
487 -o, --overwrite
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
488 Overwrite existing files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
489
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
490 -q, --quote *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
491 Put quote around column values in output CSV/TSV text file(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
492 Possible values: *Yes or No*. Default value: *Yes*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
493
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
494 -r, --root *RootName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
495 New file name is generated using the root: <Root>.<Ext>. Default for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
496 new file names:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
497 <SDFileName><TopologicalPharmacophoreAtomPairsFP>.<Ext>. The file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
498 type determines <Ext> value. The sdf, fpf, csv, and tsv <Ext> values
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
499 are used for SD, FP, comma/semicolon, and tab delimited text files,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
500 respectively.This option is ignored for multiple input files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
501
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
502 --ValuesPrecision *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
503 Precision of atom pairs count real values which might be generated
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
504 after normalization or fuzzification. Default value: up to *2*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
505 decimal places. Valid values: positive integers.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
506
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
507 -v, --VectorStringFormat *ValuesString, IDsAndValuesString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
508 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
509 Format of fingerprints vector string data in output SD, FP or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
510 CSV/TSV text file(s) specified by --output option. Possible values:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
511 *ValuesString, IDsAndValuesString | IDsAndValuesPairsString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
512 ValuesAndIDsString | ValuesAndIDsPairsString*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
513
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
514 Default value during *FixedSize* value of --AtomPairsSetSizeToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
515 option: *ValuesString*. Default value during *ArbitrarySize* value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
516 of --AtomPairsSetSizeToUse option: *IDsAndValuesString*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
517
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
518 *ValuesString* option value is not allowed for *ArbitrarySize* value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
519 of --AtomPairsSetSizeToUse option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
520
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
521 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
522
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
523 FingerprintsVector;TopologicalPharmacophoreAtomPairs:ArbitrarySize:Min
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
524 Distance1:MaxDistance10;54;NumericalValues;IDsAndValuesString;H-D1-H H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
525 -D1-NI HBA-D1-NI HBD-D1-NI H-D2-H H-D2-HBA H-D2-HBD HBA-D2-HBA HBA-D2-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
526 HBD H-D3-H H-D3-HBA H-D3-HBD H-D3-NI HBA-D3-NI HBD-D3-NI H-D4-H H-D4-H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
527 BA H-D4-HBD HBA-D4-HBA HBA-D4-HBD HBD-D4-HBD H-D5-H H-D5-HBA H-D5-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
528 18 1 2 1 22 12 8 1 2 18 6 3 1 1 1 22 13 6 5 7 2 28 9 5 1 1 1 36 16 10
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
529 3 4 1 37 10 8 1 35 10 9 3 3 1 28 7 7 4 18 16 12 5 1 2 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
530
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
531 FingerprintsVector;TopologicalPharmacophoreAtomPairs:FixedSize:MinDist
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
532 ance1:MaxDistance10;150;OrderedNumericalValues;ValuesString;18 0 0 1 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
533 0 0 2 0 0 1 0 0 0 0 22 12 8 0 0 1 2 0 0 0 0 0 0 0 0 18 6 3 1 0 0 0 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
534 0 0 1 0 0 0 0 22 13 6 0 0 5 7 0 0 2 0 0 0 0 0 28 9 5 1 0 0 0 1 0 0 1 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
535 0 0 0 36 16 10 0 0 3 4 0 0 1 0 0 0 0 0 37 10 8 0 0 0 0 1 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
536 0 35 10 9 0 0 3 3 0 0 1 0 0 0 0 0 28 7 7 4 0 0 0 0 0 0 0 0 0 0 0 18...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
537
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
538 FingerprintsVector;TopologicalPharmacophoreAtomPairs:FixedSize:MinDist
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
539 ance1:MaxDistance10;150;OrderedNumericalValues;IDsAndValuesString;H-D1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
540 -H H-D1-HBA H-D1-HBD H-D1-NI H-D1-PI HBA-D1-HBA HBA-D1-HBD HBA-D1-NI H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
541 BA-D1-PI HBD-D1-HBD HBD-D1-NI HBD-D1-PI NI-D1-NI NI-D1-PI PI-D1-PI H-D
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
542 2-H H-D2-HBA H-D2-HBD H-D2-NI H-D2-PI HBA-D2-HBA HBA-D2-HBD HBA-D2...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
543 18 0 0 1 0 0 0 2 0 0 1 0 0 0 0 22 12 8 0 0 1 2 0 0 0 0 0 0 0 0 18 6 3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
544 1 0 0 0 1 0 0 1 0 0 0 0 22 13 6 0 0 5 7 0 0 2 0 0 0 0 0 28 9 5 1 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
545 1 0 0 1 0 0 0 0 36 16 10 0 0 3 4 0 0 1 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
546
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
547 -w, --WorkingDir *DirName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
548 Location of working directory. Default value: current directory.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
549
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
550 EXAMPLES
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
551 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
552 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
553 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
554 of atom pairs count and create a SampleTPAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
555 sequential compound IDs along with fingerprints vector strings data in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
556 ValuesString format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
557
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
558 % TopologicalPharmacophoreAtomPairsFingerprints.pl -r SampleTPAPFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
559 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
560
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
561 To generate topological pharmacophore atom pairs fingerprints of fixed
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
562 size corresponding to distances from 1 through 10 using default atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
563 types with no weighting, normalization, and fuzzification of atom pairs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
564 count and create a SampleTPAPFP.csv file containing sequential compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
565 IDs along with fingerprints vector strings data in ValuesString format,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
566 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
567
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
568 % TopologicalPharmacophoreAtomPairsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
569 --AtomPairsSetSizeToUse FixedSize -r SampleTPAPFP-o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
570
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
571 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
572 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
573 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
574 of atom pairs count and create SampleTPAPFP.sdf, SampleTPAPFP.fpf and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
575 SampleTPAPFP.csv files containing sequential compound IDs in CSV file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
576 along with fingerprints vector strings data in ValuesString format,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
577 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
578
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
579 % TopologicalPharmacophoreAtomPairsFingerprints.pl --output all
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
580 -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
581
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
582 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
583 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
584 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
585 of atom pairs count and create a SampleTPAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
586 sequential compound IDs along with fingerprints vector strings data in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
587 IDsAndValuesPairsString format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
588
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
589 % TopologicalPharmacophoreAtomPairsFingerprints.pl --VectorStringFormat
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
590 IDsAndValuesPairsString -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
591
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
592 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
593 arbitrary size corresponding to distances from 1 through 6 using default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
594 atom types with no weighting, normalization, and fuzzification of atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
595 pairs count and create a SampleTPAPFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
596 compound IDs along with fingerprints vector strings data in ValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
597 format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
598
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
599 % TopologicalPharmacophoreAtomPairsFingerprints.pl --MinDistance 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
600 -MaxDistance 6 -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
601
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
602 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
603 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
604 "HBD,HBA,PI,NI" atom types with double the weighting for "HBD,HBA" and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
605 normalization by HeavyAtomCount but no fuzzification of atom pairs count
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
606 and create a SampleTPAPFP.csv file containing sequential compound IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
607 along with fingerprints vector strings data in ValuesString format,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
608 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
609
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
610 % TopologicalPharmacophoreAtomPairsFingerprints.pl --MinDistance 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
611 -MaxDistance 10 --AtomTypesToUse "HBD,HBA,PI, NI" --AtomTypesWeight
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
612 "HBD,2,HBA,2,PI,1,NI,1" --NormalizationMethodology ByHeavyAtomsCount
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
613 --FuzzifyAtomPairsCount No -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
614
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
615 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
616 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
617 "HBD,HBA,PI,NI,H" atom types with no weighting of atom types and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
618 normalization but with fuzzification of atom pairs count using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
619 FuzzyBinning methodology with FuzzFactor value 0.15 and create a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
620 SampleTPAPFP.csv file containing sequential compound IDs along with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
621 fingerprints vector strings data in ValuesString format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
622
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
623 % TopologicalPharmacophoreAtomPairsFingerprints.pl --MinDistance 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
624 --MaxDistance 10 --AtomTypesToUse "HBD,HBA,PI, NI,H" --AtomTypesWeight
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
625 "HBD,1,HBA,1,PI,1,NI,1,H,1" --NormalizationMethodology None
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
626 --FuzzifyAtomPairsCount Yes --FuzzificationMethodology FuzzyBinning
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
627 --FuzzFactor 0.5 -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
628
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
629 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
630 arbitrary size corresponding to distances distances from 1 through 10
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
631 using default atom types with no weighting, normalization, and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
632 fuzzification of atom pairs count and create a SampleTPAPFP.csv file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
633 containing compound ID from molecule name line along with fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
634 vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
635
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
636 % TopologicalPharmacophoreAtomPairsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
637 CompoundID -CompoundIDMode MolName -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
638
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
639 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
640 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
641 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
642 of atom pairs count and create a SampleTPAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
643 compound IDs using specified data field along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
644 strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
645
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
646 % TopologicalPharmacophoreAtomPairsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
647 CompoundID -CompoundIDMode DataField --CompoundID Mol_ID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
648 -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
649
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
650 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
651 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
652 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
653 of atom pairs count and create a SampleTPAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
654 compound ID using combination of molecule name line and an explicit
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
655 compound prefix along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
656
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
657 % TopologicalPharmacophoreAtomPairsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
658 CompoundID -CompoundIDMode MolnameOrLabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
659 --CompoundID Cmpd --CompoundIDLabel MolID -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
660
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
661 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
662 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
663 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
664 of atom pairs count and create a SampleTPAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
665 specific data fields columns along with fingerprints vector strings
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
666 data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
667
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
668 % TopologicalPharmacophoreAtomPairsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
669 Specify --DataFields Mol_ID -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
670
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
671 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
672 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
673 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
674 of atom pairs count and create a SampleTPAPFP.csv file containing common
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
675 data fields columns along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
676
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
677 % TopologicalPharmacophoreAtomPairsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
678 Common -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
679
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
680 To generate topological pharmacophore atom pairs fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
681 arbitrary size corresponding to distances from 1 through 10 using
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
682 default atom types with no weighting, normalization, and fuzzification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
683 of atom pairs count and create SampleTPAPFP.sdf, SampleTPAPFP.fpf, and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
684 SampleTPAPFP.csv files containing all data fields columns in CSV file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
685 along with fingerprints data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
686
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
687 % TopologicalPharmacophoreAtomPairsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
688 All --output all -r SampleTPAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
689
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
690 AUTHOR
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
691 Manish Sud <msud@san.rr.com>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
692
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
693 SEE ALSO
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
694 InfoFingerprintsFiles.pl, SimilarityMatricesFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
695 AtomNeighborhoodsFingerprints.pl, ExtendedConnectivityFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
696 MACCSKeysFingerprints.pl, PathLengthFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
697 TopologicalAtomPairsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
698 TopologicalAtomTorsionsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
699 TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
700
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
701 COPYRIGHT
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
702 Copyright (C) 2015 Manish Sud. All rights reserved.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
703
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
704 This file is part of MayaChemTools.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
705
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
706 MayaChemTools is free software; you can redistribute it and/or modify it
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
707 under the terms of the GNU Lesser General Public License as published by
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
708 the Free Software Foundation; either version 3 of the License, or (at
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
709 your option) any later version.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
710