annotate docs/scripts/txt/TopologicalPharmacophoreAtomTripletsFingerprints.txt @ 0:4816e4a8ae95 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:23:18 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
2 TopologicalPharmacophoreAtomTripletsFingerprints.pl - Generate
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
3 topological pharmacophore atom triplets fingerprints for SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
5 SYNOPSIS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
6 TopologicalPharmacophoreAtomTripletsFingerprints.pl SDFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
7
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
8 TopologicalPharmacophoreAtomTripletsFingerprints.pl [--AromaticityModel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
9 *AromaticityModelType*] [--AtomTripletsSetSizeToUse *ArbitrarySize |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
10 FixedSize*] [-a, --AtomTypesToUse *"AtomType1, AtomType2..."*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
11 [--AtomTypesWeight *"AtomType1, Weight1, AtomType2, Weight2..."*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
12 [--CompoundID *DataFieldName or LabelPrefixString*] [--CompoundIDLabel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
13 *text*] [--CompoundIDMode] [--DataFields *"FieldLabel1,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
14 FieldLabel2,..."*] [-d, --DataFieldsMode *All | Common | Specify |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
15 CompoundID*] [--DistanceBinSize *number*] [-f, --Filter *Yes | No*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
16 [--FingerprintsLabelMode *FingerprintsLabelOnly |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
17 FingerprintsLabelWithIDs*] [--FingerprintsLabel *text*] [-h, --help]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
18 [-k, --KeepLargestComponent *Yes | No*] [--MinDistance *number*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
19 [--MaxDistance *number*] [--OutDelim *comma | tab | semicolon*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
20 [--output *SD | FP | text | all*] [-o, --overwrite] [-q, --quote *Yes |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
21 No*] [-r, --root *RootName*] [-u, --UseTriangleInequality *Yes | No*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
22 [-v, --VectorStringFormat *ValuesString, IDsAndValuesString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
23 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
24 [-w, --WorkingDir dirname] SDFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
25
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
26 DESCRIPTION
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
27 Generate topological pharmacophore atom triplets fingerprints [ Ref 66,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
28 Ref 68-71 ] for *SDFile(s)* and create appropriate SD, FP or CSV/TSV
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
29 text file(s) containing fingerprints vector strings corresponding to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
30 molecular fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
31
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
32 Multiple SDFile names are separated by spaces. The valid file extensions
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
33 are *.sdf* and *.sd*. All other file names are ignored. All the SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
34 in a current directory can be specified either by **.sdf* or the current
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
35 directory name.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
36
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
37 Based on the values specified for --AtomTypesToUse, pharmacophore atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
38 types are assigned to all non-hydrogen atoms in a molecule and a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
39 distance matrix is generated. Using --MinDistance, --MaxDistance, and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
40 --DistanceBinSize values, a binned distance matrix is generated with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
41 lower bound on the distance bin as the distance in distance matrix; the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
42 lower bound on the distance bin is also used as the distance between
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
43 atom pairs for generation of atom triplet identifiers.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
44
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
45 A pharmacophore atom triplets basis set is generated for all unique atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
46 triplets constituting atom pairs binned distances between --MinDistance
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
47 and --MaxDistance. The value of --UseTriangleInequality determines
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
48 whether the triangle inequality test is applied during generation of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
49 atom triplets basis set. The lower distance bound, along with specified
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
50 pharmacophore types, is used during generation of atom triplet IDs.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
51
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
52 Let:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
53
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
54 P = Valid pharmacophore atom type
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
55
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
56 Px = Pharmacophore atom x
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
57 Py = Pharmacophore atom y
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
58 Pz = Pharmacophore atom z
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
59
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
60 Dmin = Minimum distance corresponding to number of bonds between two atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
61 Dmax = Maximum distance corresponding to number of bonds between two atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
62 D = Distance corresponding to number of bonds between two atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
63
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
64 Bsize = Distance bin size
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
65 Nbins = Number of distance bins
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
66
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
67 Dxy = Distance or lower bound of binned distance between Px and Py
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
68 Dxz = Distance or lower bound of binned distance between Px and Pz
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
69 Dyz = Distance or lower bound of binned distance between Py and Pz
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
70
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
71 Then:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
72
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
73 PxDyz-PyDxz-PzDxy = Pharmacophore atom triplet IDs for atom types Px,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
74 Py, and Pz
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
75
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
76 For example: H1-H1-H1, H2-HBA-H2 and so on
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
77
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
78 For default values of Dmin = 1 , Dmax = 10 and Bsize = 2:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
79
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
80 the number of distance bins, Nbins = 5, are:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
81
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
82 [1, 2] [3, 4] [5, 6] [7, 8] [9 10]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
83
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
84 and atom triplet basis set size is 2692.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
85
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
86 Atom triplet basis set size for various values of Dmin, Dmax and Bsize in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
87 conjunction with usage of triangle inequality is:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
88
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
89 Dmin Dmax Bsize UseTriangleInequality TripletBasisSetSize
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
90 1 10 2 No 4960
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
91 1 10 2 Yes 2692 [ Default ]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
92 2 12 2 No 8436
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
93 2 12 2 Yes 4494
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
94
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
95 Using binned distance matrix and pharmacohore atom types, occurrence of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
96 unique pharmacohore atom triplets is counted.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
97
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
98 The final pharmacophore atom triples count along with atom pair
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
99 identifiers involving all non-hydrogen atoms constitute pharmacophore
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
100 topological atom triplets fingerprints of the molecule.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
101
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
102 For *ArbitrarySize* value of --AtomTripletsSetSizeToUse option, the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
103 fingerprint vector correspond to only those topological pharmacophore
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
104 atom triplets which are present and have non-zero count. However, for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
105 *FixedSize* value of --AtomTripletsSetSizeToUse option, the fingerprint
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
106 vector contains all possible valid topological pharmacophore atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
107 triplets with both zero and non-zero count values.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
108
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
109 Example of *SD* file containing topological pharmacophore atom triplets
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
110 fingerprints string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
111
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
112 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
113 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
114 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
115 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
116 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
117 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
118 41 44 0 0 0 0 0 0 0 0999 V2000
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
119 -3.3652 1.4499 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
120 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
121 2 3 1 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
122 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
123 M END
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
124 > <CmpdID>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
125 Cmpd1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
126
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
127 > <TopologicalPharmacophoreAtomTripletsFingerprints>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
128 FingerprintsVector;TopologicalPharmacophoreAtomTriplets:ArbitrarySize:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
129 MinDistance1:MaxDistance10;696;NumericalValues;IDsAndValuesString;Ar1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
130 Ar1-Ar1 Ar1-Ar1-H1 Ar1-Ar1-HBA1 Ar1-Ar1-HBD1 Ar1-H1-H1 Ar1-H1-HBA1 Ar1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
131 -H1-HBD1 Ar1-HBA1-HBD1 H1-H1-H1 H1-H1-HBA1 H1-H1-HBD1 H1-HBA1-HBA1 H1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
132 HBA1-HBD1 H1-HBA1-NI1 H1-HBD1-NI1 HBA1-HBA1-NI1 HBA1-HBD1-NI1 Ar1-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
133 46 106 8 3 83 11 4 1 21 5 3 1 2 2 1 1 1 100 101 18 11 145 132 26 14 23
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
134 28 3 3 5 4 61 45 10 4 16 20 7 5 1 3 4 5 3 1 1 1 1 5 4 2 1 2 2 2 1 1 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
135 119 123 24 15 185 202 41 25 22 17 3 5 85 95 18 11 23 17 3 1 1 6 4 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
136
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
137 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
138 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
139 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
140
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
141 Example of *FP* file containing topological pharmacophore atom triplets
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
142 fingerprints string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
143
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
144 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
145 # Package = MayaChemTools 7.4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
146 # Release Date = Oct 21, 2010
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
147 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
148 # TimeStamp = Fri Mar 11 15:38:58 2011
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
149 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
150 # FingerprintsStringType = FingerprintsVector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
151 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
152 # Description = TopologicalPharmacophoreAtomTriplets:ArbitrarySize:M...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
153 # VectorStringFormat = IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
154 # VectorValuesType = NumericalValues
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
155 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
156 Cmpd1 696;Ar1-Ar1-Ar1 Ar1-Ar1-H1 Ar1-Ar1-HBA1 Ar1-Ar1-HBD1...;;46 106...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
157 Cmpd2 251;H1-H1-H1 H1-H1-HBA1 H1-H1-HBD1 H1-H1-NI1...;4 1 3 1 1 2 2...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
158 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
159 ... ..
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
160
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
161 Example of CSV *Text* file containing topological pharmacophore atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
162 triplets fingerprints string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
163
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
164 "CompoundID","TopologicalPharmacophoreAtomTripletsFingerprints"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
165 "Cmpd1","FingerprintsVector;TopologicalPharmacophoreAtomTriplets:Arbitr
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
166 arySize:MinDistance1:MaxDistance10;696;NumericalValues;IDsAndValuesStri
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
167 ng;Ar1-Ar1-Ar1 Ar1-Ar1-H1 Ar1-Ar1-HBA1 Ar1-Ar1-HBD1 Ar1-H1-H1 Ar1-H1-HB
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
168 A1 Ar1-H1-HBD1 Ar1-HBA1-HBD1 H1-H1-H1 H1-H1-HBA1 H1-H1-HBD1 H1-HBA1-HBA
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
169 1 H1-HBA1-HBD1 H1-HBA1-NI1 H1-HBD1-NI1 HBA1-HBA1-NI1 HBA1-HBD1-NI1 A...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
170 46 106 8 3 83 11 4 1 21 5 3 1 2 2 1 1 1 100 101 18 11 145 132 26 14 23
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
171 28 3 3 5 4 61 45 10 4 16 20 7 5 1 3 4 5 3 1 1 1 1 5 4 2 1 2 2 2 1 1 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
172 119 123 24 15 185 202 41 25 22 17 3 5 85 95 18 11 23 17 3 1 1 6 4 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
173 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
174 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
175
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
176 The current release of MayaChemTools generates the following types of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
177 topological pharmacophore atom triplets fingerprints vector strings:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
178
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
179 FingerprintsVector;TopologicalPharmacophoreAtomTriplets:ArbitrarySize:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
180 MinDistance1:MaxDistance10;696;NumericalValues;IDsAndValuesString;Ar1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
181 Ar1-Ar1 Ar1-Ar1-H1 Ar1-Ar1-HBA1 Ar1-Ar1-HBD1 Ar1-H1-H1 Ar1-H1-HBA1 Ar1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
182 -H1-HBD1 Ar1-HBA1-HBD1 H1-H1-H1 H1-H1-HBA1 H1-H1-HBD1 H1-HBA1-HBA1 H1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
183 HBA1-HBD1 H1-HBA1-NI1 H1-HBD1-NI1 HBA1-HBA1-NI1 HBA1-HBD1-NI1 Ar1-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
184 46 106 8 3 83 11 4 1 21 5 3 1 2 2 1 1 1 100 101 18 11 145 132 26 14 23
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
185 28 3 3 5 4 61 45 10 4 16 20 7 5 1 3 4 5 3 1 1 1 1 5 4 2 1 2 2 2 1 1 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
186 119 123 24 15 185 202 41 25 22 17 3 5 85 95 18 11 23 17 3 1 1 6 4 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
187
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
188 FingerprintsVector;TopologicalPharmacophoreAtomTriplets:FixedSize:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
189 istance1:MaxDistance10;2692;OrderedNumericalValues;ValuesString;46 106
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
190 8 3 0 0 83 11 4 0 0 0 1 0 0 0 0 0 0 0 0 21 5 3 0 0 1 2 2 0 0 1 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
191 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 101 18 11 0 0 145 132 26
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
192 14 0 0 23 28 3 3 0 0 5 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 61 45 10 4 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
193 0 16 20 7 5 1 0 3 4 5 3 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 5 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
194
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
195 FingerprintsVector;TopologicalPharmacophoreAtomTriplets:FixedSize:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
196 istance1:MaxDistance10;2692;OrderedNumericalValues;IDsAndValuesString;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
197 Ar1-Ar1-Ar1 Ar1-Ar1-H1 Ar1-Ar1-HBA1 Ar1-Ar1-HBD1 Ar1-Ar1-NI1 Ar1-Ar1-P
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
198 I1 Ar1-H1-H1 Ar1-H1-HBA1 Ar1-H1-HBD1 Ar1-H1-NI1 Ar1-H1-PI1 Ar1-HBA1-HB
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
199 A1 Ar1-HBA1-HBD1 Ar1-HBA1-NI1 Ar1-HBA1-PI1 Ar1-HBD1-HBD1 Ar1-HBD1-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
200 46 106 8 3 0 0 83 11 4 0 0 0 1 0 0 0 0 0 0 0 0 21 5 3 0 0 1 2 2 0 0 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
201 0 0 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 101 18 11 0 0 145
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
202 132 26 14 0 0 23 28 3 3 0 0 5 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 61 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
203
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
204 OPTIONS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
205 --AromaticityModel *MDLAromaticityModel | TriposAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
206 MMFFAromaticityModel | ChemAxonBasicAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
207 ChemAxonGeneralAromaticityModel | DaylightAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
208 MayaChemToolsAromaticityModel*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
209 Specify aromaticity model to use during detection of aromaticity.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
210 Possible values in the current release are: *MDLAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
211 TriposAromaticityModel, MMFFAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
212 ChemAxonBasicAromaticityModel, ChemAxonGeneralAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
213 DaylightAromaticityModel or MayaChemToolsAromaticityModel*. Default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
214 value: *MayaChemToolsAromaticityModel*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
215
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
216 The supported aromaticity model names along with model specific
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
217 control parameters are defined in AromaticityModelsData.csv, which
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
218 is distributed with the current release and is available under
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
219 lib/data directory. Molecule.pm module retrieves data from this file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
220 during class instantiation and makes it available to method
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
221 DetectAromaticity for detecting aromaticity corresponding to a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
222 specific model.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
223
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
224 --AtomTripletsSetSizeToUse *ArbitrarySize | FixedSize*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
225 Atom triplets set size to use during generation of topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
226 pharmacophore atom triplets fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
227
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
228 Possible values: *ArbitrarySize | FixedSize*; Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
229 *ArbitrarySize*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
230
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
231 For *ArbitrarySize* value of --AtomTripletsSetSizeToUse option, the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
232 fingerprint vector correspond to only those topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
233 pharmacophore atom triplets which are present and have non-zero
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
234 count. However, for *FixedSize* value of --AtomTripletsSetSizeToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
235 option, the fingerprint vector contains all possible valid
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
236 topological pharmacophore atom triplets with both zero and non-zero
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
237 count values.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
238
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
239 -a, --AtomTypesToUse *"AtomType1,AtomType2,..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
240 Pharmacophore atom types to use during generation of topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
241 phramacophore atom triplets. It's a list of comma separated valid
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
242 pharmacophore atom types.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
243
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
244 Possible values for pharmacophore atom types are: *Ar, CA, H, HBA,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
245 HBD, Hal, NI, PI, RA*. Default value [ Ref 71 ] :
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
246 *HBD,HBA,PI,NI,H,Ar*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
247
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
248 The pharmacophore atom types abbreviations correspond to:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
249
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
250 HBD: HydrogenBondDonor
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
251 HBA: HydrogenBondAcceptor
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
252 PI : PositivelyIonizable
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
253 NI : NegativelyIonizable
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
254 Ar : Aromatic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
255 Hal : Halogen
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
256 H : Hydrophobic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
257 RA : RingAtom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
258 CA : ChainAtom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
259
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
260 *AtomTypes::FunctionalClassAtomTypes* module is used to assign
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
261 pharmacophore atom types. It uses following definitions [ Ref 60-61,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
262 Ref 65-66 ]:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
263
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
264 HydrogenBondDonor: NH, NH2, OH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
265 HydrogenBondAcceptor: N[!H], O
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
266 PositivelyIonizable: +, NH2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
267 NegativelyIonizable: -, C(=O)OH, S(=O)OH, P(=O)OH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
268
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
269 --CompoundID *DataFieldName or LabelPrefixString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
270 This value is --CompoundIDMode specific and indicates how compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
271 ID is generated.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
272
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
273 For *DataField* value of --CompoundIDMode option, it corresponds to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
274 datafield label name whose value is used as compound ID; otherwise,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
275 it's a prefix string used for generating compound IDs like
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
276 LabelPrefixString<Number>. Default value, *Cmpd*, generates compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
277 IDs which look like Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
278
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
279 Examples for *DataField* value of --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
280
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
281 MolID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
282 ExtReg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
283
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
284 Examples for *LabelPrefix* or *MolNameOrLabelPrefix* value of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
285 --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
286
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
287 Compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
288
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
289 The value specified above generates compound IDs which correspond to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
290 Compound<Number> instead of default value of Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
291
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
292 --CompoundIDLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
293 Specify compound ID column label for CSV/TSV text file(s) used
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
294 during *CompoundID* value of --DataFieldsMode option. Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
295 *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
296
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
297 --CompoundIDMode *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
298 MolNameOrLabelPrefix*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
299 Specify how to generate compound IDs and write to FP or CSV/TSV text
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
300 file(s) along with generated fingerprints for *FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
301 values of --output option: use a *SDFile(s)* datafield value; use
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
302 molname line from *SDFile(s)*; generate a sequential ID with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
303 specific prefix; use combination of both MolName and LabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
304 with usage of LabelPrefix values for empty molname lines.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
305
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
306 Possible values: *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
307 MolNameOrLabelPrefix*. Default value: *LabelPrefix*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
308
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
309 For *MolNameAndLabelPrefix* value of --CompoundIDMode, molname line
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
310 in *SDFile(s)* takes precedence over sequential compound IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
311 generated using *LabelPrefix* and only empty molname values are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
312 replaced with sequential compound IDs.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
313
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
314 This is only used for *CompoundID* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
315
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
316 --DataFields *"FieldLabel1,FieldLabel2,..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
317 Comma delimited list of *SDFiles(s)* data fields to extract and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
318 write to CSV/TSV text file(s) along with generated fingerprints for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
319 *text | all* values of --output option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
320
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
321 This is only used for *Specify* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
322
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
323 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
324
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
325 Extreg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
326 MolID,CompoundName
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
327
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
328 -d, --DataFieldsMode *All | Common | Specify | CompoundID*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
329 Specify how data fields in *SDFile(s)* are transferred to output
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
330 CSV/TSV text file(s) along with generated fingerprints for *text |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
331 all* values of --output option: transfer all SD data field; transfer
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
332 SD data files common to all compounds; extract specified data
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
333 fields; generate a compound ID using molname line, a compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
334 prefix, or a combination of both. Possible values: *All | Common |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
335 specify | CompoundID*. Default value: *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
336
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
337 --DistanceBinSize *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
338 Distance bin size used to bin distances between atom pairs in atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
339 triplets. Default value: *2*. Valid values: positive integers.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
340
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
341 For default --MinDistance and --MaxDistance values of 1 and 10 with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
342 --DistanceBinSize of 2 [ Ref 70 ], the following 5 distance bins are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
343 generated:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
344
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
345 [1, 2] [3, 4] [5, 6] [7, 8] [9 10]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
346
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
347 The lower distance bound on the distance bin is uses to bin the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
348 distance between atom pairs in atom triplets. So in the previous
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
349 example, atom pairs with distances 1 and 2 fall in first distance
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
350 bin, atom pairs with distances 3 and 4 fall in second distance bin
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
351 and so on.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
352
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
353 In order to distribute distance bins of equal size, the last bin is
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
354 allowed to go past --MaxDistance by up to distance bin size. For
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
355 example, --MinDistance and --MaxDistance values of 2 and 10 with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
356 --DistanceBinSize of 2 generates the following 6 distance bins:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
357
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
358 [2, 3] [4, 5] [6, 7] [8, 9] [10 11]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
359
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
360 -f, --Filter *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
361 Specify whether to check and filter compound data in SDFile(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
362 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
363
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
364 By default, compound data is checked before calculating fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
365 and compounds containing atom data corresponding to non-element
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
366 symbols or no atom data are ignored.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
367
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
368 --FingerprintsLabelMode *FingerprintsLabelOnly |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
369 FingerprintsLabelWithIDs*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
370 Specify how fingerprints label is generated in conjunction with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
371 --FingerprintsLabel option value: use fingerprints label generated
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
372 only by --FingerprintsLabel option value or append topological atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
373 pair count value IDs to --FingerprintsLabel option value.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
374
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
375 Possible values: *FingerprintsLabelOnly | FingerprintsLabelWithIDs*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
376 Default value: *FingerprintsLabelOnly*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
377
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
378 Topological atom pairs IDs appended to --FingerprintsLabel value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
379 during *FingerprintsLabelWithIDs* values of --FingerprintsLabelMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
380 correspond to atom pair count values in fingerprint vector string.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
381
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
382 *FingerprintsLabelWithIDs* value of --FingerprintsLabelMode is
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
383 ignored during *ArbitrarySize* value of --AtomTripletsSetSizeToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
384 option and topological atom triplets IDs not appended to the label.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
385
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
386 --FingerprintsLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
387 SD data label or text file column label to use for fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
388 string in output SD or CSV/TSV text file(s) specified by --output.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
389 Default value: *TopologicalPharmacophoreAtomTripletsFingerprints*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
390
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
391 -h, --help
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
392 Print this help message.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
393
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
394 -k, --KeepLargestComponent *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
395 Generate fingerprints for only the largest component in molecule.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
396 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
397
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
398 For molecules containing multiple connected components, fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
399 can be generated in two different ways: use all connected components
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
400 or just the largest connected component. By default, all atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
401 except for the largest connected component are deleted before
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
402 generation of fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
403
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
404 --MinDistance *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
405 Minimum bond distance between atom pairs corresponding to atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
406 triplets for generating topological pharmacophore atom triplets.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
407 Default value: *1*. Valid values: positive integers and less than
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
408 --MaxDistance.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
409
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
410 --MaxDistance *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
411 Maximum bond distance between atom pairs corresponding to atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
412 triplets for generating topological pharmacophore atom triplets.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
413 Default value: *10*. Valid values: positive integers and greater
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
414 than --MinDistance.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
415
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
416 --OutDelim *comma | tab | semicolon*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
417 Delimiter for output CSV/TSV text file(s). Possible values: *comma,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
418 tab, or semicolon* Default value: *comma*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
419
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
420 --output *SD | FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
421 Type of output files to generate. Possible values: *SD, FP, text, or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
422 all*. Default value: *text*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
423
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
424 -o, --overwrite
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
425 Overwrite existing files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
426
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
427 -q, --quote *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
428 Put quote around column values in output CSV/TSV text file(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
429 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
430
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
431 -r, --root *RootName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
432 New file name is generated using the root: <Root>.<Ext>. Default for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
433 new file names:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
434 <SDFileName><TopologicalPharmacophoreAtomTripletsFP>.<Ext>. The file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
435 type determines <Ext> value. The sdf, fpf, csv, and tsv <Ext> values
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
436 are used for SD, FP, comma/semicolon, and tab delimited text files,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
437 respectively.This option is ignored for multiple input files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
438
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
439 -u, --UseTriangleInequality *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
440 Specify whether to imply triangle distance inequality test to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
441 distances between atom pairs in atom triplets during generation of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
442 atom triplets basis set generation. Possible values: *Yes or No*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
443 Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
444
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
445 Triangle distance inequality test implies that distance or binned
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
446 distance between any two atom pairs in an atom triplet must be less
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
447 than the sum of distances or binned distances between other two
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
448 atoms pairs and greater than the difference of their distances.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
449
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
450 For atom triplet PxDyz-PyDxz-PzDxy to satisfy triangle inequality:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
451
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
452 Dyz > |Dxz - Dxy| and Dyz < Dxz + Dxy
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
453 Dxz > |Dyz - Dxy| and Dyz < Dyz + Dxy
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
454 Dxy > |Dyz - Dxz| and Dxy < Dyz + Dxz
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
455
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
456 -v, --VectorStringFormat *ValuesString, IDsAndValuesString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
457 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
458 Format of fingerprints vector string data in output SD, FP or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
459 CSV/TSV text file(s) specified by --output option. Possible values:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
460 *ValuesString, IDsAndValuesString | IDsAndValuesPairsString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
461 ValuesAndIDsString | ValuesAndIDsPairsString*. Defaultvalue:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
462 *ValuesString*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
463
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
464 Default value during *FixedSize* value of --AtomTripletsSetSizeToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
465 option: *ValuesString*. Default value during *ArbitrarySize* value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
466 of --AtomTripletsSetSizeToUse option: *IDsAndValuesString*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
467
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
468 *ValuesString* option value is not allowed for *ArbitrarySize* value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
469 of --AtomTripletsSetSizeToUse option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
470
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
471 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
472
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
473 FingerprintsVector;TopologicalPharmacophoreAtomTriplets:ArbitrarySize:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
474 MinDistance1:MaxDistance10;696;NumericalValues;IDsAndValuesString;Ar1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
475 Ar1-Ar1 Ar1-Ar1-H1 Ar1-Ar1-HBA1 Ar1-Ar1-HBD1 Ar1-H1-H1 Ar1-H1-HBA1 Ar1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
476 -H1-HBD1 Ar1-HBA1-HBD1 H1-H1-H1 H1-H1-HBA1 H1-H1-HBD1 H1-HBA1-HBA1 H1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
477 HBA1-HBD1 H1-HBA1-NI1 H1-HBD1-NI1 HBA1-HBA1-NI1 HBA1-HBD1-NI1 Ar1-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
478 46 106 8 3 83 11 4 1 21 5 3 1 2 2 1 1 1 100 101 18 11 145 132 26 14 23
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
479 28 3 3 5 4 61 45 10 4 16 20 7 5 1 3 4 5 3 1 1 1 1 5 4 2 1 2 2 2 1 1 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
480 119 123 24 15 185 202 41 25 22 17 3 5 85 95 18 11 23 17 3 1 1 6 4 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
481
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
482 FingerprintsVector;TopologicalPharmacophoreAtomTriplets:FixedSize:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
483 istance1:MaxDistance10;2692;OrderedNumericalValues;ValuesString;46 106
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
484 8 3 0 0 83 11 4 0 0 0 1 0 0 0 0 0 0 0 0 21 5 3 0 0 1 2 2 0 0 1 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
485 0 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 100 101 18 11 0 0 145 132 26
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
486 14 0 0 23 28 3 3 0 0 5 4 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 61 45 10 4 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
487 0 16 20 7 5 1 0 3 4 5 3 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 0 0 5 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
488
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
489 FingerprintsVector;TopologicalPharmacophoreAtomTriplets:FixedSize:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
490 istance1:MaxDistance10;2692;OrderedNumericalValues;ValuesAndIDsPairsSt
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
491 ring;46 Ar1-Ar1-Ar1 106 Ar1-Ar1-H1 8 Ar1-Ar1-HBA1 3 Ar1-Ar1-HBD1 0 Ar1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
492 -Ar1-NI1 0 Ar1-Ar1-PI1 83 Ar1-H1-H1 11 Ar1-H1-HBA1 4 Ar1-H1-HBD1 0 Ar1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
493 -H1-NI1 0 Ar1-H1-PI1 0 Ar1-HBA1-HBA1 1 Ar1-HBA1-HBD1 0 Ar1-HBA1-NI1 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
494 Ar1-HBA1-PI1 0 Ar1-HBD1-HBD1 0 Ar1-HBD1-NI1 0 Ar1-HBD1-PI1 0 Ar1-NI...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
495
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
496 -w, --WorkingDir *DirName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
497 Location of working directory. Default value: current directory.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
498
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
499 EXAMPLES
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
500 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
501 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
502 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
503 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
504 compound IDs along with fingerprints vector strings data in ValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
505 format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
506
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
507 % TopologicalPharmacophoreAtomTripletsFingerprints.pl -r SampleTPATFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
508 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
509
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
510 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
511 fixed size corresponding to 5 distance bins spanning distances from 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
512 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
513 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
514 compound IDs along with fingerprints vector strings data in ValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
515 format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
516
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
517 % TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
518 --AtomTripletsSetSizeToUse FixedSize -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
519
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
520 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
521 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
522 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
523 inequality and create SampleTPATFP.sdf, SampleTPATFP.fpf and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
524 SampleTPATFP.csv files with CSV file containing sequential compound IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
525 along with fingerprints vector strings data in ValuesString format,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
526 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
527
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
528 % TopologicalPharmacophoreAtomTripletsFingerprints.pl --output all
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
529 -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
530
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
531 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
532 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
533 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
534 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
535 compound IDs along with fingerprints vector strings data in ValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
536 format and atom triplets IDs in the fingerprint data column label
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
537 starting with Fingerprints, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
538
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
539 % TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
540 --FingerprintsLabelMode FingerprintsLabelWithIDs --FingerprintsLabel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
541 Fingerprints -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
542
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
543 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
544 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
545 1 through 10 using default atoms with distances not satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
546 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
547 compound IDs along with fingerprints vector strings data in ValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
548 format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
549
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
550 % TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
551 --UseTriangleInequality No -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
552
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
553 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
554 arbitrary size corresponding to 6 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
555 1 through 12 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
556 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
557 compound IDs along with fingerprints vector strings data in ValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
558 format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
559
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
560 % TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
561 --UseTriangleInequality Yes --MinDistance 1 --MaxDistance 12
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
562 --DistanceBinSIze 2 -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
563
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
564 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
565 arbitrary size corresponding to 6 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
566 1 through 12 using "HBD,HBA,PI, NI, H, Ar" atoms with distances
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
567 satisfying triangle inequality and create a SampleTPATFP.csv file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
568 containing sequential compound IDs along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
569 strings data in ValuesString format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
570
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
571 % TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
572 --AtomTypesToUse "HBD,HBA,PI,NI,H,Ar" --UseTriangleInequality Yes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
573 --MinDistance 1 --MaxDistance 12 --DistanceBinSIze 2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
574 --VectorStringFormat ValuesString -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
575
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
576 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
577 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
578 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
579 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
580 compound IDs from molecule name line along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
581 strings data in ValuesString format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
582
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
583 % TopologicalPharmacophoreAtomTripletsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
584 CompoundID -CompoundIDMode MolName -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
585
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
586 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
587 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
588 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
589 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
590 compound IDs using specified data field along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
591 strings data in ValuesString format, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
592
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
593 % TopologicalPharmacophoreAtomTripletsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
594 CompoundID -CompoundIDMode DataField --CompoundID Mol_ID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
595 -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
596
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
597 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
598 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
599 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
600 inequality and create a SampleTPATFP.csv file containing sequential
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
601 compound IDs using combination of molecule name line and an explicit
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
602 compound prefix along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
603
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
604 % TopologicalPharmacophoreAtomTripletsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
605 CompoundID -CompoundIDMode MolnameOrLabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
606 --CompoundID Cmpd --CompoundIDLabel MolID -r SampleSampleTPATFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
607 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
608
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
609 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
610 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
611 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
612 inequality and create a SampleTPATFP.csv file containing specific data
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
613 fields columns along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
614
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
615 % TopologicalPharmacophoreAtomTripletsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
616 Specify --DataFields Mol_ID -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
617
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
618 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
619 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
620 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
621 inequality and create a SampleTPATFP.csv file containing common data
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
622 fields columns along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
623
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
624 % TopologicalPharmacophoreAtomTripletsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
625 Common -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
626
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
627 To generate topological pharmacophore atom triplets fingerprints of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
628 arbitrary size corresponding to 5 distance bins spanning distances from
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
629 1 through 10 using default atoms with distances satisfying triangle
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
630 inequality and create SampleTPATFP.sdf, SampleTPATFP.fpf and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
631 SampleTPATFP.csv files containing all data fields columns in CSV file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
632 along with fingerprints data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
633
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
634 % TopologicalPharmacophoreAtomTripletsFingerprints.pl --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
635 All --output all -r SampleTPATFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
636
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
637 AUTHOR
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
638 Manish Sud <msud@san.rr.com>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
639
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
640 SEE ALSO
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
641 InfoFingerprintsFiles.pl, SimilarityMatricesFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
642 AtomNeighborhoodsFingerprints.pl, ExtendedConnectivityFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
643 MACCSKeysFingerprints.pl, PathLengthFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
644 TopologicalAtomPairsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
645 TopologicalAtomTorsionsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
646 TopologicalPharmacophoreAtomPairsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
647
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
648 COPYRIGHT
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
649 Copyright (C) 2015 Manish Sud. All rights reserved.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
650
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
651 This file is part of MayaChemTools.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
652
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
653 MayaChemTools is free software; you can redistribute it and/or modify it
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
654 under the terms of the GNU Lesser General Public License as published by
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
655 the Free Software Foundation; either version 3 of the License, or (at
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
656 your option) any later version.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
657