annotate docs/scripts/txt/TopologicalAtomPairsFingerprints.txt @ 0:4816e4a8ae95 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:23:18 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
2 TopologicalAtomPairsFingerprints.pl - Generate topological atom pairs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
3 fingerprints for SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
5 SYNOPSIS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
6 TopologicalAtomPairsFingerprints.pl SDFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
7
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
8 TopologicalAtomPairsFingerprints.pl [--AromaticityModel
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
9 *AromaticityModelType*] [-a, --AtomIdentifierType
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
10 *AtomicInvariantsAtomTypes*] [--AtomicInvariantsToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
11 *"AtomicInvariant,AtomicInvariant..."*] [--FunctionalClassesToUse
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
12 *"FunctionalClass1,FunctionalClass2..."*] [--CompoundID *DataFieldName
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
13 or LabelPrefixString*] [--CompoundIDLabel *text*] [--CompoundIDMode]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
14 [--DataFields *"FieldLabel1,FieldLabel2,..."*] [-d, --DataFieldsMode
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
15 *All | Common | Specify | CompoundID*] [-f, --Filter *Yes | No*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
16 [--FingerprintsLabel *text*] [-h, --help] [-k, --KeepLargestComponent
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
17 *Yes | No*] [--MinDistance *number*] [--MaxDistance *number*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
18 [--OutDelim *comma | tab | semicolon*] [--output *SD | FP | text | all*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
19 [-o, --overwrite] [-q, --quote *Yes | No*] [-r, --root *RootName*] [-v,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
20 --VectorStringFormat *ValuesString, IDsAndValuesString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
21 IDsAndValuesPairsString | ValuesAndIDsString | ValuesAndIDsPairsString*]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
22 [-w, --WorkingDir dirname] SDFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
23
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
24 DESCRIPTION
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
25 Generate topological atom pairs fingerprints [ Ref 57, Ref 59, Ref 72 ]
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
26 for *SDFile(s)* and create appropriate SD, FP or CSV/TSV text file(s)
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
27 containing fingerprints vector strings corresponding to molecular
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
28 fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
29
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
30 Multiple SDFile names are separated by spaces. The valid file extensions
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
31 are *.sdf* and *.sd*. All other file names are ignored. All the SD files
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
32 in a current directory can be specified either by **.sdf* or the current
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
33 directory name.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
34
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
35 The current release of MayaChemTools supports generation of topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
36 atom pairs corresponding to following -a, --AtomIdentifierTypes:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
37
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
38 AtomicInvariantsAtomTypes, DREIDINGAtomTypes, EStateAtomTypes,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
39 FunctionalClassAtomTypes, MMFF94AtomTypes, SLogPAtomTypes,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
40 SYBYLAtomTypes, TPSAAtomTypes, UFFAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
41
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
42 Based on the values specified for -a, --AtomIdentifierType and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
43 --AtomicInvariantsToUse, initial atom types are assigned to all
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
44 non-hydrogen atoms in a molecule. Using the distance matrix for the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
45 molecule and initial atom types assigned to non-hydrogen atoms, all
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
46 unique atom pairs within --MinDistance and --MaxDistance are identified
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
47 and counted. An atom pair identifier is generated for each unique atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
48 pair; the format of the atom pair identifier is:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
49
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
50 <AtomType1>-D<n>-<AtomType2>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
51
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
52 AtomType1, AtomType2: Atom types assigned to atom1 and atom2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
53 D: Distance between atom1 and atom2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
54
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
55 where AtomType1 <= AtomType2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
56
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
57 The atom pair identifiers for all unique atom pairs corresponding to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
58 non-hydrogen atoms constitute topological atom pairs fingerprints of the
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
59 molecule.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
60
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
61 Example of *SD* file containing topological atom pairs fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
62 string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
63
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
64 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
65 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
66 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
67 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
68 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
69 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
70 41 44 0 0 0 0 0 0 0 0999 V2000
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
71 -3.3652 1.4499 0.0000 C 0 0 0 0 0 0 0 0 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
72 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
73 2 3 1 0 0 0 0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
74 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
75 M END
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
76 > <CmpdID>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
77 Cmpd1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
78
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
79 > <TopologicalAtomPairsFingerprints>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
80 FingerprintsVector;TopologicalAtomPairs:AtomicInvariantsAtomTypes:MinDi
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
81 stance1:MaxDistance10;223;NumericalValues;IDsAndValuesString;C.X1.BO1.H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
82 3-D1-C.X3.BO3.H1 C.X2.BO2.H2-D1-C.X2.BO2.H2 C.X2.BO2.H2-D1-C.X3.BO3.H1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
83 C.X2.BO2.H2-D1-C.X3.BO4 C.X2.BO2.H2-D1-N.X3.BO3 C.X2.BO3.H1-D1-C.X2...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
84 2 1 4 1 1 10 8 1 2 6 1 2 2 1 2 1 2 2 1 2 1 5 1 10 12 2 2 1 2 1 9 1 3 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
85 1 1 2 2 1 3 6 1 6 14 2 2 2 3 1 3 1 8 2 2 1 3 2 6 1 2 2 5 1 3 1 23 1 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
86
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
87 $$$$
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
88 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
89 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
90
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
91 Example of *FP* file containing topological atom pairs fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
92 string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
93
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
94 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
95 # Package = MayaChemTools 7.4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
96 # Release Date = Oct 21, 2010
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
97 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
98 # TimeStamp = Fri Mar 11 15:04:36 2011
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
99 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
100 # FingerprintsStringType = FingerprintsVector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
101 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
102 # Description = TopologicalAtomPairs:AtomicInvariantsAtomTypes:MinDi...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
103 # VectorStringFormat = IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
104 # VectorValuesType = NumericalValues
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
105 #
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
106 Cmpd1 223;C.X1.BO1.H3-D1-C.X3.BO3.H1 C.X2.BO2.H2-D1-C.X2.BO2.H2...;1 1...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
107 Cmpd2 128;C.X1.BO1.H3-D1-C.X2.BO2.H2 C.X1.BO1.H3-D1-C.X3.BO4...;1 1...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
108 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
109 ... ..
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
110
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
111 Example of CSV *Text* file containing topological atom pairs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
112 fingerprints string data:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
113
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
114 "CompoundID","TopologicalAtomPairsFingerprints"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
115 "Cmpd1","FingerprintsVector;TopologicalAtomPairs:AtomicInvariantsAtomTy
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
116 pes:MinDistance1:MaxDistance10;223;NumericalValues;IDsAndValuesString;C
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
117 .X1.BO1.H3-D1-C.X3.BO3.H1 C.X2.BO2.H2-D1-C.X2.BO2.H2 C.X2.BO2.H2-D1-C.X
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
118 3.BO3.H1C.X2.BO2.H2-D1-C.X3.BO4 C.X2.BO2.H2-D1-N.X3.BO3 C.X2.BO3.H1...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
119 2 1 4 1 1 10 8 1 2 6 1 2 2 1 2 1 2 2 1 2 1 5 1 10 12 2 2 1 2 1 9 1 3 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
120 1 1 2 2 1 3 6 1 6 14 2 2 2 3 1 3 1 8 2 2 1 3 2 6 1 2 2 5 1 3 1 23 1 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
121 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
122 ... ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
123
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
124 The current release of MayaChemTools generates the following types of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
125 topological atom pairs fingerprints vector strings:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
126
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
127 FingerprintsVector;TopologicalAtomPairs:AtomicInvariantsAtomTypes:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
128 istance1:MaxDistance10;223;NumericalValues;IDsAndValuesString;C.X1.BO1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
129 .H3-D1-C.X3.BO3.H1 C.X2.BO2.H2-D1-C.X2.BO2.H2 C.X2.BO2.H2-D1-C.X3.BO3.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
130 H1 C.X2.BO2.H2-D1-C.X3.BO4 C.X2.BO2.H2-D1-N.X3.BO3 C.X2.BO3.H1-D1-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
131 2 1 4 1 1 10 8 1 2 6 1 2 2 1 2 1 2 2 1 2 1 5 1 10 12 2 2 1 2 1 9 1 3 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
132 1 1 2 2 1 3 6 1 6 14 2 2 2 3 1 3 1 8 2 2 1 3 2 6 1 2 2 5 1 3 1 23 1...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
133
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
134 FingerprintsVector;TopologicalAtomPairs:AtomicInvariantsAtomTypes:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
135 istance1:MaxDistance10;223;NumericalValues;IDsAndValuesPairsString;C.X
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
136 1.BO1.H3-D1-C.X3.BO3.H1 2 C.X2.BO2.H2-D1-C.X2.BO2.H2 1 C.X2.BO2.H2-D1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
137 C.X3.BO3.H1 4 C.X2.BO2.H2-D1-C.X3.BO4 1 C.X2.BO2.H2-D1-N.X3.BO3 1 C.X2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
138 .BO3.H1-D1-C.X2.BO3.H1 10 C.X2.BO3.H1-D1-C.X3.BO4 8 C.X3.BO3.H1-D1-C.X
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
139 3.BO4 1 C.X3.BO3.H1-D1-O.X1.BO1.H1 2 C.X3.BO4-D1-C.X3.BO4 6 C.X3.BO...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
140
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
141 FingerprintsVector;TopologicalAtomPairs:DREIDINGAtomTypes:MinDistance1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
142 :MaxDistance10;157;NumericalValues;IDsAndValuesString;C_2-D1-C_3 C_2-D
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
143 1-C_R C_2-D1-N_3 C_2-D1-O_2 C_2-D1-O_3 C_3-D1-C_3 C_3-D1-C_R C_3-D1-N_
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
144 R C_3-D1-O_3 C_R-D1-C_R C_R-D1-F_ C_R-D1-N_3 C_R-D1-N_R C_2-D2-C_3 C_2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
145 1 1 1 2 1 7 1 1 2 23 1 1 2 1 3 5 5 2 1 5 28 2 3 3 1 1 1 2 4 1 1 4 9 3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
146 1 4 24 2 4 3 3 4 5 5 14 1 1 2 3 22 1 3 4 4 1 1 1 1 2 2 5 1 4 21 3 1...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
147
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
148 FingerprintsVector;TopologicalAtomPairs:EStateAtomTypes:MinDistance1:M
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
149 axDistance10;251;NumericalValues;IDsAndValuesString;aaCH-D1-aaCH aaCH-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
150 D1-aasC aasC-D1-aasC aasC-D1-aasN aasC-D1-dssC aasC-D1-sF aasC-D1-ssNH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
151 aasC-D1-sssCH aasN-D1-ssCH2 dO-D1-dssC dssC-D1-sOH dssC-D1-ssCH2 d...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
152 10 8 5 2 1 1 1 1 1 2 1 1 1 2 2 1 4 10 12 2 2 6 3 1 3 2 2 1 1 1 1 1 1 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
153 1 1 5 2 1 1 6 12 2 2 2 2 6 1 3 2 2 5 2 2 1 2 1 1 1 1 1 1 3 1 3 19 2...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
154
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
155 FingerprintsVector;TopologicalAtomPairs:FunctionalClassAtomTypes:MinDi
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
156 stance1:MaxDistance10;144;NumericalValues;IDsAndValuesString;Ar-D1-Ar
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
157 Ar-D1-Ar.HBA Ar-D1-HBD Ar-D1-Hal Ar-D1-None Ar.HBA-D1-None HBA-D1-NI H
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
158 BA-D1-None HBA.HBD-D1-NI HBA.HBD-D1-None HBD-D1-None NI-D1-None No...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
159 23 2 1 1 2 1 1 1 1 2 1 1 7 28 3 1 3 2 8 2 1 1 1 5 1 5 24 3 3 4 2 13 4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
160 1 1 4 1 5 22 4 4 3 1 19 1 1 1 1 1 2 2 3 1 1 8 25 4 5 2 3 1 26 1 4 1 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
161
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
162 FingerprintsVector;TopologicalAtomPairs:MMFF94AtomTypes:MinDistance1:M
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
163 axDistance10;227;NumericalValues;IDsAndValuesPairsString;C5A-D1-C5B 2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
164 C5A-D1-CB 1 C5A-D1-CR 1 C5A-D1-N5 2 C5B-D1-C5B 1 C5B-D1-C=ON 1 C5B-D1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
165 CB 1 C=ON-D1-NC=O 1 C=ON-D1-O=CN 1 CB-D1-CB 18 CB-D1-F 1 CB-D1-NC=O 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
166 COO-D1-CR 1 COO-D1-O=CO 1 COO-D1-OC=O 1 CR-D1-CR 7 CR-D1-N5 1 CR-D1-OR
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
167 2 C5A-D2-C5A 1 C5A-D2-C5B 2 C5A-D2-C=ON 1 C5A-D2-CB 3 C5A-D2-CR 4 ...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
168
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
169 FingerprintsVector;TopologicalAtomPairs:SLogPAtomTypes:MinDistance1:Ma
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
170 xDistance10;329;NumericalValues;IDsAndValuesPairsString;C1-D1-C10 1 C1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
171 -D1-C11 2 C1-D1-C5 1 C1-D1-CS 4 C10-D1-N11 1 C11-D1-C21 1 C14-D1-C18 2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
172 C14-D1-F 1 C18-D1-C18 10 C18-D1-C20 4 C18-D1-C22 2 C20-D1-C20 3 C20-D
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
173 1-C21 1 C20-D1-N11 1 C21-D1-C21 1 C21-D1-C5 1 C21-D1-N11 1 C22-D1-N4 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
174 C5-D1-N4 1 C5-D1-O10 1 C5-D1-O2 1 C5-D1-O9 1 CS-D1-O2 2 C1-D2-C1 3...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
175
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
176 FingerprintsVector;TopologicalAtomPairs:SYBYLAtomTypes:MinDistance1:Ma
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
177 xDistance10;159;NumericalValues;IDsAndValuesPairsString;C.2-D1-C.3 1 C
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
178 .2-D1-C.ar 1 C.2-D1-N.am 1 C.2-D1-O.2 1 C.2-D1-O.co2 2 C.3-D1-C.3 7 C.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
179 3-D1-C.ar 1 C.3-D1-N.ar 1 C.3-D1-O.3 2 C.ar-D1-C.ar 23 C.ar-D1-F 1 C.a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
180 r-D1-N.am 1 C.ar-D1-N.ar 2 C.2-D2-C.3 1 C.2-D2-C.ar 3 C.3-D2-C.3 5 C.3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
181 -D2-C.ar 5 C.3-D2-N.ar 2 C.3-D2-O.3 4 C.3-D2-O.co2 2 C.ar-D2-C.ar 2...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
182
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
183 FingerprintsVector;TopologicalAtomPairs:TPSAAtomTypes:MinDistance1:Max
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
184 Distance10;64;NumericalValues;IDsAndValuesPairsString;N21-D1-None 3 N7
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
185 -D1-None 2 None-D1-None 34 None-D1-O3 2 None-D1-O4 3 N21-D2-None 5 N7-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
186 D2-None 3 N7-D2-O3 1 None-D2-None 44 None-D2-O3 2 None-D2-O4 5 O3-D2-O
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
187 4 1 N21-D3-None 7 N7-D3-None 4 None-D3-None 45 None-D3-O3 4 None-D3-O4
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
188 5 N21-D4-N7 1 N21-D4-None 5 N21-D4-O3 1 N21-D4-O4 1 N7-D4-None 4 N...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
189
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
190 FingerprintsVector;TopologicalAtomPairs:UFFAtomTypes:MinDistance1:MaxD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
191 istance10;157;NumericalValues;IDsAndValuesPairsString;C_2-D1-C_3 1 C_2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
192 -D1-C_R 1 C_2-D1-N_3 1 C_2-D1-O_2 2 C_2-D1-O_3 1 C_3-D1-C_3 7 C_3-D1-C
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
193 _R 1 C_3-D1-N_R 1 C_3-D1-O_3 2 C_R-D1-C_R 23 C_R-D1-F_ 1 C_R-D1-N_3 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
194 C_R-D1-N_R 2 C_2-D2-C_3 1 C_2-D2-C_R 3 C_3-D2-C_3 5 C_3-D2-C_R 5 C_3-D
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
195 2-N_R 2 C_3-D2-O_2 1 C_3-D2-O_3 5 C_R-D2-C_R 28 C_R-D2-F_ 2 C_R-D2-...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
196
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
197 OPTIONS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
198 --AromaticityModel *MDLAromaticityModel | TriposAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
199 MMFFAromaticityModel | ChemAxonBasicAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
200 ChemAxonGeneralAromaticityModel | DaylightAromaticityModel |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
201 MayaChemToolsAromaticityModel*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
202 Specify aromaticity model to use during detection of aromaticity.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
203 Possible values in the current release are: *MDLAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
204 TriposAromaticityModel, MMFFAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
205 ChemAxonBasicAromaticityModel, ChemAxonGeneralAromaticityModel,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
206 DaylightAromaticityModel or MayaChemToolsAromaticityModel*. Default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
207 value: *MayaChemToolsAromaticityModel*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
208
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
209 The supported aromaticity model names along with model specific
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
210 control parameters are defined in AromaticityModelsData.csv, which
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
211 is distributed with the current release and is available under
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
212 lib/data directory. Molecule.pm module retrieves data from this file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
213 during class instantiation and makes it available to method
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
214 DetectAromaticity for detecting aromaticity corresponding to a
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
215 specific model.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
216
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
217 -a, --AtomIdentifierType *AtomicInvariantsAtomTypes | DREIDINGAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
218 | EStateAtomTypes | FunctionalClassAtomTypes | MMFF94AtomTypes |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
219 SLogPAtomTypes | SYBYLAtomTypes | TPSAAtomTypes | UFFAtomTypes*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
220 Specify atom identifier type to use for assignment of initial atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
221 identifier to non-hydrogen atoms during calculation of topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
222 atom pairs fingerprints. Possible values in the current release are:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
223 *AtomicInvariantsAtomTypes, DREIDINGAtomTypes, EStateAtomTypes,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
224 FunctionalClassAtomTypes, MMFF94AtomTypes, SLogPAtomTypes,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
225 SYBYLAtomTypes, TPSAAtomTypes, UFFAtomTypes*. Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
226 *AtomicInvariantsAtomTypes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
227
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
228 --AtomicInvariantsToUse *"AtomicInvariant,AtomicInvariant..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
229 This value is used during *AtomicInvariantsAtomTypes* value of a,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
230 --AtomIdentifierType option. It's a list of comma separated valid
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
231 atomic invariant atom types.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
232
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
233 Possible values for atomic invariants are: *AS, X, BO, LBO, SB, DB,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
234 TB, H, Ar, RA, FC, MN, SM*. Default value: *AS,X,BO,H,FC*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
235
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
236 The atomic invariants abbreviations correspond to:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
237
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
238 AS = Atom symbol corresponding to element symbol
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
239
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
240 X<n> = Number of non-hydrogen atom neighbors or heavy atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
241 BO<n> = Sum of bond orders to non-hydrogen atom neighbors or heavy atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
242 LBO<n> = Largest bond order of non-hydrogen atom neighbors or heavy atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
243 SB<n> = Number of single bonds to non-hydrogen atom neighbors or heavy atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
244 DB<n> = Number of double bonds to non-hydrogen atom neighbors or heavy atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
245 TB<n> = Number of triple bonds to non-hydrogen atom neighbors or heavy atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
246 H<n> = Number of implicit and explicit hydrogens for atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
247 Ar = Aromatic annotation indicating whether atom is aromatic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
248 RA = Ring atom annotation indicating whether atom is a ring
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
249 FC<+n/-n> = Formal charge assigned to atom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
250 MN<n> = Mass number indicating isotope other than most abundant isotope
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
251 SM<n> = Spin multiplicity of atom. Possible values: 1 (singlet), 2 (doublet) or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
252 3 (triplet)
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
253
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
254 Atom type generated by AtomTypes::AtomicInvariantsAtomTypes class
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
255 corresponds to:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
256
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
257 AS.X<n>.BO<n>.LBO<n>.<SB><n>.<DB><n>.<TB><n>.H<n>.Ar.RA.FC<+n/-n>.MN<n>.SM<n>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
258
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
259 Except for AS which is a required atomic invariant in atom types,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
260 all other atomic invariants are optional. Atom type specification
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
261 doesn't include atomic invariants with zero or undefined values.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
262
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
263 In addition to usage of abbreviations for specifying atomic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
264 invariants, the following descriptive words are also allowed:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
265
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
266 X : NumOfNonHydrogenAtomNeighbors or NumOfHeavyAtomNeighbors
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
267 BO : SumOfBondOrdersToNonHydrogenAtoms or SumOfBondOrdersToHeavyAtoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
268 LBO : LargestBondOrderToNonHydrogenAtoms or LargestBondOrderToHeavyAtoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
269 SB : NumOfSingleBondsToNonHydrogenAtoms or NumOfSingleBondsToHeavyAtoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
270 DB : NumOfDoubleBondsToNonHydrogenAtoms or NumOfDoubleBondsToHeavyAtoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
271 TB : NumOfTripleBondsToNonHydrogenAtoms or NumOfTripleBondsToHeavyAtoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
272 H : NumOfImplicitAndExplicitHydrogens
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
273 Ar : Aromatic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
274 RA : RingAtom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
275 FC : FormalCharge
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
276 MN : MassNumber
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
277 SM : SpinMultiplicity
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
278
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
279 *AtomTypes::AtomicInvariantsAtomTypes* module is used to assign
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
280 atomic invariant atom types.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
281
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
282 --FunctionalClassesToUse *"FunctionalClass1,FunctionalClass2..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
283 This value is used during *FunctionalClassAtomTypes* value of a,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
284 --AtomIdentifierType option. It's a list of comma separated valid
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
285 functional classes.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
286
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
287 Possible values for atom functional classes are: *Ar, CA, H, HBA,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
288 HBD, Hal, NI, PI, RA*. Default value [ Ref 24 ]:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
289 *HBD,HBA,PI,NI,Ar,Hal*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
290
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
291 The functional class abbreviations correspond to:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
292
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
293 HBD: HydrogenBondDonor
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
294 HBA: HydrogenBondAcceptor
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
295 PI : PositivelyIonizable
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
296 NI : NegativelyIonizable
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
297 Ar : Aromatic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
298 Hal : Halogen
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
299 H : Hydrophobic
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
300 RA : RingAtom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
301 CA : ChainAtom
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
302
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
303 Functional class atom type specification for an atom corresponds to:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
304
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
305 Ar.CA.H.HBA.HBD.Hal.NI.PI.RA
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
306
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
307 *AtomTypes::FunctionalClassAtomTypes* module is used to assign
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
308 functional class atom types. It uses following definitions [ Ref
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
309 60-61, Ref 65-66 ]:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
310
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
311 HydrogenBondDonor: NH, NH2, OH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
312 HydrogenBondAcceptor: N[!H], O
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
313 PositivelyIonizable: +, NH2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
314 NegativelyIonizable: -, C(=O)OH, S(=O)OH, P(=O)OH
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
315
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
316 --CompoundID *DataFieldName or LabelPrefixString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
317 This value is --CompoundIDMode specific and indicates how compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
318 ID is generated.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
319
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
320 For *DataField* value of --CompoundIDMode option, it corresponds to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
321 datafield label name whose value is used as compound ID; otherwise,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
322 it's a prefix string used for generating compound IDs like
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
323 LabelPrefixString<Number>. Default value, *Cmpd*, generates compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
324 IDs which look like Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
325
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
326 Examples for *DataField* value of --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
327
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
328 MolID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
329 ExtReg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
330
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
331 Examples for *LabelPrefix* or *MolNameOrLabelPrefix* value of
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
332 --CompoundIDMode:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
333
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
334 Compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
335
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
336 The value specified above generates compound IDs which correspond to
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
337 Compound<Number> instead of default value of Cmpd<Number>.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
338
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
339 --CompoundIDLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
340 Specify compound ID column label for CSV/TSV text file(s) used
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
341 during *CompoundID* value of --DataFieldsMode option. Default value:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
342 *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
343
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
344 --CompoundIDMode *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
345 MolNameOrLabelPrefix*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
346 Specify how to generate compound IDs and write to FP or CSV/TSV text
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
347 file(s) along with generated fingerprints for *FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
348 values of --output option: use a *SDFile(s)* datafield value; use
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
349 molname line from *SDFile(s)*; generate a sequential ID with
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
350 specific prefix; use combination of both MolName and LabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
351 with usage of LabelPrefix values for empty molname lines.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
352
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
353 Possible values: *DataField | MolName | LabelPrefix |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
354 MolNameOrLabelPrefix*. Default value: *LabelPrefix*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
355
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
356 For *MolNameAndLabelPrefix* value of --CompoundIDMode, molname line
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
357 in *SDFile(s)* takes precedence over sequential compound IDs
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
358 generated using *LabelPrefix* and only empty molname values are
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
359 replaced with sequential compound IDs.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
360
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
361 This is only used for *CompoundID* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
362
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
363 --DataFields *"FieldLabel1,FieldLabel2,..."*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
364 Comma delimited list of *SDFiles(s)* data fields to extract and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
365 write to CSV/TSV text file(s) along with generated fingerprints for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
366 *text | both* values of --output option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
367
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
368 This is only used for *Specify* value of --DataFieldsMode option.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
369
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
370 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
371
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
372 Extreg
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
373 MolID,CompoundName
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
374
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
375 -d, --DataFieldsMode *All | Common | Specify | CompoundID*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
376 Specify how data fields in *SDFile(s)* are transferred to output
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
377 CSV/TSV text file(s) along with generated fingerprints for *text |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
378 both* values of --output option: transfer all SD data field;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
379 transfer SD data files common to all compounds; extract specified
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
380 data fields; generate a compound ID using molname line, a compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
381 prefix, or a combination of both. Possible values: *All | Common |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
382 specify | CompoundID*. Default value: *CompoundID*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
383
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
384 -f, --Filter *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
385 Specify whether to check and filter compound data in SDFile(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
386 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
387
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
388 By default, compound data is checked before calculating fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
389 and compounds containing atom data corresponding to non-element
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
390 symbols or no atom data are ignored.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
391
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
392 --FingerprintsLabel *text*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
393 SD data label or text file column label to use for fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
394 string in output SD or CSV/TSV text file(s) specified by --output.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
395 Default value: *TopologicalAtomPairsFingerprints*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
396
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
397 -h, --help
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
398 Print this help message.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
399
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
400 -k, --KeepLargestComponent *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
401 Generate fingerprints for only the largest component in molecule.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
402 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
403
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
404 For molecules containing multiple connected components, fingerprints
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
405 can be generated in two different ways: use all connected components
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
406 or just the largest connected component. By default, all atoms
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
407 except for the largest connected component are deleted before
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
408 generation of fingerprints.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
409
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
410 --MinDistance *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
411 Minimum bond distance between atom pairs for generating topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
412 atom pairs. Default value: *1*. Valid values: positive integers and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
413 less than --MaxDistance.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
414
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
415 --MaxDistance *number*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
416 Maximum bond distance between atom pairs for generating topological
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
417 atom pairs. Default value: *10*. Valid values: positive integers and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
418 greater than --MinDistance.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
419
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
420 --OutDelim *comma | tab | semicolon*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
421 Delimiter for output CSV/TSV text file(s). Possible values: *comma,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
422 tab, or semicolon* Default value: *comma*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
423
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
424 --output *SD | FP | text | all*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
425 Type of output files to generate. Possible values: *SD, FP, text, or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
426 all*. Default value: *text*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
427
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
428 -o, --overwrite
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
429 Overwrite existing files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
430
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
431 -q, --quote *Yes | No*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
432 Put quote around column values in output CSV/TSV text file(s).
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
433 Possible values: *Yes or No*. Default value: *Yes*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
434
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
435 -r, --root *RootName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
436 New file name is generated using the root: <Root>.<Ext>. Default for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
437 new file names: <SDFileName><TopologicalAtomPairsFP>.<Ext>. The file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
438 type determines <Ext> value. The sdf, fpf, csv, and tsv <Ext> values
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
439 are used for SD, FP, comma/semicolon, and tab delimited text files,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
440 respectively.This option is ignored for multiple input files.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
441
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
442 -v, --VectorStringFormat *IDsAndValuesString | IDsAndValuesPairsString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
443 ValuesAndIDsString | ValuesAndIDsPairsString*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
444 Format of fingerprints vector string data in output SD, FP or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
445 CSV/TSV text file(s) specified by --output option. Possible values:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
446 *IDsAndValuesString | IDsAndValuesPairsString | ValuesAndIDsString |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
447 ValuesAndIDsPairsString*. Default value: *IDsAndValuesString*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
448
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
449 Examples:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
450
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
451 FingerprintsVector;TopologicalAtomPairs:AtomicInvariantsAtomTypes:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
452 istance1:MaxDistance10;223;NumericalValues;IDsAndValuesString;C.X1.BO1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
453 .H3-D1-C.X3.BO3.H1 C.X2.BO2.H2-D1-C.X2.BO2.H2 C.X2.BO2.H2-D1-C.X3.BO3.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
454 H1 C.X2.BO2.H2-D1-C.X3.BO4 C.X2.BO2.H2-D1-N.X3.BO3 C.X2.BO3.H1-D1-...;
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
455 2 1 4 1 1 10 8 1 2 6 1 2 2 1 2 1 2 2 1 2 1 5 1 10 12 2 2 1 2 1 9 1 3 1
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
456 1 1 2 2 1 3 6 1 6 14 2 2 2 3 1 3 1 8 2 2 1 3 2 6 1 2 2 5 1 3 1 23 1...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
457
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
458 FingerprintsVector;TopologicalAtomPairs:AtomicInvariantsAtomTypes:MinD
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
459 istance1:MaxDistance10;223;NumericalValues;IDsAndValuesPairsString;C.X
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
460 1.BO1.H3-D1-C.X3.BO3.H1 2 C.X2.BO2.H2-D1-C.X2.BO2.H2 1 C.X2.BO2.H2-D1-
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
461 C.X3.BO3.H1 4 C.X2.BO2.H2-D1-C.X3.BO4 1 C.X2.BO2.H2-D1-N.X3.BO3 1 C.X2
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
462 .BO3.H1-D1-C.X2.BO3.H1 10 C.X2.BO3.H1-D1-C.X3.BO4 8 C.X3.BO3.H1-D1-C.X
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
463 3.BO4 1 C.X3.BO3.H1-D1-O.X1.BO1.H1 2 C.X3.BO4-D1-C.X3.BO4 6 C.X3.BO...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
464
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
465 -w, --WorkingDir *DirName*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
466 Location of working directory. Default value: current directory.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
467
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
468 EXAMPLES
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
469 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
470 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
471 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
472 sequential compound IDs along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
473 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
474
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
475 % TopologicalAtomPairsFingerprints.pl -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
476
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
477 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
478 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
479 IDsAndValuesString format and create SampleTAPFP.sdf, SampleTAPFP.fpf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
480 and SampleTAPFP.csv files containing sequential compound IDs in CSV file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
481 along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
482
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
483 % TopologicalAtomPairsFingerprints.pl --output all -r SampleTAPFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
484 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
485
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
486 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
487 distances from 1 through 10 using DREIDING atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
488 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
489 sequential compound IDs along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
490 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
491
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
492 % TopologicalAtomPairsFingerprints.pl -a DREIDINGAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
493 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
494
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
495 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
496 distances from 1 through 10 using E-state types in IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
497 format and create a SampleTAPFP.csv file containing sequential compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
498 IDs along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
499
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
500 % TopologicalAtomPairsFingerprints.pl -a EStateAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
501 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
502
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
503 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
504 distances from 1 through 10 using DREIDING atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
505 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
506 sequential compound IDs along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
507 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
508
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
509 % TopologicalAtomPairsFingerprints.pl -a DREIDINGAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
510 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
511
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
512 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
513 distances from 1 through 10 using functional class atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
514 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
515 sequential compound IDs along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
516 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
517
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
518 % TopologicalAtomPairsFingerprints.pl -a FunctionalClassAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
519 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
520
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
521 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
522 distances from 1 through 10 using MMFF94 atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
523 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
524 sequential compound IDs along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
525 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
526
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
527 % TopologicalAtomPairsFingerprints.pl -a MMFF94AtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
528 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
529
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
530 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
531 distances from 1 through 10 using SLogP atom types in IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
532 format and create a SampleTAPFP.csv file containing sequential compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
533 IDs along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
534
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
535 % TopologicalAtomPairsFingerprints.pl -a SLogPAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
536 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
537
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
538 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
539 distances from 1 through 10 using SYBYL atom types in IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
540 format and create a SampleTAPFP.csv file containing sequential compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
541 IDs along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
542
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
543 % TopologicalAtomPairsFingerprints.pl -a SYBYLAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
544 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
545
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
546 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
547 distances from 1 through 10 using TPSA atom types in IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
548 format and create a SampleTAPFP.csv file containing sequential compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
549 IDs along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
550
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
551 % TopologicalAtomPairsFingerprints.pl -a TPSAAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
552 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
553
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
554 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
555 distances from 1 through 10 using UFF atom types in IDsAndValuesString
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
556 format and create a SampleTAPFP.csv file containing sequential compound
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
557 IDs along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
558
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
559 % TopologicalAtomPairsFingerprints.pl -a UFFAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
560 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
561
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
562 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
563 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
564 IDsAndValuesPairsString format and create a SampleTAPFP.csv file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
565 containing sequential compound IDs along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
566 strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
567
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
568 % TopologicalAtomPairsFingerprints.pl --VectorStringFormat
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
569 IDsAndValuesPairsString -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
570
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
571 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
572 distances from 1 through 6 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
573 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
574 sequential compound IDs along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
575 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
576
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
577 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
578 --MinDistance 1 --MaxDistance 6 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
579
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
580 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
581 distances from 1 through 10 using only AS,X atomic invariants atom types
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
582 in IDsAndValuesString format and create a SampleTAPFP.csv file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
583 containing sequential compound IDs along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
584 strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
585
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
586 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
587 --AtomicInvariantsToUse "AS,X" --MinDistance 1 --MaxDistance 6
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
588 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
589
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
590 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
591 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
592 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
593 compound ID from molecule name line along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
594 strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
595
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
596 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
597 --DataFieldsMode CompoundID -CompoundIDMode MolName
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
598 -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
599
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
600 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
601 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
602 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
603 compound IDs using specified data field along with fingerprints vector
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
604 strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
605
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
606 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
607 --DataFieldsMode CompoundID -CompoundIDMode DataField --CompoundID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
608 Mol_ID -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
609
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
610 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
611 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
612 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
613 compound ID using combination of molecule name line and an explicit
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
614 compound prefix along with fingerprints vector strings data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
615
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
616 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
617 --DataFieldsMode CompoundID -CompoundIDMode MolnameOrLabelPrefix
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
618 --CompoundID Cmpd --CompoundIDLabel MolID -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
619
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
620 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
621 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
622 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
623 specific data fields columns along with fingerprints vector strings
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
624 data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
625
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
626 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
627 --DataFieldsMode Specify --DataFields Mol_ID -r SampleTAPFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
628 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
629
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
630 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
631 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
632 IDsAndValuesString format and create a SampleTAPFP.csv file containing
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
633 common data fields columns along with fingerprints vector strings data,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
634 type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
635
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
636 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
637 --DataFieldsMode Common -r SampleTAPFP -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
638
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
639 To generate topological atom pairs fingerprints corresponding to bond
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
640 distances from 1 through 10 using atomic invariants atom types in
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
641 IDsAndValuesString format and create SampleTAPFP.sdf, SampleTAPFP.fpf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
642 and SampleTAPFP.csv files containing all data fields columns in CSV file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
643 along with fingerprints data, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
644
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
645 % TopologicalAtomPairsFingerprints.pl -a AtomicInvariantsAtomTypes
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
646 --DataFieldsMode All --output all -r SampleTAPFP
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
647 -o Sample.sdf
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
648
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
649 AUTHOR
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
650 Manish Sud <msud@san.rr.com>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
651
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
652 SEE ALSO
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
653 InfoFingerprintsFiles.pl, SimilarityMatricesFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
654 AtomNeighborhoodsFingerprints.pl, ExtendedConnectivityFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
655 MACCSKeysFingerprints.pl, PathLengthFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
656 TopologicalAtomTorsionsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
657 TopologicalPharmacophoreAtomPairsFingerprints.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
658 TopologicalPharmacophoreAtomTripletsFingerprints.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
659
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
660 COPYRIGHT
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
661 Copyright (C) 2015 Manish Sud. All rights reserved.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
662
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
663 This file is part of MayaChemTools.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
664
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
665 MayaChemTools is free software; you can redistribute it and/or modify it
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
666 under the terms of the GNU Lesser General Public License as published by
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
667 the Free Software Foundation; either version 3 of the License, or (at
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
668 your option) any later version.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
669