0
|
1 <tool id="chemfp_butina_clustering" name="Taylor-Butina Clustering" version="0.1">
|
6
|
2 <description>of molecular fingerprints</description>
|
0
|
3 <requirements>
|
|
4 <requirement type="package" version="1.1p1">chemfp</requirement>
|
|
5 </requirements>
|
|
6 <command interpreter='python'>
|
6
|
7 butina_clustering.py
|
|
8 -i $infile
|
|
9 -t $threshold
|
|
10 -o $outfile
|
|
11 -p 4
|
0
|
12 </command>
|
|
13 <inputs>
|
|
14 <param name="infile" type="data" format="fps" label="Finperprint dataset" help="Dataset missing? See TIP below"/>
|
6
|
15 <param name='threshold' type='float' value='0.8'/>
|
0
|
16 </inputs>
|
|
17 <outputs>
|
6
|
18 <data format="tabular" name="outfile" label="${tool.name} on ${on_string}"/>
|
0
|
19 </outputs>
|
|
20 <tests>
|
|
21 <test>
|
|
22 <param name="infile" ftype="fps" value="q.fps"/>
|
|
23 <param name='threshold' value='0.8' ></param>
|
6
|
24 <output name="outfile" ftype="tabular" file='Taylor-Butina_Clustering_on_data_q.txt'/>
|
0
|
25 </test>
|
|
26 </tests>
|
|
27 <help>
|
|
28
|
6
|
29 **Note**. You need molecular fingerprints in FPS format. Open Babel Fastsearch index is not supported.
|
0
|
30
|
|
31 **What it does**
|
|
32 Molecule library clustering using the Taylor-Butina algorithm.
|
|
33
|
|
34 -----
|
|
35
|
|
36 **Example**
|
|
37
|
|
38 * input::
|
|
39
|
|
40 - fingerprints in FPS format
|
|
41
|
|
42 #FPS1
|
|
43 #num_bits=881
|
|
44 #type=CACTVS-E_SCREEN/1.0 extended=2
|
|
45 #software=CACTVS/unknown
|
|
46 #source=/home/mohammed/galaxy-central/database/files/000/dataset_423.dat
|
|
47 #date=2012-02-09T13:20:37
|
|
48 07ce04000000000000000000000000000080060000000c000000000000001a800f0000780008100000701487e960cc0bed3248000580644626004101b4844805901b041c2e
|
|
49 19511e45039b8b2926101609401b13e40800000000000100200000040080000010000002000000000000 55169009
|
|
50 07ce04000000000000000000000000000080060000000c000000000000001a800f0000780008100000701087e960cc0bed3248000580644626004101b4844805901b041c2e
|
|
51 19111e45039b8b2926105609401313e40800000000000100200000040080000010000002000000000000 55079807
|
|
52 ........
|
|
53
|
|
54 - Tanimoto threshold : 0.8 (between 0 and 1)
|
|
55
|
|
56 * output::
|
|
57
|
|
58 0 true singletons
|
|
59 =>
|
|
60
|
|
61 0 false singletons
|
|
62 =>
|
|
63
|
|
64 1 clusters
|
|
65 55091849 has 12 other members
|
|
66 => 6499094 6485578 55079807 3153534 55102353 55091466 55091416 6485577 55169009 55091752 55091467 55168823
|
|
67
|
|
68 </help>
|
|
69
|
|
70 </tool>
|