annotate execute_dwt_IvC_all.xml @ 0:91fad0f30fd3 draft

Imported from capsule None
author devteam
date Thu, 23 Jan 2014 12:31:01 -0500
parents
children 509993d9fdca
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
1 <tool id="compute_p-values_second_moments_feature_occurrences_between_two_datasets_using_discrete_wavelet_transfom" name="Compute P-values and Second Moments for Feature Occurrences" version="1.0.0">
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
2 <description>between two datasets using Discrete Wavelet Transfoms</description>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
3
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
4 <command interpreter="perl">
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
5 execute_dwt_IvC_all.pl $inputFile1 $inputFile2 $outputFile1 $outputFile2
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
6 </command>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
7
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
8 <inputs>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
9 <param format="tabular" name="inputFile1" type="data" label="Select the first input file"/>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
10 <param format="tabular" name="inputFile2" type="data" label="Select the second input file"/>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
11 </inputs>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
12
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
13 <outputs>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
14 <data format="tabular" name="outputFile1"/>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
15 <data format="pdf" name="outputFile2"/>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
16 </outputs>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
17
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
18 <help>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
19
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
20 .. class:: infomark
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
21
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
22 **What it does**
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
23
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
24 This program generates plots and computes table matrix of second moments, p-values, and test orientations at multiple scales for the correlation between the occurrences of features in one dataset and their occurrences in another using multiscale wavelet analysis technique.
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
25
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
26 The program assumes that the user has two sets of DNA sequences, S1 and S1, each of which consists of one or more sequences of equal length. Each sequence in each set is divided into the same number of multiple intervals n such that n = 2^k, where k is a positive integer and k >= 1. Thus, n could be any value of the set {2, 4, 8, 16, 32, 64, 128, ...}. k represents the number of scales.
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
27
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
28 The program has two input files obtained as follows:
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
29
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
30 For a given set of features, say motifs, the user counts the number of occurrences of each feature in each interval of each sequence in S1 and S1, and builds two tabular files representing the count results in each interval of S1 and S1. These are the input files of the program.
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
31
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
32 The program gives two output files:
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
33
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
34 - The first output file is a TABULAR format file representing the second moments, p-values, and test orientations for each feature at each scale.
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
35 - The second output file is a PDF file consisting of as many figures as the number of features, such that each figure represents the values of the second moment for that feature at every scale.
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
36
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
37 -----
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
38
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
39 .. class:: warningmark
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
40
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
41 **Note**
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
42
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
43 In order to obtain empirical p-values, a random perumtation test is implemented by the program, which results in the fact that the program gives slightly different results each time it is run on the same input file.
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
44
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
45 -----
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
46
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
47 **Example**
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
48
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
49 Counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S1 gives the following tabular file::
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
50
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
51 deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
52 226 403 416 221 1165
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
53 236 444 380 241 1223
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
54 242 496 391 195 1116
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
55 243 429 364 191 1118
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
56 244 410 371 236 1063
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
57 230 386 370 217 1087
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
58 275 404 402 214 1044
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
59 265 443 365 231 1086
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
60 255 390 354 246 1114
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
61 281 384 406 232 1102
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
62 263 459 369 251 1135
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
63 280 433 400 251 1159
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
64 278 385 382 231 1147
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
65 248 393 389 211 1162
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
66 251 403 385 246 1114
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
67 239 383 347 227 1172
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
68
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
69 And counting the occurrences of 5 features (motifs) in 16 intervals (one line per interval) of the DNA sequences in S2 gives the following tabular file::
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
70
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
71 deletionHoptspot insertionHoptspot dnaPolPauseFrameshift topoisomeraseCleavageSite translinTarget
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
72 235 374 407 257 1159
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
73 244 356 353 212 1128
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
74 233 343 322 204 1110
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
75 222 329 398 253 1054
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
76 216 325 328 253 1129
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
77 257 368 352 221 1115
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
78 238 360 346 224 1102
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
79 225 350 377 248 1107
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
80 230 330 365 236 1132
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
81 241 389 357 220 1120
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
82 274 354 392 235 1120
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
83 250 379 354 210 1102
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
84 254 329 320 251 1080
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
85 221 355 406 279 1127
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
86 224 330 390 249 1129
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
87 246 366 364 218 1176
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
88
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
89
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
90 We notice that the number of scales here is 4 because 16 = 2^4. Runnig the program on the above input files gives the following output:
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
91
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
92 The first output file::
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
93
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
94 motif 1_moment2 1_pval 1_test 2_moment2 2_pval 2_test 3_moment2 3_pval 3_test 4_moment2 4_pval 4_test
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
95
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
96 deletionHoptspot 0.8751 0.376 L 1.549 0.168 R 0.6152 0.434 L 0.5735 0.488 R
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
97 insertionHoptspot 0.902 0.396 L 1.172 0.332 R 0.6843 0.456 L 1.728 0.213 R
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
98 dnaPolPauseFrameshift 1.65 0.013 R 0.267 0.055 L 0.1387 0.124 L 0.4516 0.498 L
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
99 topoisomeraseCleavageSite 0.7443 0.233 L 1.023 0.432 R 1.933 0.155 R 1.09 0.3 R
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
100 translinTarget 0.5084 0.057 L 0.8219 0.446 L 3.604 0.019 R 0.4377 0.492 L
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
101
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
102 The second output file:
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
103
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
104 .. image:: ${static_path}/operation_icons/dwt_IvC_1.png
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
105 .. image:: ${static_path}/operation_icons/dwt_IvC_2.png
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
106 .. image:: ${static_path}/operation_icons/dwt_IvC_3.png
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
107 .. image:: ${static_path}/operation_icons/dwt_IvC_4.png
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
108 .. image:: ${static_path}/operation_icons/dwt_IvC_5.png
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
109
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
110 </help>
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
111
91fad0f30fd3 Imported from capsule None
devteam
parents:
diff changeset
112 </tool>