annotate ttest/stats.xml @ 12:fd8529cd1564 default tip

better t-test
author jingchunzhu
date Mon, 28 Sep 2015 12:36:12 -0700
parents cd4c13ae11ce
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
1 <tool id="ucscCancerBrowserStats" description="t-tests of difference in genomic data" name="Difference between categories (t-test)" version="0.0.1">
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
2 <command interpreter="python">
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
3 stats.py $genomicMatrix $clinicalFeatures $outFile -a="${category1}" -b="${category2}"
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
4 </command>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
5 <inputs>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
6 <param format="tabular" name="genomicMatrix" type="data" label="Genomic Matrix"/>
11
jingchunzhu
parents: 9
diff changeset
7 <param format="tabular" name="clinicalFeatures" type="data" label="Phenotype Matrix"/>
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
8 <param type="text" name="category1" label="Category 1" optional="false"/>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
9 <param type="text" name="category2" label="Category 2" optional="false"/>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
10 </inputs>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
11 <outputs>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
12 <data format="tabular" name="outFile" />
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
13 </outputs>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
14 <requirements>
9
a04e3c59e117 fix bad xml file
jingchunzhu
parents: 0
diff changeset
15 <requirement type="package" version="1.0" >cancerBrowserStats</requirement>
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
16 </requirements>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
17 <tests>
11
jingchunzhu
parents: 9
diff changeset
18 <param name="genomicMatrix" value="sample.genomic.matrix.txt" />
jingchunzhu
parents: 9
diff changeset
19 <param name="clinicalMatrix" value="sample.clinical.matrix.txt" />
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
20 <param name="category1" value="A"/>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
21 <param name="category2" value="B"/>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
22 <output name="outFile" value="sample.stats.output.txt"/>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
23 </tests>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
24 <help>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
25
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
26 This tool performs t-test on genomic data between two groups of samples, which can be used to identify for example, differentially expressed genes or probes. The genomic data is in the format of UCSC Xena genomic matrix (a tab-deliminated matrix) with rows representing genes or probes and columns representing samples. The phenotype matrix assigns samples into groups. The tool compares two groups of samples, and computes the t-statistics, p value, and delta of medians for each probe/gene between the two groups. The result can be downloaded to programs such as EXCEL for sorting based on the t-statistics.
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
27
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
28 The user runs this tool with the following steps:
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
29
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
30 1. Specify a genomic matrix. The expected format is with rows representing genes and columns representing samples, and the first line contains sample names. Matrix can be obtained from UCSC Xena bulk download. See below for an example.
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
31
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
32
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
33 2. Specify a phenotype matrix. Here, rows indicate samples, columns indicate phenotypes or annotations. Matrix can be obtained from UCSC Xena heatmap download. See below for an example.
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
34
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
35
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
36 3. Specify the two categorical values that you want to use for defining the two groups. For example, the two groups could be A and B, 0 and 1, etc.
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
37
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
38
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
39 4. The output is, for each probe/gene (in each row), the t-statistics, the p-value, the median value for each group, and the difference between the medians. If it cannot calculate these values, it returns a vector of NAs.
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
40
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
41
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
42 **Input genomic matrix**::
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
43
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
44 Gene s1 s2 s3 s4 s5 s6 s7 s8 s9 s10
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
45 G1 2.0 2.2 3.2 1.1 5.1 8.1 3.2 1.1 8.1 0.2
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
46 G2 0.1 8.2 9.1 4.2 6.1 4.9 3.9 2.3 1.1 0.2
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
47
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
48 **Input phenotyp matrix**::
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
49
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
50 sample_id Value
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
51 s1 A
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
52 s2 A
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
53 s3 B
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
54 s4 C
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
55 s5 B
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
56 s6 B
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
57 s7 A
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
58 s8 A
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
59 s9 B
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
60 s10 A
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
61
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
62 **Category 1 : A**
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
63
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
64 **Category 2 : B**
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
65
12
fd8529cd1564 better t-test
jingchunzhu
parents: 11
diff changeset
66 **Output**::
0
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
67
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
68 Gene Statistic pValue Median1 Median2 Delta
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
69 G1 -4.168999 0.004194 2.000000 6.600000 -4.600000
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
70 G2 -1.198486 0.269724 2.300000 5.500000 -3.200000
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
71
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
72
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
73 </help>
12bb38e187b9 Uploaded, initial check-in
melissacline
parents:
diff changeset
74 </tool>