annotate READme.md @ 5:68ecc89b6cbb draft

"planemo upload commit 499df83651e020e9106d70ec658edf86355f49d9"
author galaxyp
date Wed, 21 Oct 2020 16:39:02 +0000
parents 8f595cfe51ed
children b483354e4e24
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
1 MT2MQ
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
2 ==========================================
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
3
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
4 Description
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
5 -----------
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
6
5
68ecc89b6cbb "planemo upload commit 499df83651e020e9106d70ec658edf86355f49d9"
galaxyp
parents: 4
diff changeset
7 In order to enable multi-omics data analysis of microbiome data, the Galaxy-P team has developed a tool – MT2MQ – which processes metatranscriptomics gene families output from [ASaiM](https://training.galaxyproject.org/training-material/topics/metagenomics/tutorials/metatranscriptomics/tutorial.html) workflow and converts it to [Gene Ontology](http://geneontology.org/docs/ontology-documentation/) (GO) or EC terms. The processed metatranscriptomics output can be subsequently used as an input for comparative statistical analysis via [metaQuantome](https://www.mcponline.org/content/18/8_suppl_1/S82) software suite.
3
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
8
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
9 Authors
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
10 -------
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
11
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
12 Authors and contributors:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
13
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
14 * Marie Crane
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
15 * Praveen Kumar
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
16 * Subina Mehta
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
17 * Dihn Duy An Nguyen
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
18 * Pratik Jagtap
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
19
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
20
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
21 # Instructions to run MT2MQ:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
22 --------------------------
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
23
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
24 The ASAIM workflow can be run following the training module on the [GTN](https://training.galaxyproject.org/training-material/topics/metagenomics/tutorials/metatranscriptomics/tutorial.html).
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
25 However, for training purposes we have provided inputs in the [test data](https://github.com/galaxyproteomics/tools-galaxyp/tree/master/tools/mt2mq/test-data).
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
26
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
27 ## Data upload
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
28
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
29 - Upload the files mentioned below to the Galaxy Europe instance.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
30 ```
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
31 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4A.tsv
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
32 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4B.tsv
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
33 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4C.tsv
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
34 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7A.tsv
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
35 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7B.tsv
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
36 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T7C.tsv
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
37 https://github.com/galaxyproteomics/tools-galaxyp/blob/master/tools/mt2mq/test-data/T4T7_func.tsv
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
38
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
39 ```
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
40
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
41 ## Functional mode:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
42
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
43 1. Build a **Dataset list** for the six .tsv files( `T4A`,`T4B`,`T4C`,`T7A`,`T7B`,`T7C`).
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
44 - Click the **Operations on multiple datasets** check box at the top of the history panel.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
45 - Select the files mentioned above.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
46 - Click on ** For all selected** drop down menu and select **Build Dataset list**.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
47 - Once the collection is created, rename the dataset collection as `Input collection`.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
48
5
68ecc89b6cbb "planemo upload commit 499df83651e020e9106d70ec658edf86355f49d9"
galaxyp
parents: 4
diff changeset
49 2. Download the map_go_uniref50.txt file from [zenodo](https://doi.org/10.5281/zenodo.4115871).
3
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
50
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
51 3. Run the **Regroup a HUMAnN2 generated table by features**(Galaxy Version 0.11.1.0) tool is regrouping table features (abundances or coverage) given a table of feature values and a mapping of groups to component features. It produces a new table with group values in place of feature values.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
52 - [**Regroup a HUMAnN2 generated table by features**](https://toolshed.g2.bx.psu.edu/repository?repository_id=85391b8d5d7ad39d) with the following parameters:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
53
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
54 - *"Gene/pathway table"*: `Input collection`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
55 - *"How to combine grouped features?"*: `Sum`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
56 - In *"Use built-in grouping options?"*: `No`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
57 - *"Custom groups file"*: `map_go_uniref50.txt`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
58 - *"Is the groups file reversed?"*: `No`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
59 - *"Decimal places to round to after applying function"*: `3`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
60 - *"Include an 'UNGROUPED' group to capture features that did not belong to other groups?"*: `Yes`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
61 - *"Carry through protected features, such as 'UNMAPPED'?"*: `Yes`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
62
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
63 Once this tool is run, rename the dataset collection as `Regrouped collection` .
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
64
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
65 4. Run the **Rename features of a HUMAnN2 generated table** (Galaxy Version 0.11.1.0)tool to change the Uniref-50 values to GO term .
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
66 - [**Rename features of a HUMAnN2 generated table**](https://toolshed.g2.bx.psu.edu/repository?repository_id=c68108109505c2f5) with the following parameters:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
67
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
68 - *"Gene/pathway table"*: `Regrouped collection`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
69 - *"Type of renaming"*: `Standard renaming`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
70 - *"Table features that can be renamed?"*: `Gene Ontology (GO)`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
71 - *"Remove non-alphanumeric characters from names?"*: `No`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
72
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
73 Once this tool is run, rename the dataset collection as `Renamed collection`.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
74
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
75
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
76 5. Run the **Join HUMAnN2 generated tables** (Galaxy Version 0.11.1.1) tool to merge all the files into one.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
77 - [**Join HUMAnN2 generated tables**](https://toolshed.g2.bx.psu.edu/repository?repository_id=9b27f096128b26ff) with the following parameters:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
78
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
79 - *"Gene/pathway table"*: `Renamed collection`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
80
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
81 Once this tool is run, rename the dataset collection as `Joined Data`.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
82
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
83 6. Run the **Renormalize a HUMAnN2 generated table** (Galaxy Version 0.11.1.0) tool to normalize the data.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
84 - [**Renormalize a HUMAnN2 generated table**](https://toolshed.g2.bx.psu.edu/repository?repository_id=05a56fcdeac2a25c) with the following parameters:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
85
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
86 - *"Gene/pathway table"*: `Joined Data`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
87 - *"Normalization scheme"*: `Copies per million`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
88 - *"Normalization level"*: `Normalization of all levels by community total`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
89 - *"Include the special features UNMAPPED, UNINTEGRATED, and UNGROUPED?"*: `Yes`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
90 - *"Update '-RPK' in sample names to appropriate suffix?"*: `No`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
91
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
92 Once this tool is run, rename the dataset collection as `Renormalized data`.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
93
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
94
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
95 7. Now that the data is ready, we can run **MT2MQ Tool to prepare metatranscriptomic outputs from ASaiM for Metaquantome** (Galaxy Version 1.1.0)on this data.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
96 - [**MT2MQ Tool to prepare metatranscriptomic outputs from ASaiM for Metaquantome**](https://toolshed.g2.bx.psu.edu/repository?repository_id=cab5d81c5f0a2f94) with the
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
97 following parameters:
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
98 - *"Mode"*: `Function`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
99 - *"GO namespace"*: `Molecular Function` or `Biological Process` or ` Cellular Component`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
100 - *"File from HUMAnN2 after regrouping, renaming, joining, and renormalizing"*: `Renormalized data`
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
101
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
102 **Note** : The MT2MQ tools can be run will all three GO name space.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
103
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
104 There are two tabular outputs from this tool.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
105
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
106 - A f_int.tabular output which mimics the Intensity input file for metaQuantome.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
107 - A func.tabular output which mimics the Functional input file for metaQuantome.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
108
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
109 The resulting output files can be used as input for metaQuatome's functional mode.
faab3160cfcf "planemo upload commit 98f9787ed5158179a5baf54cea3dbe70a02aaf59"
galaxyp
parents:
diff changeset
110 To run metaQuantome Function mode. Follow the [GTN](https://github.com/subinamehta/training-material/tree/metaquantome-2-3/topics/proteomics/tutorials/metaquantome-function).