annotate readme.rst @ 3:257453ff1f3d draft default tip

Uploaded
author bgruening
date Tue, 17 Mar 2015 13:39:34 -0400
parents b876c71cc0b1
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
1 Galaxy workflow for the identification of candidate genes clusters
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
2 ------------------------------------------------------------------
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
3
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
4 This approach screens three proteins against a given genome sequence, leading to a genome position
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
5 were all three genes are located nearby. As usual in Galaxy workflows every
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
6 parameter, including the proximity distance, can be changed and additional steps
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
7 can be easily added. For example additional filtering to refine the initial BLAST
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
8 hits, or inclusion of a third query sequence.
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
9
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
10 .. image:: https://raw.githubusercontent.com/bgruening/galaxytools/master/workflows/ncbi_blast_plus/find_three_genes_located_nearby/find_three_genes_located_nearby.png
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
11
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
12
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
13 Sample Data
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
14 ===========
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
15
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
16 As an example, we will use three protein sequences from *Pan troglodytes* (Chimpanzee)
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
17 which are part of the β-globin cluster.
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
18
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
19 You can upload all sequences directly into Galaxy using the "Upload tool"
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
20 with either of these URLs - Galaxy should recognise this is FASTA files.
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
21
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
22 Query sequences:
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
23
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
24 * `P61920.fasta <https://raw.githubusercontent.com/bgruening/galaxytools/master/workflows/ncbi_blast_plus/find_three_genes_located_nearby/P61920.fasta>`_
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
25 * `P61921.fasta <https://raw.githubusercontent.com/bgruening/galaxytools/master/workflows/ncbi_blast_plus/find_three_genes_located_nearby/P61921.fasta>`_
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
26 * `Q6LDH1.fasta <https://raw.githubusercontent.com/bgruening/galaxytools/master/workflows/ncbi_blast_plus/find_three_genes_located_nearby/Q6LDH1.fasta>`_
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
27
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
28 Genome sequence:
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
29
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
30 * http://hgdownload.cse.ucsc.edu/goldenPath/rn6/bigZips/rn6.fa.gz
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
31
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
32
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
33 In addition you can find the query sequences at the UniProt server:
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
34 * http://www.uniprot.org/uniprot/P61920 (Hemoglobin subunit gamma-1)
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
35 ::
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
36
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
37 >sp|P61920|HBG1_PANTR Hemoglobin subunit gamma-1 OS=Pan troglodytes GN=HBG1 PE=1 SV=2
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
38 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
39 VKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
40 KEFTPEVQASWQKMVTAVASALSSRYH
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
41
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
42
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
43 * http://www.uniprot.org/uniprot/P61921 (Hemoglobin subunit gamma-2)
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
44 ::
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
45
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
46 >sp|P61921|HBG2_PANTR Hemoglobin subunit gamma-2 OS=Pan troglodytes GN=HBG2 PE=1 SV=2
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
47 MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFDSFGNLSSASAIMGNPK
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
48 VKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPENFKLLGNVLVTVLAIHFG
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
49 KEFTPEVQASWQKMVTGVASALSSRYH
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
50
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
51
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
52 * http://www.uniprot.org/uniprot/Q6LDH1 (Hemoglobin subunit epsilon)
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
53 ::
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
54
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
55 >sp|Q6LDH1|HBE_PANTR Hemoglobin subunit epsilon OS=Pan troglodytes GN=HBE1 PE=2 SV=3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
56 MVHFTAEEKAAVTSLWSKMNVEEAGGEALGRLLVVYPWTQRFFDSFGNLSSPSAILGNPK
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
57 VKAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPENFKLLGNVMVIILATHFG
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
58 KEFTPEVQAAWQKLVSAVAIALAHKYH
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
59
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
60
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
61 Citation
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
62 ========
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
63
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
64 If you use this workflow directly, or a derivative of it, or the associated
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
65 NCBI BLAST wrappers for Galaxy, in work leading to a scientific publication,
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
66 please cite:
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
67
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
68 Peter J. A. Cock, John M. Chilton, Björn Grüning, James E. Johnson, Nicola Soranzo
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
69 NCBI BLAST+ integrated into Galaxy
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
70
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
71 * http://biorxiv.org/content/early/2015/01/21/014043
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
72 * http://dx.doi.org/10.1101/014043
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
73
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
74
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
75 Availability
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
76 ============
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
77
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
78 This workflow is available on the main Galaxy Tool Shed:
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
79
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
80 http://toolshed.g2.bx.psu.edu/view/bgruening/find_three_genes_located_nearby_workflow
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
81
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
82 Development is being done on github:
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
83
3
257453ff1f3d Uploaded
bgruening
parents: 2
diff changeset
84 https://github.com/bgruening/galaxytools/tree/master/workflows/ncbi_blast_plus/find_three_genes_located_nearby
0
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
85
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
86
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
87 Dependencies
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
88 ============
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
89
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
90 These dependencies should be resolved automatically via the Galaxy Tool Shed:
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
91
6456a29b8e84 Uploaded
bgruening
parents:
diff changeset
92 * http://toolshed.g2.bx.psu.edu/view/devteam/ncbi_blast_plus