annotate README.txt @ 3:38270a3e474e draft default tip

Uploaded v1.0.1 with updated readme file.
author peterjc
date Wed, 05 Jun 2013 13:40:29 -0400
parents dfd7c3ff3447
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3
38270a3e474e Uploaded v1.0.1 with updated readme file.
peterjc
parents: 2
diff changeset
1 #Created 07/01/2011 - Konrad Paszkiewicz, Exeter Sequencing Service, University of Exeter, UK
38270a3e474e Uploaded v1.0.1 with updated readme file.
peterjc
parents: 2
diff changeset
2 Revisions 2013 by Peter Cock, The James Hutton Institute, UK
2
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
3
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
4 The attached is a crude wrapper script for Interproscan. Typically this is useful when one wants to produce an annotation which is not based on sequence
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
5 similarity. E.g after a denovo transcriptome assembly, each transcript could be translated and run through this tool.
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
6
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
7 Prerequisites:
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
8
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
9 1. A working installation of Interproscan on your Galaxy server/cluster.
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
10
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
11 Limitations:
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
12
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
13 Currently it is setup to work with PFAM only due to the heavy computational demands Interproscan makes.
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
14
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
15 Input formats:
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
16
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
17 The standard interproscan input is either genomic or protein sequences. In the case of genomic sequences Interproscan will of run an ORF
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
18 prediction tool. However this tends to lose the ORF information (e.g. start/end co-ordinates) from the header. As such the requirement here is to input ORF
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
19 sequences (e.g. from EMBOSS getorf) and to then replace any spaces in the FASTA header with underscores. This workaround generally preserves the relevant
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
20 positional information.
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
21
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
22
dfd7c3ff3447 Restore original folder structure
peterjc
parents:
diff changeset
23