changeset 5:5f3cc8229771 draft

Uploaded v0.0.1e with links to sample data in README
author peterjc
date Tue, 20 Aug 2013 07:37:19 -0400
parents b14c822a37fe
children 36b2c2b5051e
files README.rst
diffstat 1 files changed, 16 insertions(+), 2 deletions(-) [+]
line wrap: on
line diff
--- a/README.rst	Mon Aug 19 11:38:03 2013 -0400
+++ b/README.rst	Tue Aug 20 07:37:19 2013 -0400
@@ -4,12 +4,26 @@
 It runs SignalP v3.0 (Bendtsen et al. 2004) and selects only proteins with a
 strong predicted signal peptide, and then runs TMHMM v2.0 (Krogh et al. 2001)
 on those, and selects only proteins without a predicted trans-membrane helix.
-This workflow was used in Kikuchi et al (2001), and is a simplification of
-the candidate effector protocol described in Jones et al (2009).
+This workflow was used in Kikuchi et al. (2011), and is a simplification of
+the candidate effector protocol described in Jones et al. (2009).
 
 See http://www.galaxyproject.org for information about the Galaxy Project.
 
 
+Sample Data
+===========
+
+This workflow was developed and run on several nematode species. For example,
+try the protein set for Bursaphelenchus xylophilus (Kikuchi et al. 2011):
+
+ftp://ftp.sanger.ac.uk/pub/pathogens/Bursaphelenchus/xylophilus/Assembly-v1.2/BUX.v1.2.genedb.protein.fa.gz
+
+You can upload this directly into Galaxy via this URL. Galaxy will handle
+removing the gzip compression to give you the FASTA protein file which has
+18,074 sequences. The expected result (selecting organism type Eukaryote)
+is a FASTA protein file of 2,297 predicted secreted protein sequences.
+
+
 Citation
 ========