|
19
|
1 <tool id="ukseed_stage_1" name="UK SeeD Dartseq Analysis #1" version="0.0.2">
|
|
|
2 <description>run a dartseq analysis (stage #1)</description>
|
|
15
|
3 <requirements>
|
|
24
|
4 <requirement type="package" version="1.12.2">bioconductor-snprelate</requirement>
|
|
15
|
5 <requirement type="package" version="1.90b4">plink</requirement>
|
|
|
6 <requirement type="package" version="0.8_14">r-amap</requirement>
|
|
|
7 <requirement type="package" version="1.0">r-dartr</requirement>
|
|
24
|
8 <requirement type="package" version="1.8_0">r-labdsv</requirement>
|
|
|
9 <!--<requirement type="package" version="2.10.0">bioconductor-qvalue</requirement>-->
|
|
15
|
10 </requirements>
|
|
|
11
|
|
|
12 <command><![CDATA[
|
|
25
|
13 Rscript --vanilla $__tool_directory__/dartseq_seeduk_1.R $input > $output
|
|
15
|
14 ]]></command>
|
|
|
15
|
|
|
16 <inputs>
|
|
|
17 <param format="csv,txt" name="input" type="data" label="Input file"
|
|
|
18 help="Input file of genotype data"/>
|
|
|
19 </inputs>
|
|
|
20
|
|
|
21 <outputs>
|
|
|
22 <data name="output" format="txt" />
|
|
|
23 </outputs>
|
|
|
24
|
|
|
25 <stdio>
|
|
|
26 <exit_code range="1:" />
|
|
|
27 </stdio>
|
|
|
28
|
|
|
29 <help>
|
|
27
|
30 This pipeline has been developed for loading a DArT SNP or SilicoDArT, report and apply filters to those datasets based on locus and
|
|
|
31 individuals call rates and locus reproducibility. It also allows data export for other formats such as GDS, plink bed and a text file
|
|
|
32 with a header line, and then one line per sample with V+6 where V is the number of variants suitable for loading into R. Finally the
|
|
|
33 pipeline perform a Principal Coordinates Analysis (PCoA, = Multidimensional scaling, MDS) to explore similarities of data and outputs
|
|
|
34 a vcf file suitable for visualizing in CurlyWhirly.
|
|
|
35
|
|
|
36 In **UK-SeeD Data Analysis Infrastructure**, a BBSRC-Newton funded project, we have deployed an advanced computing hardware and software
|
|
|
37 platform for the analysis of large genomics datasets for wheat varieties. The platform integrates computing resources and bioinformatics
|
|
|
38 expertise to enable crop geneticists to implement sophisticated data analysis algorithms to improve the use of genetic resources for
|
|
|
39 wheat and other important crops. The computing platform is distributed across the partners’ sites with hardware deployed at CIMMYT
|
|
|
40 (Mexico) and the Earlham Institute (UK).
|
|
15
|
41
|
|
20
|
42 |LOGOS|
|
|
15
|
43
|
|
20
|
44 .. |LOGOS| image:: https://ics.hutton.ac.uk/resources/ukseed/logos.png
|
|
15
|
45 </help>
|
|
|
46
|
|
|
47 <citations>
|
|
|
48 </citations>
|
|
|
49 </tool> |