comparison data_manager/hisat_index_builder.xml @ 0:8082f04de7ae draft default tip

planemo upload for repository https://github.com/galaxyproject/tools-devteam/tree/master/data_managers/data_manager_hisat_index_builder commit 86cf90107482cab1cb47fc0d42d6705f8077daa7
author devteam
date Fri, 06 Nov 2015 14:17:24 -0500
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:8082f04de7ae
1 <tool id="hisat_index_builder_data_manager" name="HISAT index" tool_type="manage_data" version="1.0.0">
2 <description>builder</description>
3 <requirements>
4 <requirement type="package" version="0.1.6">hisat</requirement>
5 </requirements>
6 <stdio>
7 <exit_code range=":-1" />
8 <exit_code range="1:" />
9 </stdio>
10 <command interpreter="python">hisat_index_builder.py "${out_file}" --fasta_filename "${all_fasta_source.fields.path}" --fasta_dbkey "${all_fasta_source.fields.dbkey}" --fasta_description "${all_fasta_source.fields.name}" --data_table_name "hisat_indexes"</command>
11 <inputs>
12 <param label="Source FASTA Sequence" name="all_fasta_source" type="select">
13 <options from_data_table="all_fasta" />
14 </param>
15 <param label="Name of sequence" name="sequence_name" type="text" value="" />
16 <param label="ID for sequence" name="sequence_id" type="text" value="" />
17 </inputs>
18 <outputs>
19 <data format="data_manager_json" name="out_file" />
20 </outputs>
21 <help>
22 <![CDATA[
23 .. class:: infomark
24
25 **Notice:** If you leave name, description, or id blank, it will be generated automatically.
26
27 What is HISAT?
28 --------------
29
30 `HISAT <http://ccb.jhu.edu/software/hisat>`__ is a fast and sensitive
31 spliced alignment program. As part of HISAT, we have developed a new
32 indexing scheme based on the Burrows-Wheeler transform
33 (`BWT <http://en.wikipedia.org/wiki/Burrows-Wheeler_transform>`__) and
34 the `FM index <http://en.wikipedia.org/wiki/FM-index>`__, called
35 hierarchical indexing, that employs two types of indexes: (1) one global
36 FM index representing the whole genome, and (2) many separate local FM
37 indexes for small regions collectively covering the genome. Our
38 hierarchical index for the human genome (about 3 billion bp) includes
39 ~48,000 local FM indexes, each representing a genomic region of
40 ~64,000bp. As the basis for non-gapped alignment, the FM index is
41 extremely fast with a low memory footprint, as demonstrated by
42 `Bowtie <http://bowtie-bio.sf.net>`__. In addition, HISAT provides
43 several alignment strategies specifically designed for mapping different
44 types of RNA-seq reads. All these together, HISAT enables extremely fast
45 and sensitive alignment of reads, in particular those spanning two exons
46 or more. As a result, HISAT is much faster >50 times than
47 `TopHat2 <http://ccb.jhu.edu/software/tophat>`__ with better alignment
48 quality. Although it uses a large number of indexes, the memory
49 requirement of HISAT is still modest, approximately 4.3 GB for human.
50 HISAT uses the `Bowtie2 <http://bowtie-bio.sf.net/bowtie2>`__
51 implementation to handle most of the operations on the FM index. In
52 addition to spliced alignment, HISAT handles reads involving indels and
53 supports a paired-end alignment mode. Multiple processors can be used
54 simultaneously to achieve greater alignment speed. HISAT outputs
55 alignments in `SAM <http://samtools.sourceforge.net/SAM1.pdf>`__ format,
56 enabling interoperation with a large number of other tools (e.g.
57 `SAMtools <http://samtools.sourceforge.net>`__,
58 `GATK <http://www.broadinstitute.org/gsa/wiki/index.php/The_Genome_Analysis_Toolkit>`__)
59 that use SAM. HISAT is distributed under the `GPLv3
60 license <http://www.gnu.org/licenses/gpl-3.0.html>`__, and it runs on
61 the command line under Linux, Mac OS X and Windows.
62 ]]>
63 </help>
64 <citations>
65 <citation type="doi">10.1038/nmeth.3317</citation>
66 </citations>
67 </tool>