comparison htseq-count.xml @ 6:08a11d1eaec6

Updated HTSEQ package to version 0.5.4p1, attempted to fix galaxy install where lib64 directory does not exist
author lparsons
date Mon, 11 Mar 2013 12:42:42 -0400
parents 0a835934d792
children 1615d0b5b809
comparison
equal deleted inserted replaced
5:0a835934d792 6:08a11d1eaec6
1 <tool id="htseq_count" name="htseq-count" version="0.3"> 1 <tool id="htseq_count" name="htseq-count" version="0.3.1">
2 <description> - Count aligned reads in a BAM file that overlap features in a GFF file</description> 2 <description> - Count aligned reads in a BAM file that overlap features in a GFF file</description>
3 <version_command>htseq-count -h | grep version | sed 's/^\(.*\)*\(version .*\)\./\2/'</version_command> 3 <version_command>htseq-count -h | grep version | sed 's/^\(.*\)*\(version .*\)\./\2/'</version_command>
4 <requirements> 4 <requirements>
5 <requirement type="package" version="1.6.2">numpy</requirement> 5 <requirement type="package" version="1.6.2">numpy</requirement>
6 <requirement type="package" version="0.5.3p9">htseq</requirement> 6 <requirement type="package" version="0.5.4p1">htseq</requirement>
7 <requirement type="package" version="0.1.18">samtools</requirement> 7 <requirement type="package" version="0.1.18">samtools</requirement>
8 <requirement type="package" version="1.56.0">picard</requirement> 8 <requirement type="package" version="1.56.0">picard</requirement>
9 </requirements> 9 </requirements>
10 <command> 10 <command>
11 ##set up input files 11 ##set up input files
176 each gene is considered here as the union of all its exons. One may also consider 176 each gene is considered here as the union of all its exons. One may also consider
177 each exon as a feature, e.g., in order to check for alternative splicing. For 177 each exon as a feature, e.g., in order to check for alternative splicing. For
178 comparative ChIP-Seq, the features might be binding regions from a pre-determined 178 comparative ChIP-Seq, the features might be binding regions from a pre-determined
179 list. 179 list.
180 180
181 **Paired-end Data MUST be sorted by QUERY NAME first**
182
183 This tool requires that paired-end data be sorted by query name, which is NOT the default for Galaxy. Using the Picard Paired Read Mate Fixer with Query name sort FIRST is required for paired end data.
184
185 181
186 Overlap Modes 182 Overlap Modes
187 ------------- 183 -------------
188 184
189 Special care must be taken to decide how to deal with reads that overlap more than one feature. 185 Special care must be taken to decide how to deal with reads that overlap more than one feature.
193 The following figure illustrates the effect of these three modes: 189 The following figure illustrates the effect of these three modes:
194 190
195 .. image:: /static/images/count_modes.png 191 .. image:: /static/images/count_modes.png
196 :width: 500 192 :width: 500
197 193
194
198 Strandedness 195 Strandedness
199 ------------ 196 ------------
200 197
201 **Important**: The default for strandedness is yes. If your RNA-Seq data has not been made with a strand-specific protocol, this causes half of the reads to be lost. Hence, make sure to set the option Stranded to 'No' unless you have strand-specific data! 198 **Important**: The default for strandedness is yes. If your RNA-Seq data has not been made with a strand-specific protocol, this causes half of the reads to be lost. Hence, make sure to set the option Stranded to 'No' unless you have strand-specific data!
199
202 200
203 Output 201 Output
204 ------ 202 ------
205 203
206 The script outputs a table with counts for each feature, followed by the special counters, which count reads that were not counted for any feature for various reasons, namely 204 The script outputs a table with counts for each feature, followed by the special counters, which count reads that were not counted for any feature for various reasons, namely