# HG changeset patch # User devteam # Date 1378845708 14400 # Node ID 7d6cc3605cee2d8a6ca9333fb1fecb295ba82e88 # Parent b2814057f28675868f83504510da04649dd3d8ea Uploaded updated tool dependency definition. diff -r b2814057f286 -r 7d6cc3605cee tool_dependencies.xml --- a/tool_dependencies.xml Fri Aug 30 13:51:25 2013 -0400 +++ b/tool_dependencies.xml Tue Sep 10 16:41:48 2013 -0400 @@ -1,8 +1,5 @@ - - - @@ -34,10 +31,20 @@ $INSTALL_DIR/bin + + http://hannonlab.cshl.edu/fastx_toolkit/fastx_toolkit_0.0.13_binaries_MacOSX.10.5.8_i386.tar.bz2 + + ./bin + $INSTALL_DIR/bin + + + + + http://hannonlab.cshl.edu/fastx_toolkit/fastx_toolkit-0.0.13.tar.bz2 - + @@ -50,6 +57,15 @@ +The FASTX-Toolkit is a collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing. + +Next-Generation sequencing machines usually produce FASTA or FASTQ files, containing multiple short-reads sequences (possibly with quality information). + +The main processing of such FASTA/FASTQ files is mapping (aka aligning) the sequences to reference genomes or other databases using specialized programs. Example of such mapping programs are: Blat, SHRiMP, LastZ, MAQ and many many others. + +However, it is sometimes more productive to preprocess the FASTA/FASTQ files before mapping the sequences to the genome - manipulating the sequences to produce better mapping results. + +The FASTX-Toolkit tools perform some of these preprocessing tasks.