# HG changeset patch # User pjbriggs # Date 1421057567 18000 # Node ID 718d2bda28f16a8902725465022b375f3954e011 Uploaded initial copy from main tooshed diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/README.rst --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/README.rst Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,104 @@ +Trimmomatic: flexible read trimming tool for Illumina NGS data +============================================================== + +Galaxy tool wrapper for the Trimmomatic program, which provides various functions for +manipluating Illumina FASTQ files (both single and paired-end). + +Trimmomatic has been developed within Bjorn Usadel's group at RWTH Aachen university +http://www.usadellab.org/cms/index.php?page=trimmomatic + +The reference for Trimmomatic is: + +- Bolger, A.M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer + for Illumina Sequence Data. Bioinformatics, btu170. + +Automated installation +====================== + +Installation via the Galaxy Tool Shed will take of installing the tool wrapper and +the trimmomatic program and data, and setting the appropriate environment variables. + +Manual Installation +=================== + +There are two files to install: + +- ``trimmomatic.xml`` (the Galaxy tool definition) +- ``trimmomatic.sh`` (the shell script wrapper) + +The suggested location is in a ``tools/trimmomatic/`` folder. You will then +need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the tool +by adding the line: + + + +You will also need to install trimmomatic 0.32: + +- http://www.usadellab.org/cms/uploads/supplementary/Trimmomatic/Trimmomatic-0.32.zip + +The tool wrapper uses the following environment variables in order to find the +appropriate files: + +- ``TRIMMOMATIC_DIR`` should point to the directory holding the + ``trimmomatic-0.32.jar`` file +- ``TRIMMOMATIC_ADAPTERS_DIR`` should point to the directory holding the adapter + sequence files (used by the ``ILLUMINACLIP`` option). + +If you want to run the functional tests, copy the sample test files under +sample test files under Galaxy's ``test-data/`` directory. Then: + + ./run_tests.sh -id trimmomatic + +You will need to have set the environment variables above. + +History +======= + +========== ====================================================================== +Version Changes +---------- ---------------------------------------------------------------------- +0.32.1 - Remove ``trimmomatic_adapters.loc.sample`` and hard-code adapter files + into the XML wrapper. +0.32.0 - Add tool_dependencies.xml to install Trimmomatic 0.32 automatically and + set the environment. + - Update tool versioning to use Trimmomatic version number (i.e. ``0.32``) + with tool iteration appended (i.e. ``.1``). +0.0.4 - Specify '-threads 6' in section. +0.0.3 - Added MINLEN, LEADING, TRAILING, CROP and HEADCROP options of trimmomatic. +0.0.2 - Updated ILLUMINACLIP option to use standard adapter sequences (requires + the trimmomatic_adapters.loc file; sample version is supplied) plus + cosmetic updates to wording and help text for some options. +0.0.1 - Initial version +========== ====================================================================== + + +Developers +========== + +This tool is developed on the following GitHub repository: +https://github.com/fls-bioinformatics-core/galaxy-tools/tree/master/trimmomatic + +For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball I use +the ``package_trimmomatic.sh`` script. + + +Licence (MIT) +============= + +Permission is hereby granted, free of charge, to any person obtaining a copy +of this software and associated documentation files (the "Software"), to deal +in the Software without restriction, including without limitation the rights +to use, copy, modify, merge, publish, distribute, sublicense, and/or sell +copies of the Software, and to permit persons to whom the Software is +furnished to do so, subject to the following conditions: + +The above copyright notice and this permission notice shall be included in +all copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, +OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN +THE SOFTWARE. diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/test-data/Illumina_SG_R1.fastq --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/test-data/Illumina_SG_R1.fastq Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,40 @@ +@MISEQ:1:000000000-A2Y1L:1:1101:19264:2433 1:N:0:NAAGGCGATAGATCGC +AGATAGCCGAAGATAAAGAGNTCATAACCGTAAAGGCCAGAGACGAGAACCACTCGCAGGCAAGGTTTGCAAGCCTCGCCAAGAATATTATAGACCCGCTTGCCAACGACTGGCATTACATCACCGTAAAGAAGGTAGGCAAGAATCACTGGATCAGCATAACAGATGAGGGTAGAGACGCCGTATCGTTTCTGGGCTAGGTTATCTGGGCCGCCTTCTTGCGTCGGCGCACAGTCACAAAGGAAATGCC ++ +?????BBB@BBBB?BBFFFF#66EAFHHHCEFEDFGHHHHF@DFHACEEF@FDFHHHHEEEHHHF?FGEHFHDEFGHHCEHEFHHHHFFFCFDFFFFBEEEEEEEEEEEB?EEEEEEEEEEEBBEE?CE?:ACCC*??CAAEECE:?ECA:?A*?EEEE?EEEE?0AAC/?8)00**0:8'8;48*?8/8?AEEEE8:A*8*0:**0**:?C?480?EE?A############################# +@MISEQ:1:000000000-A2Y1L:1:1101:18667:2435 1:N:0:NAAGGCGATAGATCGC +ATATATTCATCCGCCATTATNAGAGTCCGATTACTTTAGAACAGTGCCGCGAATCCTTCGTCAACTGTAATCCAAGCGACATTGGTTCGATTAGCACGCATGAGGCACGAGATCTACGCGTATCTCTGGTAGATACACCAGGAGGGATGAGAGTTGAGCGAGTTAGCTGTGGCCAGAGAGATGGGTCTAATCGTAGTCTACAACTGGGAGTCACATGCCACTACCGACTAATCTGGTCGCCGAACGGGAG ++ +?????BBBDDDDADDDE@FF#66CFCFHHBEHHIIIHIIHHFHHGHHHHHHHHHHIHIHIFHHHHHHEGFHIHHIIIHHHHHHHHCFHHHHHHHFFFFFFEEEEFFEFEEBBEEEFFE@E8;EEFFFEF*:AEEEEEFFFEAAEA?A?ACAEF?CEEFED2<.?AAECEAEFEEE8??A8?CAAE1?EEE?AE*888AEEEEFEEFFA8A8?ACEAEEEAE?C:??ED28.)8?< +@MISEQ:1:000000000-A2Y1L:1:1101:17523:2436 1:N:0:NAAGGCGATAGATCGC +CATCACTACCGCTCAGGAATNTGACGGCAGTCTTAGCGGCGCTCTAGTGCGGGAGGCCGTATCTTGGAATAAGGTGTCGTCAAATGCAAGGCAGGTAACCCTACACGCCGAGGTTACCACTATACTCCCGTTTCTATATTCCTGTCTCTTATACACATCTCCGAGCCCACGAGACTAAGGCGAATATCGGTTGCCGTCTTCTGCTTGAAAAAAAAAAACAAACAAGAGAAGACGACGTACTGCACCACCA ++ +?<,/<@@@@@@>@CCCE#66>ACEEEEGGGGGGGFFFEFDFFFFF=CDEFFFFEDEFFFFFC=AEBF-AA@CE;ECD5<5<==C==DEEE)83CD==@+4@DEE*D?)8):A8?EC?8***0*00:8?*)0*1:?CACC:AA?;)?EEFAEECE88*)1A*10*1:*1?:AC::A)*::*8ACEEE?########### +@MISEQ:1:000000000-A2Y1L:1:1101:15290:2442 1:N:0:NAAGGCGATAGATCGC +AAAATAATCCTAAAAAATAACCTCTATGCCGCCGAACGCTCCGCCTCTATCTTCGTAAAAACTATCTTCTCCTCCTCACCTCCATAATCAAGCATCAAGCGATCGCCCTCCGCCAACTCACCCCGCAAAATCTTATCCGCTAGCGGATTCTCAATCTCCGTCTGAATGACCCGCCTCAGCGGCCGCGCCCCATAAACCGAATCAAATCCACGCCTGTCTCTTATACACATCTCCGAGCCCACGAGACTAA ++ +?????BBBDDDDDDDDGGGGGGIIIHHFFHHHHHHHHHHEHHEHHHHHIIHHHHHFEHIIIHHIHHIHIHIIIIIHHHHHHHHHHHHHHHHDHHHHHHHHGEDFGGGGGGG;CEGEGCEGGGGG8>GGGGEGGEECEGGGGD8EDGGAEGEEGGCE:CGG8CEEGG???CEEDG.89D9DFG;FHHHHGHHFFFFHHHHECDCEFF>EFEEEE@CDFEFHHGHHHHEHDGHHFHDFHFHHHHFFHHH@FDD??DFFFFDBBEEBEEEEEEEEEEEEEEEEE,;?CEC;CEEEECB?CEEEEEEEECEEE?C?CE*0?***/:::?:*/?:A::**/:*:ACE?0:::A::***00::*/?C888??EEE############# +@MISEQ:1:000000000-A2Y1L:1:1101:15892:2446 1:N:0:NAAGGCGATAGATCGC +CTTCCCCACGGCCCAGACACAAGAGACGACCTCCATAAATCTTTTAGAGGGTGACCGCATCTCCGACGCAAACCAGGACGCCGATACCCTCGTGGTGGTGTTCGACCGTACGGATGGCGCAGACACCGACGGCACTAGTGCCACGGTATCAGGTACCACCATAACGTATGATTCGGGCACGCTCAAGGGCCAGCGTGACGGAATCGATAGAATACACTACACGGTGACTGATGGGGCCAGCCAAACCTGT ++ +?????BBBDBDDDDDDFFFFFFHIHIHHHHHHIHIFGGHFHHHHIIFHIHH?EEGHHHHHH-EGEHHCEHHHHH@FDFFEFF5@EEEFFEFE;AECCE;AEEEEEA?8?AEDDEEDFFDE2>>EEFF<<<2>D?DEEE*:C?AA<>8AEFCEE:?C?EEE?CEFEE0?:E?ACEECD8>EE>)8>E:CEEEEEED.)?AE??A?:A?*??:C0?CCE?AAA:88.88?::C:C?*8)4A?A######### +@MISEQ:1:000000000-A2Y1L:1:1101:17903:2450 1:N:0:TAAGGCGATAGATCGC +GTGCAGGGGGTGTGGTCAATCCACACTGTTGCTGAGGTGATTGGGTCTCCGGGGTCAACTCCATGAGGTAGCAGACCAGTGCCAGAGATAGTGACCACAGTGTTGGCCCTGTCTCTTATACACATCTCCGAGCCCACGAGACTAAGGCGAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAAAGACTTTTAAAAAACCGATCTACACCTTGACAGGCTATTAAAGGAAACTGGCCTGGTATCACAG ++ +=5===<>+5<5<+5=@CC;8CEEEEE;-8ACFDE.7C87CF-A>E>E=FFE**>>5AEDAEED-5-5AADC=D+=DDE<++3+CCD9DCD+=@3:DEEEEDD<@D*9DE99;@;@EE@;@@EEE7@@8@:@@@;9EE'449?<>>@@@@@9@@CCEE@EE+++6C8-++CECE+>DCC>@@EFFFFEEEDEE55+ACEF;-C+>EE++CBA--5A++CDC+4<<+=DE=CDEE@+@9D:D+4;@############################################################################################################################################### diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/test-data/Illumina_SG_R2.fastq --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/test-data/Illumina_SG_R2.fastq Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,40 @@ +@MISEQ:1:000000000-A2Y1L:1:1101:19264:2433 2:N:0:NAAGGCGATAGATCGC +GCCGATATGCAGATGCCGTTGATTGCTGCCAGCATAGCAGCCATTGCCGGCATATCCTTTGTGACTGTGCGCCGACGCAAGAAGGCGGCCCAGATAACCTAGCCCAGAAACGATACGGCGTCTCTACCCTCATCTGTTATGCTGATCCAGTGATTCTTGCCTACCTTCTTTACGGTGATGTAATGCCAGTCGTTGGCAAGCGGGTCTATAATATTCTTGGCGAGGCTTGCAAACCTGGCCTGCGAGTGGT ++ +??A??ABBDDDDDDDDFFEEEFHIIIIHHIIHIFHIIHHHFHHIIIIIHHHFHHIIIIIIFFFHHIIHGDEHHAAEHHHHFHHHHD=EBBEEEEEFFFFEEEECEFFEEFFF8*??AA?;>DEEEFFFF*?8AECFEFFFFFFEAEEECFFFCEEEFFFFFFEEFEACEEC?CAACEFEEEFE?0?EACAAAEEAACEEA*.48.'0*1::A?CAA?CC::4''4'00'8:?A8?E############## +@MISEQ:1:000000000-A2Y1L:1:1101:18667:2435 2:N:0:NAAGGCGATAGATCGC +GATGACATCAGCACAGGTTCCGCTTCCGCTGTCGCAACGACGTGGCTGCCATCGATTATATTCAATTCTGAGTGCTGCCGTAGGGCTAGATGTAAGACTCACGTTCGGCGACCAGATTAGTCGGTAGTGGCATGTGACTCCCCGTTGTAGACTACGATTAGACCCATCTCTCTGGCCACAGCTAACTCGCTCAACTCTCATCCCTCCTGGGGTATCTACCAGCGATACGCGTAGAACTCGTGCCTCATGC ++ +?????BBBDDDDDDDDGFFGGGHHHHIHHHHHHHIHHHHIHBCEEHFFHHHIIHFHHHIIIIIIIHIFBGFGFHHHHIHE@EEEHHFFHHFF?CFHHHHHGGGGGGGGGGGGGGGGGGGECGGG;EE:CEGGG?#################################################################################################################### +@MISEQ:1:000000000-A2Y1L:1:1101:17523:2436 2:N:0:NAAGGCGATAGATCGC +GAATATAGAAACGGGAGTATAGTGGTAACCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTAGNNCGCCGCTAAGACTGCCGTCAAATTCCTGAGCGGTAGTGATGCTGTCTCTTATACACATCTGACGCTGCCGACGAGCGATCTAGTGTAGATCTCGGGGGGCGCCGGCACATTCAAAAAAAAAAAACAGGAAAATGACATAGGCGTGTGTGA ++ +??AAABBBDDDDDEDDGGCFGGHHIFFHHH################################################################66?=##466@@EEEGCGGC>CE*0*1C:?################################################################# +@MISEQ:1:000000000-A2Y1L:1:1101:15489:2437 2:N:0:NAAGGCGATAGATCGC +GTCCACGTGAAATCCCGAGCGTCCCTCTCCCTCCCGAGAACCCGGTCTATATCCCACGATTTTTGACCGAACAAGGATTGACCTCATCCAACTCCGAAGCCCGCCGCCTACTTTCCCAAGGCGCGATAAAGATTGATGGAACCCCGCCCAAGGCTGACGAAATCCCCGCCAACGAACTGTCGGGCCGACCCATTCAAGTGGGGAAAAAAACCTTTCCCCGGGTTGACAAACCGCCAAACAACCTCACCAA ++ +????9B???AD9?BDDEEDDDDHHHHHFHFHFFHHHHHHHHHEHH>CEGHHFGHHGHH=FHHHHC+CGEBCCCE6BB)).).:AAAEE?A?A################################################################################################################################## +@MISEQ:1:000000000-A2Y1L:1:1101:18851:2442 2:N:0:NAAGGCGATAGATCGC +TCTCCTCAGTACGGGGGAGGAGGTTCGCTCGGCGGACGTGCGGCCCTACGCCAGACTCTACCACATGGTTCAGCAGGAGCTGGAGCACCGCGCTGTGTCCGGGGACTGGGAATACACCATCAGCCCCGGCGAGGAGTGGCAAGAGACGGAGCCTCTGCAACCAGCAGAGGTTGCCAAGCTGGACTATCACAGCTACGTCACCAGACGCTTCATGAAGGTCACCCAGTCTGACCCTGTGAACCAGGCCGGG ++ +A??A?BBBDDDDDDDDGGGGGGHEFHHGEHHHIHEHHHHHDHBEHGFFFEGGDEEDGGGGGEGGEECE?EGCEGEGGGCEEEGGCEEGG4D;;4AGGGGGG8>GGA?EGGGEEGEEGGGCGGEGEGGGGD81,;;;;3B,3B;;*:B;1;''0::0*1:A*:;>DD'4'8.::AA>D22>DA>''0..08?*:A*1?:*0?########################################################################################################################## +@MISEQ:1:000000000-A2Y1L:1:1101:18106:2444 2:N:0:NAAGGCGATAGATCGC +GAGTTACTATTACAGAGTGGAGCTAATGTACTGGCAGTTGACTGCAAAAGAAATGCTCCAATTCATGTTGCATGTGCAAATATGAGCTTAGAGTGTTTAAAGATAATTTTGTGCCACAAAAAATGCAACCCAAACCAACAAAATGCAGTGGGAGACACTCCACTCCACACCTTATGCAGTTTGGGGACTTGTGATATGAGAATACTACCTGTCTCTTATACACATCTGACGCTGCCGACGAGCGAACAAG ++ +?????BBBDDDDDDDDGCFGFGIIIIIHFHIIIIHIHFG=EHHIGIIIIFHIHDGHGHHHIHIH=CGHGGHFHHHFHFGHHFH/ACDFGG?FE?CDFFHHHHHIIHFHHI>CEDGFHHHHHHHHDFHHFHHHFFAFFGGGGGGEDEG>>DACC;?EGG>CEEA>AEACCEE?:C::CC:::C:CEG######### +@MISEQ:1:000000000-A2Y1L:1:1101:15892:2446 2:N:0:NAAGGCGATAGATCGC +GTATTGCTGGCACCATCAGTCACCGTGTAGTGTATTCTATCGATTCCGTCACGCTGGCCCTTGAGCGTGCCCGAATCATACGTTATGGTGGTACCTGATACCGTGGCACTAGTGCCGTCGGTGTCTGCGCCATCCGTACGGTCGAACACCACCACGAGGGTATCGGCGTCCTGGTTTGCGTCGGGGATGCGGTCACCCTCTAAAAGATTTATGGAGGTCGTCTCTTGTGTCTGGGCCGGGGGGAAGATGT ++ +?????BBBDDDDDDDDGGGGGGIIHIIFHHHIHHIFHGHGGHHIHIIDEFFHEF@DFHIIHIIIGHACDCFFHHHIHIIHIHGHHHBGFGEEHHHHFHFFHHDFDCEGGFGGFGGG@DBEEEEGC>CEEGGGGGGE0?CDGGGD>EE2CE8CEGEE<>>>):CEEGDA'428CE:?9CC4828>'4<)0:>'04)1?E*8:1CC??):CCEECC**8)0548?C*:?:*010*:CC############## +@MISEQ:1:000000000-A2Y1L:1:1101:17903:2450 2:N:0:TAAGGCGATAGATCGC +GGCCAACACTGTGGTCACTATCTCTGGCACTGGTCTGCTACCACATGGAGTTGACACCGGAGACCCAATCACCACAGCAACAGTGTGGATTGACCACACCCCCTGCACCTGTCTCTTATACACATCTGACGCTGCCGACGAGCGATCTAGTGTAGATCTCCGTGGGCGCCGCATCATTAAAAAAAAAAAAAAAAAACATAACTCTCAAACGCGAATAGAGCAAGAAACAGACAGCTAAAGTACAAACAAT ++ +55DDD880:?#################################################################################################### +@MISEQ:1:000000000-A2Y1L:1:1101:15113:2451 2:N:0:TAAGGCGATAGATCGC +GAGGGGAGGAGGGGAAGGGAGAGGGGAAGAGAGGAGAGGAGAGGAGGAGGAGAGAAGGGGGAATGAGCAAGTAACATAATTACATAGCTGTGTCTGAATCTCTTTGAGCACCCTACCTCGGCATAGGAGGCTAGCTGGTGAGGGAGGGAGGGGCAGACTGACAGGCCGAGGATGGCCTTGTGGTTGGCAAACTCCGTGGGGCGAGTCTGGGCTGCTGCATGCGCATCTGAGACTGGCTCTTGTACACATC ++ +?????@9@B?B?BBBBEEEFB@@EEHEC?BF-CE@DDEH,5+5C,>@+CE+/<@@@@@@>@CCCE#6 +@MISEQ:1:000000000-A2Y1L:1:1101:18851:2442 1:N:0:NAAGGCGATAGATCGC +GGTATCCCCCGGCAGTGAGGATGGAGCCATGGTCTGCATCATACTCACCGTAGGTGAGAATATCCACGTCCTTCGACTCCTGGGTGCCGTCTATGGTGCCCTCTGTTACCAGGCAGTGACGGAGGACATGGTGAGGTTTCAGTACCTCTGGCCCGGCCTGG ++ +??,D?)8 +@MISEQ:1:000000000-A2Y1L:1:1101:15290:2442 1:N:0:NAAGGCGATAGATCGC +AAAATAATCCTAAAAAATAACCTCTATGCCGCCGAACGCTCCGCCTCTATCTTCGTAAAAACTATCTTCTCCTCCTCACCTCCATAATCAAGCATCAAGCGATCGCCCTCCGCCAACTCACCCCGCAAAATCTTATCCGCTAGCGGATTCTCAATCTCCGTCTGAATGACCCGCCTCAGCGGCCGCGCCCCATAAACCGAATCAAATCCACGCCTGTCTCTTA ++ +?????BBBDDDDDDDDGGGGGGIIIHHFFHHHHHHHHHHEHHEHHHHHIIHHHHHFEHIIIHHIHHIHIHIIIIIHHHHHHHHHHHHHHHHDHHHHHHHHGEDFGGGGGGG;CEGEGCEGGGGG8>GGGGEGGEECEGGGGD8EDGGAEGEEGGCE:CGG8CEEGG???CEEDG.8>EEFF<<<2>D?DEEE*:C?AA<>8AEFCEE:?C?EEE?CEFEE0?:E?ACEECD8>EE>)8>E:CEEEEEED.)?AE??A?:A?*??:C0?CCE?AAA:88.88?::C:C?*8 +@MISEQ:1:000000000-A2Y1L:1:1101:17903:2450 1:N:0:TAAGGCGATAGATCGC +GTGCAGGGGG ++ +=5===<>+5< diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/test-data/trimmomatic_pe_r1_unpaired_out1.fastq diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/test-data/trimmomatic_pe_r2_paired_out1.fastq --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/test-data/trimmomatic_pe_r2_paired_out1.fastq Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,32 @@ +@MISEQ:1:000000000-A2Y1L:1:1101:19264:2433 2:N:0:NAAGGCGATAGATCGC +GCCGATATGCAGATGCCGTTGATTGCTGCCAGCATAGCAGCCATTGCCGGCATATCCTTTGTGACTGTGCGCCGACGCAAGAAGGCGGCCCAGATAACCTAGCCCAGAAACGATACGGCGTCTCTACCCTCATCTGTTATGCTGATCCAGTGATTCTTGCCTACCTTCTTTACGGTGATGTAATGCCAGTCGTTGGCAAG ++ +??A??ABBDDDDDDDDFFEEEFHIIIIHHIIHIFHIIHHHFHHIIIIIHHHFHHIIIIIIFFFHHIIHGDEHHAAEHHHHFHHHHD=EBBEEEEEFFFFEEEECEFFEEFFF8*??AA?;>DEEEFFFF*?8AECFEFFFFFFEAEEECFFFCEEEFFFFFFEEFEACEEC?CAACEFEEEFE?0?EACAAAEEAACEEA +@MISEQ:1:000000000-A2Y1L:1:1101:18667:2435 2:N:0:NAAGGCGATAGATCGC +GATGACATCAGCACAGGTTCCGCTTCCGCTGTCGCAACGACGTGGCTGCCATCGATTATATTCAATTCTGAGTGCTGCCGTAGGGCTAGATGTAAGACTCACGTTCGGCGACCAGATTAGTCGGTAGTGGCATG ++ +?????BBBDDDDDDDDGFFGGGHHHHIHHHHHHHIHHHHIHBCEEHFFHHHIIHFHHHIIIIIIIHIFBGFGFHHHHIHE@EEEHHFFHHFF?CFHHHHHGGGGGGGGGGGGGGGGGGGECGGG;EE:CEGGG? +@MISEQ:1:000000000-A2Y1L:1:1101:17523:2436 2:N:0:NAAGGCGATAGATCGC +GAATATAGAAACGGGAGTATAGTGGTAACC ++ +??AAABBBDDDDDEDDGGCFGGHHIFFHHH +@MISEQ:1:000000000-A2Y1L:1:1101:15489:2437 2:N:0:NAAGGCGATAGATCGC +GTCCACGTGAAATCCCGAGCGTCCCTCTCCCTCCCGAGAACCCGGTCTATATCCCACGATTTTTGACCGAACAAGGATTGACCTCATCCAACTCCGAAGCCCGCC ++ +????9B???AD9?BDDEEDDDDHHHHHFHFHFFHHHHHHHHHEHH>CEGHHFGHHGHH=FHHHHC+CGEBCCCE6BB +@MISEQ:1:000000000-A2Y1L:1:1101:18851:2442 2:N:0:NAAGGCGATAGATCGC +TCTCCTCAGTACGGGGGAGGAGGTTCGCTCGGCGGACGTGCGGCCCTACGCCAGACTCTACCACATGGTTCAGCAGGAGCTGGAGCACCGCGCTGTGTCCGGGGACTGGGAATACACCATCAGCCCCGGCGAGGAGTGGCAAGAGACGGA ++ +A??A?BBBDDDDDDDDGGGGGGHEFHHGEHHHIHEHHHHHDHBEHGFFFEGGDEEDGGGGGEGGEECE?EGCEGEGGGCEEEGGCEEGG4D;;4AGGGGGG8>GGA?EGGGEEGEEGGGCGGEGEGGGGD81,; +@MISEQ:1:000000000-A2Y1L:1:1101:15892:2446 2:N:0:NAAGGCGATAGATCGC +GTATTGCTGGCACCATCAGTCACCGTGTAGTGTATTCTATCGATTCCGTCACGCTGGCCCTTGAGCGTGCCCGAATCATACGTTATGGTGGTACCTGATACCGTGGCACTAGTGCCGTCGGTGTCTGCGCCATCCGTACGGTCGAACACCACCACGAGGGTATCGGCG ++ +?????BBBDDDDDDDDGGGGGGIIHIIFHHHIHHIFHGHGGHHIHIIDEFFHEF@DFHIIHIIIGHACDCFFHHHIHIIHIHGHHHBGFGEEHHHHFHFFHHDFDCEGGFGGFGGG@DBEEEEGC>CEEGGGGGGE0?CDGGGD>EE2CE8CEGEE<>>>):CEEGDA +@MISEQ:1:000000000-A2Y1L:1:1101:17903:2450 2:N:0:TAAGGCGATAGATCGC +GGCCAACACTGTGGTCACTATCTCTGGCACTGGTCTGCTACCACATGGAGTTGACACCGGAGACCCAATCACCACAGCAACAGTGTGGATTGACCACACCCCCTGCACCTGTCTCTTATACACATCTGACGCTGCCGACGAGCGATCTAG ++ +55DDD880:? diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/test-data/trimmomatic_pe_r2_unpaired_out1.fastq --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/test-data/trimmomatic_pe_r2_unpaired_out1.fastq Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,8 @@ +@MISEQ:1:000000000-A2Y1L:1:1101:18106:2444 2:N:0:NAAGGCGATAGATCGC +GAGTTACTATTACAGAGTGGAGCTAATGTACTGGCAGTTGACTGCAAAAGAAATGCTCCAATTCATGTTGCATGTGCAAATATGAGCTTAGAGTGTTTAAAGATAATTTTGTGCCACAAAAAATGCAACCCAAACCAACAAAATGCAGTGGGAGACACTCCACTCCACACCTTATGCAGTTTGGGGACTTGTGATATGAGAATACTACCTGTCTCTTATACACA ++ +?????BBBDDDDDDDDGCFGFGIIIIIHFHIIIIHIHFG=EHHIGIIIIFHIHDGHGHHHIHIH=CGHGGHFHHHFHFGHHFH/ACDFGG?FE?CDFFHHHHHIIHFHHI>CEDGFHHHHHHHHDFHHFHHHFFAFFGGGGGGEDEG>>DACC;?EGG>CEEA>AEACCEE?:C::CC:::C:CE/<@@@@@@>@CCCE#6 +@MISEQ:1:000000000-A2Y1L:1:1101:18851:2442 1:N:0:NAAGGCGATAGATCGC +GGTATCCCCCGGCAGTGAGGATGGAGCCATGGTCTGCATCATACTCACCGTAGGTGAGAATATCCACGTCCTTCGACTCCTGGGTGCCGTCTATGGTGCCCTCTGTTACCAGGCAGTGACGGAGGACATGGTGAGGTTTCAGTACCTCTGGCCCGGCCTGG ++ +??,D?)8 +@MISEQ:1:000000000-A2Y1L:1:1101:15290:2442 1:N:0:NAAGGCGATAGATCGC +AAAATAATCCTAAAAAATAACCTCTATGCCGCCGAACGCTCCGCCTCTATCTTCGTAAAAACTATCTTCTCCTCCTCACCTCCATAATCAAGCATCAAGCGATCGCCCTCCGCCAACTCACCCCGCAAAATCTTATCCGCTAGCGGATTCTCAATCTCCGTCTGAATGACCCGCCTCAGCGGCCGCGCCCCATAAACCGAATCAAATCCACGCCTGTCTCTTA ++ +?????BBBDDDDDDDDGGGGGGIIIHHFFHHHHHHHHHHEHHEHHHHHIIHHHHHFEHIIIHHIHHIHIHIIIIIHHHHHHHHHHHHHHHHDHHHHHHHHGEDFGGGGGGG;CEGEGCEGGGGG8>GGGGEGGEECEGGGGD8EDGGAEGEEGGCE:CGG8CEEGG???CEEDG.8>EEFF<<<2>D?DEEE*:C?AA<>8AEFCEE:?C?EEE?CEFEE0?:E?ACEECD8>EE>)8>E:CEEEEEED.)?AE??A?:A?*??:C0?CCE?AAA:88.88?::C:C?*8 +@MISEQ:1:000000000-A2Y1L:1:1101:17903:2450 1:N:0:TAAGGCGATAGATCGC +GTGCAGGGGG ++ +=5===<>+5< diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/test-data/trimmomatic_se_out2.fastq --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/test-data/trimmomatic_se_out2.fastq Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,40 @@ +@MISEQ:1:000000000-A2Y1L:1:1101:19264:2433 1:N:0:NAAGGCGATAGATCGC +AGATAGCCGA ++ +?????BBB@B +@MISEQ:1:000000000-A2Y1L:1:1101:18667:2435 1:N:0:NAAGGCGATAGATCGC +ATATATTCAT ++ +?????BBBDD +@MISEQ:1:000000000-A2Y1L:1:1101:17523:2436 1:N:0:NAAGGCGATAGATCGC +CATCACTACC ++ +?<,/<@@ +@MISEQ:1:000000000-A2Y1L:1:1101:18851:2442 1:N:0:NAAGGCGATAGATCGC +GGTATCCCCC ++ +??,+5< +@MISEQ:1:000000000-A2Y1L:1:1101:15113:2451 1:N:0:TAAGGCGATAGATCGC +TCTCAGATGA ++ +=,,<=>>>@@ diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/tool_dependencies.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/tool_dependencies.xml Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,27 @@ + + + + + + http://www.usadellab.org/cms/uploads/supplementary/Trimmomatic/Trimmomatic-0.32.zip + + trimmomatic-0.32.jar + $INSTALL_DIR + + + adapters + $INSTALL_DIR/adapters + + + $INSTALL_DIR + + + $INSTALL_DIR/adapters + + + + Trimmomatic requires the Java Runtime Environment + + + + diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/trimmomatic.sh --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/trimmomatic.sh Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,18 @@ +#!/bin/sh +# +# Shell wrapper to run Trimmomatic jar file as a Galaxy tool +echo Arguments: +for i in $@ ; do + echo "*" $i +done +java $@ 2>&1 | tee trimmomatic.log +status=$? +echo "Exit status: $status" +# Check for successful completion +if [ -z "$(tail -1 trimmomatic.log | grep "Completed successfully")" ] ; then + echo "Trimmomatic did not finish successfully" >&2 + exit 1 +fi +exit $status +## +# diff -r 000000000000 -r 718d2bda28f1 trimmomatic-2bd7cdbb6228/trimmomatic.xml --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/trimmomatic-2bd7cdbb6228/trimmomatic.xml Mon Jan 12 05:12:47 2015 -0500 @@ -0,0 +1,240 @@ + + flexible read trimming tool for Illumina NGS data + trimmomatic.sh + -mx8G + -jar \$TRIMMOMATIC_DIR/trimmomatic-0.32.jar + #if $paired_end.is_paired_end + PE -threads 6 -phred33 $fastq_r1_in $paired_end.fastq_r2_in $fastq_out_r1_paired $fastq_out_r1_unpaired $fastq_out_r2_paired $fastq_out_r2_unpaired + #else + SE -threads 6 -phred33 $fastq_in $fastq_out + #end if + ## ILLUMINACLIP option + #if $illuminaclip.do_illuminaclip + ILLUMINACLIP:\$TRIMMOMATIC_ADAPTERS_DIR/$illuminaclip.adapter_fasta:$illuminaclip.seed_mismatches:$illuminaclip.palindrome_clip_threshold:$illuminaclip.simple_clip_threshold + #end if + ## Other operations + #for $op in $operations + ## SLIDINGWINDOW + #if str( $op.operation.name ) == "SLIDINGWINDOW" + SLIDINGWINDOW:$op.operation.window_size:$op.operation.required_quality + #end if + ## MINLEN:36 + #if str( $op.operation.name ) == "MINLEN" + MINLEN:$op.operation.minlen + #end if + #if str( $op.operation.name ) == "LEADING" + LEADING:$op.operation.leading + #end if + #if str( $op.operation.name ) == "TRAILING" + TRAILING:$op.operation.trailing + #end if + #if str( $op.operation.name ) == "CROP" + CROP:$op.operation.crop + #end if + #if str( $op.operation.name ) == "HEADCROP" + HEADCROP:$op.operation.headcrop + #end if + #end for + + + trimmomatic + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + paired_end['is_paired_end'] + + + paired_end['is_paired_end'] + + + paired_end['is_paired_end'] + + + paired_end['is_paired_end'] + + + not paired_end['is_paired_end'] + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +.. class:: infomark + +**What it does** + +Trimmomatic performs a variety of useful trimming tasks for illumina paired-end and +single ended data. + +This tool allows the following trimming steps to be performed: + + * **ILLUMINACLIP:** Cut adapter and other illumina-specific sequences from the read + * **SLIDINGWINDOW:** Perform a sliding window trimming, cutting once the average + quality within the window falls below a threshold + * **MINLEN:** Drop the read if it is below a specified length + * **LEADING:** Cut bases off the start of a read, if below a threshold quality + * **TRAILING:** Cut bases off the end of a read, if below a threshold quality + * **CROP:** Cut the read to a specified length + * **HEADCROP:** Cut the specified number of bases from the start of the read + +If ILLUMINACLIP is requested then it is always performed first; subsequent options +can be mixed and matched and will be performed in the order that they have been +specified. + +.. class:: warningmark + +Note that trimming operation order is important. + +------------- + +.. class:: infomark + +**Outputs** + +For paired-end data a particular strength of Trimmomatic is that it retains the +pairing of reads (from R1 and R2) in the filtered output files: + + * Two FASTQ files (R1-paired and R2-paired) contain one read from each pair where + both have survived filtering. + * Additionally two FASTQ files (R1-unpaired and R2-unpaired) contain reads where + one of the pair failed the filtering steps. + +Retaining the same order and number of reads in the filtered output fastq files is +essential for many downstream analysis tools. + +For single-end data the output is a single FASTQ file containing just the filtered +reads. + +------------- + +.. class:: infomark + +**Credits** + +This Galaxy tool has been developed within the Bioinformatics Core Facility at the +University of Manchester. It runs the Trimmomatic program which has been developed +within Bjorn Usadel's group at RWTH Aachen university. + +Trimmomatic website (including documentation): + + * http://www.usadellab.org/cms/index.php?page=trimmomatic + +The reference for Trimmomatic is: + + * Bolger, A.M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer + for Illumina Sequence Data. Bioinformatics, btu170. + +Please kindly acknowledge both this Galaxy tool and the Trimmomatic program if you +use it. + + + + 10.1093/bioinformatics/btu170 + +