Mercurial > repos > test-svm > kmersvm_test
comparison kmersvm/index.html @ 11:3b0c30b3baf1 draft default tip
Uploaded
| author | test-svm |
|---|---|
| date | Wed, 08 Aug 2012 19:20:14 -0400 |
| parents | |
| children |
comparison
equal
deleted
inserted
replaced
| 10:83346c563dcb | 11:3b0c30b3baf1 |
|---|---|
| 1 <!doctype html> | |
| 2 <!-- paulirish.com/2008/conditional-stylesheets-vs-css-hacks-answer-neither/ --> | |
| 3 <!--[if lt IE 7]> <html class="no-js lt-ie9 lt-ie8 lt-ie7" lang="en"> <![endif]--> | |
| 4 <!--[if IE 7]> <html class="no-js lt-ie9 lt-ie8" lang="en"> <![endif]--> | |
| 5 <!--[if IE 8]> <html class="no-js lt-ie9" lang="en"> <![endif]--> | |
| 6 <!-- Consider adding a manifest.appcache: h5bp.com/d/Offline --> | |
| 7 <!--[if gt IE 8]><!--> <html class="no-js" lang="en"> <!--<![endif]--> | |
| 8 <head> | |
| 9 <meta charset="utf-8"> | |
| 10 | |
| 11 <!-- Use the .htaccess and remove these lines to avoid edge case issues. | |
| 12 More info: h5bp.com/i/378 --> | |
| 13 <meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"> | |
| 14 | |
| 15 <title></title> | |
| 16 <meta name="description" content=""> | |
| 17 | |
| 18 <!-- Mobile viewport optimized: h5bp.com/viewport --> | |
| 19 <meta name="viewport" content="width=device-width"> | |
| 20 | |
| 21 <!-- Place favicon.ico and apple-touch-icon.png in the root directory: mathiasbynens.be/notes/touch-icons --> | |
| 22 | |
| 23 <link rel="stylesheet" href="css/style.css"> | |
| 24 | |
| 25 <!-- More ideas for your <head> here: h5bp.com/d/head-Tips --> | |
| 26 | |
| 27 <!-- All JavaScript at the bottom, except this Modernizr build. | |
| 28 Modernizr enables HTML5 elements & feature detects for optimal performance. | |
| 29 Create your own custom Modernizr build: www.modernizr.com/download/ --> | |
| 30 <script src="js/libs/modernizr-2.5.3.min.js"></script> | |
| 31 </head> | |
| 32 <body> | |
| 33 <!-- Prompt IE 6 users to install Chrome Frame. Remove this if you support IE 6. | |
| 34 chromium.org/developers/how-tos/chrome-frame-getting-started --> | |
| 35 <!--[if lt IE 7]><p class=chromeframe>Your browser is <em>ancient!</em> <a href="http://browsehappy.com/">Upgrade to a different browser</a> or <a href="http://www.google.com/chromeframe/?redirect=true">install Google Chrome Frame</a> to experience this site.</p><![endif]--> | |
| 36 <header> | |
| 37 | |
| 38 </header> | |
| 39 <div role="main"> | |
| 40 <head> | |
| 41 <h2> Installation for KmerSVM Tool from Galaxy ToolShed</h2> | |
| 42 </head> | |
| 43 <body> | |
| 44 <p> | |
| 45 Christopher Fletez-Brant, Dongwon Lee, Andrew McCallion and Michael A. Beer | |
| 46 </p> | |
| 47 <p> | |
| 48 KmerSVM can be downloaded from the Galaxy Tool Shed, located at <a>Dummy Link</a>. To install KmerSVM, download files from LINK BELOW. Untar/zip the archive and move the resulting file 'kmersvm' to '~/galaxy-dist/tools/'. | |
| 49 </p> | |
| 50 <section> | |
| 51 <h2>Dependencies</h2> | |
| 52 <h3>Mac Users:</h3> | |
| 53 <ol> | |
| 54 <li>Xcode (Mac App Store)</li> | |
| 55 <li>Fortran compiler (http://gcc.gnu.org/wiki/GFortran/)</li> | |
| 56 </ol> | |
| 57 <h3>Everyone:</h3> | |
| 58 <ol> | |
| 59 <li>Swig (http://www.swig.org)</li> | |
| 60 <li>Numpy (numpy.scipy.org)</li> | |
| 61 <li>Shogun Toolbox, v0.9.3 - v1.10 (http://www.shogun-toolbox.org/)</li> | |
| 62 <li>Bitarray (http://pypi.python.org/pypi/bitarray/)</li> | |
| 63 <li>R (http://www.r-project.org)</li> | |
| 64 <li>ROCR R Package (Available through CRAN)</li> | |
| 65 </ol> | |
| 66 <p>Note that for binaries are provided for Mac users. However, if difficulties in installation are encountered, it may be beneficial to compile the Fortran compiler from source. Additionally, be sure to add the location of your Shogun installation to the PYTHONPATH.</p> | |
| 67 <p>Further, KmerSVM has been tested on Python 2.6, 2.7 on Linux and Mac OS X. At this time KmerSVM has not been tested on Windows.</p> | |
| 68 </section> | |
| 69 <section> | |
| 70 <h2>Required Files</h2> | |
| 71 <p>For efficient access to genome-wide data "Generate Null Sequence" and | |
| 72 "Sequence Profiles" rely on access to binary files (indices) generated | |
| 73 by using the script 'nullseq_build_indices.py'. Download the *.tar or | |
| 74 *.zip files for each genome to be analyzed. To create indices for a | |
| 75 specific genome, call nullseq_build_indices.py as follows:</p> | |
| 76 <p><pre> | |
| 77 python nullseq_build_indices.py <GENOME ARCHIVE FILE> <GENOME BUILD> | |
| 78 </pre></p> | |
| 79 <p>Next copy nullseq_indices.loc.sample to tool-data/nullseq_indices.loc | |
| 80 and add the path to the created indices following the instructions | |
| 81 included in that file.</p> | |
| 82 <p>To generate FASTA files for training or scoring purposes, KmerSVM uses | |
| 83 the built-in tool "Fetch Sequences", which looks for genomes in *.nib or | |
| 84 *.2bit format. Download genomes related to your data and update the | |
| 85 'alignseq.loc' file to include the location of these genomes according | |
| 86 to directions in that file. FASTA files can also be provided by the user.</p> | |
| 87 <p>Move the file sample_roc_chen.png to the 'tool-data' file.</p> | |
| 88 </section> | |
| 89 <section> | |
| 90 <h2>tool_conf.xml</h2> | |
| 91 <p>Add the following lines to tool_conf.xml to make tools visible in Galaxy:</p> | |
| 92 <p><pre> | |
| 93 <section name="SVM Tools" id="kmersvm"> | |
| 94 <tool file="kmersvm/classify.xml"/> | |
| 95 <tool file="kmersvm/nullseq.xml"/> | |
| 96 <tool file="kmersvm/rocprcurve.xml"/> | |
| 97 <tool file="kmersvm/train.xml"/> | |
| 98 <tool file="kmersvm/split_genome.xml"/> | |
| 99 <tool file="kmersvm/seqprofile.xml" /> | |
| 100 </section> | |
| 101 </pre></p> | |
| 102 </section> | |
| 103 <section> | |
| 104 <h2>Tool Tests</h2> | |
| 105 <p>To run tests on the tools "Train SVM", "Score Sequences of Interest" and | |
| 106 "Split Genome", move the following files to the 'test-data' directory:</p> | |
| 107 <ul> | |
| 108 <li>classify_output.out</li> | |
| 109 <li>classify_test.fa</li> | |
| 110 <li>kmersvm_output_weights.out</li> | |
| 111 <li>test_positive.fa</li> | |
| 112 <li>test_negative.fa</li> | |
| 113 <li>test_weights.out</li> | |
| 114 <li>train_predictions.out</li> | |
| 115 </ul> | |
| 116 <p>IDs for KmerSVM tests can be found by calling run_functional_tests.sh with the '-list' flag.</p> | |
| 117 </section> | |
| 118 <section> | |
| 119 <h2>Non-Galaxy-Based Usage</h2> | |
| 120 <p>The KmerSVM suite can be ran without using the Galaxy framework. Each tool exists as | |
| 121 a standalone Python script (all located in /scripts) which can be called from the command | |
| 122 line. Specific documentation can be found within each tool's Python file, or by calling the script with no arguments. A general | |
| 123 workflow can be found in INSERT_PAPER_TITLE_HERE, which can be followed by calling | |
| 124 each of the relevant Python scripts as outlined below, with the exception that users will have to provide | |
| 125 needed FASTA files themselves.</p> | |
| 126 <p>A simple worflow for the KmerSVM suite is as follows: </p> | |
| 127 <ol> | |
| 128 <li>python nullseq_build_indices.py <TARBALL OR ZIP OF GENOME> <GENOME BUILD></li> | |
| 129 <li>python nullseq_generate <INPUT FILE> <GENOME BUILD> <PATH TO INDEX FILES>#This assumes no negative data sets. Output will need to be converted to FASTA. Skip if negative data is provided.</li> | |
| 130 <li>python kmersvm_train.py <POSITIVE FASTA> <NEGATIVE FASTA> #Outputs will be WEIGHTS, PREDICTIONS</li> | |
| 131 <li>python split_genome.py <INPUT BED FILE> #Skip if already have a list of regions you want to test. Output is TEST_SEQS, which will need to be converted to FASTA.</li> | |
| 132 <li>python kmersvm_classify.py <WEIGHTS> <TEST_SEQS></li> | |
| 133 </ol> | |
| 134 <p>Additionally, for any BED file, sequence composition (in terms of length, GC content and repeat fraction) can be obtained by calling 'make profile' as follows:</p> | |
| 135 <p>python make_profile.py <BEDFILE> <BUILDNAME> <BASE_DIR> <OUT_FILE></p> | |
| 136 <p>Note that each tool has its own parameters, the manipulation of which allow the user to further customize their analysis. To learn more about a particular tool, simply call it without passing it any arguments.</p> | |
| 137 </section> | |
| 138 </body> | |
| 139 </div> | |
| 140 <footer> | |
| 141 | |
| 142 </footer> | |
| 143 | |
| 144 | |
| 145 <!-- JavaScript at the bottom for fast page loading --> | |
| 146 | |
| 147 <!-- Grab Google CDN's jQuery, with a protocol relative URL; fall back to local if offline --> | |
| 148 <script src="//ajax.googleapis.com/ajax/libs/jquery/1.7.1/jquery.min.js"></script> | |
| 149 <script>window.jQuery || document.write('<script src="js/libs/jquery-1.7.1.min.js"><\/script>')</script> | |
| 150 | |
| 151 <!-- scripts concatenated and minified via build script --> | |
| 152 <script src="js/plugins.js"></script> | |
| 153 <script src="js/script.js"></script> | |
| 154 <!-- end scripts --> | |
| 155 | |
| 156 <!-- Asynchronous Google Analytics snippet. Change UA-XXXXX-X to be your site's ID. | |
| 157 mathiasbynens.be/notes/async-analytics-snippet --> | |
| 158 <script> | |
| 159 var _gaq=[['_setAccount','UA-XXXXX-X'],['_trackPageview']]; | |
| 160 (function(d,t){var g=d.createElement(t),s=d.getElementsByTagName(t)[0]; | |
| 161 g.src=('https:'==location.protocol?'//ssl':'//www')+'.google-analytics.com/ga.js'; | |
| 162 s.parentNode.insertBefore(g,s)}(document,'script')); | |
| 163 </script> | |
| 164 </body> | |
| 165 </html> |
