annotate src/breadcrumbs/README.md @ 25:1a2f01b31cf3 draft

Updated micropita_prepare.py to use the correct directory
author george-weingart
date Thu, 11 Aug 2016 12:45:27 -0400
parents d589875b8125
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
1 # BreadCrumbs #
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
2
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
3 BreadCrumbs is an unofficial collection of scripts and code intended to consolidate functions for tool development and contain scripts for command line access to commonly used functions. Breadcrumbs tends to include functionality associated with metagenomics analysis but you never know what you will find!
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
4
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
5
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
6 ## Dependencies: ##
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
7
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
8 1. Cogent https://pypi.python.org/pypi/cogent
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
9 2. MatplotLib http://matplotlib.org/downloads.html
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
10 3. Mercurial http://mercurial.selenic.com/ (optional for downloading)
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
11 4. Numpy http://www.numpy.org/
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
12 5. Python 2.x http://www.python.org/download/
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
13 6. SciPy http://www.scipy.org/install.html
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
14 7. biom support http://biom-format.org/
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
15
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
16
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
17 ## How to download ##
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
18
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
19 To download BreadCrumbs from BitBucket use the command:
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
20
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
21 > hg clone https://bitbucket.org/timothyltickle/breadcrumbs
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
22
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
23 To update BreadCrumbs, in the BreadCrumbs directory use the 2 commands sequentially:
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
24
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
25 > hg pull
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
26 > hg update
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
27
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
28
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
29 ## Scripts: ##
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
30
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
31 Scripts are included to expose core functionality through the command line. Currently these scripts center on manipulating and visualizing abundance tables.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
32 A quick description of the scripts include:
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
33
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
34 * *Hclust.py* Flexible script to create a visualization of hierarchical clustering of abundance tables (or other matrices).
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
35
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
36 * *scriptBiplotTSV.R* Allows one to plot a tsv file as a biplot using nonmetric multidimensional scaling.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
37
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
38 * *scriptPlotFeature.py* Allows one to plot a histogram, boxplot, or scatter plot of a bug or metadata in an abundance table. Will work on any row in a matrix.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
39
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
40 * *scriptManipulateTable.py* Allows one to perform common functions on an abundance table including, summing, normalizing, filtering, stratifying tables.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
41
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
42 * *scriptPcoa.py* Allows one to plot a principle covariance analysis (PCoA) plot of an abundance table.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
43
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
44 * *scriptConvertBetweenBIOMAndPCL.py* Allows one to convert between BIOM and PCL file formats.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
45
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
46
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
47 ## Programming Classes: ##
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
48
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
49 Brief descriptions of classes are as follows. More detailed descriptions are given in the classes themselves.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
50
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
51 * *AbundanceTable* Data structure to contain and perform operations on an abundance table.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
52
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
53 * *BoxPlot* Wrapper to plot box plots.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
54
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
55 * *CClade* Helper object used in hierarchical summing and normalization
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
56
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
57 * *Cladogram* Object that manipulated an early dendrogram visualization. Deprecated, should use the GraPhlan visualization tool on bitbucket instead.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
58
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
59 * *CommandLine* Collection of code to work with command line. Deprecated. Should use sfle calls.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
60
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
61 * *ConstantsBreadCrumbs* Contains generic constants.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
62
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
63 * *ConstantsFiguresBreadCrumbs* Contains constants associated with formatting figures.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
64
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
65 * *KMedoids* Code from MLPY which performs KMedoids sample selection.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
66
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
67 * *MLPYDistanceAdaptor* Used to allow custom distance matrices to be used by KMedoids.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
68
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
69 * *Metric* Difference functions associated with distance and diversity metrics.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
70
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
71 * *PCoA* Functionality surrounding the plotting of a PCoA
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
72
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
73 * *PlotMatrix* Allows on to plot a matrix of numbers.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
74
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
75 * *SVM* Support Vector Machine associated scripts.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
76
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
77 * *Utility* Generic functions
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
78
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
79 * *UtilityMath* Generic math related functions
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
80
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
81 * *ValidateData* Collection of functions to validate data types when needed.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
82
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
83
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
84 ## Demo input files: ##
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
85
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
86 * *fastunifrac_Ley_et_al_NRM_2_sample_id_map.txt* Example Unifrac Id mapping file (source http://bmf2.colorado.edu/fastunifrac/tutorial.psp)
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
87
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
88 * *GreenGenesCore-May09.ref.tre* Example Greengenes core set reference for Unifrac demo (source http://bmf2.colorado.edu/fastunifrac/tutorial.psp)
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
89
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
90 * *Test.pcl* Example file / Test PCL file to run scripts on.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
91
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
92 * *Test.biom* Example file / Test BIOM file to run scripts on.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
93
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
94 * *Test_no_metadata.pcl* Example file / Test PCL file to run scripts on which does not have metadata.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
95
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
96 * *Test_no_metadata.biom* Example file / Test BIOM file to run scripts on which does not have metadata.
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
97
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
98 * *Test-biplot.tsv* Example file / Test file for the scriptBiplotTSV.R
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
99
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
100
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
101 ## Contributing Authors: ##
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
102 Timothy Tickle, George Weingart, Nicola Segata, Curtis Huttenhower
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
103
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
104
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
105 ## Contact: ##
d589875b8125 First version of micropita in this repository
george-weingart
parents:
diff changeset
106 Please feel free to contact ttickle@hsph.harvard.edu with questions.