annotate mayachemtools/docs/scripts/txt/ElementalAnalysisTextFiles.txt @ 9:ab29fa5c8c1f draft default tip

Uploaded
author deepakjadmin
date Thu, 15 Dec 2016 14:18:03 -0500
parents 73ae111cf86f
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
2 ElementalAnalysisTextFiles.pl - Perform elemental analysis using formula
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
3 column in TextFile(s)
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
4
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
5 SYNOPSIS
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
6 ElementalAnalysisTextFiles.pl TextFile(s)...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
7
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
8 ElementalAnalysisTextFiles.pl [-c, --colmode colnum | collabel] [-d,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
9 --detail infolevel] [-f, --fast] [-f, --formulacol colnum | collabel]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
10 [-h, --help] [--indelim comma | semicolon] [-m, --mode All |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
11 "ElementalAnysis, [MolecularWeight, ExactMass]"] [-o, --overwrite]
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
12 [--outdelim comma | tab | semicolon] [-p, --precision number] [-q,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
13 --quote yes | no] [-r, --root rootname] [-s, --startcol colnum |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
14 collabel] [--startcolmode before | after] -v --valuecollabels [Name,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
15 Label, [Name, Label,...]] [-w, --workingdir dirname] TextFile(s)...
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
16
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
17 DESCRIPTION
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
18 Perform elemental analysis using molecular formula column specified by a
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
19 column number or label in *TextFile(s)*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
20
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
21 In addition to straightforward molecular formulas - H2O, HCl, C3H7O2N -
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
22 other supported variations are: Ca3(PO4)2, [PCl4]+, [Fe(CN)6]4-,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
23 C37H42N2O6+2, Na2CO3.10H2O, 8H2S.46H2O, and so on. Charges are simply
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
24 ignored. Isotope symbols in formulas specification, including D and T,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
25 are not supported.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
26
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
27 The valid file extensions are *.csv* and *.tsv* for comma/semicolon and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
28 tab delimited text files respectively. All other file names are ignored.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
29 All the text files in a current directory can be specified by **.csv*,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
30 **.tsv*, or the current directory name. The --indelim option determines
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
31 the format of *TextFile(s)*. Any file which doesn't correspond to the
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
32 format indicated by --indelim option is ignored.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
33
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
34 OPTIONS
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
35 -c, --colmode *colnum | collabel*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
36 Specify how columns are identified in *TextFile(s)*: using column
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
37 number or column label. Possible values: *colnum or collabel*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
38 Default value: *colnum*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
39
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
40 -d, --detail *infolevel*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
41 Level of information to print about lines being ignored. Default:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
42 *1*. Possible values: *1, 2 or 3*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
43
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
44 -h, --help
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
45 Print this help message.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
46
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
47 --fast
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
48 In this mode, the formula column specified using -f, --formulacol
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
49 option is assumed to contain valid molecular formula data and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
50 initial formula validation check is skipped.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
51
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
52 -f, --formulacol *col number | col name*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
53 This value is mode specific. It specifies molecular formula column
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
54 to use for performing elemental analysis on *TextFile(s)*. Possible
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
55 values: *col number or col label*. Default value: *first column
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
56 containing the word formula in its column label*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
57
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
58 -m, --mode *All | "ElementalAnalysis,[MolecularWeight,ExactMass]"*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
59 Specify what values to calculate using molecular formula in
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
60 *TextFile(s)*: calculate all supported values or specify a comma
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
61 delimited list of values. Possible values: *All |
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
62 "ElementalAnalysis, [MolecularWeight, ExactMass]"*. Default: *All*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
63
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
64 --indelim *comma | semicolon*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
65 Input delimiter for CSV *TextFile(s)*. Possible values: *comma or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
66 semicolon*. Default value: *comma*. For TSV files, this option is
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
67 ignored and *tab* is used as a delimiter.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
68
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
69 -o, --overwrite
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
70 Overwrite existing files.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
71
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
72 --outdelim *comma | tab | semicolon*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
73 Output text file delimiter. Possible values: *comma, tab, or
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
74 semicolon* Default value: *comma*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
75
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
76 -p, --precision *number*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
77 Precision of calculated values in the output file. Default: up to
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
78 *2* decimal places. Valid values: positive integers.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
79
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
80 -q, --quote *yes | no*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
81 Put quotes around column values in output text file. Possible
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
82 values: *yes or no*. Default value: *yes*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
83
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
84 -r, --root *rootname*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
85 New text file name is generated using the root: <Root>.<Ext>.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
86 Default new file name: <InitialTextFileName>ElementalAnalysis.<Ext>.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
87 The csv, and tsv <Ext> values are used for comma/semicolon, and tab
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
88 delimited text files respectively. This option is ignored for
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
89 multiple input files.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
90
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
91 -s, --startcol *colnum | collabel*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
92 This value is mode specific. It specifies the column in text files
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
93 which is used for start adding calculated column values. For
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
94 *colnum* mode, specify column number and for *collabel* mode,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
95 specify column label.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
96
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
97 Default value: *last*. Start merge after the last column.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
98
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
99 --startcolmode *before | after*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
100 Start adding calculated column values after the -s, --startcol
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
101 value. Possible values: *before or after*. Default value: *after*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
102
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
103 -v --valuecollabels *Name,Label,[Name,Label,...]*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
104 Specify column labels to use for calculated values. In general, it's
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
105 a comma delimited list of value name and column label pairs.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
106 Supported value names: *ElementalAnalysis, MolecularWeight, and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
107 ExactMass*. Default labels: *ElementalAnalysis, MolecularWeight, and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
108 ExactMass*.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
109
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
110 -w, --workingdir *dirname*
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
111 Location of working directory. Default: current directory.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
112
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
113 EXAMPLES
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
114 To perform elemental analysis, calculate molecular weight and exact mass
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
115 using formulas in a column with the word Formula in its column label and
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
116 generate a new CSV text file NewSample1.csv, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
117
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
118 % ElementalAnalysisTextFiles.pl -o -r NewSample1 Sample1.csv
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
119
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
120 To perform elemental analysis using formulas in column number two, use
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
121 column label Analysis for calculated data, and generate a new CSV text
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
122 file NewSample1.csv, type:
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
123
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
124 % ElementalAnalysisTextFiles.pl --m ElementalAnalysis --formulacol 2
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
125 --valuecollabels "ElementalAnalysis,Analysis" -o -r NewSample1
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
126 Sample1.csv
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
127
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
128 To calculate molecular weight using formula in column label Formula with
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
129 four decimal precision and generate a new CSV text file NewSample1.csv,
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
130 type
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
131
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
132 % ElementalAnalysisTextFiles.pl --m MolecularWeight --colmode collabel
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
133 --formulacol Formula --precision 4 -o -r NewSample1 Sample1.csv
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
134
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
135 To calculate exact mass using formula in column label Formula with four
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
136 decimal precision, adding column for exact mass right after Formula
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
137 column, and generate a new CSV text file NewSample1.csv, type
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
138
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
139 % ElementalAnalysisTextFiles.pl --m ExactMass --colmode collabel
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
140 --formulacol Formula --precision 4 --startcolmode after
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
141 --startcol Formula -o -r NewSample1 Sample1.csv
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
142
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
143 AUTHOR
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
144 Manish Sud <msud@san.rr.com>
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
145
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
146 SEE ALSO
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
147 AnalyzeTextFilesData.pl, InfoTextFiles.pl, ExtractFromTextFiles.pl
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
148
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
149 COPYRIGHT
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
150 Copyright (C) 2015 Manish Sud. All rights reserved.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
151
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
152 This file is part of MayaChemTools.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
153
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
154 MayaChemTools is free software; you can redistribute it and/or modify it
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
155 under the terms of the GNU Lesser General Public License as published by
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
156 the Free Software Foundation; either version 3 of the License, or (at
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
157 your option) any later version.
73ae111cf86f Uploaded
deepakjadmin
parents:
diff changeset
158