annotate docs/scripts/txt/InfoTextFiles.txt @ 0:4816e4a8ae95 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:23:18 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
2 InfoTextFiles.pl - List information about TextFile(s)
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
3
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
4 SYNOPSIS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
5 InfoTextFiles.pl TextFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
6
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
7 InfoTextFiles.pl [-a, --all] [-c, --count] [--datacheck] [-d, --detail
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
8 infolevel] [-e, --empty] [-h, --help] [--indelim comma | semicolon] [-m,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
9 --mode colnum | collabel] [-n, --numericaldatacols colnum,[colnum,...] |
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
10 collabel,[collabel,...]] [-w, --workingdir dirname] TextFile(s)...
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
11
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
12 DESCRIPTION
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
13 List information about *TextFile(s)* contents: number of lines and
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
14 columns, empty column values, and so on. The file names are separated by
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
15 spaces. The valid file extensions are *.csv* and *.tsv* for
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
16 comma/semicolon and tab delimited text files respectively. All other
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
17 file names are ignored. All the text files in a current directory can be
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
18 specified by **.csv*, **.tsv*, or the current directory name. The
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
19 --indelim option determines the format of *TextFile(s)*. Any file which
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
20 doesn't correspond to the format indicated by --indelim option is
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
21 ignored.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
22
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
23 OPTIONS
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
24 -a, --all
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
25 List all the available information.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
26
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
27 -c, --count
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
28 List number of rows and columns. This is default behavior.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
29
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
30 --datacheck
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
31 List number of numerical and non-numerical values for each column.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
32
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
33 -d, --detail *infolevel*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
34 Level of information to print about lines being ignored. Default:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
35 *1*. Possible values: *1, 2 or 3*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
36
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
37 -e, --empty
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
38 List number of empty row and column values.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
39
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
40 -h, --help
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
41 Print this help message.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
42
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
43 --indelim *comma | semicolon*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
44 Input delimiter for CSV *TextFile(s)*. Possible values: *comma or
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
45 semicolon*. Default value: *comma*. For TSV files, this option is
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
46 ignored and *tab* is used as a delimiter.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
47
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
48 -m, --mode *colnum | collabel*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
49 Specify how to identify numerical data columns: using column number
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
50 or column label. Possible values: *colnum or collabel*. Default
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
51 value: *colnum*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
52
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
53 -n, --numericaldatacols *colnum,[colnum,...] | collabel,[collabel,...]*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
54 This value is mode specific. It is a list of column number or labels
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
55 to check for presence of numerical data only; otherwise, the value
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
56 is flagged. Default value: *all;all;...*.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
57
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
58 For *colnum* mode, input value format is:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
59 *colnum,...;colnum,...;...*. Example:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
60
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
61 1,3,5
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
62 "2,4,6"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
63
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
64 For *collabel* mode, input value format is:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
65 *collabel,...;collabel,...;...*. Example:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
66
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
67 "MW,SumNO,SumNHOH"
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
68
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
69 -w, --workingdir *dirname*
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
70 Location of working directory. Default: current directory.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
71
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
72 EXAMPLES
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
73 To count number of lines and columns in Text file(s), type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
74
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
75 % InfoTextFiles.pl Sample1.csv
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
76 % InfoTextFiles.pl Sample1.csv Sample1.tsv
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
77 % InfoTextFiles.pl *.csv *.tsv
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
78
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
79 To count number of lines, columns and empty values in Sample1.csv file
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
80 and print detailed information, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
81
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
82 % InfoTextFiles.pl -d 3 -e Sample1.csv
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
83
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
84 To track all available information and non-numerical values for Mol_ID
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
85 and MolWeight columns in Sample1.csv file and print detailed
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
86 information, type:
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
87
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
88 % InfoTextFiles.pl -d 3 -a -m collabel -n Mol_ID,MolWeight Sample1.csv
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
89
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
90 AUTHOR
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
91 Manish Sud <msud@san.rr.com>
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
92
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
93 SEE ALSO
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
94 JoinTextFiles.pl, MergeTextFilesWithSD.pl, ModifyTextFilesFormat.pl,
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
95 SplitTextFiles.pl, TextFilesToHTML.pl
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
96
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
97 COPYRIGHT
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
98 Copyright (C) 2015 Manish Sud. All rights reserved.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
99
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
100 This file is part of MayaChemTools.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
101
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
102 MayaChemTools is free software; you can redistribute it and/or modify it
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
103 under the terms of the GNU Lesser General Public License as published by
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
104 the Free Software Foundation; either version 3 of the License, or (at
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
105 your option) any later version.
4816e4a8ae95 Uploaded
deepakjadmin
parents:
diff changeset
106