0
|
1 NAME
|
|
2 InfoTextFiles.pl - List information about TextFile(s)
|
|
3
|
|
4 SYNOPSIS
|
|
5 InfoTextFiles.pl TextFile(s)...
|
|
6
|
|
7 InfoTextFiles.pl [-a, --all] [-c, --count] [--datacheck] [-d, --detail
|
|
8 infolevel] [-e, --empty] [-h, --help] [--indelim comma | semicolon] [-m,
|
|
9 --mode colnum | collabel] [-n, --numericaldatacols colnum,[colnum,...] |
|
|
10 collabel,[collabel,...]] [-w, --workingdir dirname] TextFile(s)...
|
|
11
|
|
12 DESCRIPTION
|
|
13 List information about *TextFile(s)* contents: number of lines and
|
|
14 columns, empty column values, and so on. The file names are separated by
|
|
15 spaces. The valid file extensions are *.csv* and *.tsv* for
|
|
16 comma/semicolon and tab delimited text files respectively. All other
|
|
17 file names are ignored. All the text files in a current directory can be
|
|
18 specified by **.csv*, **.tsv*, or the current directory name. The
|
|
19 --indelim option determines the format of *TextFile(s)*. Any file which
|
|
20 doesn't correspond to the format indicated by --indelim option is
|
|
21 ignored.
|
|
22
|
|
23 OPTIONS
|
|
24 -a, --all
|
|
25 List all the available information.
|
|
26
|
|
27 -c, --count
|
|
28 List number of rows and columns. This is default behavior.
|
|
29
|
|
30 --datacheck
|
|
31 List number of numerical and non-numerical values for each column.
|
|
32
|
|
33 -d, --detail *infolevel*
|
|
34 Level of information to print about lines being ignored. Default:
|
|
35 *1*. Possible values: *1, 2 or 3*.
|
|
36
|
|
37 -e, --empty
|
|
38 List number of empty row and column values.
|
|
39
|
|
40 -h, --help
|
|
41 Print this help message.
|
|
42
|
|
43 --indelim *comma | semicolon*
|
|
44 Input delimiter for CSV *TextFile(s)*. Possible values: *comma or
|
|
45 semicolon*. Default value: *comma*. For TSV files, this option is
|
|
46 ignored and *tab* is used as a delimiter.
|
|
47
|
|
48 -m, --mode *colnum | collabel*
|
|
49 Specify how to identify numerical data columns: using column number
|
|
50 or column label. Possible values: *colnum or collabel*. Default
|
|
51 value: *colnum*.
|
|
52
|
|
53 -n, --numericaldatacols *colnum,[colnum,...] | collabel,[collabel,...]*
|
|
54 This value is mode specific. It is a list of column number or labels
|
|
55 to check for presence of numerical data only; otherwise, the value
|
|
56 is flagged. Default value: *all;all;...*.
|
|
57
|
|
58 For *colnum* mode, input value format is:
|
|
59 *colnum,...;colnum,...;...*. Example:
|
|
60
|
|
61 1,3,5
|
|
62 "2,4,6"
|
|
63
|
|
64 For *collabel* mode, input value format is:
|
|
65 *collabel,...;collabel,...;...*. Example:
|
|
66
|
|
67 "MW,SumNO,SumNHOH"
|
|
68
|
|
69 -w, --workingdir *dirname*
|
|
70 Location of working directory. Default: current directory.
|
|
71
|
|
72 EXAMPLES
|
|
73 To count number of lines and columns in Text file(s), type:
|
|
74
|
|
75 % InfoTextFiles.pl Sample1.csv
|
|
76 % InfoTextFiles.pl Sample1.csv Sample1.tsv
|
|
77 % InfoTextFiles.pl *.csv *.tsv
|
|
78
|
|
79 To count number of lines, columns and empty values in Sample1.csv file
|
|
80 and print detailed information, type:
|
|
81
|
|
82 % InfoTextFiles.pl -d 3 -e Sample1.csv
|
|
83
|
|
84 To track all available information and non-numerical values for Mol_ID
|
|
85 and MolWeight columns in Sample1.csv file and print detailed
|
|
86 information, type:
|
|
87
|
|
88 % InfoTextFiles.pl -d 3 -a -m collabel -n Mol_ID,MolWeight Sample1.csv
|
|
89
|
|
90 AUTHOR
|
|
91 Manish Sud <msud@san.rr.com>
|
|
92
|
|
93 SEE ALSO
|
|
94 JoinTextFiles.pl, MergeTextFilesWithSD.pl, ModifyTextFilesFormat.pl,
|
|
95 SplitTextFiles.pl, TextFilesToHTML.pl
|
|
96
|
|
97 COPYRIGHT
|
|
98 Copyright (C) 2015 Manish Sud. All rights reserved.
|
|
99
|
|
100 This file is part of MayaChemTools.
|
|
101
|
|
102 MayaChemTools is free software; you can redistribute it and/or modify it
|
|
103 under the terms of the GNU Lesser General Public License as published by
|
|
104 the Free Software Foundation; either version 3 of the License, or (at
|
|
105 your option) any later version.
|
|
106
|