comparison docs/scripts/txt/InfoTextFiles.txt @ 0:4816e4a8ae95 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:23:18 -0500
parents
children
comparison
equal deleted inserted replaced
-1:000000000000 0:4816e4a8ae95
1 NAME
2 InfoTextFiles.pl - List information about TextFile(s)
3
4 SYNOPSIS
5 InfoTextFiles.pl TextFile(s)...
6
7 InfoTextFiles.pl [-a, --all] [-c, --count] [--datacheck] [-d, --detail
8 infolevel] [-e, --empty] [-h, --help] [--indelim comma | semicolon] [-m,
9 --mode colnum | collabel] [-n, --numericaldatacols colnum,[colnum,...] |
10 collabel,[collabel,...]] [-w, --workingdir dirname] TextFile(s)...
11
12 DESCRIPTION
13 List information about *TextFile(s)* contents: number of lines and
14 columns, empty column values, and so on. The file names are separated by
15 spaces. The valid file extensions are *.csv* and *.tsv* for
16 comma/semicolon and tab delimited text files respectively. All other
17 file names are ignored. All the text files in a current directory can be
18 specified by **.csv*, **.tsv*, or the current directory name. The
19 --indelim option determines the format of *TextFile(s)*. Any file which
20 doesn't correspond to the format indicated by --indelim option is
21 ignored.
22
23 OPTIONS
24 -a, --all
25 List all the available information.
26
27 -c, --count
28 List number of rows and columns. This is default behavior.
29
30 --datacheck
31 List number of numerical and non-numerical values for each column.
32
33 -d, --detail *infolevel*
34 Level of information to print about lines being ignored. Default:
35 *1*. Possible values: *1, 2 or 3*.
36
37 -e, --empty
38 List number of empty row and column values.
39
40 -h, --help
41 Print this help message.
42
43 --indelim *comma | semicolon*
44 Input delimiter for CSV *TextFile(s)*. Possible values: *comma or
45 semicolon*. Default value: *comma*. For TSV files, this option is
46 ignored and *tab* is used as a delimiter.
47
48 -m, --mode *colnum | collabel*
49 Specify how to identify numerical data columns: using column number
50 or column label. Possible values: *colnum or collabel*. Default
51 value: *colnum*.
52
53 -n, --numericaldatacols *colnum,[colnum,...] | collabel,[collabel,...]*
54 This value is mode specific. It is a list of column number or labels
55 to check for presence of numerical data only; otherwise, the value
56 is flagged. Default value: *all;all;...*.
57
58 For *colnum* mode, input value format is:
59 *colnum,...;colnum,...;...*. Example:
60
61 1,3,5
62 "2,4,6"
63
64 For *collabel* mode, input value format is:
65 *collabel,...;collabel,...;...*. Example:
66
67 "MW,SumNO,SumNHOH"
68
69 -w, --workingdir *dirname*
70 Location of working directory. Default: current directory.
71
72 EXAMPLES
73 To count number of lines and columns in Text file(s), type:
74
75 % InfoTextFiles.pl Sample1.csv
76 % InfoTextFiles.pl Sample1.csv Sample1.tsv
77 % InfoTextFiles.pl *.csv *.tsv
78
79 To count number of lines, columns and empty values in Sample1.csv file
80 and print detailed information, type:
81
82 % InfoTextFiles.pl -d 3 -e Sample1.csv
83
84 To track all available information and non-numerical values for Mol_ID
85 and MolWeight columns in Sample1.csv file and print detailed
86 information, type:
87
88 % InfoTextFiles.pl -d 3 -a -m collabel -n Mol_ID,MolWeight Sample1.csv
89
90 AUTHOR
91 Manish Sud <msud@san.rr.com>
92
93 SEE ALSO
94 JoinTextFiles.pl, MergeTextFilesWithSD.pl, ModifyTextFilesFormat.pl,
95 SplitTextFiles.pl, TextFilesToHTML.pl
96
97 COPYRIGHT
98 Copyright (C) 2015 Manish Sud. All rights reserved.
99
100 This file is part of MayaChemTools.
101
102 MayaChemTools is free software; you can redistribute it and/or modify it
103 under the terms of the GNU Lesser General Public License as published by
104 the Free Software Foundation; either version 3 of the License, or (at
105 your option) any later version.
106