annotate mayachemtool/mayachemtools/docs/modules/txt/TextUtil.txt @ 0:68300206e90d draft default tip

Uploaded
author deepakjadmin
date Thu, 05 Nov 2015 02:41:30 -0500
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
1 NAME
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
2 TextUtil
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
3
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
4 SYNOPSIS
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
5 use TextUtil;
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
6
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
7 use TextUtil qw(:all);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
8
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
9 DESCRIPTION
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
10 TextUtil module provides the following functions:
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
11
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
12 AddNumberSuffix, ContainsWhiteSpaces, GetTextFileDataByNonUniqueKey,
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
13 GetTextFileDataByUniqueKey, GetTextLine, HashCode, IsEmpty, IsFloat,
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
14 IsInteger, IsNotEmpty, IsNumberPowerOfNumber, IsNumerical,
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
15 IsPositiveInteger, JoinWords, QuoteAWord,
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
16 RemoveLeadingAndTrailingWhiteSpaces, RemoveLeadingWhiteSpaces,
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
17 RemoveTrailingWhiteSpaces, SplitWords, WrapText
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
18
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
19 FUNCTIONS
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
20 AddNumberSuffix
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
21 $NumberWithSuffix = AddNumberSuffix($IntegerValue);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
22
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
23 Returns number with appropriate suffix: 0, 1st, 2nd, 3rd, 4th, and
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
24 so on.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
25
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
26 ContainsWhiteSpaces
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
27 $Status = ContainsWhiteSpaces($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
28
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
29 Returns 1 or 0 based on whether the string contains any white
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
30 spaces.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
31
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
32 GetTextLine
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
33 $Line = GetTextLine(\*TEXTFILE);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
34
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
35 Reads next line from an already opened text file, takes out any
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
36 carriage return, and returns it as a string. NULL is returned for
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
37 EOF.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
38
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
39 GetTextFileDataByNonUniqueKey
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
40 GetTextFileDataByNonUniqueKey($TextDataFile, $TextDataMapRef,
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
41 $DataKeyColNum, $InDelim);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
42
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
43 Load data from a text file into the specified hash reference using a
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
44 specific column for non-unique data key values.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
45
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
46 The lines starting with # are treated as comments and ignored. First
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
47 line not starting with # must contain column labels and the number
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
48 of columns in all other data rows must match the number of column
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
49 labels.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
50
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
51 The first column is assumed to contain data key value by default;
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
52 all other columns contain data as indicated in their column labels.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
53
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
54 In order to avoid dependence of data access on the specified column
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
55 labels, the column data is loaded into hash with Column<Num> hash
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
56 keys, where column number start from 1. The data key column is not
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
57 available as Colnum<Num> hash key;
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
58
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
59 The format of the data structure loaded into a specified hash
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
60 reference is:
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
61
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
62 @{$TextDataMapRef->{DataKeys}} - Array of unique data keys
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
63 @{$TextDataMapRef->{ColLabels}} - Array of column labels
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
64 @{$TextDataMapRef->{DataColIDs}} - Array of data column IDs
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
65 $TextDataMapRef->{NumOfCols} - Number of columns
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
66 %{$TextDataMapRef->{DataKey}} - Hash keys pair: <DataKey, DataKey>
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
67 @{$TextDataMapRef->{DataCol<Num>}} - Hash keys pair with data as an array:
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
68 <DataCol<Num>, DataKey>
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
69
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
70 GetTextFileDataByUniqueKey
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
71 GetTextFileDataByUniqueKey($TextDataFile, $TextDataMapRef, $DataKeyColNum,
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
72 $InDelim);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
73
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
74 Load data from a text file into the specified hash reference using a
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
75 a specific column for unique data key values.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
76
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
77 The lines starting with # are treated as comments and ignored. First
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
78 line not starting with # must contain column labels and the number
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
79 of columns in all other data rows must match the number of column
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
80 labels.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
81
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
82 The first column is assumed to contain data key value by default;
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
83 all other columns contain data as indicated in their column labels.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
84
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
85 In order to avoid dependence of data access on the specified column
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
86 labels, the column data is loaded into hash with Column<Num> hash
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
87 keys, where column number start from 1. The data key column is not
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
88 available as Colnum<Num> hash key;
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
89
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
90 The format of the data structure loaded into a specified hash
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
91 reference is:
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
92
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
93 @{$TextDataMapRef->{DataKeys}} - Array of unique data keys
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
94 @{$TextDataMapRef->{ColLabels}} - Array of column labels
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
95 @{$TextDataMapRef->{DataColIDs}} - Array of data column IDs
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
96 $TextDataMapRef->{NumOfCols} - Number of columns
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
97 %{$TextDataMapRef->{DataKey}} - Hash keys pair: <DataKey, DataKey>
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
98 %{$TextDataMapRef->{DataCol<Num>}} - Hash keys pair: <DataCol<Num>, DataKey>
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
99
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
100 HashCode
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
101 $HashCode = HashCode($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
102
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
103 Returns a 32 bit integer hash code using One-at-a-time algorithm By
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
104 Bob Jenkins [Ref 38]. It's also implemented in Perl for internal
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
105 hash keys in hv.h include file.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
106
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
107 IsEmpty
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
108 $Status = IsEmpty($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
109
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
110 Returns 1 or 0 based on whether the string is empty.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
111
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
112 IsInteger
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
113 $Status = IsInteger($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
114
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
115 Returns 1 or 0 based on whether the string is a positive integer.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
116
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
117 IsPositiveInteger
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
118 $Status = IsPositiveInteger($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
119
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
120 Returns 1 or 0 based on whether the string is an integer.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
121
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
122 IsFloat
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
123 $Status = IsFloat($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
124
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
125 Returns 1 or 0 based on whether the string is a float.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
126
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
127 IsNotEmpty
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
128 $Status = IsNotEmpty($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
129
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
130 Returns 0 or 1 based on whether the string is empty.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
131
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
132 IsNumerical
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
133 $Status = IsNumerical($TheString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
134
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
135 Returns 1 or 0 based on whether the string is a number.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
136
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
137 IsNumberPowerOfNumber
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
138 $Status = IsNumberPowerOfNumber($FirstNum, $SecondNum);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
139
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
140 Returns 1 or 0 based on whether the first number is a power of
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
141 second number.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
142
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
143 JoinWords
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
144 $JoinedWords = JoinWords($Words, $Delim, $Quote);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
145
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
146 Joins different words using delimiter and quote parameters, and
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
147 returns it as a string.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
148
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
149 QuoteAWord
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
150 $QuotedWord = QuoteAWord($Word, $Quote);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
151
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
152 Returns a quoted string based on *Quote* value.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
153
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
154 RemoveLeadingWhiteSpaces
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
155 $OutString = RemoveLeadingWhiteSpaces($InString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
156
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
157 Returns a string without any leading and traling white spaces.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
158
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
159 RemoveTrailingWhiteSpaces
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
160 $OutString = RemoveTrailingWhiteSpaces($InString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
161
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
162 Returns a string without any trailing white spaces.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
163
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
164 RemoveLeadingAndTrailingWhiteSpaces
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
165 $OutString = RemoveLeadingAndTrailingWhiteSpaces($InString);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
166
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
167 Returns a string without any leading and traling white spaces.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
168
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
169 SplitWords
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
170 @Words = SplitWords($Line, $Delimiter);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
171
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
172 Returns an array *Words* ontaining unquoted words generated after
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
173 spliting string value *Line* containing quoted or unquoted words.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
174
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
175 This function is used to split strings generated by JoinWords as
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
176 replacement for Perl's core module funtion
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
177 Text::ParseWords::quotewords() which dumps core on very long
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
178 strings.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
179
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
180 WrapText
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
181 $OutString = WrapText($InString, [$WrapLength, $WrapDelimiter]);
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
182
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
183 Returns a wrapped string. By default, *WrapLenght* is *40* and
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
184 *WrapDelimiter* is Unix new line character.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
185
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
186 AUTHOR
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
187 Manish Sud <msud@san.rr.com>
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
188
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
189 SEE ALSO
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
190 FileUtil.pm
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
191
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
192 COPYRIGHT
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
193 Copyright (C) 2015 Manish Sud. All rights reserved.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
194
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
195 This file is part of MayaChemTools.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
196
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
197 MayaChemTools is free software; you can redistribute it and/or modify it
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
198 under the terms of the GNU Lesser General Public License as published by
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
199 the Free Software Foundation; either version 3 of the License, or (at
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
200 your option) any later version.
68300206e90d Uploaded
deepakjadmin
parents:
diff changeset
201