diff awk.xml @ 15:74aae7d6cb09 draft

planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/text_processing/text_processing commit 3103ebed1a420c7d3415b67ef532ea579edf9faa
author bgruening
date Wed, 12 Jul 2017 14:06:47 -0400
parents 7725ab6dab67
children 1e974b82380d
line wrap: on
line diff
--- a/awk.xml	Sat May 06 13:09:30 2017 -0400
+++ b/awk.xml	Wed Jul 12 14:06:47 2017 -0400
@@ -80,14 +80,17 @@
 - **/AG..AG/** will match lines that contain the regular expression **AG..AG** (meaning the characeters AG followed by any two characeters followed by AG). (This is the way to specify regular expressions on the entire line, similar to GREP.)
 - **$7 ~ /A{4}U/**  will match lines whose seventh column contains 4 consecutive A's followed by a U. (This is the way to specify regular expressions on a specific field.)
 - **10000 < $4 && $4 < 20000** will match lines whose fourth column value is larger than 10,000 but smaller than 20,000
+- **BEGIN** will be executed once only, before the first input record is read. 
 - If no pattern is specified, all lines match (meaning the **action** part will be executed on all lines).
 
 
+
 **Action Examples**
 
 - **{ print }** or **{ print $0 }**   will print the entire input line (the line that matched in **pattern**). **$0** is a special marker meaning 'the entire line'.
 - **{ print $1, $4, $5 }** will print only the first, fourth and fifth fields of the input line.
 - **{ print $4, $5-$4 }** will print the fourth column and the difference between the fifth and fourth column. (If the fourth column was start-position in the input file, and the fifth column was end-position - the output file will contain the start-position, and the length).
+- **{ FS = "," }** can be used to change the field separator (delimeter) for parsing the input file.
 - If no action part is specified (not even the curly brackets) - the default action is to print the entire line.