view docs/scripts/txt/SplitTextFiles.txt @ 0:4816e4a8ae95 draft default tip

Uploaded
author deepakjadmin
date Wed, 20 Jan 2016 09:23:18 -0500
parents
children
line wrap: on
line source

NAME
    SplitTextFiles.pl - Split CSV or TSV TextFile(s) into multiple text
    files

SYNOPSIS
    SplitTextFiles.pl TextFile(s)...

    SplitTextFiles.pl [-f, --fast] [-h, --help] [--indelim comma |
    semicolon] [-l, --label yes | no] [-n, --numfiles number] [-o,
    --overwrite] [--outdelim comma | tab | semicolon] [-q, --quote yes | no]
    [-r, --root rootname] [-w, --workingdir dirname] TextFile(s)...

DESCRIPTION
    Split CSV or TSV *TextFile(s)* into multiple text files. Each new text
    file contains a subset of similar number of lines from the initial file.
    The file names are separated by space. The valid file extensions are
    *.csv* and *.tsv* for comma/semicolon and tab delimited text files
    respectively. All other file names are ignored. All the text files in a
    current directory can be specified by **.csv*, **.tsv*, or the current
    directory name. The --indelim option determines the format of
    *TextFile(s)*. Any file which doesn't correspond to the format indicated
    by --indelim option is ignored.

OPTIONS
    -f, --fast
        In this mode, --indelim, --outdelim, and -q --quote options are
        ignored. The format of input and output file(s) are assumed to be
        similar. And the text lines from input *TextFile(s)* are just
        transferred to output file(s) without any processing.

    -h, --help
        Print this help message.

    --indelim *comma | semicolon*
        Input delimiter for CSV *TextFile(s)*. Possible values: *comma or
        semicolon*. Default value: *comma*. For TSV files, this option is
        ignored and *tab* is used as a delimiter.

    -l, --label *yes | no*
        First line contains column labels. Possible values: *yes or no*.
        Default value: *yes*.

    -n, --numfiles *number*
        Number of new files to generate for each TextFile(s). Default: *2*.

    -o, --overwrite
        Overwrite existing files.

    --outdelim *comma | tab | semicolon*
        Output text file delimiter. Possible values: *comma, tab, or
        semicolon*. Default value: *comma*

    -q, --quote *yes | no*
        Put quotes around column values in output text file. Possible
        values: *yes or no*. Default value: *yes*.

    -r, --root *rootname*
        New text file names are generated using the root:
        <Root>Part<Count>.<Ext>. Default new file names:
        <InitialTextFileName>Part<Count>.<Ext>. The csv, and tsv <Ext>
        values are used for comma/semicolon, and tab delimited text files
        respectively.This option is ignored for multiple input files.

    -w, --workingdir *dirname*
        Location of working directory. Default: current directory.

EXAMPLES
    To split each CSV text files into 4 different text files type:

        % SplitTextFiles.pl -n 5 -o Sample1.csv Sample2.csv
        % SplitTextFiles.pl -n 5 -o *.csv

    To split Sample1.tsv into 10 different CSV text files, type:

        % SplitTextFiles.pl -n 10 --outdelim comma -o Sample1.tsv

AUTHOR
    Manish Sud <msud@san.rr.com>

SEE ALSO
    JoinTextFiles.pl, MergeTextFiles.pl, ModifyTextFilesFormat.pl

COPYRIGHT
    Copyright (C) 2015 Manish Sud. All rights reserved.

    This file is part of MayaChemTools.

    MayaChemTools is free software; you can redistribute it and/or modify it
    under the terms of the GNU Lesser General Public License as published by
    the Free Software Foundation; either version 3 of the License, or (at
    your option) any later version.