Linux and UNIX Man Pages

Linux & Unix Commands - Search Man Pages

gensprep(8) [centos man page]

gensprep(8)							 ICU 50.1.2 Manual						       gensprep(8)

NAME
gensprep - compile StringPrep data from files filtered by filterRFC3454.pl SYNOPSIS
gensprep [ -h, -?, --help ] [ -v, --verbose ] [ -c, --copyright ] [ -s, --sourcedir source ] [ -d, --destdir destination ] DESCRIPTION
gensprep reads filtered RFC 3454 files and compiles their information into a binary form. The resulting file, <name>.icu, can then be read directly by ICU, or used by pkgdata(8) for incorporation into a larger archive or library. The files read by gensprep are described in the FILES section. OPTIONS
-h, -?, --help Print help about usage and exit. -v, --verbose Display extra informative messages during execution. -c, --copyright Include a copyright notice into the binary data. -s, --sourcedir source Set the source directory to source. The default source directory is specified by the environment variable ICU_DATA. -d, --destdir destination Set the destination directory to destination. The default destination directory is specified by the environment variable ICU_DATA. ENVIRONMENT
ICU_DATA Specifies the directory containing ICU data. Defaults to /usr/share/icu/50.1.2/. Some tools in ICU depend on the presence of the trailing slash. It is thus important to make sure that it is present if ICU_DATA is set. FILES
The following files are read by gensprep and are looked for in the source /misc for rfc3454_*.txt files and in source /unidata for Normal- izationCorrections.txt. rfc3453_A_1.txt Contains the list of unassigned codepoints in Unicode version 3.2.0.... rfc3454_B_1.txt Contains the list of code points that are commonly mapped to nothing.... rfc3454_B_2.txt Contains the list of mappings for casefolding of code points when Normalization form NFKC is specified.... rfc3454_C_X.txt Contains the list of code points that are prohibited for IDNA. NormalizationCorrections.txt Contains the list of code points whose normalization has changed since Unicode Version 3.2.0. VERSION
50.1.2 COPYRIGHT
Copyright (C) 2000-2002 IBM, Inc. and others. SEE ALSO
pkgdata(8) ICU MANPAGE
18 March 2003 gensprep(8)

Check Out this Related Man Page

ICUPKG(8)							 ICU 50.1.2 Manual							 ICUPKG(8)

NAME
icupkg - extract or modify an ICU .dat archive SYNOPSIS
icupkg [ -h, -?, --help ] [ -tl, --type l | -tb, --type b | -te, --type e ] [ -c, --copyright | -C, --comment comment ] [ -a, --add list ] [ -r, --remove list ] [ -x, --extract list ] [ -l, --list ] [ -s, --sourcedir source ] [ -d, --destdir destination ] [ -w, --writepkg ] [ -m, --matchmode mode ] infilename [ outfilename ] DESCRIPTION
icupkg reads the input ICU .dat package file, modify it according to the options, swap it to the desired platform properties (charset & endianness), and optionally write the resulting ICU .dat package to the output file. Items are removed, then added, then extracted and listed. An ICU .dat package is written if items are removed or added, or if the input and output filenames differ, or if the -w, --writepkg option is set. If the input filename is "new" then an empty package is created. If the output filename is missing, then it is automatically generated from the input filename. If the input filename ends with an l, b, or e matching its platform properties, then the output filename will con- tain the letter from the -t, --type option. This tool can also be used to just swap a single ICU data file, replacing the former icuswap tool. For this mode, provide the infilename (and optional outfilename) for a non-package ICU data file. Allowed options include -t, -w, -s and -d The filenames can be absolute, or relative to the source/dest dir paths. Other options are not allowed in this mode. OPTIONS
-h, -?, --help Print help about usage and exit. -tl, --type l Output for little-endian/ASCII charset family. The output type defaults to the input type. -tb, --type b Output for big-endian/ASCII charset family. The output type defaults to the input type. -te, --type e Output for big-endian/EBCDIC charset family. The output type defaults to the input type. -c,--copyright Include the ICU copyright notice in the resulting data. -C, --comment comment Include the specified comment in the resulting data instead of the ICU copyright notice. -a, --add list Add items from the list to the package. The list can be a single filename with a .txt file extension containing a list of item file- names, or an ICU .dat package filename. -r, --remove list Remove items from the list from the package. The list can be a single filename with a .txt file extension containing a list of item filenames, or an ICU .dat package filename. -x, --extract list Extract items from the list from the package. The list can be a single filename with a .txt file extension containing a list of item filenames, or an ICU .dat package filename. -m, --matchmode mode Set the matching mode for item names with wildcards. -s, --sourcedir source Set the source directory to source. The default source directory is the current directory. -d, --destdir destination Set the destination directory to destination. The default destination directory is the current directory. -l, --list List the package items to stdout (after modifying the package). LIST FILE SYNTAX
Items are listed on one or more lines and separated by whitespace (space+tab). Comments begin with # and are ignored. Empty lines are ignored. Lines where the first non-whitespace character is one of "%&'()*+,-./:;<=>?_ are also ignored to reserve for future syntax. Items for removal or extraction may contain a single * wildcard character. The * matches zero or more characters. If -m, --matchmode noslash is set, then the * character does not match the / character. Items must be listed relative to the package, and the --sourcedir or the --destdir path will be prepended. The paths are only prepended to item filenames while adding or extracting items, not to ICU .dat package or list filenames. Paths may contain / instead of the platform's file separator character and are converted as appropriate. AUTHORS
Markus Scherer George Rhoten VERSION
1.0 COPYRIGHT
Copyright (C) 2006 IBM, Inc. and others. SEE ALSO
pkgdata(1) genrb(1) ICU MANPAGE
18 August 2006 ICUPKG(8)
Man Page