Encode::TW(3pm) Perl Programmers Reference Guide Encode::TW(3pm)NAME
Encode::TW - Taiwan-based Chinese Encodings
SYNOPSIS
use Encode qw/encode decode/;
$big5 = encode("big5", $utf8); # loads Encode::TW implicitly
$utf8 = decode("big5", $big5); # ditto
DESCRIPTION
This module implements tradition Chinese charset encodings as used in Taiwan and Hong Kong. Encodings supported are as follows.
Canonical Alias Description
--------------------------------------------------------------------
big5-eten /big-?5$/i Big5 encoding (with ETen extensions)
/big5-?et(en)?$/i
/tca-?big5$/i
big5-hkscs /big5-?hk(scs)?$/i
/hk(scs)?-?big5$/i
Big5 + Cantonese characters in Hong Kong
MacChineseTrad Big5 + Apple Vendor Mappings
cp950 Code Page 950
= Big5 + Microsoft vendor mappings
--------------------------------------------------------------------
To find out how to use this module in detail, see Encode.
NOTES
Due to size concerns, "EUC-TW" (Extended Unix Character), "CCCII" (Chinese Character Code for Information Interchange), "BIG5PLUS" (CMEX's
Big5+) and "BIG5EXT" (CMEX's Big5e) are distributed separately on CPAN, under the name Encode::HanExtra. That module also contains extra
China-based encodings.
BUGS
Since the original "big5" encoding(1984) is not supported anywhere (glibc and DOS-based systems uses "big5" to mean "big5-eten"; Microsoft
uses "big5" to mean "cp950"), a conscious decision was made to alias "big5" to "big5-eten", which is the de facto superset of the original
big5.
The "CNS11643" encoding files are not complete. For common "CNS11643" manipulation, please use "EUC-TW" in Encode::HanExtra, which contains
planes 1-7.
The ASCII region (0x00-0x7f) is preserved for all encodings, even though this conflicts with mappings by the Unicode Consortium.
SEE ALSO
Encode
perl v5.18.2 2013-11-04 Encode::TW(3pm)
Check Out this Related Man Page
PERLTW(1) Perl Programmers Reference Guide PERLTW(1)NAME
perltw - XXXX Perl XX
DESCRIPTION
XXXX Perl XXX!
X 5.8.0 XXX, Perl XXXXXX Unicode (XXX) XX, XXXXXXXXXXXXXXXXXXX; CJK (XXX) XXXXXXXX. Unicode XXXXXXX, XXXXXXXXXXXX: XXXX, XXXX, XXXXXXXX
(XXX, XXXX, XXXX, XXXX, XXX, XXXX, XX). XXXXXXXXXXXXXX (X PC XXXX).
Perl XXX Unicode XXXX. XXX Perl XXXXXXXXX Unicode XX; Perl XXXXXX (XXXXXXXXX) XXX Unicode XXXX. XXXXXXX, XXXXX Unicode XXXXXXXXXXXX, Perl
XXX Encode XXXX, XXXXXXXXXXXXXXXXXXX.
Encode XXXXXXXXXXXXXXXXX ('big5' XX 'big5-eten'):
big5-eten Big5 XX (XXXXXXX)
big5-hkscs Big5 + XXXXX, 2001 XX
cp950 XXX 950 (Big5 + XXXXXXX)
XXXX, X Big5 XXXXXXX Unicode, XXXXXXXX:
perl -Mencoding=big5,STDOUT,utf8 -pe1 < file.big5 > file.utf8
Perl XXXX "piconv", XXXXX Perl XXXXXXXXXXX, XXXX:
piconv -f big5 -t utf8 < file.big5 > file.utf8
piconv -f utf8 -t big5 < file.utf8 > file.big5
XX, XX encoding XX, XXXXXXXXXXXXXXXXX, XXXX:
#!/usr/bin/env perl
# XX big5 XXXX; XXXXXXXXXXXXX big5 XX
use encoding 'big5', STDIN => 'big5', STDOUT => 'big5';
print length("XX"); # 2 (XXXXXXX)
print length('XX'); # 4 (XXXXXXXX)
print index("XXXX", "XX"); # -1 (XXXXXXX)
print index('XXXX', 'XX'); # 1 (XXXXXXXXX)
XXXXXXXX, "X" XXXXXXXX "X" XXXXXXXXXX Big5 XX "X"; "X" XXXXXXXXX "X" XXXXXXXXXX "X". XXXXXX Big5 XXXXXXXXXXX.
XXXXXXX
XXXXXXXXXXX, XXX CPAN (<http://www.cpan.org/>) XX Encode::HanExtra XX. XXXXXXXXXXX:
cccii 1980 XXXXXXXXXXXX
euc-tw Unix XXXXX, XX CNS11643 XX 1-7
big5plus XXXXXXXXXXXXX Big5+
big5ext XXXXXXXXXXXXX Big5e
XX, Encode::HanConvert XXXXXXXXXXXXXXXX:
big5-simp Big5 XXXXX Unicode XXXXXX
gbk-trad GBK XXXXX Unicode XXXXXX
XXX GBK X Big5 XXXX, XXXXXXXXX b2g.pl X g2b.pl XXXX, XXXXXXXXXXX:
use Encode::HanConvert;
$euc_cn = big5_to_gb($big5); # X Big5 XX GBK
$big5 = gb_to_big5($euc_cn); # X GBK XX Big5
XXXXXX
XXX Perl XXXXXXXXX (XXXXXXXXX), XXXXXXX Perl XXX, XX Unicode XXXXX. XX, XXXXXXXXX:
XX Perl XXXXX
<http://www.perl.com/>
Perl XXX (XXXXXXXX)
<http://www.cpan.org/>
Perl XXXXX (Comprehensive Perl Archive Network)
<http://lists.perl.org/>
Perl XXXXXX
XX Perl XXX
<http://www.oreilly.com.tw/product_perl.php?id=index_perl>
XXXXXXXXX Perl XX
<http://groups.google.com/groups?q=tw.bbs.comp.lang.perl>
XX Perl XXXXX (XXXXX BBS X Perl XXX)
Perl XXXXX
<http://www.pm.org/groups/asia.html>
XX Perl XXXXX
<http://irc.elixus.org/>
XXXXXXXX
Unicode XXXX
<http://www.unicode.org/>
Unicode XXXX (Unicode XXXXXX)
<http://www.cl.cam.ac.uk/%7Emgk25/unicode.html>
Unix/Linux XX UTF-8 X Unicode XXX
XXXXX
XXXXXXX
<http://www.cpatch.org/>
Linux XXXXXXX
<http://www.linux.org.tw/CLDP/>
SEE ALSO
Encode, Encode::TW, encoding, perluniintro, perlunicode
AUTHORS
Jarkko Hietaniemi <jhi@iki.fi>
Audrey Tang (XX) <audreyt@audreyt.org>
perl v5.12.5 2012-11-03 PERLTW(1)