杴慼 TABLE OF CONTENTS
perltw - 正體中文 Perl \x{033b}\x{fffd}n
\x{fffd}w\x{0273}\x{32b4}\x{2047} Perl 炙天\x{fffd}a!
\x{fffd}q 5.8.0 邶\x{fffd}}\x{fffd}l, Perl \x{fffd}蒬々F完善炙 Unicode (\x{fffd}U國\x{fffd}X) 支援, \x{fffd}]\x{fffd}s\x{fffd}a支援\x{fffd}F\x{fffd}\\x{fffd}h庚\x{fffd}B\x{fffd}y\x{fffd}t\x{fffd}H\x{fffd}~炙\x{fffd}s\x{fffd}X方式; CJK (中日韓) \x{fffd}K\x{fffd}O\x{fffd}鉹云漱@部\x{fffd}\x{fffd}. Unicode \x{fffd}O國際股炙標準, 試圖\x{fffd}[\x{fffd}\\x{fffd}@\x{24d8}\x{fffd}W芯有炙\x{fffd}r符: \x{2600}方\x{fffd}@\x{24d8}, \x{fffd}F方\x{fffd}@\x{24d8}, \x{fffd}H及\x{fffd}滫抾〞漱@切 (\x{23c2}臘文, 敘\x{fffd}Q\x{3258}文, \x{0281}庚\x{fffd}B文, \x{23c2}\x{fffd}B\x{32b4}文, \x{fffd}L\x{0306}文, \x{fffd}L\x{fffd}a\x{fffd}w文, 等等). 它\x{fffd}]\x{fffd}e滄\x{fffd}F\x{fffd}h種\x{fffd}@\x{fffd}~\x{fffd}t統\x{fffd}P平\x{fffd}O (\x{fffd}p PC 及麥\x{029d}塔).
Perl 本身\x{fffd}H Unicode \x{fffd}i\x{3020}操\x{fffd}@. \x{fffd}o\x{0288}示 Perl 內部炙\x{fffd}r\x{2602}資紼\x{fffd}i用 Unicode \x{0288}示; Perl 炙\x{25d0}式\x{fffd}P算符 (\x{32b3}\x{fffd}p正\x{fffd}W\x{0288}示式比對) \x{fffd}]\x{2146e}對 Unicode \x{fffd}i\x{3020}操\x{fffd}@. \x{fffd}b輸\x{fffd}J及輸\x{fffd}X\x{fa31}, \x{24f0}\x{fffd}F\x{fffd}B\x{fffd}z\x{fffd}H Unicode 之\x{fffd}e炙\x{fffd}s\x{fffd}X方式\x{fffd}x\x{fffd}s炙資紼, Perl 提\x{32b2}\x{fffd}F Encode \x{fffd}o\x{338e}模組, \x{fffd}i\x{fffd}H讓\x{fffd}A輕空\x{fffd}a讀恆及\x{fffd}g\x{fffd}J舊有炙\x{fffd}s\x{fffd}X資紼.
Encode \x{00be}\x{31fa}模組支援\x{fffd}U\x{fffd}C正體中文炙\x{fffd}s\x{fffd}X方式 ('big5' \x{0288}示 'big5-eten'):
big5-eten Big5 \x{fffd}s\x{fffd}X (\x{fffd}t\x{330d}天\x{00be}\x{31fa}\x{fffd}r\x{23ca})
big5-hkscs Big5 + \x{2166}港\x{fffd}~\x{fffd}r集, 2001 \x{fffd}~邶
cp950 \x{fffd}r\x{fffd}X\x{2161} 950 (Big5 + \x{fffd}L\x{fffd}n\x{fffd}K\x{fffd}[炙\x{fffd}r符)
\x{fffd}|\x{32b3}\x{32b4}說, \x{fffd}N Big5 \x{fffd}s\x{fffd}X炙檔釣轉成 Unicode, 街需鍵\x{fffd}J\x{fffd}U\x{fffd}C\x{033b}\x{fffd}O:
perl -Mencoding=big5,STDOUT,utf8 -pe1 < file.big5 > file.utf8
Perl \x{fffd}]內\x{0295}\x{fffd}F "piconv", \x{fffd}@支完\x{fffd}頞H Perl \x{fffd}g成炙\x{fffd}r符轉換\x{fffd}u\x{fffd}蒫{式, 用\x{fffd}k\x{fffd}p\x{fffd}U:
piconv -f big5 -t utf8 < file.big5 > file.utf8
piconv -f utf8 -t big5 < file.utf8 > file.big5
\x{fffd}t\x{fffd}~, \x{fffd}Q用 encoding 模組, \x{fffd}A\x{fffd}i\x{fffd}H輕空\x{fffd}g\x{fffd}X\x{fffd}H\x{fffd}r符\x{24f0}單\x{2668}炙\x{fffd}{式\x{fffd}X, \x{fffd}p\x{fffd}U芯示:
#!/usr/bin/env perl
# 啟動 big5 \x{fffd}r\x{2602}解\x{fffd}R; 標準輸\x{fffd}X\x{fffd}J及標準錯\x{fffd}~都\x{fffd}]\x{24f0} big5 \x{fffd}s\x{fffd}X
use encoding 'big5', STDIN => 'big5', STDOUT => 'big5';
print length("\x{fffd}d\x{fffd}m"); # 2 (雙引號\x{0288}示\x{fffd}r符)
print length('\x{fffd}d\x{fffd}m'); # 4 (單引號\x{0288}示\x{2668}元組)
print index("諄諄教誨", "彖帢"); # -1 (不\x{fffd}]\x{fffd}t\x{2664}\x{fffd}l\x{fffd}r\x{2602})
print index('諄諄教誨', '彖帢'); # 1 (\x{fffd}q第\x{fffd}G\x{338e}\x{2668}元組\x{fffd}}\x{fffd}l)
\x{fffd}b最\x{02e6}\x{fffd}@\x{fffd}C\x{32b3}\x{fffd}l裡, "諄" 炙第\x{fffd}G\x{338e}\x{2668}元組\x{fffd}P "諄" 炙第\x{fffd}@\x{338e}\x{2668}元組結\x{fffd}X成 Big5 \x{fffd}X炙 "彖"; "諄" 炙第\x{fffd}G\x{338e}\x{2668}元組\x{fffd}h\x{fffd}P "教" 炙第\x{fffd}@\x{338e}\x{2668}元組結\x{fffd}X成 "帢". \x{fffd}o解\x{fffd}M\x{fffd}F\x{fffd}H\x{fffd}e Big5 \x{fffd}X比對\x{fffd}B\x{fffd}z\x{fffd}W\x{fffd}`見炙問\x{fffd}D.
\x{fffd}p\x{fffd}G需\x{fffd}n\x{30f8}\x{fffd}h炙中文\x{fffd}s\x{fffd}X, \x{fffd}i\x{fffd}H\x{fffd}q CPAN (http://www.cpan.org/) \x{fffd}U載 Encode::HanExtra 模組. 它目\x{fffd}e提\x{32b2}\x{fffd}U\x{fffd}C\x{fffd}s\x{fffd}X方式:
cccii 1980 \x{fffd}~文\x{203f}\x{fffd}|炙中文資\x{fffd}T交換\x{fffd}X
euc-tw Unix \x{00be}\x{31fa}\x{fffd}r符集, \x{fffd}]\x{fffd}t CNS11643 平\x{2470} 1-7
big5plus 中文數\x{2668}化技\x{fffd}N推\x{fffd}s基\x{029d}\x{fffd}|炙 Big5+
big5ext 中文數\x{2668}化技\x{fffd}N推\x{fffd}s基\x{029d}\x{fffd}|炙 Big5e
\x{fffd}t\x{fffd}~, Encode::HanConvert 模組\x{fffd}h提\x{32b2}\x{fffd}F簡\x{fffd}c轉換用炙\x{fffd}熇婼s\x{fffd}X:
big5-simp Big5 正體中文\x{fffd}P Unicode 簡體中文互轉
gbk-trad GBK 簡體中文\x{fffd}P Unicode 正體中文互轉
\x{fffd}Y\x{fffd}Q\x{fffd}b GBK \x{fffd}P Big5 之間互轉, 請參考該模組內\x{0295}炙 b2g.pl \x{fffd}P g2b.pl \x{fffd}滮鉾{式, 帕\x{fffd}b\x{fffd}{式內\x{325f}用\x{fffd}U\x{fffd}C\x{fffd}g\x{fffd}k:
use Encode::HanConvert;
$euc_cn = big5_to_gb($big5); # \x{fffd}q Big5 轉\x{24f0} GBK
$big5 = gb_to_big5($euc_cn); # \x{fffd}q GBK 轉\x{24f0} Big5
請參考 Perl 內\x{0295}炙\x{fffd}j\x{fffd}q說咩文件 (不2\x{fffd}曋O用\x{fffd}^文\x{fffd}g炙), \x{32b4}學習\x{30f8}\x{fffd}h關房 Perl 炙疙識, \x{fffd}H及 Unicode 炙\x{325f}用方式. 不\x{fffd}L, \x{fffd}~部炙資源\x{32d0}當豐\x{fffd}I:
Perl 炙\x{2165}\x{2161} (由歐萊禮公\x{fffd}q維\x{fffd}@)
Perl 綜\x{fffd}X\x{fffd}斳繳\x{fffd} (Comprehensive Perl Archive Network)
Perl \x{fffd}l遞論壇\x{fffd}@覽
正體中文邶炙歐萊禮 Perl 脤藉
\x{fffd}O\x{fffd}W Perl \x{fffd}s\x{fffd}u\x{fffd}Q論區 (\x{fffd}]\x{fffd}N\x{fffd}O\x{fffd}U\x{fffd}j BBS 炙 Perl \x{fffd}s\x{fffd}u邶)
\x{fffd}O\x{fffd}W Perl 推\x{fffd}s組\x{fffd}@覽
藝立\x{1e3f}\x{fffd}u\x{fffd}W聊天\x{1f71}
Unicode 學\x{fffd}N學\x{fffd}| (Unicode 標準炙\x{2049}\x{fffd}w垠)
Unix/Linux \x{fffd}W炙 UTF-8 及 Unicode 答\x{fffd}豹x{fffd}
http://www.csie.ntu.edu.tw/~b7506051/mozilla/faq.html#faqglossary
the Encode manpage, the Encode::TW manpage, the encoding manpage, the perluniintro manpage, the perlunicode manpage
Jarkko Hietaniemi <jhi@iki.fi>
Autrijus Tang (\x{fffd}薳v\x{fffd}~) <autrijus@autrijus.org>