perltw - 正體中文 Perl \x{033b}\x{fffd}n

杴慼 TABLE OF CONTENTS


抩蟆 NAME

perltw - 正體中文 Perl \x{033b}\x{fffd}n


濩抸 DESCRIPTION

\x{fffd}w\x{0273}\x{32b4}\x{2047} Perl 炙天\x{fffd}a!

\x{fffd}q 5.8.0 邶\x{fffd}}\x{fffd}l, Perl \x{fffd}蒬々F完善炙 Unicode (\x{fffd}U國\x{fffd}X) 支援, \x{fffd}]\x{fffd}s\x{fffd}a支援\x{fffd}F\x{fffd}\\x{fffd}h庚\x{fffd}B\x{fffd}y\x{fffd}t\x{fffd}H\x{fffd}~炙\x{fffd}s\x{fffd}X方式; CJK (中日韓) \x{fffd}K\x{fffd}O\x{fffd}鉹云漱@部\x{fffd}\x{fffd}. Unicode \x{fffd}O國際股炙標準, 試圖\x{fffd}[\x{fffd}\\x{fffd}@\x{24d8}\x{fffd}W芯有炙\x{fffd}r符: \x{2600}方\x{fffd}@\x{24d8}, \x{fffd}F方\x{fffd}@\x{24d8}, \x{fffd}H及\x{fffd}滫抾〞漱@切 (\x{23c2}臘文, 敘\x{fffd}Q\x{3258}文, \x{0281}庚\x{fffd}B文, \x{23c2}\x{fffd}B\x{32b4}文, \x{fffd}L\x{0306}文, \x{fffd}L\x{fffd}a\x{fffd}w文, 等等). 它\x{fffd}]\x{fffd}e滄\x{fffd}F\x{fffd}h種\x{fffd}@\x{fffd}~\x{fffd}t統\x{fffd}P平\x{fffd}O (\x{fffd}p PC 及麥\x{029d}塔).

Perl 本身\x{fffd}H Unicode \x{fffd}i\x{3020}操\x{fffd}@. \x{fffd}o\x{0288}示 Perl 內部炙\x{fffd}r\x{2602}資紼\x{fffd}i用 Unicode \x{0288}示; Perl 炙\x{25d0}式\x{fffd}P算符 (\x{32b3}\x{fffd}p正\x{fffd}W\x{0288}示式比對) \x{fffd}]\x{2146e}對 Unicode \x{fffd}i\x{3020}操\x{fffd}@. \x{fffd}b輸\x{fffd}J及輸\x{fffd}X\x{fa31}, \x{24f0}\x{fffd}F\x{fffd}B\x{fffd}z\x{fffd}H Unicode 之\x{fffd}e炙\x{fffd}s\x{fffd}X方式\x{fffd}x\x{fffd}s炙資紼, Perl 提\x{32b2}\x{fffd}F Encode \x{fffd}o\x{338e}模組, \x{fffd}i\x{fffd}H讓\x{fffd}A輕空\x{fffd}a讀恆及\x{fffd}g\x{fffd}J舊有炙\x{fffd}s\x{fffd}X資紼.

Encode \x{00be}\x{31fa}模組支援\x{fffd}U\x{fffd}C正體中文炙\x{fffd}s\x{fffd}X方式 ('big5' \x{0288}示 'big5-eten'):

    big5-eten	Big5 \x{fffd}s\x{fffd}X (\x{fffd}t\x{330d}天\x{00be}\x{31fa}\x{fffd}r\x{23ca})
    big5-hkscs	Big5 + \x{2166}港\x{fffd}~\x{fffd}r集, 2001 \x{fffd}~邶
    cp950	\x{fffd}r\x{fffd}X\x{2161} 950 (Big5 + \x{fffd}L\x{fffd}n\x{fffd}K\x{fffd}[炙\x{fffd}r符)

\x{fffd}|\x{32b3}\x{32b4}說, \x{fffd}N Big5 \x{fffd}s\x{fffd}X炙檔釣轉成 Unicode, 街需鍵\x{fffd}J\x{fffd}U\x{fffd}C\x{033b}\x{fffd}O:

    perl -Mencoding=big5,STDOUT,utf8 -pe1 < file.big5 > file.utf8

Perl \x{fffd}]內\x{0295}\x{fffd}F "piconv", \x{fffd}@支完\x{fffd}頞H Perl \x{fffd}g成炙\x{fffd}r符轉換\x{fffd}u\x{fffd}蒫{式, 用\x{fffd}k\x{fffd}p\x{fffd}U:

    piconv -f big5 -t utf8 < file.big5 > file.utf8
    piconv -f utf8 -t big5 < file.utf8 > file.big5

\x{fffd}t\x{fffd}~, \x{fffd}Q用 encoding 模組, \x{fffd}A\x{fffd}i\x{fffd}H輕空\x{fffd}g\x{fffd}X\x{fffd}H\x{fffd}r符\x{24f0}單\x{2668}炙\x{fffd}{式\x{fffd}X, \x{fffd}p\x{fffd}U芯示:

    #!/usr/bin/env perl
    # 啟動 big5 \x{fffd}r\x{2602}解\x{fffd}R; 標準輸\x{fffd}X\x{fffd}J及標準錯\x{fffd}~都\x{fffd}]\x{24f0} big5 \x{fffd}s\x{fffd}X
    use encoding 'big5', STDIN => 'big5', STDOUT => 'big5';
    print length("\x{fffd}d\x{fffd}m");	     #  2 (雙引號\x{0288}示\x{fffd}r符)
    print length('\x{fffd}d\x{fffd}m');	     #  4 (單引號\x{0288}示\x{2668}元組)
    print index("諄諄教誨", "彖帢"); # -1 (不\x{fffd}]\x{fffd}t\x{2664}\x{fffd}l\x{fffd}r\x{2602})
    print index('諄諄教誨', '彖帢'); #  1 (\x{fffd}q第\x{fffd}G\x{338e}\x{2668}元組\x{fffd}}\x{fffd}l)

\x{fffd}b最\x{02e6}\x{fffd}@\x{fffd}C\x{32b3}\x{fffd}l裡, "諄" 炙第\x{fffd}G\x{338e}\x{2668}元組\x{fffd}P "諄" 炙第\x{fffd}@\x{338e}\x{2668}元組結\x{fffd}X成 Big5 \x{fffd}X炙 "彖"; "諄" 炙第\x{fffd}G\x{338e}\x{2668}元組\x{fffd}h\x{fffd}P "教" 炙第\x{fffd}@\x{338e}\x{2668}元組結\x{fffd}X成 "帢". \x{fffd}o解\x{fffd}M\x{fffd}F\x{fffd}H\x{fffd}e Big5 \x{fffd}X比對\x{fffd}B\x{fffd}z\x{fffd}W\x{fffd}`見炙問\x{fffd}D.

\x{fffd}B\x{fffd}~炙中文\x{fffd}s\x{fffd}X

\x{fffd}p\x{fffd}G需\x{fffd}n\x{30f8}\x{fffd}h炙中文\x{fffd}s\x{fffd}X, \x{fffd}i\x{fffd}H\x{fffd}q CPAN (http://www.cpan.org/) \x{fffd}U載 Encode::HanExtra 模組. 它目\x{fffd}e提\x{32b2}\x{fffd}U\x{fffd}C\x{fffd}s\x{fffd}X方式:

    cccii	1980 \x{fffd}~文\x{203f}\x{fffd}|炙中文資\x{fffd}T交換\x{fffd}X
    euc-tw	Unix \x{00be}\x{31fa}\x{fffd}r符集, \x{fffd}]\x{fffd}t CNS11643 平\x{2470} 1-7
    big5plus	中文數\x{2668}化技\x{fffd}N推\x{fffd}s基\x{029d}\x{fffd}|炙 Big5+
    big5ext	中文數\x{2668}化技\x{fffd}N推\x{fffd}s基\x{029d}\x{fffd}|炙 Big5e

\x{fffd}t\x{fffd}~, Encode::HanConvert 模組\x{fffd}h提\x{32b2}\x{fffd}F簡\x{fffd}c轉換用炙\x{fffd}熇婼s\x{fffd}X:

    big5-simp	Big5 正體中文\x{fffd}P Unicode 簡體中文互轉
    gbk-trad	GBK 簡體中文\x{fffd}P Unicode 正體中文互轉

\x{fffd}Y\x{fffd}Q\x{fffd}b GBK \x{fffd}P Big5 之間互轉, 請參考該模組內\x{0295}炙 b2g.pl \x{fffd}P g2b.pl \x{fffd}滮鉾{式, 帕\x{fffd}b\x{fffd}{式內\x{325f}用\x{fffd}U\x{fffd}C\x{fffd}g\x{fffd}k:

    use Encode::HanConvert;
    $euc_cn = big5_to_gb($big5); # \x{fffd}q Big5 轉\x{24f0} GBK
    $big5 = gb_to_big5($euc_cn); # \x{fffd}q GBK 轉\x{24f0} Big5

\x{fffd}i\x{fffd}@\x{fffd}B炙資\x{fffd}T

請參考 Perl 內\x{0295}炙\x{fffd}j\x{fffd}q說咩文件 (不2\x{fffd}曋O用\x{fffd}^文\x{fffd}g炙), \x{32b4}學習\x{30f8}\x{fffd}h關房 Perl 炙疙識, \x{fffd}H及 Unicode 炙\x{325f}用方式. 不\x{fffd}L, \x{fffd}~部炙資源\x{32d0}當豐\x{fffd}I:

提\x{32b2} Perl 資源炙網\x{fffd}}

http://www.perl.com/

Perl 炙\x{2165}\x{2161} (由歐萊禮公\x{fffd}q維\x{fffd}@)

http://www.cpan.org/

Perl 綜\x{fffd}X\x{fffd}斳繳\x{fffd} (Comprehensive Perl Archive Network)

http://lists.perl.org/

Perl \x{fffd}l遞論壇\x{fffd}@覽

學習 Perl 炙網\x{fffd}}

http://www.oreilly.com.tw/chinese/perl/index.html

正體中文邶炙歐萊禮 Perl 脤藉

http://groups.google.com/groups?q=tw.bbs.comp.lang.perl

\x{fffd}O\x{fffd}W Perl \x{fffd}s\x{fffd}u\x{fffd}Q論區 (\x{fffd}]\x{fffd}N\x{fffd}O\x{fffd}U\x{fffd}j BBS 炙 Perl \x{fffd}s\x{fffd}u邶)

Perl \x{325f}用垠集\x{fffd}|

http://www.pm.org/groups/asia.shtml#Taiwan

\x{fffd}O\x{fffd}W Perl 推\x{fffd}s組\x{fffd}@覽

http://irc.elixus.org/

藝立\x{1e3f}\x{fffd}u\x{fffd}W聊天\x{1f71}

Unicode \x{32d0}關網\x{fffd}}

http://www.unicode.org/

Unicode 學\x{fffd}N學\x{fffd}| (Unicode 標準炙\x{2049}\x{fffd}w垠)

http://www.cl.cam.ac.uk/%7Emgk25/unicode.html

Unix/Linux \x{fffd}W炙 UTF-8 及 Unicode 答\x{fffd}豹x{fffd}

中文化資\x{fffd}T

\x{24f0}什麼\x{fffd}s "正體中文" 不\x{fffd}s "\x{fffd}c體中文"?

http://www.csie.ntu.edu.tw/~b7506051/mozilla/faq.html#faqglossary

中文化\x{fffd}n體\x{fffd}p盟

http://www.cpatch.org/

Linux \x{fffd}n體中文化\x{fffd}p劃

http://www.linux.org.tw/CLDP/


楮洘嫩杴 SEE ALSO

the Encode manpage, the Encode::TW manpage, the encoding manpage, the perluniintro manpage, the perlunicode manpage


難樊 AUTHORS

Jarkko Hietaniemi <jhi@iki.fi>

Autrijus Tang (\x{fffd}薳v\x{fffd}~) <autrijus@autrijus.org>

perltw - 正體中文 Perl \x{033b}\x{fffd}n

綢婁 INDEX

perltw - 正體中文 Perl \x{033b}\x{fffd}n