php的漢字轉換: Unicode(UTF8)->GBK (轉)

worldblog發表於2007-12-13
php的漢字轉換: Unicode(UTF8)->GBK (轉)[@more@]

秋水無恨 GBK Unicode UTF8 漢字 轉換

的漢字轉換一直是比較麻煩的事

該類內建了四個過濾"[dec];","[hex];","%u[hex]","utf8轉換"
方便的使用,同時也可自定義過濾進行自己喜歡的操作


qswhU.php 從這裡
a.com/user/qswh/qswhU.zip">

class qswhU{
 var $qswhData;
 function qswhU($filename="qswhU.php"){
 $this->qswhData=file($filename);
 }
 
 function decode($str,$pattern=0){
 $arr=array("/(w+);/iU","/((%ww)+)/i","/%u(w{4,5})/iU");
 if(is_integer($pattern)){
 if($pattern>=count($arr))die("Invalid Function");
 $pattern=$arr[$pattern];
 }
 return preg_replace_callback($pattern,array($this,"u2gb"),$str);
 }
 
 function u2gb($arr){
 /******(qiushuiwuhen 2002-8-15)******/
 $ret="";$str=$arr[1];
 if(preg_match_all("/%w{2}/",$str,$matches)){
 for($i=0;$i $chr1=hexdec(substr($matches[0][$i],1));
 $arr=array("f0","e0","c0","0");
 for($j=0;$jhexdec($arr[$j]))break;
 $chr=hexdec(substr($matches[0][$i],1))-hexdec($arr[$j]);
 while(++$j $str=dechex($chr);
 if(strlen($str)==4){
 $p=hexdec(substr($str,0,2))-0x4d;
 $q=hexdec(substr($str,2))*4;
 $ret.=chr(hexdec(substr($this->qswhData[$p],$q,2)));
 $ret.=chr(hexdec(substr($this->qswhData[$p],$q+2,2)));
 }else
 $ret.=chr(hexdec($str));
 }
 }
 else{ 
 if(strtolower($str[0])=="x")
 $str=substr($str,1);
 else
 if(strlen($str)!=4)$str=dechex($str);
 
 if(strlen($str)==4){
 $p=hexdec(substr($str,0,2))-0x4d;
 $q=hexdec(substr($str,2))*4;
 $ret.=chr(hexdec(substr($this->qswhData[$p],$q,2)));
 $ret.=chr(hexdec(substr($this->qswhData[$p],$q+2,2)));
 }else
 $ret.=chr(hexdec($str));
 }
 return $ret;
 }
 
 


使用範例

$qswh=new qswhU("qswhU.php");//如果名是qswhU.php,可省引數
echo "

不帶引數(預設過濾為:[num];):&quot;;<br>echo &quot;n&quot;.$qswh-&gt;decode(&quot;中文Abc&quot;);<br>echo &quot;n&quot;.$qswh-&gt;decode(&quot;中文Abc&quot;);<br>echo &quot;n內建過濾(UTF轉碼):&quot;.$qswh-&gt;decode(&quot;%E4%B8%AD%E6%96%87%20!%22%23%24%25%26'()*%2B%2C%2F%3A%3B%3C%3D%3E%3F%40%5B%5D%5E%60%7B%7C%7D~%25Abc&quot;,1);<br>echo &quot;n呼叫內建過濾unescape(%u[num]):&quot;.$qswh-&gt;decode(&quot;%u4E2D%u6587Abc&quot;,2);<br>echo &quot;n自定義過濾([x+num]):&quot;.$qswh-&gt;decode(&quot;[x4E2D][x6587][x41][x62][x63]&quot;,&quot;/[(w+)]/&quot;); <p>效果如下:<br>不帶引數(預設過濾為:[num];):<br>中文Abc<br>中文Abc<br>呼叫內建過濾(UTF轉碼):中文 !&quot;#$%&amp;'()*+,/:;&lt;=&gt;?@[]^`{|}~%Abc<br>呼叫內建過濾unescape(%u[num]):中文Abc<br>自定義過濾([x+num]):中文Abc</p> <p><br>ps.該文章現僅限在csdn文件中心,如要轉載,請和作者聯絡,否則後果自負。</p><br>

來自 “ ITPUB部落格 ” ,連結:http://blog.itpub.net/10752043/viewspace-992845/,如需轉載,請註明出處,否則將追究法律責任。

相關文章