ANSIç¼ç
å¼å§è®¡ç®æºåªå¨ç¾å½ç¨ãå «ä½çåèä¸å ±å¯ä»¥ç»ååº256ï¼2ç8次æ¹ï¼ç§ä¸åçç¶æãÂ
ä»ä»¬æå ¶ä¸çç¼å·ä»0å¼å§ç32ç§ç¶æåå«è§å®äºç¹æ®çç¨éï¼æè¿äº0X20以ä¸çåèç¶æç§°ä¸º"æ§å¶ç "ã
ä»ä»¬åæææçç©ºæ ¼ãæ ç¹ç¬¦å·ãæ°åã大å°å忝åå«ç¨è¿ç»çåèç¶æè¡¨ç¤ºï¼ä¸ç´ç¼å°äºç¬¬127å·ï¼è¿æ ·è®¡ç®æºå°±å¯ä»¥ç¨ä¸ååèæ¥åå¨è±è¯çæåäºãäºæ¯å¤§å®¶é½æè¿ä¸ªæ¹æ¡å«å ANSI ç"Ascii"ç¼ç ï¼American Standard Code for Information Interchangeï¼ç¾å½ä¿¡æ¯äºæ¢æ å代ç ï¼ã彿¶ä¸ç䏿æçè®¡ç®æºé½ç¨åæ ·çASCIIæ¹æ¡æ¥ä¿åè±ææåã
æ©å±çANSIç¼ç
忥ï¼ä¸çåå°çé½å¼å§ä½¿ç¨è®¡ç®æºï¼ä½æ¯å¾å¤å½å®¶ç¨ç䏿¯è±æï¼ä»ä»¬ç忝éæè®¸å¤æ¯ASCIIéæ²¡æçï¼ä¸ºäºå¯ä»¥å¨è®¡ç®æºä¿åä»ä»¬çæåï¼ä»ä»¬å³å®éç¨127å·ä¹åçç©ºä½æ¥è¡¨ç¤ºè¿äºæ°ç忝ã符å·ï¼è¿å å ¥äºå¾å¤ç»è¡¨æ ¼æ¶éè¦ç¨ä¸å°ç横线ãç«çº¿ã交åçå½¢ç¶ï¼ä¸ç´æåºå·ç¼å°äºæåä¸ä¸ªç¶æ255ãä»128å°255è¿ä¸é¡µçå符éè¢«ç§°âæ©å±å符éâã
GB2312ç¼ç
å½å¤©æäººä»¬å¾å°è®¡ç®æºæ¶ï¼å·²ç»æ²¡æå¯ä»¥å©ç¨çåèç¶ææ¥è¡¨ç¤ºæ±åï¼åµä¸æ6000å¤ä¸ªå¸¸ç¨æ±åéè¦ä¿åã天æäººæ°å°±ä¸å®¢æ°å°æé£äº127å·ä¹åçå¥å¼ç¬¦å·ä»¬ç´æ¥åæ¶æã
è§å®ï¼ä¸ä¸ªå°äº127çå符çæä¹ä¸åæ¥ç¸åï¼ä½ä¸¤ä¸ªå¤§äº127çå符è¿å¨ä¸èµ·æ¶ï¼å°±è¡¨ç¤ºä¸ä¸ªæ±åï¼åé¢çä¸ä¸ªåèï¼ä»ç§°ä¹ä¸ºé«åèï¼ä»0xA1ç¨å°0xF7ï¼åé¢ä¸ä¸ªåèï¼ä½åèï¼ä»0xA1å°0xFEï¼è¿æ ·æä»¬å°±å¯ä»¥ç»ååºå¤§çº¦7000å¤ä¸ªç®ä½æ±åäºã
å¨è¿äºç¼ç éï¼æä»¬è¿ææ°å¦ç¬¦å·ãç½é©¬å¸è çåæ¯ãæ¥æçåå们é½ç¼è¿å»äºï¼è¿å¨ ASCII 鿬æ¥å°±æçæ°åãæ ç¹ã忝é½ç»ç»éæ°ç¼äºä¸¤ä¸ªåèé¿çç¼ç ï¼è¿å°±æ¯å¸¸è¯´çâå ¨è§âå符ï¼è忥å¨127å·ä»¥ä¸çé£äºå°±å«"åè§"å符äºãäºæ¯å°±æè¿ç§æ±åæ¹æ¡å«å âGB2312âãGB2312 æ¯å¯¹ ASCII ç䏿æ©å±ã
GBK å GB18030ç¼ç
使¯å¤©æçæ±å太å¤äºï¼æä»¬å¾å¿«å°±å°±åç°æè®¸å¤äººçäººåæ²¡æåæ³å¨è¿éæåºæ¥ï¼äºæ¯æä»¬ä¸å¾ä¸ç»§ç»æ GB2312 没æç¨å°çç 使¾åºæ¥èå®ä¸å®¢æ°å°ç¨ä¸ãåæ¥è¿æ¯ä¸å¤ç¨ï¼äºæ¯å¹²èä¸åè¦æ±ä½åèä¸å®æ¯127å·ä¹åçå ç ï¼åªè¦ç¬¬ä¸ä¸ªåèæ¯å¤§äº127å°±åºå®è¡¨ç¤ºè¿æ¯ä¸ä¸ªæ±åçå¼å§ï¼ä¸ç®¡åé¢è·çæ¯ä¸æ¯æ©å±å符ééçå 容ãç»ææ©å±ä¹åçç¼ç æ¹æ¡è¢«ç§°ä¸º GBK æ åï¼GBK å æ¬äº GB2312 çææå 容ï¼åæ¶åå¢å äºè¿20000个æ°çæ±åï¼å æ¬ç¹ä½åï¼å符å·ã忥尿°æ°æä¹è¦ç¨çµèäºï¼äºæ¯æä»¬åæ©å±ï¼åå äºå å个æ°çå°æ°æ°æçåï¼GBK æ©æäº GB18030ãå¨è¿ä¸ªæ åéï¼æå¤§çç¹ç¹æ¯ä¸¤åèé¿çæ±åå符åä¸åèé¿çè±æå符并åäºåä¸å¥ç¼ç æ¹æ¡éï¼å æ¤ä»ä»¬åçç¨åºä¸ºäºæ¯æä¸æå¤çï¼å¿ é¡»è¦æ³¨æå串éçæ¯ä¸ä¸ªåèçå¼ï¼å¦æè¿ä¸ªå¼æ¯å¤§äº127çï¼é£ä¹å°±è®¤ä¸ºä¸ä¸ªååèå符ééçå符åºç°äºã
飿¶å塿¯åè¿ç¼ç¨å¦ä¹ çç¨åºåé½è¦æ¯å¤©å¿µä¸é¢è¿ä¸ªåè¯æ°ç¾éçæç£¨ï¼Â
âä¸ä¸ªæ±åç®ä¸¤ä¸ªè±æå符ï¼ä¸ä¸ªæ±åç®ä¸¤ä¸ªè±æå符â¦â¦â
UNICODEç¼ç
å ä¸ºå½æ¶å个å½å®¶é½æä¸å¥èªå·±çç¼ç æ åï¼ç»æäºç¸ä¹é´è°ä¹ä¸æè°çç¼ç ï¼è°ä¹ä¸æ¯æå«äººçç¼ç ãæ£å¨è¿æ¶ï¼ä¸ä¸ªå« ISO ï¼å½é æ è°åç»ç»ï¼çå½é ç»ç»å³å®çæè§£å³è¿ä¸ªé®é¢ãä»ä»¬éç¨çæ¹æ³å¾ç®åï¼åºäºææçå°åºæ§ç¼ç æ¹æ¡ï¼éæ°æä¸ä¸ªå æ¬äºå°ç䏿ææåãææåæ¯å符å·çç¼ç ï¼ä»ä»¬æç®å«å® UCS, ä¿ç§° UNICODEãï¼ Universal Multiple-Octet Coded Character Set ï¼å¨UNICODE ä¸ï¼ä¸ä¸ªæ±åç®ä¸¤ä¸ªè±æåç¬¦çæ¶ä»£å·²ç»å¿«è¿å»äºãæ 论æ¯åè§çè±æåæ¯ï¼è¿æ¯å ¨è§çæ±åï¼å®ä»¬é½æ¯ç»ä¸çâä¸ä¸ªå符âï¼åæ¶ï¼ä¹é½æ¯ç»ä¸çâ两个åè"âã
UTF-8åUTF-16
UNICODE æ¥å°æ¶ï¼ä¸èµ·å°æ¥çè¿æè®¡ç®æºç½ç»çå ´èµ·ï¼UNICODE å¦ä½å¨ç½ç»ä¸ä¼ è¾ä¹æ¯ä¸ä¸ªå¿ é¡»èèçé®é¢ï¼äºæ¯é¢åä¼ è¾çä¼å¤ UTFï¼UCS Transfer Formatï¼æ ååºç°äºï¼é¡¾åæä¹ï¼UTF8å°±æ¯æ¯æ¬¡8个ä½ä¼ è¾æ°æ®ï¼èUTF16å°±æ¯æ¯æ¬¡16个ä½ï¼åªä¸è¿ä¸ºäºä¼ è¾æ¶çå¯é æ§ï¼ä»UNICODEå°UTFæ¶å¹¶ä¸æ¯ç´æ¥ç对åºï¼èæ¯è¦è¿ä¸äºç®æ³åè§åæ¥è½¬æ¢ã
æªæ¥çUCS-4
å¦åæè¿°ï¼UNICODE æ¯ç¨ä¸¤ä¸ªåèæ¥è¡¨ç¤ºä¸ºä¸ä¸ªå符ï¼ä»æ»å ±å¯ä»¥ç»ååº65535ä¸åçå符ï¼è¿å¤§æ¦å·²ç»å¯ä»¥è¦çä¸ç䏿ææåç符å·ã妿è¿ä¸å¤ä¹æ²¡æå ³ç³»ï¼ISOå·²ç»åå¤äºUCS-4æ¹æ¡ï¼è¯´ç®åäºå°±æ¯å个åèæ¥è¡¨ç¤ºä¸ä¸ªå符ï¼è¿æ ·æä»¬å°±å¯ä»¥ç»ååº21亿个ä¸åçåç¬¦åºæ¥ï¼æé«ä½æå ¶ä»ç¨éï¼ã
é»è®¤è¯ç³»ï¼<META CONTENT=âtext/htmlï¼charset=XXXXXâ>ãè¿ä¸ªä¸»è¦æ¯ç±äºç¨åºåæ¯é¢åå½å°ç人å¼åçç½ç«ï¼ç±äºå½å°é½æ¯é»è®¤è¯ç³»ï¼æä»¥æ²¡æä¹±ç ç§æ åµãè³äºåºç°å£å£å£å£å£å£è¿ç§æ åµï¼è¿æ¯ç±äºç½ç«å¹¶æ²¡æéç¨UTF-8ç¼ç èæ¯éç¨çå½å°çç¼ç ï¼å¦èå¤è¯çï¼é¿æä¼¯è¯çç¼ç ï¼ä½ çè®¡ç®æºä¸å¹¶æ²¡æè¿ç§ç¼ç ï¼æä»¥ä¸è½è¯å«ã
è§£å³åæ³æ¯ï¼äºå 为æµè§å¨å®è£ å¤è¯è¨æ¯æå ï¼ä¾å¦å¨å®è£ IEæ¶è¦å®è£ å¤è¯è¨æ¯æå ï¼ï¼è¿æ ·å¨æµè§ç½é¡µåºç°ä¹±ç æ¶ï¼å°±å¯ä»¥å¨æµè§å¨ä¸éæ©èåæ ä¸çâæ¥çâ/âç¼ç â/âèªå¨éæ©â/èå¤ï¼ï¼å¦ä¸ºç¹ä½ä¸æåéæ©âæ¥çâ/âç¼ç â/âèªå¨éæ©â/é¿æä¼¯è¯ï¼å ¶å®è¯è¨ä¾æ¤ç±»æ¨éæ©ç¸åºçè¯ç³»ï¼è¿æ ·å¯æ¶é¤ç½é¡µä¹±ç ç°è±¡ã
ç®åå¼åç½ç«ç¨ä»ä¹ç¼ç æ¯è¾å¥½
ä¸è¬éä¿çç解为ï¼
utf-8æ¯ä¸çæ§éç¨ä»£ç ï¼ä¹å®ç¾çæ¯æä¸æç¼ç ï¼å¦ææä»¬åçç½ç«è½è®©å½å¤ç¨æ·æ£å¸¸ç访é®ï¼å°±æå¥½ç¨utf-8ã
GB2312å±äºä¸æç¼ç ï¼ä¸»è¦é对å½å ç¨æ·ä½¿ç¨ï¼å¦æå½å¤ç¨æ·è®¿é®GB2312ç¼ç çç½ç«å°±ä¼åä¹±ç ã
ä¸è¬è§å¾æ¯ç¨utf-8æ¯GB2312è¦å¤å¾å¤ï¼å¤§å®¶é½æ¯è¾èµåç¨utf-8ã
UTF-8ç¼ç çæä»¶æ¯GB2312æ´å 空é´ä¸äºï¼è½ç¶ç®åç硬件ç¯å¢ä¸å¯ä»¥å¿½ç¥ï¼ä½æ¯è¿äºé¨æ·ç½ç«ä¸ºäºåå°æå¡å¨è´è½½åºæ¬ä¸ææç页é¢é½çæäºéæé¡µï¼UTF-8ä¿åèµ·æ¥æä»¶ä¼æ¯è¾å¤§ï¼å¯¹äºé¨æ·çº§å«çç½ç«æ¯å¤©çæçæä»¶éè¿æ¯é常巨大ï¼å¸¦æ¥çå卿æ¬ç¸åºæé«ãç±äºUTF-8çç¼ç æ¯GB2312è§£ç çç½ç»ä¼ è¾æ°æ®éè¦å¤§ï¼å¯¹äºé¨æ·çº§å«çç½ç«æ¥è¯´ãè¿ä¸ªæ å½¢ä¹é´å°±è¦å¢å¤§å¸¦å®½ï¼ç¨GB2312对ç½ç»æµéæ çæ¯æå¥½çä¼åã
