| ²é¿´: 306 | »Ø¸´: 0 | |||
mybag1ľ³æ (ÕýʽдÊÖ)
|
[½»Á÷]
ÓÃchardetÅжÏ×Ö·û±àÂëµÄ·½·¨
|
|
chardet ÓÃÀ´ÊµÏÖ×Ö·û´®/Îļþ±àÂë¼ì²âÄ£°å 1¡¢chardetÏÂÔØÓë°²×° ÏÂÔØµØÖ·£ºhttp://pypi.python.org/pypi/chardet ÏÂÔØchardetºó£¬½âѹchardetѹËõ°ü£¬Ö±½Ó½«chardetÎļþ¼Ð·ÅÔÚÓ¦ÓóÌÐòĿ¼Ï£¬¾Í¿ÉÒÔʹÓÃimport chardet¿ªÊ¼Ê¹ÓÃchardetÁË£¬Ò²¿ÉÒÔ½«chardet¿½±´µ½PythonϵͳĿ¼Ï£¬ÕâÑùÄãËùÓеÄpython³ÌÐòÖ»ÒªÓÃimport chardet¾Í¿ÉÒÔÁË¡£ python setup.py install 2¡¢ÊµÀý ʹÓÃÖУ¬chardet.detect()·µ»Ø×ֵ䣬ÆäÖÐconfidenceÊǼì²â¾«È·¶È£¬encodingÊDZàÂëÐÎʽ £¨1£©ÍøÒ³±àÂëÅжϣº >>> import urllib >>> rawdata = urllib.urlopen('http://www.google.cn/').read() >>> import chardet >>> chardet.detect(rawdata) {'confidence': 0.98999999999999999, 'encoding': 'GB2312'} £¨2£©Îļþ±àÂëÅÐ¶Ï import chardet tt=open('c:\\111.txt','rb') ff=tt.readline() #ÕâÀïÊÔ×Å»»³Éread(5)Ò²¿ÉÒÔ£¬µ«ÊÇ»»³Éreadlines()ºó±¨´í enc=chardet.detect(ff) print enc['encoding'] tt.close() |
» ²ÂÄãϲ»¶
Çóµ÷¼ÁÍÆ¼ö
ÒѾÓÐ7È˻ظ´
289 ·Ö105500ҩѧר˶Çóµ÷¼Á(ÕÒBÇøÑ§Ð£)
ÒѾÓÐ4È˻ظ´
0854Çóµ÷¼Á
ÒѾÓÐ13È˻ظ´
³õÊÔ324 ÖÐҩѧ Ò»Ö¾Ô¸ÌìÖÐÒ½ Çóµ÷¼Á
ÒѾÓÐ3È˻ظ´
ҩѧÇóµ÷¼Á
ÒѾÓÐ14È˻ظ´
327Çóµ÷¼Á
ÒѾÓÐ27È˻ظ´
¼±Ðèµ÷¼Á
ÒѾÓÐ5È˻ظ´
300Çóµ÷¼Á
ÒѾÓÐ7È˻ظ´
271Çóµ÷¼Á
ÒѾÓÐ33È˻ظ´
273Çóµ÷¼Á
ÒѾÓÐ8È˻ظ´














»Ø¸´´ËÂ¥
5