用python轉換不同編碼的文字檔案

這本來應該是乙個很簡單的問題，一開始被我想複雜了。弄得繞了很大乙個圈。

先說事例：

得到乙個big5編碼的文字檔案，用記事本開啟顯示為亂碼，想轉換儲存為utf-8或gb-2312編碼文件，就這麼回事！

開始以為要用到encode()和decode()進行內碼轉換後再儲存，結果py總是報錯……後來發現：在python3.x以後，f.read()的文字內容都自動轉換為unicode，文字文件儲存為哪種編碼，由開啟檔案的encoding引數決定。因此，將big5編碼的檔案（本例：big5.txt）轉存為utf-8編碼的檔案（本例：utf8.txt），只需如下操作：

bg=open('big5.txt','r',encoding='big5')
ut=open('utf8.txt','w',encoding='utf-8')
ut.write(bg.read())
bg.close()
ut.close()

Python 編碼轉換

coding utf 8 s abc print type s str utf 8 print len s 3 s unicode s str unicode，其中str的每個字元值必須小於128 print type s unicode print len s 3 s u abc print ty...

python編碼轉換

參見主要介紹了python的編碼機制，unicode,utf 8,utf 16,gbk,gb2312,iso 8859 1 等編碼之間的轉換。常見的編碼轉換分為以下幾種情況 1.自動識別字串編碼 coding utf8 import urllib import chardet rawdata ur...

python 編碼轉換

主要介紹了python的編碼機制，unicode,utf 8,utf 16,gbk,gb2312,iso 8859 1 等編碼之間的轉換。常見的編碼轉換分為以下幾種情況可以使用 chardet 模組自動識別字元創編碼 chardet 使用方法例如 a為unicode編碼要轉為gb2312。a...

用python轉換不同編碼的文字檔案

Python 編碼轉換

python編碼轉換

python 編碼轉換

相關推薦