檔案方式實現完整的英文詞頻統計例項

1.讀入待分析的字串

2.分解提取單詞

3.計數字典

4.排除語法型詞彙

5.排序

6.輸出top(20)

7.對輸出結果的簡要說明。

a = open('
text.txt
','r')
new =aa.read()
aa.close()
exc=
new =new.lower()
for i in
',.-"':
new = new.replace(i,'')
new = new.split('
')    #
分詞，單詞列表
print
(new)
d ={}
keys = set(new)      #
單詞的集合，存入字典
keys = keys-exc
print
(keys)
for i in
keys:
d[i] = new.count(i)     #
統計單詞，出現次數的字典
print
(d)w = list(d.items())       #
將字典鍵值對轉換為列表
w.sort(key = lambda x:x[1],reverse = true)      #
排序for i in range(20):
				檔案方式實現完整的英文詞頻統計例項
1.讀入待分析的字串 fo open test.txt r 讀入待分析的字串 str fo.read fo.close 2.分解提取單詞 for i in n 分解提取單詞 str str.replace i,words str.split 3.計數字典 for i in keys 建立計數字典 d...
				檔案方式實現完整的英文詞頻統計例項
1.讀入待分析的字串 2.分解提取單詞 3.計數字典 4.排除語法型詞彙 5.排序 6.輸出top 20 7.對輸出結果的簡要說明。fo open english.txt r s fo.read s s.lower 大寫轉換為小寫 for i in s s.replace i,所有標點符號替換為空格...
				英文詞頻統計
詞頻統計預處理 將所有,等分隔符全部替換為空格 將所有大寫轉換為小寫 生成單詞列表 生成詞頻統計 排序排除語法型詞彙，代詞 冠詞 連詞 輸出詞頻最大top10 word lately,i ve been,i ve been losing sleep dreaming about the things...

檔案方式實現完整的英文詞頻統計例項

檔案方式實現完整的英文詞頻統計例項

檔案方式實現完整的英文詞頻統計例項

英文詞頻統計

相關推薦