中央一号文件中文词频统计并保存到文件中

时间：2019-07-28 15:31:39 阅读：117 评论：0 收藏：0 [点我收藏+]

 1 # Author :Zcb
 2 
 3 #中文词频统计
 4 import jieba
 5 import sys
 6 
 7 f = open("d:/政府工作报告.txt",‘r‘)
 8 txt = f.read()
 9 f.close()
10 
11 d={}
12 ls=jieba.lcut(txt)
13 exclude =[‘“‘,‘”‘,‘要‘,‘和‘,‘，‘,‘的‘,‘\n‘,‘。‘,‘、‘]  #这里是要exclude 的列表，根据需求可灵活改变
14 for w in ls:
15     if w in exclude or len(w)==1:
16         continue
17     d[w]=d.get(w,0)+1
18 
19 #####################按val排序
20 res = sorted(d.items(),key =lambda x:x[1],reverse=True)
21 fd= open("d:/统计结果.txt",‘w‘)
22 old_stdout = sys.stdout
23 sys.stdout = fd
24 for w in res:
25     if(w[1]<=25):
26         continue
27     print(‘"{}"出现了{}次‘.format(w[0],w[1]))
28 sys.stdout = old_stdout
29 fd.close()

中央一号文件中文词频统计并保存到文件中

标签：auth 并保存中文词频统计文件 res std span eve

原文地址：https://www.cnblogs.com/zach0812/p/11258982.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行