【Python】政府工作报告词云

时间：2020-04-26 13:52:07 阅读：478 评论：0 收藏：0 [点我收藏+]

标签：round file path jpg close wordcloud 图片 odi read

技术图片

2019政府工作报告.txt

https://www.lanzous.com/iby44eh

栗子1：

import wordcloud
import jieba
f=open("2019政府工作报告.txt","r",encoding="utf-8")
t=f.read()
f.close()
ls=jieba.lcut(t)
txt=" ".join(ls)
w=wordcloud.WordCloud(font_path="msyh.ttc",    width=1000,height=700,background_color="white",        )
w.generate(txt)
w.to_file("2019政府工作报告.png")

结果

技术图片

可以添加

max_words=15

限制词语数量

技术图片

栗子2：自定义词云背景样式，背景样式自己找一个图片

技术图片

代码：

# 分词模块
import jieba
# 画图模块
import matplotlib.pyplot as plt
# 文字云模块
from wordcloud import WordCloud
# 这是一个处理图像的函数，读取背景图片
#from scipy.misc import imread  #这句出错imread不用另安装
from matplotlib.pyplot import imread
 
# 词源的文本文件
wf = ‘2019政府工作报告.txt‘
# 读取文件内容
word_content = open(wf,‘r‘, encoding=‘utf-8‘).read().replace(‘\n‘,‘‘)
# 设置背景图片
img_file = ‘bj.jpg‘
# 解析背景图片
mask_img = imread(img_file)
# 进行分词
word_cut = jieba.cut(word_content)
# 把分词用空格连起来
word_cut_join = " ".join(word_cut)
# 设置词云参数
wc = WordCloud(
    #字体
    font_path="msyh.ttc",
    # 允许最大词汇量
    max_words = 2000,
    # 设置最大号字体大小
    max_font_size = 90,
    # 设置使用的背景图片，这个参数不为空时，width和height会被忽略
    mask = mask_img,
    # 设置输出的图片背景色
    background_color = ‘white‘
   )
 
# 生成词云
wc.generate(word_cut_join)
wc.to_file("2019政府工作报告.png")

结果：

技术图片

【Python】政府工作报告词云

标签：round file path jpg close wordcloud 图片 odi read

原文地址：https://www.cnblogs.com/HGNET/p/12778898.html

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行