Python:词云图

来自YS的笔记
Yaosong讨论 | 贡献2019年2月20日 (三) 22:40的版本
跳到导航 跳到搜索

词云图


demo


安装

pip install wordcloud

中文分词

pip install jieba

获取遮罩图

demo

import numpy as np alice_mask = np.array(Image.open("xxx.png"))

from scipy.misc import imread alice_mask = imread('xxx.png')



如果报:_tkinter.TclError: no display name and no $DISPLAY environment variable

在“import matplotlib.pyplot as plt”之前,加入:

import matplotlib as mpl
mpl.use('Agg')


实例:

#!/usr/bin/python
# -*- coding: utf-8 -*-
import wordcloud
from wordcloud import WordCloud
import jieba
# from PIL import Image
# import numpy as np

import matplotlib as mpl
mpl.use('Agg')

import matplotlib.pyplot as plt
from scipy.misc import imread

with open('论语.txt', 'r') as f:
    text = " ".join(jieba.cut(f.read()))
    # alice_mask = np.array(Image.open("遮罩图.png"))
    alice_mask = imread('遮罩图.png')

    wordCloud = WordCloud(font_path='msyh.ttf',background_color='white', max_words=2000, max_font_size=80, random_state=40, mask=alice_mask)
    wordCloud.generate(text)
    plt.imshow(wordCloud, interpolation='bilinear')
    plt.axis("off")
    plt.show()

    # plt.savefig('path to output.png')
    wordCloud.to_file('path to output.png')