zoukankan      html  css  js  c++  java
  • Python 绘制词云

    文本内容:data(包含很多条文本)

    1、分词:

    import jieba
    data_cut = data.apply(jieba.lcut)

    2、去除停用词:

    stoplist.txt:链接:https://pan.baidu.com/s/1lN1J8aUFOwqXpYMzuqVA7w  提取码:nk7z 

    with open(r'D:数据文件stoplist.txt', encoding='utf-8') as f:
        txt = f.read()
    stop = txt.split()
    stop = stop + [' ']   #把空格加进去
    data_after = data_cut.apply(
    lambda x : [i for i in x if i not in stop]
    )

    3、

    from tkinter import _flatten
    tmp = pd.Series(_flatten(list(data_after)))   #把二维变成一维
    num = tmp.value_counts()

    4、绘制

    from wordcloud import WordCloud
    import matplotlib.pyplot as plt  
    
    pic = plt.imread(r'D:数据文件aixin.jpg')
    wc = WordCloud(
        background_color='white',
        mask=pic,
        font_path=r'C:/Windows/Fonts/simsun.ttc')
    wc2 = wc.fit_words(num)
    plt.imshow(wc2)
    plt.axis('off')
    plt.show()
  • 相关阅读:
    python
    shader example
    shader 关键字
    Android Studio如何导出可供Unity使用的aar插件详解 转
    adb
    知识
    Unity实现模拟按键
    小知识
    图种制作命令
    八卦
  • 原文地址:https://www.cnblogs.com/DDiamondd/p/11183079.html
Copyright © 2011-2022 走看看