zoukankan      html  css  js  c++  java
  • 文件方式实现完整的英文词频统计实例

    1.读入待分析的字符串

    2.分解提取单词 

    3.计数字典

    4.排除语法型词汇

    5.排序

    6.输出TOP(20)

    fo=open('dead romance.txt','w')
    fo.write('''in a rainy night
    can you hear me
    in a rainy night
    can you help me
    man,what are you thinking of 
    man,what do you need
    man,nobody tell you what to do
    man ,you need somebody to hurt
    in a rainy night
    can you hear me
    in a rainy night
    can you help me
    man,you feel so lonely
    man,can you hear the message come from the sky
    man,you are driving into the rain
    man,you know it's time to find the prey
    in a rainy night
    can you hear me
    in a rainy night
    can you help me''')
    fo.close()
    
    fo=open('dead romance.txt','r')
    A= fo.read()
    exc={'the','and','to','of','in','a','for','with',''}
    for i in ',.?!
    "':
        A=A.replace(i,' ')
    A=A.lower()
    A=A.split(" ")
    words=set(A)
    dic={}
    keys=set(A)#出现过单词的集合,字典的KEY
    keys=keys-exc
    for i in keys:
        dic[i]=A.count(i)
    w=list(dic.items())
    w.sort(key=lambda x:x[1],reverse=True)
    for i in range(20):
        print(w[i])
    fo.close()

  • 相关阅读:
    作业01(2020年10月10号)
    C语言I博客作业04
    C语言I博客作业03
    C语言I博客作业02
    第一次学c语言作业
    C语言I博客作业09
    C语言I博客作业08
    C语言I博客作业07
    C语言I博客作业06
    C语言I博客作业05
  • 原文地址:https://www.cnblogs.com/GAODASHANG/p/7598539.html
Copyright © 2011-2022 走看看