zoukankan      html  css  js  c++  java
  • Python 可视化Twitter中指定话题中Tweet的词汇频率

    CODE:

    #!/usr/bin/python 
    # -*- coding: utf-8 -*-
    
    '''
    Created on 2014-7-8
    @author: guaguastd
    @name: plot_frequencies_words.py
    '''
    
    if __name__ == '__main__':
        #import json
        
        # import Counter
        from collections import Counter
        
        # import search
        from search import search_for_tweet
        
        # import visualize
        from visualize import visualize_for_frequencies
        
        # import login, see http://blog.csdn.net/guaguastd/article/details/31706155
        from login import twitter_login
        
        # get the twitter access api
        twitter_api = twitter_login()
        
        # import tweet
        from tweet import extract_tweet_entities
        
        while 1:
            query = raw_input('
    Input the query (eg. #MentionSomeoneImportantForYou, exit to quit): ')
            
            if query == 'exit':
                print 'Successfully exit!'
                break
            
            statuses = search_for_tweet(twitter_api, query)
            status_texts,screen_names,hashtags,words = extract_tweet_entities(statuses)  
            
            word_counts = sorted(Counter(words).values(), reverse=True)
            visualize_for_frequencies(word_counts, "Word Rank", "Freq")

    RESULT:

    Input the query (eg. #MentionSomeoneImportantForYou, exit to quit): #MentionSomeoneImportantForYou
    Length of statuses 100
    Length of statuses 196


  • 相关阅读:
    js笔记4
    js笔记3
    js笔记2
    js笔记1
    前端笔记13
    (7)第3章的开始
    (6)第2章的尾巴~
    (5)自定义数据结构再探
    我的学习方法(6)
    (4)自定义数据结构初探
  • 原文地址:https://www.cnblogs.com/clnchanpin/p/6811448.html
Copyright © 2011-2022 走看看