zoukankan      html  css  js  c++  java
  • twint 安装及使用

    分享这个post是自己方便查,还有中文网界对这个东西介绍太少。
    更多的就看github项目twint吧。

    Installation:

    git+pip3:

    git clone https://github.com/twintproject/twint.git
    pip3 install -r requirements.txt
    pip3 install twint
    

    or pip3+pipenv:

    pip3 install --user --upgrade -e git+https://github.com/twintproject/twint.git@origin/master#egg=twint
    pipenv install -e git+https://github.com/twintproject/twint.git#egg=twint
    

    You may meet module cannot found error when you try to run twint after installation. On ubuntu, add ~/.local/bin into your PATH by:

    export PATH=$PATH:~/.local/bin
    
    

    You may edit ~/.bashrc file to permanately add the '~/.local/bin' into your PATH.

    Usage:

    Running the twint cmd with arguments can give you results. A few simple examples to help you understand the basics:

    • twint -u username - Scrape all the Tweets from user's timeline.
    • twint -u username -s pineapple - Scrape all Tweets from the user's timeline containing pineapple.
    • twint -s pineapple - Collect every Tweet containing pineapple from everyone's Tweets.
    • twint -u username --year 2014 - Collect Tweets that were tweeted before 2014.
    • twint -u username --since "2015-12-20 20:30:15" - Collect Tweets that were tweeted since 2015-12-20 20:30:15.
    • twint -u username --since 2015-12-20 - Collect Tweets that were tweeted since 2015-12-20 00:00:00.
    • twint -u username -o file.txt - Scrape Tweets and save to file.txt.
    • twint -u username -o file.csv --csv - Scrape Tweets and save as a csv file.
    • twint -u username --email --phone - Show Tweets that might have phone numbers or email addresses.
    • twint -s "Donald Trump" --verified - Display Tweets by verified users that Tweeted about Donald Trump.
    • twint -g="48.880048,2.385939,1km" -o file.csv --csv - Scrape Tweets from a radius of 1km around a place in Paris and export them to a csv file.
    • twint -u username -es localhost:9200 - Output Tweets to Elastic search
    • twint -u username -o file.json --json - Scrape Tweets and save as a json file.
    • twint -u username --database tweets.db - Save Tweets to a SQLite database.
    • twint -u username --followers - Scrape a Twitter user's followers.
    • twint -u username --following - Scrape who a Twitter user follows.
    • twint -u username --favorites - Collect all the Tweets a user has favorited (gathers ~3200 tweet).
    • twint -u username --following --user-full - Collect full user information a person follows
    • twint -u username --profile-full - Use a slow, but effective method to gather Tweets from a user's profile (Gathers ~3200 Tweets, Including Retweets).
    • twint -u username --retweets - Use a quick method to gather the last 900 Tweets (that includes retweets) from a user's profile.
    • twint -u username --resume resume_file.txt - Resume a search starting from the last saved scroll-id.

    More detail about the commands and options are located in the wiki

  • 相关阅读:
    极光推送JPush
    dom4j解析xml
    Hibernate4之JPA规范配置详解
    Jquery的$命名冲突
    初识zookeeper(二)之与Dubbo-admin关联
    初识zookeeper(一)之zookeeper的安装及配置
    jenkins maven svn 部署web项目到本地Tomcat
    spring mvc 和junit 4集成的注意点
    String 和 CharSequence 关系与区别
    (转)如何安装 easy installer+pip
  • 原文地址:https://www.cnblogs.com/sonictl/p/11705857.html
Copyright © 2011-2022 走看看