zoukankan      html  css  js  c++  java
  • scrapy安装

    参考文章:http://www.cnblogs.com/txw1958/archive/2012/07/12/scrapy_installation_introduce.html

    一、 Scrapy简介

    Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

    官方主页: http://www.scrapy.org/

    第三方中文文档:http://scrapy-chs.readthedocs.org/zh_CN/latest/intro/overview.html

    二、 安装Python2.7

    官方主页:http://www.python.org/

    下载地址:http://www.python.org/ftp/python/2.7.3/python-2.7.3.msi

    1) 安装python

    安装目录:D:Python27

    2) 添加环境变量

    path 中添加 D:Python27;D:Python27Scripts;

    3) 验证环境变量

    打开命令行,并且运行以下命令来修改 PATH:

    c:python27python.exe c:python27	oolsscriptswin_add2path.py
    
    
    

    4) 验证Python

    复制代码
    T:>python
    Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
    Type "help", "copyright", "credits" or "license" for more information.
    >>> exit()
    
    T:>
    复制代码

    三、 安装Twisted

    Twisted is an event-driven networking engine written in Python and licensed under the open source

    1) 安装setuptools

    Download, build, install, upgrade, and uninstall Python packages -- easily!

    官方主页:http://pypi.python.org/pypi/setuptools

    下载地址:http://pypi.python.org/packages/2.7/s/setuptools/setuptools-0.6c11.win32-py2.7.exe

    安装过程:略

    2) 安装Zope.Interface

    官方主页:http://pypi.python.org/pypi/zope.interface/

    下载地址:http://pypi.python.org/packages/2.7/z/zope.interface/zope.interface-4.0.1-py2.7-win32.egg

    注意区分64位和32位的程序。命令行要以管理员身份运行,程序应该放在“D:Python27Scripts”这个文件夹下。

    安装过程:

    复制代码
    T:>d:
    D:>cd D:Python27Scripts
    D:Python27Scripts>easy_install.exe zope.interface-4.0.1-py2.7-win32.egg
    Processing zope.interface-4.0.1-py2.7-win32.egg
    creating d:python27libsite-packageszope.interface-4.0.1-py2.7-win32.egg
    Extracting zope.interface-4.0.1-py2.7-win32.egg to d:python27libsite-packages
    Adding zope.interface 4.0.1 to easy-install.pth file
    
    Installed d:python27libsite-packageszope.interface-4.0.1-py2.7-win32.egg
    Processing dependencies for zope.interface==4.0.1
    Finished processing dependencies for zope.interface==4.0.1
    
    D:Python27Scripts>
    复制代码

    验证安装:

    D:Python27Scripts>python
    Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import zope.interface
    >>>

    3) 安装Twisted

    官方主页:http://twistedmatrix.com/trac/wiki/TwistedProject

    下载地址:http://pypi.python.org/packages/2.7/T/Twisted/Twisted-12.1.0.win32-py2.7.msi

    安装过程:略

    四、 安装w3lib

    官方主页:http://pypi.python.org/pypi/w3lib

    下载地址: http://pypi.python.org/packages/source/w/w3lib/w3lib-1.2.tar.gz

    解压过程:略

    安装过程:

    复制代码
    T:w3lib-1.2>python setup.py install
    running install
    running build
    running build_py
    creating build
    creating buildlib
    creating buildlibw3lib
    copying w3libencoding.py -> buildlibw3lib
    copying w3libform.py -> buildlibw3lib
    copying w3libhtml.py -> buildlibw3lib
    copying w3libhttp.py -> buildlibw3lib
    copying w3liburl.py -> buildlibw3lib
    copying w3libutil.py -> buildlibw3lib
    copying w3lib\__init__.py -> buildlibw3lib
    running install_lib
    creating D:Python27Libsite-packagesw3lib
    copying buildlibw3libencoding.py -> D:Python27Libsite-packagesw3lib
    copying buildlibw3libform.py -> D:Python27Libsite-packagesw3lib
    copying buildlibw3libhtml.py -> D:Python27Libsite-packagesw3lib
    copying buildlibw3libhttp.py -> D:Python27Libsite-packagesw3lib
    copying buildlibw3liburl.py -> D:Python27Libsite-packagesw3lib
    copying buildlibw3libutil.py -> D:Python27Libsite-packagesw3lib
    copying buildlibw3lib\__init__.py -> D:Python27Libsite-packagesw3lib
    byte-compiling D:Python27Libsite-packagesw3libencoding.py to encoding.pyc
    byte-compiling D:Python27Libsite-packagesw3libform.py to form.pyc
    byte-compiling D:Python27Libsite-packagesw3libhtml.py to html.pyc
    byte-compiling D:Python27Libsite-packagesw3libhttp.py to http.pyc
    byte-compiling D:Python27Libsite-packagesw3liburl.py to url.pyc
    byte-compiling D:Python27Libsite-packagesw3libutil.py to util.pyc
    byte-compiling D:Python27Libsite-packagesw3lib\__init__.py to __init__.pyc
    running install_egg_info
    Writing D:Python27Libsite-packagesw3lib-1.2-py2.7.egg-info
    
    T:w3lib-1.2>
    复制代码

    验证安装:

    T:>python
    Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import w3lib
    >>> 

    五、 安装libxml2

    官方主页:http://users.skynet.be/sbi/libxml-python/http://pypi.python.org/pypi/pyOpenSSL

    下载地址:http://users.skynet.be/sbi/libxml-python/binaries/libxml2-python-2.7.7.win32-py2.7.exe

    安装过程:把程序放到“D:Python27Scripts”下,然后用easy_install安装。

    D:Python27Scripts>easy_install.exe libxml2-python-2.7.7.win32-py2.7.exe

    验证安装:

    T:>python
    Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import libxml2
    >>> 

    六、 安装pyOpenSSL

    官方主页:http://pypi.python.org/pypi/pyOpenSSL

    下载地址:http://pypi.python.org/packages/2.7/p/pyOpenSSL/pyOpenSSL-0.13.winxp32-py2.7.msi

    安装过程:用pip安装(没有成功?)

    pip install pyopenssl

    验证安装:注意验证时import后面的OpenSSL是区分大小写的。

    T:>python
    Python 2.7.3 (default, Apr 10 2012, 23:31:26) [MSC v.1500 32 bit (Intel)] on win32
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import OpenSSL
    >>>

    七、 安装Scrapy

    官方主页:http://scrapy.org/

    下载地址:http://pypi.python.org/packages/source/S/Scrapy/Scrapy-0.14.4.tar.gz

    解压过程:略

    安装过程:

    复制代码
    T:Scrapy-0.14.4>python setup.py install
    
    ……
    Installing easy_install-2.7-script.py script to D:Python27Scripts
    Installing easy_install-2.7.exe script to D:Python27Scripts
    Installing easy_install-2.7.exe.manifest script to D:Python27Scripts
    
    Using d:python27libsite-packages
    Finished processing dependencies for Scrapy==0.14.4
    
    T:Scrapy-0.14.4>
    复制代码

    验证安装:

    复制代码
    T:>scrapy
    Scrapy 0.14.4 - no active project
    
    Usage:
      scrapy <command> [options] [args]
    
    Available commands:
      fetch         Fetch a URL using the Scrapy downloader
      runspider     Run a self-contained spider (without creating a project)
      settings      Get settings values
      shell         Interactive scraping console
      startproject  Create new project
      version       Print Scrapy version
      view          Open URL in browser, as seen by Scrapy
    
    Use "scrapy <command> -h" to see more info about a command
    
    T:>
    复制代码
  • 相关阅读:
    .NET 开源框架
    ORM 开发框架
    C# 文件下载四方法
    用ASP.net判断上传文件类型的三种方法
    站在十字路口的程序媛,该如何选择?
    突然的烦恼
    Request获取url信息的各种方法比较 及 Request.UrlReferrer详解
    JS 获得当前地址栏url
    MvcPager 概述
    Simditor使用方法
  • 原文地址:https://www.cnblogs.com/wy1290939507/p/4685705.html
Copyright © 2011-2022 走看看