zoukankan      html  css  js  c++  java
  • xpath ,css

    https://docs.scrapy.org/en/latest/intro/tutorial.html

    xpath @选择属性  .当前目录下选择 //任意路径选择

    /bookstore/book[position()<3],选取最前面的两个属于 bookstore 元素的子元素的 book 元素

     css span.text::text

     response.css("span.text").text().get() ///  AttributeError: 'SelectorList' object has no attribute 'text'

    quote.css("span.text::text").get() 选择span下面text的text()标签内容

     

    scrapy crawl quotes -o quotes.json
    

    That will generate an quotes.json file containing all scraped items, serialized in JSON.

    For historic reasons, Scrapy appends to a given file instead of overwriting its contents. If you run this command twice without removing the file before the second time, you’ll end up with a broken JSON file.

    -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- focus on what you want to be
  • 相关阅读:
    生产环境经常用到的命令
    JDK 安装部署
    oracle备份脚本
    HTTP与HTTPS的区别
    TCP和UDP的优缺点及区别
    Web服务器优化
    DDOS攻击的三种常见方式
    Xss Csrf DDOS sql注入及防范
    session共享
    Cookie防伪造防修改
  • 原文地址:https://www.cnblogs.com/bamboozone/p/10371485.html
Copyright © 2011-2022 走看看