zoukankan      html  css  js  c++  java
  • Python scrapy

    https://stackoverflow.com/questions/37841409/python-scrapy-login-authenication-issue

    from scrapy.crawler import CrawlerProcess
    import scrapy
    
    from scrapy.http import Request
    
    class FirstSpider(scrapy.Spider):
        name = "first"
        allowed_domains = ["instacart.com"]
        start_urls = [
            "https://www.instacart.com"
        ]
    
        def start_requests(self):
            return [Request(url="https://www.instacart.com", callback=self.login)]
    
        def login(self, response):
            return scrapy.FormRequest('https://www.instacart.com/accounts/login',
                                      headers={"X-Requested-With": "XMLHttpRequest"},
                                      formdata={'user[email]': 'xxxxxxx@gmail.com', 'user[password]': 'xxxxx',
                                                "authenticity_token": response.xpath(
                                                    "//meta[@name='csrf-token']/@content").extract_first()},
                                      callback=self.parse,dont_filter=True)
    
    
        def parse(self, response):
            print(response.body)
            if "Goutam" in response.body:
                print "Successfully logged in. Let's start crawling!"
            else:
                print "Login unsuccessful"
    -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- focus on what you want to be
  • 相关阅读:
    编程思想之正则表达式
    SQL查询顺序
    hibernate inverse属性的作用
    介绍一下Hibernate的二级缓存
    JSON数据
    你没玩过的全新版本!Win10这些骚操作你知多少
    VSCode 小鸡汤 第01期
    Editor REST Client
    k8s常用命令
    【项目3-2】多肉植物网站
  • 原文地址:https://www.cnblogs.com/bamboozone/p/10455063.html
Copyright © 2011-2022 走看看