zoukankan      html  css  js  c++  java
  • Python scrapy

    https://stackoverflow.com/questions/37841409/python-scrapy-login-authenication-issue

    from scrapy.crawler import CrawlerProcess
    import scrapy
    
    from scrapy.http import Request
    
    class FirstSpider(scrapy.Spider):
        name = "first"
        allowed_domains = ["instacart.com"]
        start_urls = [
            "https://www.instacart.com"
        ]
    
        def start_requests(self):
            return [Request(url="https://www.instacart.com", callback=self.login)]
    
        def login(self, response):
            return scrapy.FormRequest('https://www.instacart.com/accounts/login',
                                      headers={"X-Requested-With": "XMLHttpRequest"},
                                      formdata={'user[email]': 'xxxxxxx@gmail.com', 'user[password]': 'xxxxx',
                                                "authenticity_token": response.xpath(
                                                    "//meta[@name='csrf-token']/@content").extract_first()},
                                      callback=self.parse,dont_filter=True)
    
    
        def parse(self, response):
            print(response.body)
            if "Goutam" in response.body:
                print "Successfully logged in. Let's start crawling!"
            else:
                print "Login unsuccessful"
    -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- focus on what you want to be
  • 相关阅读:
    Java项目远程调试
    Linux虚拟机安装
    研究Quarkus出现的各种错误集锦
    debian 安装 ifconfig ping
    adb shell pm list packages 高级用法
    2020年深港澳金融科技师专才计划一级考试报名简章
    k8s组件
    查询Kubernetes中etcd数据
    curl
    kubelet
  • 原文地址:https://www.cnblogs.com/bamboozone/p/10455063.html
Copyright © 2011-2022 走看看