zoukankan      html  css  js  c++  java
  • Python scrapy

    https://stackoverflow.com/questions/37841409/python-scrapy-login-authenication-issue

    from scrapy.crawler import CrawlerProcess
    import scrapy
    
    from scrapy.http import Request
    
    class FirstSpider(scrapy.Spider):
        name = "first"
        allowed_domains = ["instacart.com"]
        start_urls = [
            "https://www.instacart.com"
        ]
    
        def start_requests(self):
            return [Request(url="https://www.instacart.com", callback=self.login)]
    
        def login(self, response):
            return scrapy.FormRequest('https://www.instacart.com/accounts/login',
                                      headers={"X-Requested-With": "XMLHttpRequest"},
                                      formdata={'user[email]': 'xxxxxxx@gmail.com', 'user[password]': 'xxxxx',
                                                "authenticity_token": response.xpath(
                                                    "//meta[@name='csrf-token']/@content").extract_first()},
                                      callback=self.parse,dont_filter=True)
    
    
        def parse(self, response):
            print(response.body)
            if "Goutam" in response.body:
                print "Successfully logged in. Let's start crawling!"
            else:
                print "Login unsuccessful"
    -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- focus on what you want to be
  • 相关阅读:
    垃圾回收机制,正则模块
    日常模块
    文件路径带有字符串的处理方法
    QT进制之间的相互转换
    4-7 selectors模块
    4-5 异步IO模型
    4-4 多路复用IO模型
    4-3 非阻塞IO
    4-2 阻塞IO
    4-1 IO模型介绍
  • 原文地址:https://www.cnblogs.com/bamboozone/p/10455063.html
Copyright © 2011-2022 走看看