zoukankan      html  css  js  c++  java
  • Python scrapy

    https://stackoverflow.com/questions/37841409/python-scrapy-login-authenication-issue

    from scrapy.crawler import CrawlerProcess
    import scrapy
    
    from scrapy.http import Request
    
    class FirstSpider(scrapy.Spider):
        name = "first"
        allowed_domains = ["instacart.com"]
        start_urls = [
            "https://www.instacart.com"
        ]
    
        def start_requests(self):
            return [Request(url="https://www.instacart.com", callback=self.login)]
    
        def login(self, response):
            return scrapy.FormRequest('https://www.instacart.com/accounts/login',
                                      headers={"X-Requested-With": "XMLHttpRequest"},
                                      formdata={'user[email]': 'xxxxxxx@gmail.com', 'user[password]': 'xxxxx',
                                                "authenticity_token": response.xpath(
                                                    "//meta[@name='csrf-token']/@content").extract_first()},
                                      callback=self.parse,dont_filter=True)
    
    
        def parse(self, response):
            print(response.body)
            if "Goutam" in response.body:
                print "Successfully logged in. Let's start crawling!"
            else:
                print "Login unsuccessful"
    -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- focus on what you want to be
  • 相关阅读:
    leetcode16 3-Sum
    leetcode679:24Game
    leetcode621 贪心:任务安排
    SpringMVC中的Controller默认单例
    O(n)复杂度求没有出现的数字(leetcode448)
    SpringBoot获取ApplicationContext
    Largest Number(leetcode 179)
    使用免费ip代理进行投票
    Oracle分页
    Oracle JDBC配置
  • 原文地址:https://www.cnblogs.com/bamboozone/p/10455063.html
Copyright © 2011-2022 走看看