zoukankan      html  css  js  c++  java
  • 拉钩爬虫

    拉钩

    ajax请求,cookies反爬

    # 第一页
    # https://www.lagou.com/jobs/list_python/p-city_6
    '''
    TG-TRACK-CODE=search_code; user_trace_token=20200106214534-53c939b1-10b4-45a1-bb34-daebd661d4ab;
    X_HTTP_TOKEN=acb1a28e7bde8ee74338138751eaff2f5fc5651c92; WEBTJ-ID=20200106214541-16f7b1aa6b61-0363b02d88bb6-2393f61-2073600-16f7b1aa6b719;
    JSESSIONID=ABAAABAABEEAAJAA8CD0EDA72E15C2EF8CEA34B3CEB748A; _ga=GA1.2.1798328149.157831
    '''
    '''
    TG-TRACK-CODE=search_code; user_trace_token=20200106214534-53c939b1-10b4-45a1-bb34-daebd661d4ab; 
    X_HTTP_TOKEN=acb1a28e7bde8ee74338138751eaff2f5fc5651c92; WEBTJ-ID=20200106214541-16f7b1aa6b61-0363b02d88bb6-2393f61-2073600-16f7b1aa6b719;
     JSESSIONID=ABAAABAABEEAAJAA8CD0EDA72E15C2EF8CEA34B3CEB748A; _ga=GA1.2.1798328149.157831
    '''
    import requests
    
    header1 = {
        'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36',
    }
    
    session = requests.session()
    r = session.get(url='https://www.lagou.com/jobs/list_python/p-city_6', headers=header1)
    
    header2 = {
        'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36',
        'Referer': 'https://www.lagou.com/jobs/list_python',
    
    }
    
    
    for i in range(10):
        data = {
            'first': False,
            'pn': i+2,
            'kd': 'python'
        }
        jobs = session.post(url='https://www.lagou.com/jobs/positionAjax.json?city=%E6%9D%AD%E5%B7%9E&needAddtionalResult=false',
                        headers=header2,data=data)
        print(jobs.json())
    
    
  • 相关阅读:
    AWK 学习手札, 转载自lovelyarry
    Perl 学习手札之一: introduction
    开发者必看:iOS应用审核的通关秘籍
    Perl 学习手札之三: General syntax
    Perl 学习手札之二: Guide to experienced programmers
    RepotService添加空格符
    CSMS2软件架构
    关于Oracle的动态查询
    CSMS2公共方法
    CSMS2绑定数据
  • 原文地址:https://www.cnblogs.com/zx125/p/12162702.html
Copyright © 2011-2022 走看看