zoukankan      html  css  js  c++  java
  • (一)requests-实战小练习

    1.需求:爬取豆瓣电影分类排行榜 https://movie.douban.com/中的电影详情数据
     (此处以抓取科幻电影分类的信息为例)
    import requests
    import json
    url = 'https://movie.douban.com/j/new_search_subjects'
    params= {
        'sort': 'U',
        'range': '0,10',
        'tags':'' ,
        'start': '40',
        'genres': '科幻'
    }
    headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.92 Safari/537.36'
    }
    response = requests.get(url=url, params=params, headers=headers)
    ret = response.json()
    fileName = 'files/'+ 'douban.json'
    with open(fileName,'w',encoding='utf-8') as f:
        f.write(json.dumps(ret, ensure_ascii=False))
    print('work is done')

    2.爬取肯德基餐厅查询http://www.kfc.com.cn/kfccda/index.aspx中指定地点的餐厅数据

    import requests
    import json
    if __name__ == '__main__':
        url ='http://www.kfc.com.cn/kfccda/ashx/GetStoreList.ashx?op=keyword'
        keyword = input('请输入搜索地点: ')
        data={
            'cname':'',
            'pid': '',
            'keyword': keyword,
            'pageIndex': 1,
            'pageSize': 10,
        }
        headers = {
            'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.92 Safari/537.36'
        }
        response = requests.post(url=url, data=data, headers=headers)
        page_info = response.text
        print(page_info)
        fileName = 'files/KFC.json'
        with open(fileName, 'w', encoding='utf-8') as f:
            f.write(json.dumps(page_info, ensure_ascii=False))
        print('work is done')
     
  • 相关阅读:
    Leetcode:42. Trapping Rain Water
    Leetcode: 41. First Missing Positive
    Leetcode: 40. Combination Sum II
    多项式全家桶
    BZOJ 3878 [AHOI&JSOI2014]奇怪的计算器 (线段树)
    BZOJ 2959 长跑 (LCT+并查集)
    BZOJ 3028 食物 (生成函数+数学题)
    luogu P5504 [JSOI2011]柠檬
    hdu 6399 City Development
    luogu P3826 [NOI2017]蔬菜
  • 原文地址:https://www.cnblogs.com/sxy-blog/p/13213853.html
Copyright © 2011-2022 走看看