zoukankan      html  css  js  c++  java
  • (一)requests-实战小练习

    1.需求:爬取豆瓣电影分类排行榜 https://movie.douban.com/中的电影详情数据
     (此处以抓取科幻电影分类的信息为例)
    import requests
    import json
    url = 'https://movie.douban.com/j/new_search_subjects'
    params= {
        'sort': 'U',
        'range': '0,10',
        'tags':'' ,
        'start': '40',
        'genres': '科幻'
    }
    headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.92 Safari/537.36'
    }
    response = requests.get(url=url, params=params, headers=headers)
    ret = response.json()
    fileName = 'files/'+ 'douban.json'
    with open(fileName,'w',encoding='utf-8') as f:
        f.write(json.dumps(ret, ensure_ascii=False))
    print('work is done')

    2.爬取肯德基餐厅查询http://www.kfc.com.cn/kfccda/index.aspx中指定地点的餐厅数据

    import requests
    import json
    if __name__ == '__main__':
        url ='http://www.kfc.com.cn/kfccda/ashx/GetStoreList.ashx?op=keyword'
        keyword = input('请输入搜索地点: ')
        data={
            'cname':'',
            'pid': '',
            'keyword': keyword,
            'pageIndex': 1,
            'pageSize': 10,
        }
        headers = {
            'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.92 Safari/537.36'
        }
        response = requests.post(url=url, data=data, headers=headers)
        page_info = response.text
        print(page_info)
        fileName = 'files/KFC.json'
        with open(fileName, 'w', encoding='utf-8') as f:
            f.write(json.dumps(page_info, ensure_ascii=False))
        print('work is done')
     
  • 相关阅读:
    Java NIO开发需要注意的陷阱(转)
    Java Nio注意事项
    NIO的介绍及使用(总结)
    蓝萝卜blu netty3升netty4
    tcp nio 远程主机强迫关闭了一个现有的连接
    java 竖线分割字符串的问题
    15个免费好用的抓包工具
    JSP 基础之 JSTL <c:forEach>用法
    JSP中多条件判断
    怎么不让控制台system.out.println()打印
  • 原文地址:https://www.cnblogs.com/sxy-blog/p/13213853.html
Copyright © 2011-2022 走看看