zoukankan      html  css  js  c++  java
  • 05数据爬取-补

    修改自一个爬取数据并输入省份查询的脚本。

    import requests
    import json
    import mysql.connector
    import time
    
    def Down_data():#获取JSON
        url = 'https://view.inews.qq.com/g2/getOnsInfo?name=disease_h5'
        print="%int(time.mktime( t )*1000)'"
        headers = {#伪装浏览器请求头
            'user-agent': 'Mozilla/5.0 (Linux; Android 6.0; Nexus 5 Build/MRA58N) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Mobile Safari/537.36'
        }
        r = requests.get(url, headers)
        res = json.loads(r.text)
        data_res = json.loads(res['data'])
        return data_res
     
    mydb = mysql.connector.connect(#连接数据库
      host="localhost",
      user="root",
      passwd="Asd1402306745@qq.com",
      database="uu_zaa_db"
    )
    mycursor = mydb.cursor()
    #开始搞事
    data = Down_data()['areaTree'][0]['children']#便于后面操作
    lastTime = Down_data()['lastUpdateTime']#爬取的JSON更新日期在data--lastUpdateTime
    tm = 1
    for i in data:
        #if i['children']:#判断是否有children,以此筛选掉外国的数据(根据分析JSON发现外国的数据没有children,当然也可以写死所有省份名然后筛选,但是我懒得写)
        if tm:
            sql = "INSERT INTO info1 (Date, Province, City, Confirmed_num, Yisi_num, Cured_num, Dead_num) VALUES (%s, %s, %s, %s, %s, %s, %s)"
            ipt = (str(lastTime), str(i['name']), str(''), str(i['total']['confirm']), str(''), str(i['total']['heal']), str(i['total']['dead']))#储存省份信息,根据表里规则省份的那行的City为空串
            mycursor.execute(sql, ipt)
            for item in i['children']:#各个城市信息,
                ipt = (str(lastTime), str(i['name']), str(item['name']), str(item['total']['confirm']), str(''), str(item['total']['heal']), str(item['total']['dead']))
                mycursor.execute(sql, ipt)
    
    mydb.commit()#提交更改

    Python其实上手很快,个人因为本身就有(自认为)规范的缩进习惯,对于Python的强制缩进等等没什么感受。

  • 相关阅读:
    夺冠概率蓝桥杯
    Memcache配置
    [置顶] 6个月:从小白到程序员,也许没你想的那么难
    Delphi Messagebox自动关闭
    最近读园内的几篇好的文章的摘录及感悟
    近期项目的两点教训
    win7网络连接优先顺序设置方法
    WebSerivce学习笔记
    Delphi 控制摄像头操作
    郁闷的一天
  • 原文地址:https://www.cnblogs.com/minadukirinno/p/13337246.html
Copyright © 2011-2022 走看看