zoukankan      html  css  js  c++  java
  • 4.22

    re练习⼿刃⾖瓣TOP250电影信息

    import requests
    headers = {
     "user-agent": "Mozilla/5.0 (Macintosh; Intel
    Mac OS X 10_15_4) AppleWebKit/537.36 (KHTML, like
    Gecko) Chrome/87.0.4280.88 Safari/537.36"
    }
    url = "https://movie.douban.com/top250?
    start=0&filter="
    resp = requests.get(url, headers=headers)
    print(resp.text)
    obj = re.compile(r'<li>.*?<div class="item">.*?
    <div class="pic">.*?<em class="">(?P<num>d+)
    </em>'
     r'.*?<span class="title">(?
    P<name>.*?)</span>'
     r'.*?<p class="">.*?<br>
    (?
    P<year>.*?)&nbsp;'
     r'.*?property="v:average">(?
    P<average>.*?)</span>'
     r'.*?<span>(?P<people>d+)⼈评价
    </span>', re.S)
    it = obj.finditer(resp.text)
    with open("movie.csv", mode="w", encoding="utf-8")
    as f:
     csvwriter = csv.writer(f) # 创建csv⽂件写⼊⼯具,
    也可以直接f.write()
     for item in it:
     dic = item.groupdict()
     dic['year'] = dic['year'].strip()
     csvwriter.writerow(dic.values()) # 写⼊数

  • 相关阅读:
    VS中的DataPager分页
    获取select标签选中的值
    JS判断包括IE11在内的IE浏览器
    几款jQuery右键菜单插件
    java day2
    java day1
    转换成(大)小写字母
    模态框扩展
    自定义动画,点赞
    反选
  • 原文地址:https://www.cnblogs.com/zhaoyids/p/14903831.html
Copyright © 2011-2022 走看看