zoukankan      html  css  js  c++  java
  • Python多线程采集百度相关搜索关键词

        百度相关搜索关键词抓取,读取txt关键词,导出txt关键词

      

    #百度相关搜索关键词抓取,读取txt关键词,导出txt关键词
     
    # -*- coding=utf-8 -*-
    import requests
    import re
    import time
    from multiprocessing.dummy import Pool as ThreadPool
     
     
    #百度相关关键词查询
    def xgss(url):
        headers = {
            "User-Agent":"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36"
        }
        html=requests.get(url,headers=headers).text
        #print(html)
        ze=r'<div id="rs"><div class="tt">相关搜索</div><table cellpadding="0">(.+?)</table></div>'
        xgss=re.findall(ze,html,re.S)
        #print(xgss)
        xgze=r'<th><a href="(.+?)">(.+?)</a></th>'
        sj=re.findall(xgze,str(xgss),re.S)
        #print(sj)
        gjc=''
        for x in sj:
            print(x[1])
            gjc=gjc+x[1]+'
    '
     
        # 导出关键词为txt文本
        with open(".gjcsj.txt", 'a', encoding='utf-8') as f:
            f.write(gjc)
        print("-----------------------------------")
        return gjc
     
     
    print("程序运行,正在导入关键词列表!!!")
    print("-----------------------------------")
    # 导入要搜索的关键词txt列表
    urls = []
    data = []
    for line in open('.gjc.txt', "r", encoding='utf-8'):
        data.append(line)
    print("导入关键词列表成功!")
    print("-----------------------------------")
     
    #转换关键词为搜索链接
    for keyword in data:
        url = 'https://www.baidu.com/s?wd=' + keyword
        urls.append(url)
     
    print("采集百度相关搜索关键词开启!")
    print("...................")
    #多线程获取相关关键词
    try:
        # 开4个 worker,没有参数时默认是 cpu 的核心数
        pool = ThreadPool()
        results = pool.map(xgss, urls)
        pool.close()
        pool.join()
        print("采集百度相关搜索关键词完成,已保存于gjcsj.txt!")
    except:
        print("Error: unable to start thread")
     
    print("8s后程序自动关闭!!!")
    time.sleep(8)
    

      

  • 相关阅读:
    第一阶段总结
    第二次项目总结(移动端)
    iOS 配置smart svn8.6.6
    iOS Xcode6.4添加pch文件
    iOS 上传应用到APPStore 详细版
    真机测试
    ios 引导页
    iOS 画图工具的截图
    iOS UIViewController
    iOS 把图片存到相册中
  • 原文地址:https://www.cnblogs.com/68xi/p/11658324.html
Copyright © 2011-2022 走看看