zoukankan      html  css  js  c++  java
  • Python+Webdriver+Phantomjs,设置不同的User-Agent,获得的url不一致

    话不多说,代码如下:

    u"""
    @author: xieyanfen1990@126.com
    @attention: this is PhantomJS about webdriver and settings User-Agent
    
    """
    
    from selenium import webdriver
    from selenium.webdriver.common.desired_capabilities import DesiredCapabilities
    
    url="http://m.dianping.com/forum/note/7868689"
    
    ua1=("Mozilla/5.0 (Windows NT 6.1; WOW64; rv:25.0) ""Gecko/20100101 Firefox/25.0")
    
    dcap = dict(DesiredCapabilities.PHANTOMJS)
    dcap["phantomjs.page.settings.userAgent"] = ua1
    
    driver=webdriver.PhantomJS(desired_capabilities=dcap)
    driver.get(url)
    current_url=driver.current_url
    
    print "the first url is :"+current_url
    
    ua2=('Mozilla/5.0 (iPhone; CPU iPhone OS 7_0 like Mac OS X; en-us) AppleWebKit/537.51.1 (KHTML, like Gecko) Version/7.0 Mobile/11A465 Safari/9537.53')
    dcap["phantomjs.page.settings.userAgent"] = ua2
    
    driver=webdriver.PhantomJS(desired_capabilities=dcap)
    driver.get(url)
    current_url=driver.current_url
    
    print "the second url is :"+current_url

    得到运行结果是:

    CindydeMacBook-Pro:httpreader cindy$ python test.py
    the first url is :http://s.dianping.com/topic/7868689
    the second url is :http://m.dianping.com/forum/note/7868689
  • 相关阅读:
    Redis实现分布式锁
    Redis数据结构
    Mysql与redis缓存一致性
    mysql分库分表
    mysql主从同步
    mysql配置优化
    Netty 参数优化
    JAVA多线程之park & unpack
    网络时钟服务器(网络校时服务器)无法同步的排查方法
    GPS北斗共视授时中的多径效应分析
  • 原文地址:https://www.cnblogs.com/cindy-2014/p/4323772.html
Copyright © 2011-2022 走看看