zoukankan html css js c++ java

【Python爬虫】爬取吴江旅游网的虚拟游的*.swf文件

　　　最近在和朋友做点小东西，由于需要吴江旅游网的虚拟游的*.swf文件，就去看了点python爬虫的教程，然后写了个超级小的爬虫。

步骤如下：

１．分析地址，打开开发者工具箱，打开网络标签，然后再浏览网页，就可以看到它是用GET方法请求资源的，并且没有参数。

２．构造ＵＲＬ，我预计他有３０个

for no in range(1,30):
	#print no
	if no<=9:
		name="0"+repr(no)
	else:
		name=repr(no)
	url="http://www.wjtour.gov.cn/virtualtour/jsy/tour"+name+".swf"

３．完整代码（实际上只有２２个）

import urllib
import urllib2

#values={}

for no in range(1,30):
	#print no
	if no<=9:
		name="0"+repr(no)
	else:
		name=repr(no)
	url="http://www.wjtour.gov.cn/virtualtour/jsy/tour"+name+".swf"	
	try:
		response = urllib2.urlopen(url)	
	except urllib2.HTTPError, e:
		print e.code
	except urllib2.URLError, e:
		print e.reason
	else:
		print "OK"
		outfile =open(repr(no)+".swf","w")
		outfile.write(response.read())
		outfile.close()
		print repr(no)+".swf saved!";

４．运行结果

爬虫太好玩了，明天要爬学校的教务系统试试。

查看全文

相关阅读:
2018软件工程第七次作业（团队二）
2018软件工程第六次作业（团队一）
2018软件工程第五次作业（结对二）
2018软件工程第四次作业（结对一）
2018软件工程第三次作业（个人三）
2018软件工程第二次作业（个人二）
2018软件工程第一次作业（个人一）
Serializable与transient的联合使用：动态管理成员属性——《Thinking in Java》随笔033
transient关键字的应用——《Thinking in Java》随笔032
Serializable：用于保存及还原对象——《Thinking in Java》随笔031

原文地址：https://www.cnblogs.com/A-yes/p/9894235.html