zoukankan
html css js c++ java
简单的信息采集程序示例(小偷程序) (转)
简单的信息采集程序示例(小偷程序)
最近正准备做一个信息采集的程序,下面是一个简单的采集程序,提供给初学者入门参考。
aspx页面代码
<
asp:TextBox ID
=
"
Txt_Url
"
runat
=
"
server
"
Width
=
"
441px
"
></
asp:TextBox
><
br
/>
<
asp:Button id
=
"
Btn_GetUrlSource
"
runat
=
"
server
"
Text
=
"
取得网页代码
"
OnClick
=
"
Btn_GetUrlSource_Click
"
></
asp:Button
>
<
br
/>
<
asp:TextBox id
=
"
Txt_UrlSource
"
runat
=
"
server
"
Width
=
"
100%
"
Height
=
"
195px
"
TextMode
=
"
MultiLine
"
></
asp:TextBox
><
br
/>
<
br
/>
采集开始代码
<
asp:TextBox ID
=
"
Txt_First
"
runat
=
"
server
"
Height
=
"
90px
"
TextMode
=
"
MultiLine
"
Width
=
"
280px
"
></
asp:TextBox
><
br
/>
<
asp:Button ID
=
"
Btn_ListCheck
"
runat
=
"
server
"
OnClick
=
"
Btn_ListCheck_Click
"
Text
=
"
测试唯一性
"
/><
br
/>
采集结束代码
<
asp:TextBox ID
=
"
Txt_Last
"
runat
=
"
server
"
Height
=
"
90px
"
TextMode
=
"
MultiLine
"
Width
=
"
280px
"
></
asp:TextBox
><
br
/>
<
br
/>
<
asp:Button ID
=
"
Btn_Result
"
runat
=
"
server
"
Text
=
"
取得采集结果
"
OnClick
=
"
Btn_Result_Click
"
/><
br
/>
<
asp:TextBox ID
=
"
Txt_Result
"
runat
=
"
server
"
Height
=
"
134px
"
TextMode
=
"
MultiLine
"
Width
=
"
579px
"
></
asp:TextBox
>
.cs页面代码
using
System;
using
System.Collections;
using
System.ComponentModel;
using
System.Data;
using
System.Drawing;
using
System.Web;
using
System.Web.SessionState;
using
System.Web.UI;
using
System.Web.UI.WebControls;
using
System.Web.UI.HtmlControls;
using
System.IO;
using
System.Net;
using
System.Text.RegularExpressions;
using
NetShuai.Database;
private
string
PageUrl
=
""
;
private
void
Page_Load(
object
sender, System.EventArgs e)
{
//
在此处放置用户代码以初始化页面
}
protected
void
Btn_GetUrlSource_Click(
object
sender, EventArgs e)
{
PageUrl
=
Txt_Url.Text;
WebRequest request
=
WebRequest.Create(PageUrl);
WebResponse response
=
request.GetResponse();
Stream resStream
=
response.GetResponseStream();
StreamReader sr
=
new
StreamReader(resStream, System.Text.Encoding.Default);
Txt_UrlSource.Text
=
sr.ReadToEnd();
resStream.Close();
sr.Close();
}
protected
void
Btn_Result_Click(
object
sender, EventArgs e)
{
string
strExp;
strExp
=
@"
(?<=
"
+
Server.HtmlEncode(Txt_First.Text)
+
"
)[\w\W]*?(?=
"
+
Server.HtmlEncode(Txt_Last.Text)
+
"
)
"
;
MatchCollection mc
=
Regex.Matches(Server.HtmlEncode(Txt_UrlSource.Text), strExp);
for
(
int
i
=
0
; i
<
mc.Count; i
++
)
{
Txt_Result.Text
+=
Server.HtmlDecode(mc[i].Value);
}
}
protected
void
Btn_ListCheck_Click(
object
sender, EventArgs e)
{
string
strExp;
strExp
=
Server.HtmlEncode(Txt_First.Text);
MatchCollection mc
=
Regex.Matches(Server.HtmlEncode(Txt_UrlSource.Text), strExp);
if
(mc.Count
>
1
)
{
Response.Write(
"
<script>alert('列表开始代码有重复!')</script>
"
);
return
;
}
strExp
=
Server.HtmlEncode(Txt_Last.Text);
mc
=
Regex.Matches(Server.HtmlEncode(Txt_UrlSource.Text), strExp);
if
(mc.Count
>
1
)
{
Response.Write(
"
<script>alert('列表结束代码有重复!')</script>
"
);
return
;
}
}
查看全文
相关阅读:
webgame模块划分
VC的若干实用小技巧(一)
反病毒技术:从亡羊补牢到免疫防御
MySQL 备份和恢复
企业网络安全整体解决方案
大型银行核心网络的三层结构设计
编写"优美"的SHELLCODE
linux中apache访问控制配置文件。
网站同步镜像制作!
Linux远程桌面(vnc)
原文地址:https://www.cnblogs.com/zhangzheny/p/901722.html
最新文章
汉诺塔问题
markdown利用img标签控制图片大小
心脏滴血漏洞(CVE20140160)
国外安全论坛&新闻
记一次火狐添加百度搜索引擎
lamp环境安装
markdown链接新窗口打开
配置yum源
baby_web
ctf练手平台
热门文章
lamp环境php无法解析
webgame 前端架构与人员分工
游戏开发纯代码切图做动画
as3 自定义事件中都要重构clone这个方法
游戏开发随笔
游戏开发粒子
FLASH WEB GAME的系统架构
游戏开发:性能优化1
Box2d错误:1046:找不到类型,或者它不是编译时常数:b2Fixture
as3 人物走动(图片序列)
Copyright © 2011-2022 走看看