zoukankan
html css js c++ java
简单的信息采集程序示例(小偷程序) (转)
简单的信息采集程序示例(小偷程序)
最近正准备做一个信息采集的程序,下面是一个简单的采集程序,提供给初学者入门参考。
aspx页面代码
<
asp:TextBox ID
=
"
Txt_Url
"
runat
=
"
server
"
Width
=
"
441px
"
></
asp:TextBox
><
br
/>
<
asp:Button id
=
"
Btn_GetUrlSource
"
runat
=
"
server
"
Text
=
"
取得网页代码
"
OnClick
=
"
Btn_GetUrlSource_Click
"
></
asp:Button
>
<
br
/>
<
asp:TextBox id
=
"
Txt_UrlSource
"
runat
=
"
server
"
Width
=
"
100%
"
Height
=
"
195px
"
TextMode
=
"
MultiLine
"
></
asp:TextBox
><
br
/>
<
br
/>
采集开始代码
<
asp:TextBox ID
=
"
Txt_First
"
runat
=
"
server
"
Height
=
"
90px
"
TextMode
=
"
MultiLine
"
Width
=
"
280px
"
></
asp:TextBox
><
br
/>
<
asp:Button ID
=
"
Btn_ListCheck
"
runat
=
"
server
"
OnClick
=
"
Btn_ListCheck_Click
"
Text
=
"
测试唯一性
"
/><
br
/>
采集结束代码
<
asp:TextBox ID
=
"
Txt_Last
"
runat
=
"
server
"
Height
=
"
90px
"
TextMode
=
"
MultiLine
"
Width
=
"
280px
"
></
asp:TextBox
><
br
/>
<
br
/>
<
asp:Button ID
=
"
Btn_Result
"
runat
=
"
server
"
Text
=
"
取得采集结果
"
OnClick
=
"
Btn_Result_Click
"
/><
br
/>
<
asp:TextBox ID
=
"
Txt_Result
"
runat
=
"
server
"
Height
=
"
134px
"
TextMode
=
"
MultiLine
"
Width
=
"
579px
"
></
asp:TextBox
>
.cs页面代码
using
System;
using
System.Collections;
using
System.ComponentModel;
using
System.Data;
using
System.Drawing;
using
System.Web;
using
System.Web.SessionState;
using
System.Web.UI;
using
System.Web.UI.WebControls;
using
System.Web.UI.HtmlControls;
using
System.IO;
using
System.Net;
using
System.Text.RegularExpressions;
using
NetShuai.Database;
private
string
PageUrl
=
""
;
private
void
Page_Load(
object
sender, System.EventArgs e)
{
//
在此处放置用户代码以初始化页面
}
protected
void
Btn_GetUrlSource_Click(
object
sender, EventArgs e)
{
PageUrl
=
Txt_Url.Text;
WebRequest request
=
WebRequest.Create(PageUrl);
WebResponse response
=
request.GetResponse();
Stream resStream
=
response.GetResponseStream();
StreamReader sr
=
new
StreamReader(resStream, System.Text.Encoding.Default);
Txt_UrlSource.Text
=
sr.ReadToEnd();
resStream.Close();
sr.Close();
}
protected
void
Btn_Result_Click(
object
sender, EventArgs e)
{
string
strExp;
strExp
=
@"
(?<=
"
+
Server.HtmlEncode(Txt_First.Text)
+
"
)[\w\W]*?(?=
"
+
Server.HtmlEncode(Txt_Last.Text)
+
"
)
"
;
MatchCollection mc
=
Regex.Matches(Server.HtmlEncode(Txt_UrlSource.Text), strExp);
for
(
int
i
=
0
; i
<
mc.Count; i
++
)
{
Txt_Result.Text
+=
Server.HtmlDecode(mc[i].Value);
}
}
protected
void
Btn_ListCheck_Click(
object
sender, EventArgs e)
{
string
strExp;
strExp
=
Server.HtmlEncode(Txt_First.Text);
MatchCollection mc
=
Regex.Matches(Server.HtmlEncode(Txt_UrlSource.Text), strExp);
if
(mc.Count
>
1
)
{
Response.Write(
"
<script>alert('列表开始代码有重复!')</script>
"
);
return
;
}
strExp
=
Server.HtmlEncode(Txt_Last.Text);
mc
=
Regex.Matches(Server.HtmlEncode(Txt_UrlSource.Text), strExp);
if
(mc.Count
>
1
)
{
Response.Write(
"
<script>alert('列表结束代码有重复!')</script>
"
);
return
;
}
}
查看全文
相关阅读:
lda spark 代码官方文档
4.17 斐波那契数列 K维斐波那契数列 矩阵乘法 构造
CF R 635 div1 C Kaavi and Magic Spell 区间dp
CF R 635 div2 1337D Xenia and Colorful Gems 贪心 二分 双指针
luogu P5043 【模板】树同构 hash 最小表示法
CF R 633 div 1 1338 C. Perfect Triples 打表找规律
CF 633 div1 1338 B. Edge Weight Assignment 构造
4.15 省选模拟赛 哈密顿回路 折半搜索 双指针
4.15 省选模拟赛 编码 trie树 前缀和优化建图 2-sat
4.13 省选模拟赛 守卫 点分治 虚树
原文地址:https://www.cnblogs.com/zhangzheny/p/901722.html
最新文章
第二周Python笔记os模块、str和bytes区别,列表常用方法 sys模块
第一周python学习总结
English trip V2
English trip V2
English Voice of <<Trouble is a friend>>
高中语法探索(代词篇)
English trip V2
高中语法探索(名词篇)
Phonics 自然拼读法 qu, ou,oi, ue,er,ar Teacher:Lamb
English Voice of <<Diamounds>>
热门文章
English trip V2
训练集(train set) 验证集(validation set) 测试集(test set)
python 把txt文件分隔成0.8和0.2的比例的新文件
mac 下 python链接mysql
协同过滤
爬虫 正则表达式
LDa 通俗理解
三个臭皮匠顶个诸葛亮的随机森林算法!
18个技巧实战深度学习,资深研究员的血泪教训
python 字符串报错问题
Copyright © 2011-2022 走看看