zoukankan      html  css  js  c++  java
  • 正则表达式识别js跳转的链接

    用Webclient访问链接后返回了跳转页面,页面代码如下,需要把

    http://kd.szty56.com:8086/xms/client/order_online!print.action?userToken=6f6980b4633c4f03b2eb294054e11f01&trackingNo=LS185519295CN&pageSizeCode=6
    这个链接解析出来
    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
    <html xmlns="http://www.w3.org/1999/xhtml">
    <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    <title>打印</title>
    </head>
    <body>
        <script type="text/javascript">
            var printUrl = "http://kd.szty56.com:8086/xms/client/order_online!print.action?userToken=6f6980b4633c4f03b2eb294054e11f01&trackingNo=LS185519295CN&pageSizeCode=6";
            location.href = printUrl;
        </script>
    </body>
    
    </html>

    //1.解析出跳转链接字符串,如printUrl = "http://kd.szty56.com:8086/xms/client/order_online!print.action?userToken=6f6980b4633c4f03b2eb294054e11f01&trackingNo=LS185519295CN&pageSizeCode=6"
    string link = "";
    MatchCollection matches = Regex.Matches(strHtml, "printUrl\s*=\s*".*?"", RegexOptions.IgnoreCase);
    link =matches[0].Groups[0].Value;
    //2.从上面字符串中解析出链接,解析出双引号包围的内容
    string pattern = "(?<=").*?(?=")";
    string realLink = Regex.Match(link, pattern).Value;

  • 相关阅读:
    java监控Linux磁盘空间的使用情况
    报道篇
    标签详解
    selectors模块
    [Python 多线程] RLock可重入锁
    Mysql中索引的 创建,查看,删除,修改
    openpyxl模块的使用
    Oracle中row_number()、rank()、dense_rank() 的区别
    python面试题
    sql 用Group by分组后,取每组的前几条记录
  • 原文地址:https://www.cnblogs.com/lidaying5/p/7866346.html
Copyright © 2011-2022 走看看