zoukankan      html  css  js  c++  java
  • php正则获取网页标题、关键字、网页描述代码

    php正则获取网页关键字,代码如下:

    function get_keywords($html)
    {
    $html=strtolower($html);
    preg_match("@<head[^>]*>(.*?)</head>@si",$html, $regs);
    $headdata = $regs[1];
    preg_match("/<meta +name *=["']?keywords["']? *content=["']?([^<>"]+)["']?/i", $headdata, $res);
    if (isset ($res)) {
    $keywords = $res[1];
    }
    if ($keywords == "") $keywords = "无";
    $keywords=replace_word(textcut($keywords,250));
    $keywords=str_replace("-",",",$keywords);
    $keywords=str_replace(",",",",$keywords);
    $keywords=str_replace(" ",",",$keywords);
    $keywords=str_replace("|",",",$keywords);
    $keywords=str_replace("、",",",$keywords);
    $keywords=str_replace(",,",",",$keywords);
    $keywords=str_replace("<","",$keywords);
    $keywords=str_replace(">","",$keywords);
    return addslashes(trim($keywords));
    }
    
    
    

      

     php正则获取网页标题,代码如下

    function get_title($html)
    {
      $html=strtolower($html);
     $title = str_replace(" - ",",",cut($html,"<title>", "</title>" ));
    //if ($title == "") $title = "无标题";
    if ($title) $title=replace_word(textcut($title,80));
    if ($title) $title=preg_replace("/<(.*?)>/","",$title);
    return addslashes(trim($title));
    }
    
    
    

      

     php正则获取网页描述description,代码如下

    function get_description($html)
    {
    $html=strtolower($html);
    preg_match("@<head[^>]*>(.*?)</head>@si",$html, $regs);
    $headdata = $regs[1];
    preg_match("/<meta +name *=["']?description["']? *content=["']?([^<>"]+)["']?/i", $headdata, $res);
          if (isset ($res)) {
           $description = $res[1];
          }
    if ($description == "") $description = "无";
    $description=replace_word(textcut($description,250));
    $description=str_replace("-",",",$description);
    $description=str_replace(",",",",$description);
    $description=str_replace(" ",",",$description);
    $description=str_replace("|",",",$description);
    $description=str_replace("、",",",$description);
    $description=str_replace(",,",",",$description);
    $description=str_replace("<","",$description);
    $description=str_replace(">","",$description);
    return addslashes(trim($description));
    }
    
    
    
  • 相关阅读:
    break-continue
    函数定义
    函数类型
    为何要继承SpringBootServletInitializer,为何要实现configure这方法
    查询一个表中的两个字段值相同的数据
    数据库中查出来的时间多8小时&查询数据正常展示少8小时
    @JsonFormat与@DateTimeFormat注解的使用
    用js获取当前月份的天数
    js获取当前年,月,日,时,分,秒
    maven配置和安装
  • 原文地址:https://www.cnblogs.com/qhorse/p/5067378.html
Copyright © 2011-2022 走看看