zoukankan      html  css  js  c++  java
  • sql删除重复数据

    用爬虫爬了一些数据,但是有些标题是重复的,需要删除,所以找了一下删除重复标题数据的sql。
    
    # 查询所有重复的数据
    select * 
    FROM
    	tb_xici_article 
    WHERE
    	post_title IN ( SELECT post_title FROM tb_xici_article GROUP BY post_title HAVING count( post_title ) > 1 )
    	
    	
    # 查询所有重复并且id不是最小的那些重复数据
    SELECT
    	* 
    FROM
    	tb_xici_article 
    WHERE
    	post_title IN ( SELECT post_title FROM tb_xici_article GROUP BY post_title HAVING count( post_title ) > 1 ) 
    	AND id NOT IN ( SELECT min( id ) FROM tb_xici_article GROUP BY post_title HAVING count( post_title ) > 1 )
    	
    
    # 删除重复数据
    如果直接按下面这样写,mysql会报You can't specify target table for update in FROM clause错误,需要把select出的结果再通过中间表select一遍
    DELETE
    FROM
    	tb_xici_article 
    WHERE
    	post_title IN ( SELECT post_title FROM tb_xici_article GROUP BY post_title HAVING count( post_title )
    	> 1 ) and id not in (select min(id) from tb_xici_article group by post_title HAVING count(post_title) > 1)
    	
    #	最终版
    DELETE 
    FROM
    	tb_xici_article 
    WHERE
    	id IN (
    	SELECT
    		temp.id 
    	FROM
    		(
    		SELECT
    			* 
    		FROM
    			tb_xici_article 
    		WHERE
    			post_title IN ( SELECT post_title FROM tb_xici_article GROUP BY post_title HAVING count( post_title ) > 1 ) 
    			AND id NOT IN ( SELECT min( id ) FROM tb_xici_article GROUP BY post_title HAVING count( post_title ) > 1 ) 
    		) temp 
    	)
    	
    

      

  • 相关阅读:
    go装饰器
    python属性描述符和属性查找过程
    python property动态属性
    python中is 与 ==的区别
    python变量到底是什么
    python对象的可变性
    python sorted函数
    《Note --- UE4 --- Blueprint_Overview_HowTo》
    《Temporal AA , SMAA and MSAA》
    TODO
  • 原文地址:https://www.cnblogs.com/leyi/p/10802501.html
Copyright © 2011-2022 走看看