zoukankan      html  css  js  c++  java
  • 在SQL中删除重复记录

    以表employee为例


    SQL> desc employee
     

     Name                                          Type
     ----------------------------------------- -------- ------------------ 

    emp_id                                                NUMBER(10)
    emp_name                                              VARCHAR2(20)
    salary                                                NUMBER(10,2) 

    可以通过下面的语句查询重复的记录:
     

    SQL> select * from employee;

        EMP_ID EMP_NAME                                  SALARY

    ---------- ---------------------------------------- ----------
     

             1 sunshine                                      10000
             1 sunshine                                      10000
             2 semon                                         20000
             2 semon                                         20000
             3 xyz                                           30000
             2 semon                                         20000 

    SQL> select distinct * from employee;
     

        EMP_ID EMP_NAME                                     SALARY
    ---------- ---------------------------------------- ----------
             1 sunshine                                      10000
             2 semon                                         20000
             3 xyz                                           30000
     

    SQL> select * from employee group by emp_id,emp_name,salary having count (*)>1
     

        EMP_ID EMP_NAME                                     SALARY
    ---------- ---------------------------------------- ----------
             1 sunshine                                      10000
             2 semon                                         20000


    SQL> select * from employee e1
    where rowid in (select max(rowid) from employe e2
     where e1.emp_id=e2.emp_id and
     e1.emp_name=e2.emp_name and e1.salary=e2.salary);

        EMP_ID EMP_NAME                                     SALARY
    ---------- ---------------------------------------- ----------
             1 sunshine                                      10000
             3 xyz                                           30000
             2 semon                                         20000
     

    2. 删除的几种方法:

     (1)通过建立临时表来实现 

    SQL>create table temp_emp as (select distinct * from employee) 

    SQL> truncate table employee; (清空employee表的数据) 

    SQL> insert into employee select * from temp_emp;  (再将临时表里的内容插回来)
     

    ( 2)通过唯一rowid实现删除重复记录.在Oracle中,每一条记录都有一个rowid,rowid在整个数据库中是唯一的,rowid确定了每条记录是在Oracle中的哪一个数据文件、块、行上。在重复的记录中,可能所有列的内容都相同,但rowid不会相同,所以只要确定出重复记录中那些具有最大或最小rowid的就可以了,其余全部删除。


    SQL>delete from employee e2 where rowid not in (
            select max(e1.rowid) from employee e1 where
            e1.emp_id=e2.emp_id and e1.emp_name=e2.emp_name and e1.salary=e2.salary);--这里用min(rowid)也可以。


     

    SQL>delete from employee e2 where rowid <(
            select max(e1.rowid) from employee e1 where
            e1.emp_id=e2.emp_id and e1.emp_name=e2.emp_name and e1.salary=e2.salary);

    (3)也是通过rowid,但效率更高。 

    SQL>delete from employee where rowid not in (
            select max(t1.rowid) from employee t1 group by 1.emp_id,t1.emp_name,t1.salary);
             --这里用min(rowid)也可以。

        EMP_ID EMP_NAME                                     SALARY

    ---------- ---------------------------------------- ----------
             1 sunshine                                      10000
             3 xyz                                           30000
             2 semon                                         20000

  • 相关阅读:
    (转)证券公司信息化——4
    面试常见高频算法题总结
    git常用命令
    JDBC框架——DBUtils
    springboot读取 yaml或者properties文件几种方式
    log4j2配置
    java通过SparkSession连接spark-sql
    列式存储格式之Parquet
    动态规划专题
    CountDownLatch、CyclicBarrier和Semaphore用法
  • 原文地址:https://www.cnblogs.com/conquer/p/807425.html
Copyright © 2011-2022 走看看