zoukankan      html  css  js  c++  java
  • Samtools pick up seq

    Samtools pick up seq

    # grep the name of gene and find the real name in goat_reference_gene_list
    for record in `grep "Re" ../out_No_Pinneped`; do
            grep "${${record##*/}%.*}" ../goat_gene.list >> gene_record
    done
    
    # pick up the gene symbol
    awk '{print $2}' gene_record > ./gene_symbol
    
    # find the overlap
    for i in `les gene_symbol `; do
                    grep "$i" results_Yuan >> overlap
    done
    
    # find specific range of seq
    # Method 1: seqkit
    seqkit faidx ./chr10.fa chr10:47818633-47822767| les
    # Method 2: samtools
    samtools faidx ./trichechus_manatus.fa
    samtools faidx trichechus_manatus.fa NW_004446089.1:0-30000 > NW_004446089.1.fa
    
    #===================================================================================
    
    #get the order of "coevolution site"/"total length" in No-pinneped coevolution list
    
    1. get the total length list
    seqkit head -n 1 ../Re_alignment/NP_001272477.fa | sed --regexp-extended 's/>S+//' | wc -c
    for gene in `grep "Re" ./out_No_Pinneped `; do
                    echo $gene >> out_No_Pinneped_statistics; seqkit head -n 1 $gene| sed --regexp-extended 's/>S_//' | w
    done
    
    2. calculate and sort in excel
    
  • 相关阅读:
    Java基础其他
    java网络编程
    java多线程编程
    正则表达式--位置匹配和组
    设计模式
    深入 Java Web
    mysql 好用的sql语句
    spring boot 发送邮件
    dubbo的spi机制
    原理分析dubbo分布式应用中使用zipkin做链路追踪
  • 原文地址:https://www.cnblogs.com/zhengjm/p/15049230.html
Copyright © 2011-2022 走看看