zoukankan      html  css  js  c++  java
  • 从 NCBI 批量下载基因组的方法

    先下载 assembly summary files

    The assembly_summary files report metadata for the genome assemblies on the NCBI genomes FTP site.

    Four master files reporting data for either GenBank or RefSeq genome assemblies are available under ftp://ftp.ncbi.nlm.nih.gov/genomes/ASSEMBLY_REPORTS/

    assembly_summary_genbank.txt                  - current GenBank genome assemblies

    assembly_summary_genbank_historical.txt  - replaced and suppressed GenBank genome assemblies

    assembly_summary_refseq.txt                      - current RefSeq genome assemblies

    assembly_summary_refseq_historical.txt      - replaced and suppressed RefSeq genome assemblies

    assembly_summary_genbank.txt and assembly_summary_genbank_historical.txt are also available at:

    ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/assembly_summary_genbank.txt

    ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/assembly_summary_genbank_historical.txt

    assembly_summary_refseq.txt and assembly_summary_refseq_historical.txt are also available at:

    ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/assembly_summary_refseq.txt

    ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/assembly_summary_refseq_historical.txt

    The assembly_summary.txt files in the directories named for taxonomic groups or species contain the relevant subsets of the data from the master files.

    也可以从 ftp://ftp.ncbi.nlm.nih.gov/genomes/genbank/ 下载单独的summary文件(bacteria fungi viral 等)

    也可以从 ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/ 下载单独的summary文件(bacteria fungi viral 等)

    根据 summary 文件中的 ftp_path 列 可以下载到基因组及相关信息

  • 相关阅读:
    [Machine Learning]Numpy
    [LeetCode]Valid Palindrome
    [LeetCode]Remove Linked List Elements
    [LeetCode]Reverse Linked List
    [LeetCode]Palindrome Number
    Spring绑定请求参数过程以及使用@InitBinder来注册自己的属性处理器
    servlet温故知新
    线程池简单实现
    JAVA NIO学习笔记
    XSS攻击简单介绍
  • 原文地址:https://www.cnblogs.com/0820LL/p/10103293.html
Copyright © 2011-2022 走看看