functional genomics | epigenomic annotations
刚入行的总是一头雾水,对这些表观的标记一点兴趣都没有,种类繁多,总是记不住,这里我就做一个常识性的总结,不搞太多术语。
需要了解的也不多,常见的就那么几个,搞懂ENCODE和ROADMAP上有的就行。细节颇多,需要一点耐心。
各种类型的数据可以直接在这个genome browser里浏览:http://genomebrowser.wustl.edu/
注意:
- 所有的表观或转录组都具有非常强的组织(cell type)特异性
- ChIP-seq最大的特点就是需要input,作为对照
- ChIP-seq可以Identify direct and indirect protein-DNA interactions
- ChIP-seq preferred for functional information
原始数据:
- DHSs
- H3K4me3
- H3K9ac
- H3K27ac
- H3K4me1
处理后数据:
- Enhancer
- TFBSs
主要是ChIP-seq(immunological assays)占了很大一类,把它搞懂就行。
另一类non-immunological assays:ATAC-seq, MNase-seq, DNase-seq, and FAIRE-seq。
DHSs
DNase I hypersensitive site
DNase-seq
FAIRE-Seq is a successor
genome-wide DNA footprints
Deoxyribonuclease 脱氧核糖核酸酶
DNase I hypersensitive sites (DHSs) are regions of chromatin that are sensitive to cleavage by the DNase I enzyme. In these specific regions of the genome, chromatin has lost its condensed structure, exposing the DNA and making it accessible. This raises the availability of DNA to degradation by enzymes, such as DNase I. These accessible chromatin zones are functionally related to transcriptional activity, since this remodeled state is necessary for the binding of proteins such as transcription factors.
ChIP-seq
Basically,
- "encc-enhancer.bed" is enhancers defined with H3K27ac & H3K4me1 activity
- "encc-enhancer-atac.bed" is enhancers defined with H3K27ac & H3K4me1 activity as well as open chromatin (ATAC-seq) signal summits.
不同ChIP-seq的功能,一图胜千言:【我们用了第一行和最后一行,效率最高】
不同表观注释的比较:
待续~
快速使用epigenomic annotations data:
有个叫做baseline_v1.1的文件,里面包含了各种整理好的表观注释数据。
https://data.broadinstitute.org/alkesgroup/LDSCORE/baseline_v1.1_bedfiles.tgz
~/project2/CPloci/Evo/ENCODE/
包含的数据类型:
- Coding
- Intron
- Transcribe
- Conserved
- DGF
- DHS
- H3K9ac
- H3K27ac
- H3K4me1
- H3K4me3
- CTCF
- TFBS
- TSS
- Promoter
- Enhancer
- SuperEnhancer
- WeakEnhancer
- Repressed
- UTR_5
- UTR_3
算是种类非常多了,如果对精度没有要求,就可以直接用了,全部是bed格式的。
参考:
Chromatin accessibility and the regulatory epigenome
Identifying and mitigating bias in next-generation sequencing methods for chromatin biology - 刘小乐
Chromatin Structure Research Methods
Introduction to ChIP-seq and ATAC-seq - 非常赞
Mapping DNA-protein interactions via ChIP-seq - 非常详细
如何通过CHIP-seq分析鉴别基因启动子和增强子 - ChIP-seq详解