Downstream ChIP-seq Analysis
GitHub Classroom Assignment
Data
You’ve been given some data from a reanalysis of the paper Genome-wide maps of chromatin state in pluripotent and lineage-committed cells using the nf-core/chipseq pipeline.
You’ve also been given a file containing the MysteryFactorX regions.
Instructions
IGV
- Visualize the regions in IGV.
- Combine the MysteryFactorX Regions with Peaks for H3K4me3 and H3K27me3 from Mikkelsen ‘07. Reproduce Figure 1 (without the ChIP-chip data) to the best of your ability using IGV, including the MysteryFactorX.
Figure 1
If you're using IGV Desktop
Select mm10
as the genome, or download a fasta for just chr11 for mm10.
MEME-ChIP
- Perform motif analysis on the MysteryFactorX Regions. Add a image of the motif to your README.
- What TF might the MysteryFactorX be?
Example motif logo:
Running MEME-ChIP on chr1 only
If running MEME-ChIP is taking a long time, you might create a sequences fasta of only chr1
.
Bedtools
- How many MysteryFactorX regions overlap with H3K4me3 peaks?
- How many MysteryFactorX regions overlap with H3K27me3 peaks?
Feel free to use igv-notebook, the IGV web app, or the IGV desktop application! igv-reports will also work!
You may write a bash script, Jupyter Notebook, Snakemake, Galaxy or a combination of all of them. Just document how to reproduce your analysis!
Useful Docs
- nf-core/chipseq 2.0.0
- Biostar Handbook: Using the Integrative Genomics Viewer Chapter 41 - Page 367
- Biostar Handbook: Chip-Seq motifs Chapter 114 - Page 922
- Biostar Handbook: Downstream Analysis 1 Chapter 116 - Page 949
Hints
- Search a Gene name that’s in the center of the figure to get the location of Figure 1
- Read the Figure description to get the size of the window used in IGV