RNA-seq_analysis
: RNA-seq_analysis.sh
RNA-seq data is applied quality and adapter trimming by Trim Galore v0.6.4_dev and mapped to the mm9 reference genome using STAR v2.7.2d. And we performed reads summarization for genomic features via featureCounts v2.0.0.
ChIP-seq_analysis
: ChIP-seq_analysis.sh
Adapters and low-quality bases were removed using Trim Galore v0.6.4_dev. ChIP-seq reads were mapped to the mm9 reference genome using Bowtie2 v2.3.5.1. Samtools v1.10 was used to filter and convert file formats. To remove PCR duplicate reads, which might occur false positive results, MarkDuplicates (v4.1.4.1, Picard) was used to locate and tag duplicate reads. And sequencing duplicates were removed using the --REMOVE_SEQUENCING_DUPLICATES
options. ChIP-seq peaks were called using MACS2 v2.2.6.
Repetitive Elements binding analysis
For the repetitive Elements binding analysis, the reads were then aligned to the mm9 genome assembly using STAR v2.7.2d with the options --alignIntronMax 1
and --alignEndsType EndToEnd
as previously reported. The parameter --outFilterMultimapNmax 1
was applied to include only the uniquely mapped reads. Duplicate reads were then removed using MarkDuplicates (v4.1.4.1, Picard). Replicate samples were merged using the Samtools v1.10.
The raw read counts normalization and differential expression were determined by RNA-seq_rif1.R.
coding_genes.R is for further analysis of genes
repeats.R is for further analysis of repeats
featureCounts sample.repeats.Aligned.sortedByCoord.out.bam \
-a ~/project/rif1/mm9_repeats_single_locus.gtf \
-g gene_id2 -o sample.counts -T 40
All data in this article has been quality controlled, the qc reports are as follows: