Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:4
-
Epic Link:
-
Sprint:Fall 1, Fall 2, Fall 3
Description
GOAL: To look at how many reads align in 3' UTR regions of interest mentioned by Kausik.
Rob's prevailing thought: No reads align to these areas. Prove him wrong.
- Get file from Kausik's email with the gene of interest
- Find relevant reference annotation (GFF, GTF, refflat file, bed) that contains the gene name.
- Within this file, identify the coordinates for the 3'UTR
- Get coordinates for m6A sites related to this gene.
- Check how many reads for these sites and for the 3' UTR in general.
- Repeat for each gene.
- Repeat for each of the 10 files (Ring, troph, Schitz, etc)
To figure out how to get the depth of reads using samtools.
Can it all be done in Python but we use the samtools.
Grab random genome chunks and hunt for m6A:
1. Typical 3' UTR length (random spot, grab 300 Nucleotides from that spot)
2. Random piece on same chrom
3. m6A predictions in those sections
4. Get bam file counts, do we see anything.
LOTS OF ITERATIONS (Like 1000)