Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:3
-
Epic Link:
-
Sprint:Fall 6, Fall 7, Spring 1
Description
Re-run mark 2022 timeseries data with the name SRP441343 from SRA for both SL4 and SL5 genomes.
For this task, we need to confirm and sanity-check the mark 2022 time series data that Rob uploaded and submitted to the Sequence Read Archive.
If the data are good, we will replace all the existing BAM, junctions, etc. files deployed in the "hotpollen" quickload site with newly processed data.
For this task:
- Check SRP on NCBI and review submission
- Download the data onto the cluster by using the SRP name
- Run nf-core/rnaseq pipeline
- Run our coverage graph and junctions scripts on the data
Note that all files should now use their "SRR" names instead of the existing file names.
Re-run Directory: /projects/tomato_genome/fnb/dataprocessing/SRP441343
SL4: /projects/tomato_genome/fnb/dataprocessing/SRP441343/nfcore-SL4
SL5: /projects/tomato_genome/fnb/dataprocessing/SRP441343/nfcore-SL5
Prefetch SRR Script:
Execute:
Faster Dump Script:
Execute: