[IGBF-3739] Re-run SRA muday 2022 timeseries data again with SL4 - JIRA UNCC

Details

Type: Task
Status: Closed (View Workflow)
Priority: Major
Resolution: Done
Affects Version/s: None
Fix Version/s: None
Labels:
None

Story Points:
2
Epic Link:
Support NSF pollen grant
Sprint:
Spring 10, Summer 1

Description

SRP460750
Directory: /projects/tomato_genome/fnb/dataprocessing/SRP460750/nfcore-SL4-2024-05-21
Previously we noticed that SRA had mismatched some of the data incorrectly and 16 of the sample-names were mislabeled. Dr. Reid reached out and had SRA change everything to the correct sample names. Now we must rerun the muday SRA data again on the cluster with nextflow and make sure the data is correctly labeled.
For this task, we need to confirm and sanity-check the muday time course data that Rob recently uploaded and submitted to the Sequence Read Archive.
If the data are good, we will replace all the existing BAM, junctions, etc. files deployed in the "hotpollen" quickload site with newly processed data.
For this task:
Check SRP on NCBI and review submission
Download the data onto the cluster by using the SRP name
Run nf-core/rnaseq pipeline
Run our coverage graph and junctions scripts on the data
Note that all files should now use their "SRR" names instead of the existing file names.

Attachments

Issue Links

relates to

IGBF-3720 Re-run Nextflow Muday time course data again with SL5 and data downloaded from SRA

Closed

Activity

Ascending order - Click to sort in descending order

Hide

Permalink

Molly Davis added a comment - 21/May/24 2:10 PM

Used this document to run the pipeline: https://docs.google.com/document/d/1ig9ET-ykXF5nAX3P487cXWmZDGUlQpcwrvFXpbyP5vw/edit

Show

Molly Davis added a comment - 21/May/24 2:10 PM Used this document to run the pipeline: https://docs.google.com/document/d/1ig9ET-ykXF5nAX3P487cXWmZDGUlQpcwrvFXpbyP5vw/edit

Hide

Permalink

Molly Davis added a comment - 28/May/24 1:19 PM - edited

Branch: https://bitbucket.org/mdavis4290/molly5-flavonoid-rnaseq/branch/IGBF-3739
Directory: /projects/tomato_genome/fnb/dataprocessing/SRP460750/nfcore-SL4-2024-05-21/results/star_salmon
Reviewer:
Check that files have reasonable sizes (no "zero" size files, for example)
Check that every "FJ.bed.gz" file has a corresponding "FJ.bed.gz.tbi" index file
Check that every bam file has a corresponding "FJ.bed.gz" file
Check that every bam file has a corresponding "scaled.bedgraph.gz" file
Check that every "scaled.bedgraph.gz" has a corresponding "scaled.bedgraph.gz.tbi"

Show

Molly Davis added a comment - 28/May/24 1:19 PM - edited Branch : https://bitbucket.org/mdavis4290/molly5-flavonoid-rnaseq/branch/IGBF-3739 Directory : /projects/tomato_genome/fnb/dataprocessing/SRP460750/nfcore-SL4-2024-05-21/results/star_salmon Reviewer : Check that files have reasonable sizes (no "zero" size files, for example) Check that every "FJ.bed.gz" file has a corresponding "FJ.bed.gz.tbi" index file Check that every bam file has a corresponding "FJ.bed.gz" file Check that every bam file has a corresponding "scaled.bedgraph.gz" file Check that every "scaled.bedgraph.gz" has a corresponding "scaled.bedgraph.gz.tbi"

Hide

Permalink

Robert Reid added a comment - 28/May/24 2:43 PM

I found 72

bam files (~ 400MB in size)
bai files (kb in size)
scaled.bedgraph.gz files (40MB in size)
more bedgraph tbi files (70k in size)

72 bed.gz files, 40MB in size
and 72 tbi files for these bad files

Looks like everything is here!

Show

Robert Reid added a comment - 28/May/24 2:43 PM I found 72 bam files (~ 400MB in size) bai files (kb in size) scaled.bedgraph.gz files (40MB in size) more bedgraph tbi files (70k in size) 72 bed.gz files, 40MB in size and 72 tbi files for these bad files Looks like everything is here!

Hide

Permalink

Molly Davis added a comment - 28/May/24 4:09 PM

PR: https://bitbucket.org/hotpollen/flavonoid-rnaseq/pull-requests/48

Show

Molly Davis added a comment - 28/May/24 4:09 PM PR : https://bitbucket.org/hotpollen/flavonoid-rnaseq/pull-requests/48

Hide

Permalink

Ann Loraine added a comment - 04/Jun/24 11:33 AM

PR is merged. Ready for testing.

Show

Ann Loraine added a comment - 04/Jun/24 11:33 AM PR is merged. Ready for testing.

Hide

Permalink

Molly Davis added a comment - 05/Jun/24 3:19 PM

Review:

All quick load files are accounted for in SL4 directory
I made sure permission were accessible for group chmod -R g+w *
The merged files on bitbucket are correct

Moving to Done!

Show

Molly Davis added a comment - 05/Jun/24 3:19 PM Review : All quick load files are accounted for in SL4 directory I made sure permission were accessible for group chmod -R g+w * The merged files on bitbucket are correct Moving to Done!

People

Assignee:

Molly Davis

Reporter:

Molly Davis

Votes:

0 Vote for this issue

Watchers:

3 Start watching this issue

Dates

Created:

21/May/24 10:01 AM

Updated:

05/Jun/24 3:19 PM

Resolved:

05/Jun/24 3:19 PM