Details
-
Type:
Task
-
Status: Closed (View Workflow)
-
Priority:
Major
-
Resolution: Done
-
Affects Version/s: None
-
Fix Version/s: None
-
Labels:None
-
Story Points:3
-
Epic Link:
-
Sprint:Fall 2
Description
Goal:
To run the Deseq2 script that Molly perfected on the next flow results of the reference free assemblies. The goal here is get Brandon more acquainted with R and to see what difference we can observe within a variety.
e.g., Heinz versus Heinz de novo
To start, Brandon has a copy of Molly's script, and the tomato data for the Muday lab. the salmon counts table and the metafile needed to run the script.
1. Get this to run as intended.
2. Make a metafile for the Heinz versus Heinz de novo run.
3. Tweak the Heinz versus Heinz de novo nextflow table to be suitable for this script.
4. Run the script!
Attachments
Activity
| Field | Original Value | New Value |
|---|---|---|
| Epic Link | IGBF-2993 [ 21429 ] |
| Status | To-Do [ 10305 ] | In Progress [ 3 ] |
| Status | In Progress [ 3 ] | Needs 1st Level Review [ 10005 ] |
| Status | Needs 1st Level Review [ 10005 ] | First Level Review in Progress [ 10301 ] |
| Status | First Level Review in Progress [ 10301 ] | Ready for Pull Request [ 10304 ] |
| Status | Ready for Pull Request [ 10304 ] | Pull Request Submitted [ 10101 ] |
| Status | Pull Request Submitted [ 10101 ] | Reviewing Pull Request [ 10303 ] |
| Status | Reviewing Pull Request [ 10303 ] | Merged Needs Testing [ 10002 ] |
| Status | Merged Needs Testing [ 10002 ] | Post-merge Testing In Progress [ 10003 ] |
| Resolution | Done [ 10000 ] | |
| Status | Post-merge Testing In Progress [ 10003 ] | Closed [ 6 ] |
Script was pulled down via the github tools built into Posit's Rstudio software.
The original code ran smoothly producing the expected results.
Trying the code on the de novo assembly data, Brandon ran into a few issues related to the headers of the columns.
We resolved this via a few carefully placed greps within the R code (thanks Steven).
At quick glance, de novo read counds and ref based read counts are entirely different due to the process each dataset goes through. These are not really comparable via EdgeR or deseq2.
So new plan to highlight bias has been devised and this task is no longer important.
Case closed.