[IGBF-3740] Reorganize the Muday Lab folder in the RNA-Seq Quickload - JIRA UNCC

Details

Type: Task
Status: Closed (View Workflow)
Priority: Major
Resolution: Done
Affects Version/s: None
Fix Version/s: None
Labels:
None

Story Points:
1
Epic Link:
Publish our work
Sprint:
Spring 10, Summer 1, Summer 2

Description

Situation: The RNA-Seq Quickload needs reorganizing prior to the ASPB workshop. Specifically, the Muday Lab folder is currently organized in a way that makes sense, but I think that we could optimize this folder for the workshop by organizing it a bit differently.

Tasks:
1. Reorganize the Muday Lab folder:

List all of the A (are) samples first, then V (VF36), then F (VF36-F3H-T3).
Within each of these subgroups, keep them listed in order first by time point, then by temperature, then by replicate.
Do this for all three subfolders (Reads, Coverage Graphs, and Junctions).

2. Give the last folder in the tomato RNA-Seq Quickload ("SRP328042 - anther development under simulated drought stress") a name that matches with the other folders. I believe the PI associated with that SRP data is Gang Lu. Here's a link to their paper: https://www.mdpi.com/2073-4409/10/7/1809.

With all of this information, the title of the folder after renaming should be something like "Lu Lab - anther, # varieties, drought stress, # minutes (SRP328042)".

Attachments

Issue Links

relates to

IGBF-3672 Draft a workshop outline for ASPB 2024

Closed

Activity

Ascending order - Click to sort in descending order

Paige Kulzer (Inactive) created issue - 21/May/24 10:50 AM

Paige Kulzer (Inactive) made changes - 21/May/24 10:50 AM

Field	Original Value	New Value
Epic Link		IGBF-2809 [ 19325 ]

Paige Kulzer (Inactive) made changes - 21/May/24 10:50 AM

Link

This issue relates to ~~IGBF-3672~~ [ ~~IGBF-3672~~ ]

Paige Kulzer (Inactive) made changes - 21/May/24 10:50 AM

Rank

Ranked higher

Ann Loraine made changes - 27/May/24 10:03 AM

Sprint

Spring 10 [ 194 ]

Spring 10, Summer 1 [ 194, 195 ]

Ann Loraine made changes - 27/May/24 10:03 AM

Rank

Ranked higher

Ann Loraine made changes - 28/May/24 2:47 PM

Status

To-Do [ 10305 ]

In Progress [ 3 ]

Hide

Permalink

Ann Loraine added a comment - 29/May/24 10:27 AM - edited

I have made many changes to the organization and naming schemes for the data in IGB RNA-Seq quickload.
First main change is that I sorted the "are" data from the Muday Lab as requested.
Second main change is that all the data sets created for the NSF project - including data harvested from the SRA - now reside in a folder named for the project, with subfolders named for individual laboratories and tissue types investigated.
I did this because I felt it is useful to separate different lab's datasets into folders because where a dataset was produced (the laboratory location) is a confounder in data analysis.
Similarly, some experiments were done at different times, by different people. Those datasets are also separated into folders.
Thus, I am separating data collections into folders using confounders. The idea is to make it super clear to users which collections of samples can be compared to each others in the same collection using standard methods, and which would need to be compared with the known confounders taken into account.

Please review by visiting tomato genome assemblies from June 2022 (SL5) and Sept. 2019 (SL4).

Show

Ann Loraine added a comment - 29/May/24 10:27 AM - edited I have made many changes to the organization and naming schemes for the data in IGB RNA-Seq quickload. First main change is that I sorted the "are" data from the Muday Lab as requested. Second main change is that all the data sets created for the NSF project - including data harvested from the SRA - now reside in a folder named for the project, with subfolders named for individual laboratories and tissue types investigated. I did this because I felt it is useful to separate different lab's datasets into folders because where a dataset was produced (the laboratory location) is a confounder in data analysis. Similarly, some experiments were done at different times, by different people. Those datasets are also separated into folders. Thus, I am separating data collections into folders using confounders. The idea is to make it super clear to users which collections of samples can be compared to each others in the same collection using standard methods, and which would need to be compared with the known confounders taken into account. Please review by visiting tomato genome assemblies from June 2022 (SL5) and Sept. 2019 (SL4).

Ann Loraine made changes - 29/May/24 10:27 AM

Status

In Progress [ 3 ]

Needs 1st Level Review [ 10005 ]

Ann Loraine made changes - 29/May/24 10:27 AM

Assignee

Ann Loraine [ aloraine ]

Paige Kulzer [ pkulzer ]

Paige Kulzer (Inactive) made changes - 04/Jun/24 8:58 AM

Status

Needs 1st Level Review [ 10005 ]

First Level Review in Progress [ 10301 ]

Paige Kulzer (Inactive) made changes - 04/Jun/24 10:56 AM

Status

First Level Review in Progress [ 10301 ]

Needs 1st Level Review [ 10005 ]

Paige Kulzer (Inactive) made changes - 06/Jun/24 10:17 AM

Status

Needs 1st Level Review [ 10005 ]

First Level Review in Progress [ 10301 ]

Ann Loraine made changes - 09/Jun/24 4:44 PM

Sprint

Spring 10, Summer 1 [ 194, 195 ]

Spring 10, Summer 1, Summer 2 [ 194, 195, 196 ]

Ann Loraine made changes - 09/Jun/24 4:44 PM

Rank

Ranked higher

Hide

Permalink

Paige Kulzer (Inactive) added a comment - 11/Jun/24 9:15 AM

The Muday lab folder is now organized well for the upcoming ASPB workshop. I noticed several things that I'd like to make note of, but these don't need to be fixed right now:

Some folders have SRP numbers, some have a longer string of numbers, and others have no number attached to them at all.
Similarly, folder names follow slightly different patterns. For example, some names follow the pattern, "[part of tomato] heat stress, [#] varieties, [time] min". Others follow a different pattern like "heat stress [time] min".
Some file names begin with a sample code (e.g., Muday > Reads), whereas others end with this sample code (e.g., Johnson Lab > 30 - 120 min timeseries > Reads)
File names from the Reproduction-related data... > Reads folder have SRR numbers, but no other file names in the Quickload have them.

Thank you for these changes! Moving this ticket to done now that the Muday lab folder has been reorganized.

Show

Paige Kulzer (Inactive) added a comment - 11/Jun/24 9:15 AM The Muday lab folder is now organized well for the upcoming ASPB workshop. I noticed several things that I'd like to make note of, but these don't need to be fixed right now: Some folders have SRP numbers, some have a longer string of numbers, and others have no number attached to them at all. Similarly, folder names follow slightly different patterns. For example, some names follow the pattern, " [part of tomato] heat stress, [#] varieties, [time] min". Others follow a different pattern like "heat stress [time] min". Some file names begin with a sample code (e.g., Muday > Reads), whereas others end with this sample code (e.g., Johnson Lab > 30 - 120 min timeseries > Reads) File names from the Reproduction-related data... > Reads folder have SRR numbers, but no other file names in the Quickload have them. Thank you for these changes! Moving this ticket to done now that the Muday lab folder has been reorganized.

Paige Kulzer (Inactive) made changes - 11/Jun/24 9:15 AM

Status

First Level Review in Progress [ 10301 ]

Ready for Pull Request [ 10304 ]

Paige Kulzer (Inactive) made changes - 11/Jun/24 9:15 AM

Status

Ready for Pull Request [ 10304 ]

Pull Request Submitted [ 10101 ]

Paige Kulzer (Inactive) made changes - 11/Jun/24 9:15 AM

Status

Pull Request Submitted [ 10101 ]

Reviewing Pull Request [ 10303 ]

Paige Kulzer (Inactive) made changes - 11/Jun/24 9:15 AM

Status

Reviewing Pull Request [ 10303 ]

Merged Needs Testing [ 10002 ]

Paige Kulzer (Inactive) made changes - 11/Jun/24 9:15 AM

Status

Merged Needs Testing [ 10002 ]

Post-merge Testing In Progress [ 10003 ]

Paige Kulzer (Inactive) made changes - 11/Jun/24 9:15 AM

Resolution		Done [ 10000 ]
Status	Post-merge Testing In Progress [ 10003 ]	Closed [ 6 ]

People

Assignee:

Paige Kulzer (Inactive)

Reporter:

Paige Kulzer (Inactive)

Votes:

0 Vote for this issue

Watchers:

2 Start watching this issue

Dates

Created:

21/May/24 10:50 AM

Updated:

11/Jun/24 9:15 AM

Resolved:

11/Jun/24 9:15 AM