[IGBF-3251] Process and deploy Palanivelu Lab data - JIRA UNCC

Ann Loraine created issue - 07/Feb/23 10:41 AM

Ann Loraine made changes - 07/Feb/23 10:41 AM

Field	Original Value	New Value
Epic Link		IGBF-2993 [ 21429 ]

Ann Loraine made changes - 08/Feb/23 11:21 AM

Description

RR to describe location, etc.

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Report from the sequencer (Azenta) is attached.

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on just one sample to get a preliminary MultiQC report. Check the strandedness parameter.
* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples using the correct strandedness parameter.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Ann Loraine made changes - 08/Feb/23 11:21 AM

Attachment

Azenta_30-804059537_Data_Report.html [ 17672 ]

Ann Loraine made changes - 08/Feb/23 11:25 AM

Description

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Report from the sequencer (Azenta) is attached.

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on just one sample to get a preliminary MultiQC report. Check the strandedness parameter.
* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples using the correct strandedness parameter.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Report from the sequencer (Azenta) is attached.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on just one sample to get a preliminary MultiQC report. Check the strandedness parameter.
* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples using the correct strandedness parameter.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Ann Loraine made changes - 08/Feb/23 11:28 AM

Description

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Report from the sequencer (Azenta) is attached.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on just one sample to get a preliminary MultiQC report. Check the strandedness parameter.
* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples using the correct strandedness parameter.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Report from the sequencer (Azenta) is attached.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Ann Loraine made changes - 08/Feb/23 11:29 AM

Attachment

30-804059537.pdf [ 17673 ]

Ann Loraine made changes - 08/Feb/23 11:30 AM

Description

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Report from the sequencer (Azenta) is attached.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Ann Loraine made changes - 08/Feb/23 11:39 AM

Description

For this task, process new data set from the Palanivelu lab.

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Ann Loraine made changes - 08/Feb/23 11:44 AM

Description

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/ovary-rnaseq/src/main/

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Ann Loraine made changes - 08/Feb/23 11:48 AM

Description

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/ovary-rnaseq/src/main/

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly on all the samples "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/ovary-rnaseq/src/main/

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Ann Loraine made changes - 08/Feb/23 12:03 PM

Description

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/ovary-rnaseq/src/main/

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/pistil-rna-seq

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Ann Loraine made changes - 08/Feb/23 12:04 PM

Description

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/pistil-rna-seq

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/pistil-rna-seq

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Contact:
* Kelsey Pryze - kelseypryze@email.arizona.edu

Ann Loraine made changes - 08/Feb/23 12:05 PM

Description

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-kelsie

Bitbucket repo: https://bitbucket.org/hotpollen/pistil-rna-seq

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Contact:
* Kelsey Pryze - kelseypryze@email.arizona.edu

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-KP (kp for "Kelsey Pryze")

Bitbucket repo: https://bitbucket.org/hotpollen/pistil-rna-seq

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Contact:
* Kelsey Pryze - kelseypryze@email.arizona.edu

Molly Davis made changes - 08/Feb/23 2:52 PM

Attachment

KP_samples.csv [ 17674 ]

Molly Davis made changes - 08/Feb/23 4:27 PM

Status

To-Do [ 10305 ]

In Progress [ 3 ]

Molly Davis made changes - 08/Feb/23 4:28 PM

Assignee

Molly Davis [ molly ]

Molly Davis made changes - 09/Feb/23 9:40 AM

Attachment

Screen Shot 2023-02-09 at 9.40.46 AM.png [ 17678 ]

Hide

Permalink

Molly Davis added a comment - 09/Feb/23 9:40 AM - edited

Pipeline successfully ran:
Unable to render embedded object: File (Screen Shot 2023-02-09 at 9.40.46 AM.png) not found.

Directory: /nobackup/tomato_genome/30-804059537-KP

Comment: There are no errors in the report but the number of sequences mapped is pretty low. Might need to look into that! Double check sample sheet I made maybe or the wrong reference genome was used to map data.

Sequence Duplication levels might be an issue
Unable to render embedded object: File (Screen Shot 2023-02-09 at 10.28.15 AM.png) not found.
Per sequence GC Content also poor
Alignment scores are poor because unmapped reads are too short.

Link to interpret report: https://nf-co.re/eager/2.2.2/output#multiqc-report

Next steps:

remove sorted names
make coverage graphs

Show

Molly Davis added a comment - 09/Feb/23 9:40 AM - edited Pipeline successfully ran: Unable to render embedded object: File (Screen Shot 2023-02-09 at 9.40.46 AM.png) not found. Directory: /nobackup/tomato_genome/30-804059537-KP Comment: There are no errors in the report but the number of sequences mapped is pretty low. Might need to look into that! Double check sample sheet I made maybe or the wrong reference genome was used to map data. Sequence Duplication levels might be an issue Unable to render embedded object: File (Screen Shot 2023-02-09 at 10.28.15 AM.png) not found. Per sequence GC Content also poor Alignment scores are poor because unmapped reads are too short. Link to interpret report: https://nf-co.re/eager/2.2.2/output#multiqc-report Next steps: remove sorted names make coverage graphs

Molly Davis made changes - 09/Feb/23 9:45 AM

Attachment

KP_multiqc_report.html [ 17679 ]

Hide

Permalink

Ann Loraine added a comment - 09/Feb/23 10:24 AM - edited

The "star alignment scores" section looks informative. According to the plot, a high percentage of sequences in some samples were reported as "Unmapped: too short." I think this can happen when the library contained a lot of very short inserts. I think we should go ahead and proceed with the rest of the pipeline, but keep an eye on how those samples with the largest percentage of "shorties" perform in subsequent analyses.

attn: [~molly]

Show

Ann Loraine added a comment - 09/Feb/23 10:24 AM - edited The "star alignment scores" section looks informative. According to the plot, a high percentage of sequences in some samples were reported as "Unmapped: too short." I think this can happen when the library contained a lot of very short inserts. I think we should go ahead and proceed with the rest of the pipeline, but keep an eye on how those samples with the largest percentage of "shorties" perform in subsequent analyses. attn: [~molly]

Molly Davis made changes - 09/Feb/23 10:28 AM

Attachment

Screen Shot 2023-02-09 at 10.28.15 AM.png [ 17680 ]

Molly Davis made changes - 09/Feb/23 10:35 AM

Attachment

Screen Shot 2023-02-09 at 10.28.15 AM.png [ 17681 ]

Molly Davis made changes - 09/Feb/23 10:35 AM

Attachment

Screen Shot 2023-02-09 at 10.28.15 AM.png [ 17681 ]

Molly Davis made changes - 09/Feb/23 10:35 AM

Attachment

Screen Shot 2023-02-09 at 10.35.27 AM.png [ 17682 ]

Hide

Permalink

Molly Davis added a comment - 09/Feb/23 10:52 AM - edited

Other people were having the same issue and said:

"I encountered this issue when in the two paired-end input FASTQ files mates are out-of-order, i.e. mates are not found at the same line of the two files. This leads to a lot of not properly mapped read pairs that STAR throws into the "too short" bucket." https://github.com/alexdobin/STAR/issues/169

Not sure if this is the same but I will look more into it and check the fastq file directory. The naming of the files might be confusing nextflow as well because R1 and R2 are used twice in some file names.

[~aloraine]

Show

Molly Davis added a comment - 09/Feb/23 10:52 AM - edited Other people were having the same issue and said: "I encountered this issue when in the two paired-end input FASTQ files mates are out-of-order, i.e. mates are not found at the same line of the two files. This leads to a lot of not properly mapped read pairs that STAR throws into the "too short" bucket." https://github.com/alexdobin/STAR/issues/169 Not sure if this is the same but I will look more into it and check the fastq file directory. The naming of the files might be confusing nextflow as well because R1 and R2 are used twice in some file names. [~aloraine]

Hide

Permalink

Ann Loraine added a comment - 10/Feb/23 10:20 AM

Renaming code: https://bitbucket.org/hotpollen/splicing-analysis/src/main/src/renameBams.sh

Show

Ann Loraine added a comment - 10/Feb/23 10:20 AM Renaming code: https://bitbucket.org/hotpollen/splicing-analysis/src/main/src/renameBams.sh

Molly Davis made changes - 10/Feb/23 2:49 PM

Attachment

KP_samples.csv [ 17686 ]

Hide

Permalink

Molly Davis added a comment - 10/Feb/23 2:49 PM - edited

Update:

I have changed the names of the fastq files so instead of having R1, R2, R3 it is now Rep1, Rep2, Rep3. This is due to nextflow possibly confusing paired end files and not being matched together correctly.
Here is the new csv samples file: [^KP_samples.csv]
Rerunning nextflow.

Show

Molly Davis added a comment - 10/Feb/23 2:49 PM - edited Update: I have changed the names of the fastq files so instead of having R1, R2, R3 it is now Rep1, Rep2, Rep3. This is due to nextflow possibly confusing paired end files and not being matched together correctly. Here is the new csv samples file: [^KP_samples.csv] Rerunning nextflow.

Molly Davis made changes - 13/Feb/23 9:44 AM

Attachment

KP_multiqc_report.html [ 17687 ]

Hide

Permalink

Molly Davis added a comment - 13/Feb/23 9:49 AM - edited

Update:

New MutliQC report:
[^KP_multiqc_report.html]

Comment: Unfortunately, even after changing the fastq file names the mutliqc report is the exact same as the last one with Unmapped: too short # of reads for the alignment scores.

Here is the txt file to see the alignment scores instead of using the actual graph from the report:
multiqc_star.txt

Directory: /nobackup/tomato_genome/30-804059537-KP/results/multiqc/star_salmon/multiqc_data

Show

Molly Davis added a comment - 13/Feb/23 9:49 AM - edited Update: New MutliQC report: [^KP_multiqc_report.html] Comment: Unfortunately, even after changing the fastq file names the mutliqc report is the exact same as the last one with Unmapped: too short # of reads for the alignment scores. Here is the txt file to see the alignment scores instead of using the actual graph from the report: multiqc_star.txt Directory: /nobackup/tomato_genome/30-804059537-KP/results/multiqc/star_salmon/multiqc_data

Hide

Permalink

Ann Loraine added a comment - 13/Feb/23 10:43 AM

Next steps:

Proceed with using the output from most recent run of nextflow nf-core/rnaseq pipeline which used the renamed samples (e.g., Rep1, Rep2, Rep3)
[~aloraine] to review samples file for possible problems

Show

Ann Loraine added a comment - 13/Feb/23 10:43 AM Next steps: Proceed with using the output from most recent run of nextflow nf-core/rnaseq pipeline which used the renamed samples (e.g., Rep1, Rep2, Rep3) [~aloraine] to review samples file for possible problems

Molly Davis made changes - 13/Feb/23 11:31 AM

Attachment

Screen Shot 2023-02-09 at 10.28.15 AM.png [ 17680 ]

Molly Davis made changes - 13/Feb/23 11:31 AM

Attachment

multiqc_star.txt [ 17688 ]

Hide

Permalink

Molly Davis added a comment - 13/Feb/23 3:34 PM

Update:

I moved forward with results and renamed sorted bam files and made coverage graphs.

Directory: /nobackup/tomato_genome/30-804059537-KP/results/star_salmon

Show

Molly Davis added a comment - 13/Feb/23 3:34 PM Update: I moved forward with results and renamed sorted bam files and made coverage graphs. Directory: /nobackup/tomato_genome/30-804059537-KP/results/star_salmon

Molly Davis made changes - 14/Feb/23 10:38 AM

Attachment

KP_samples.csv [ 17674 ]

Molly Davis made changes - 14/Feb/23 10:38 AM

Attachment

KP_multiqc_report.html [ 17679 ]

Hide

Permalink

Molly Davis added a comment - 14/Feb/23 10:42 AM - edited

Update: After speaking with Nowlan, we agree that running fastqc on the files would be beneficial to check the quality of the data before running it with nextflow.

Script:

#!/bin/bash


#SBATCH --job-name=fastqc               #job name after submission
#SBATCH -p Orion                        #partition being used
#SBATCH -N 1                            #number of nodes to use
#SBATCH --ntasks-per-node=8             #max number of tasks per node
#SBATCH --mem=60gb                      #memory required per node
#SBATCH -t 0-50:00                      #time (D-HH:MM)
#SBATCH -o fastqc.%j.out                #standard output file
#SBATCH -e fastqc.%j.err                #standard error file
#SBATCH --mail-type=END,FAIL            #Notifications for job complete/failure
#SBATCH --mail-user=mdavi258@uncc.edu   #Send to user email


module load fastqc

for i in /nobackup/tomato_genome/30-804059537-KP/*.fastq.gz
do
  	fastqc -o /nobackup/tomato_genome/30-804059537-KP/fastQC_dataQuality $i
done

Directory: /nobackup/tomato_genome/30-804059537-KP/fastQC_dataQuality

For quick reference here are two of the fastqc reports to check data quality:
[^Heinz-Ovary-Rep1-0hr-25C-unpol_R1_001_fastqc.html]
[^Heinz-Ovary-Rep2-0hr-25C-unpol_R1_001_fastqc.html]

Let me know what you think Nowlan Freese

Show

Molly Davis added a comment - 14/Feb/23 10:42 AM - edited Update: After speaking with Nowlan, we agree that running fastqc on the files would be beneficial to check the quality of the data before running it with nextflow. Script: #!/bin/bash #SBATCH --job-name=fastqc #job name after submission #SBATCH -p Orion #partition being used #SBATCH -N 1 #number of nodes to use #SBATCH --ntasks-per-node=8 #max number of tasks per node #SBATCH --mem=60gb #memory required per node #SBATCH -t 0-50:00 #time (D-HH:MM) #SBATCH -o fastqc.%j.out #standard output file #SBATCH -e fastqc.%j.err #standard error file #SBATCH --mail-type=END,FAIL #Notifications for job complete/failure #SBATCH --mail-user=mdavi258@uncc.edu #Send to user email module load fastqc for i in /nobackup/tomato_genome/30-804059537-KP/*.fastq.gz do fastqc -o /nobackup/tomato_genome/30-804059537-KP/fastQC_dataQuality $i done Directory: /nobackup/tomato_genome/30-804059537-KP/fastQC_dataQuality For quick reference here are two of the fastqc reports to check data quality: [^Heinz-Ovary-Rep1-0hr-25C-unpol_R1_001_fastqc.html] [^Heinz-Ovary-Rep2-0hr-25C-unpol_R1_001_fastqc.html] Let me know what you think Nowlan Freese

Molly Davis made changes - 14/Feb/23 11:15 AM

Attachment

Heinz-Ovary-Rep1-0hr-25C-unpol_R1_001_fastqc.html [ 17689 ]

Molly Davis made changes - 14/Feb/23 11:20 AM

Attachment

Heinz-Ovary-Rep2-0hr-25C-unpol_R1_001_fastqc.html [ 17690 ]

Hide

Permalink

Molly Davis added a comment - 14/Feb/23 11:26 AM - edited

Notes about High Duplication:

High duplication is either going to be the result of technical duplication (too many PCR cycles), or over-sequencing (very high fold coverage).
How many PCR cycles were done during the protocol? Also, how many reads were their total?
Dimer Contamination?
Should we just let it through for downstream analysis?

Source: https://wiki.bits.vib.be/index.php/Quality_control_of_NGS_data

Show

Molly Davis added a comment - 14/Feb/23 11:26 AM - edited Notes about High Duplication: High duplication is either going to be the result of technical duplication (too many PCR cycles), or over-sequencing (very high fold coverage). How many PCR cycles were done during the protocol? Also, how many reads were their total? Dimer Contamination? Should we just let it through for downstream analysis? Source: https://wiki.bits.vib.be/index.php/Quality_control_of_NGS_data

Hide

Permalink

Molly Davis added a comment - 14/Feb/23 4:35 PM - edited

Validating mate pair files

Script:

#!/bin/bash


#SBATCH --job-name=validate_script      #job name after submission
#SBATCH -p Orion                        #partition being used
#SBATCH -N 1                            #number of nodes to use
#SBATCH --ntasks-per-node=8             #max number of tasks per node
#SBATCH --mem=60gb                      #memory required per node
#SBATCH -t 0-50:00                      #time (D-HH:MM)
#SBATCH -o validate.%j.out              #standard output file
#SBATCH -e validate.%j.err              #standard error file
#SBATCH --mail-type=END,FAIL            #Notifications for job complete/failure
#SBATCH --mail-user=mdavi258@uncc.edu   #Send to user email
#SBATCH --array=1-63


#setting up where to grab files from
file=$(sed -n -e "${SLURM_ARRAY_TASK_ID}p"  /nobackup/tomato_genome/30-804059537-KP/kp_runlist.txt)

#The command to validate each pair:
cd /nobackup/tomato_genome/30-804059537-KP

perl /projects/tomato_genome/scripts/validateHiseqPairs.pl ${file}_R1_001.fastq.gz ${file}_R2_001.fastq.gz


echo "Done"

Error: mate-pair files are not ordered

Ann's note: We need to understand what exactly this script actually is doing and assessing.

Show

Molly Davis added a comment - 14/Feb/23 4:35 PM - edited Validating mate pair files Script: #!/bin/bash #SBATCH --job-name=validate_script #job name after submission #SBATCH -p Orion #partition being used #SBATCH -N 1 #number of nodes to use #SBATCH --ntasks-per-node=8 #max number of tasks per node #SBATCH --mem=60gb #memory required per node #SBATCH -t 0-50:00 #time (D-HH:MM) #SBATCH -o validate.%j.out #standard output file #SBATCH -e validate.%j.err #standard error file #SBATCH --mail-type=END,FAIL #Notifications for job complete/failure #SBATCH --mail-user=mdavi258@uncc.edu #Send to user email #SBATCH --array=1-63 #setting up where to grab files from file=$(sed -n -e "${SLURM_ARRAY_TASK_ID}p" /nobackup/tomato_genome/30-804059537-KP/kp_runlist.txt) #The command to validate each pair: cd /nobackup/tomato_genome/30-804059537-KP perl /projects/tomato_genome/scripts/validateHiseqPairs.pl ${file}_R1_001.fastq.gz ${file}_R2_001.fastq.gz echo "Done" Error: mate-pair files are not ordered Ann's note: We need to understand what exactly this script actually is doing and assessing.

Hide

Permalink

Ann Loraine added a comment - 16/Feb/23 10:09 AM

[~RobertReid] : MD5 checking shows that the fastq files were not corrupted by the transfer.
Nowlan Freese : Suggests comparing the first few lines per file (via "head" function) to see if pairs are present

Show

Ann Loraine added a comment - 16/Feb/23 10:09 AM [~RobertReid] : MD5 checking shows that the fastq files were not corrupted by the transfer. Nowlan Freese : Suggests comparing the first few lines per file (via "head" function) to see if pairs are present

Hide

Permalink

Ann Loraine added a comment - 16/Feb/23 10:10 AM

Problem is: The mate pair records in the "1" and "2" fastq files per sample are not in the same order in the two files. According to the above script (validating mate pairs) the records are out of order.

Show

Ann Loraine added a comment - 16/Feb/23 10:10 AM Problem is: The mate pair records in the "1" and "2" fastq files per sample are not in the same order in the two files. According to the above script (validating mate pairs) the records are out of order.

Hide

Permalink

Robert Reid added a comment - 16/Feb/23 10:10 AM

MD5 Check:

All the MD5 check out.

Details on this are saved to Google Drive in Kelsie's experiment folder.
https://drive.google.com/drive/folders/1TxUDhJHr9mrXOVysrcceS9YGyTXFHRmo?usp=share_link

Show

Robert Reid added a comment - 16/Feb/23 10:10 AM MD5 Check: All the MD5 check out. Details on this are saved to Google Drive in Kelsie's experiment folder. https://drive.google.com/drive/folders/1TxUDhJHr9mrXOVysrcceS9YGyTXFHRmo?usp=share_link

Hide

Permalink

Ann Loraine added a comment - 16/Feb/23 10:12 AM

Please add validateMatePairs.pl to "src" directory in the repository:

https://bitbucket.org/hotpollen/pistil-rna-seq/src/main/.

Show

Ann Loraine added a comment - 16/Feb/23 10:12 AM Please add validateMatePairs.pl to "src" directory in the repository: https://bitbucket.org/hotpollen/pistil-rna-seq/src/main/ .

Hide

Permalink

Ann Loraine added a comment - 16/Feb/23 10:26 AM

NF suggestion: Run fastqc on output of trim galore to observe new and possibly aberrant size distribution of read sequence

Show

Ann Loraine added a comment - 16/Feb/23 10:26 AM NF suggestion: Run fastqc on output of trim galore to observe new and possibly aberrant size distribution of read sequence

Hide

Permalink

Molly Davis added a comment - 16/Feb/23 1:12 PM - edited

validateMatePairs.pl code:

#!/usr/bin/perl

use strict;
use warnings;

open FH1,$ARGV[0] or die "\n can not open file $ARGV[0]\n";  ## first pair
open FH2,$ARGV[1] or die "\n can not open file $ARGV[1]\n";  ## second pair
my($str1,$str2,$tempStr);
my($n1,$n2);
$n1 = 0;
$n2 = 0;
print " Validating $ARGV[0] and $ARGV[1] \n";
my(@a1,@a2);

	while($str1 = <FH1>){
        $tempStr = <FH1>;
        $tempStr = <FH1>;
        $tempStr = <FH1>;
        ++$n1;

        $str2 = <FH2>;
        $tempStr = <FH2>;
        $tempStr = <FH2>;
        $tempStr = <FH2>;
        ++$n2;

        $str1 =~ s/\n//;
        $str1 =~ s/\r//;
        $str2 =~ s/\n//;
        $str2 =~ s/\r//;


        @a1 = split(/\s+/,$str1);
        @a2 = split(/\s+/,$str2);

        $str1 = $a1[0];
        $str2 = $a2[0];

        $str1 =~ s/(\/\d)$//;
        $str2 =~ s/(\/\d)$//;

                if($str1 ne $str2){
                die "Read pairs not found for $str1 and $str2, mate-pair files are not ordered\n";
                }
        }  ## while(<FH1>) ends
close FH1;
close FH2;

print "Total validated mates: $n1 and $n2: Read-pairs are properly ordered\n";

Next step: Run the code with unzipped fastq files.

gzip -d *.gz

Comment: script finished running and output files say Read-pairs are properly ordered. Decompressing the files helped fix the validate script error.

Show

Molly Davis added a comment - 16/Feb/23 1:12 PM - edited validateMatePairs.pl code: #!/usr/bin/perl use strict; use warnings; open FH1,$ARGV[0] or die "\n can not open file $ARGV[0]\n" ; ## first pair open FH2,$ARGV[1] or die "\n can not open file $ARGV[1]\n" ; ## second pair my($str1,$str2,$tempStr); my($n1,$n2); $n1 = 0; $n2 = 0; print " Validating $ARGV[0] and $ARGV[1] \n" ; my(@a1,@a2); while ($str1 = <FH1>){ $tempStr = <FH1>; $tempStr = <FH1>; $tempStr = <FH1>; ++$n1; $str2 = <FH2>; $tempStr = <FH2>; $tempStr = <FH2>; $tempStr = <FH2>; ++$n2; $str1 =~ s/\n //; $str1 =~ s/\r //; $str2 =~ s/\n //; $str2 =~ s/\r //; @a1 = split(/\s+/,$str1); @a2 = split(/\s+/,$str2); $str1 = $a1[0]; $str2 = $a2[0]; $str1 =~ s/(\/\d)$ //; $str2 =~ s/(\/\d)$ //; if ($str1 ne $str2){ die "Read pairs not found for $str1 and $str2, mate-pair files are not ordered\n" ; } } ## while (<FH1>) ends close FH1; close FH2; print "Total validated mates: $n1 and $n2: Read-pairs are properly ordered\n" ; Next step: Run the code with unzipped fastq files. gzip -d *.gz Comment: script finished running and output files say Read-pairs are properly ordered. Decompressing the files helped fix the validate script error.

Hide

Permalink

Robert Reid added a comment - 17/Feb/23 9:48 AM

I ran trimmomatic on the raw data to see what would happen.

Resulting files are located here:
/nobackup/tomato_genome/30-804059537-KP/trimmoTest

Script to run it is here:
/projects/tomato_genome/scripts/rob/trimmomatic-temp.slurm

The script will validate pairs afterwards. (using option -validate pairs)
It appears that 99.95% of all reads are great.
About 2-8 reads are bad per pairing.
Those reads are saved as unpaired.fastq files.

We could now run nextflow with these read files instead.

Show

Robert Reid added a comment - 17/Feb/23 9:48 AM I ran trimmomatic on the raw data to see what would happen. Resulting files are located here: /nobackup/tomato_genome/30-804059537-KP/trimmoTest Script to run it is here: /projects/tomato_genome/scripts/rob/trimmomatic-temp.slurm The script will validate pairs afterwards. (using option -validate pairs) It appears that 99.95% of all reads are great. About 2-8 reads are bad per pairing. Those reads are saved as unpaired.fastq files. We could now run nextflow with these read files instead.

Hide

Permalink

Ann Loraine added a comment - 17/Feb/23 10:14 AM

[~RobertReid] suggests counting number of aligned versus unaligned reads, for each BAM file we have.

Show

Ann Loraine added a comment - 17/Feb/23 10:14 AM [~RobertReid] suggests counting number of aligned versus unaligned reads, for each BAM file we have.

Hide

Permalink

Robert Reid added a comment - 17/Feb/23 10:21 AM

To use samtools to view the aligned and unaligned.

READS MAPPED:
module load samtools
samtools view -c -F 4 nagcarlang-sorted.bam

For Unmapped:
samtools view -c -f 4 nagcarlang-sorted.bam

Let's calculate coverage:
samtools depth nagcarlang-sorted.bam | awk '

{sum+=$3}

END

{ print "Average = ",sum/NR}

'

Put these lines into a slurm script. Should run very quickly.

Show

Robert Reid added a comment - 17/Feb/23 10:21 AM To use samtools to view the aligned and unaligned. READS MAPPED: module load samtools samtools view -c -F 4 nagcarlang-sorted.bam For Unmapped: samtools view -c -f 4 nagcarlang-sorted.bam Let's calculate coverage: samtools depth nagcarlang-sorted.bam | awk ' {sum+=$3} END { print "Average = ",sum/NR} ' Put these lines into a slurm script. Should run very quickly.

Hide

Permalink

Molly Davis added a comment - 17/Feb/23 2:19 PM

Created pull request to add src folder and perl validation file to bitbucket:

https://bitbucket.org/hotpollen/pistil-rna-seq/pull-requests/1

[~aloraine]

Show

Molly Davis added a comment - 17/Feb/23 2:19 PM Created pull request to add src folder and perl validation file to bitbucket: https://bitbucket.org/hotpollen/pistil-rna-seq/pull-requests/1 [~aloraine]

Hide

Permalink

Molly Davis added a comment - 17/Feb/23 4:20 PM - edited

Created script to use samtools to view the aligned and unaligned:

#!/bin/bash


#SBATCH --job-name=samtools_view        #job name after submission
#SBATCH -p Orion                        #partition being used
#SBATCH -N 1                            #number of nodes to use
#SBATCH --ntasks-per-node=8             #max number of tasks per node
#SBATCH --mem=900gb                     #memory required per node
#SBATCH -t 14-00:00                     #time (D-HH:MM)
#SBATCH -o samtools_view.%j.out         #standard output file
#SBATCH --mail-type=END,FAIL            #Notifications for job complete/failure
#SBATCH --mail-user=mdavi258@uncc.edu   #Send to user email
#SBATCH --array=1-63


file=$(sed -n -e "${SLURM_ARRAY_TASK_ID}p"  /nobackup/tomato_genome/30-804059537-KP/kp_runlist.txt)

module load samtools
echo "Mapped:" ${file}
samtools view -c -F 4 ${file}.bam
echo  "Unmapped:" ${file}
samtools view -c -f 4 ${file}.bam

echo "Calculate Coverage" 
samtools depth ${file}.bam | awk '{sum+=$3} END { print "Average = ",sum/NR}'

echo "done"
echo "---------------------------------------------------------"

Directory: /nobackup/tomato_genome/30-804059537-KP/results/star_salmon

Combined output files into one:

cat *.out > ./mergedsamtoolsOut.txt

Output File:

mergedsamtoolsOut.txt

Show

Molly Davis added a comment - 17/Feb/23 4:20 PM - edited Created script to use samtools to view the aligned and unaligned: #!/bin/bash #SBATCH --job-name=samtools_view #job name after submission #SBATCH -p Orion #partition being used #SBATCH -N 1 #number of nodes to use #SBATCH --ntasks-per-node=8 #max number of tasks per node #SBATCH --mem=900gb #memory required per node #SBATCH -t 14-00:00 #time (D-HH:MM) #SBATCH -o samtools_view.%j.out #standard output file #SBATCH --mail-type=END,FAIL #Notifications for job complete/failure #SBATCH --mail-user=mdavi258@uncc.edu #Send to user email #SBATCH --array=1-63 file=$(sed -n -e "${SLURM_ARRAY_TASK_ID}p" /nobackup/tomato_genome/30-804059537-KP/kp_runlist.txt) module load samtools echo "Mapped:" ${file} samtools view -c -F 4 ${file}.bam echo "Unmapped:" ${file} samtools view -c -f 4 ${file}.bam echo "Calculate Coverage" samtools depth ${file}.bam | awk '{sum+=$3} END { print "Average = " ,sum/NR}' echo "done" echo "---------------------------------------------------------" Directory: /nobackup/tomato_genome/30-804059537-KP/results/star_salmon Combined output files into one: cat *.out > ./mergedsamtoolsOut.txt Output File: mergedsamtoolsOut.txt

Molly Davis made changes - 17/Feb/23 4:45 PM

Attachment

mergedsamtoolsOut.txt [ 17707 ]

Molly Davis made changes - 17/Feb/23 4:46 PM

Status

In Progress [ 3 ]

Needs 1st Level Review [ 10005 ]

Ann Loraine made changes - 19/Feb/23 2:49 PM

Description

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-KP (kp for "Kelsey Pryze")

Bitbucket repo: https://bitbucket.org/hotpollen/pistil-rna-seq

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attack to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization
* Create annots.xml metadata file for visualization

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Contact:
* Kelsey Pryze - kelseypryze@email.arizona.edu

For this task, process new data set from the Palanivelu lab. These data are from Kelsey

RR has downloaded the data onto the UNCC cluster and saved it here: /projects/tomato_genome/rnaseq/30-804059537-kelsie.
Please do the data processing in this directory:

* /nobackup/tomato_genome/30-804059537-KP (kp for "Kelsey Pryze")

Bitbucket repo: https://bitbucket.org/hotpollen/pistil-rna-seq

To-do:

* Run nf-core/rnaseq pipeline with SL5/2022 target genome assembly using "reverse" strandedness parameter.
* Check the multi-qc report (attach to this ticket). Re-run processing as necessary.
* Rename BAM files to not included "sorted" in the name.
* Create scaled coverage graphs.
* Create junction files.
* Migrate data to an on-line location for IGB visualization.
* Create annots.xml metadata file with visualization parameters.

Attached:
* Azenta (sequencing provider) data report, with numbers of sequences produced
* Quote from Azenta indicating strand-specific RNA-Seq, 2x150 bp paired end sequencing

Contact:
* Kelsey Pryze - kelseypryze@email.arizona.edu

Ann Loraine made changes - 19/Feb/23 2:52 PM

Link

This issue relates to IGBF-3261 [ IGBF-3261 ]

Nowlan Freese made changes - 21/Feb/23 3:03 PM

Sprint

Spring 3 2023 Feb 1 [ 163 ]

Spring 3 2023 Feb 1, Spring 4 2023 Feb 21 [ 163, 164 ]

Nowlan Freese made changes - 21/Feb/23 3:03 PM

Rank

Ranked higher

Ann Loraine made changes - 23/Feb/23 10:23 AM

Story Points

2

4

Ann Loraine made changes - 23/Feb/23 10:29 AM

Assignee

Molly Davis [ molly ]

Hide

Permalink

Ann Loraine added a comment - 23/Feb/23 12:21 PM - edited

Ann's comments:

Based on output above:

The bam files do not contain any unmapped reads, only mapped reads
The samtools "depth" command computes the number of alignments per base pair position - see http://www.htslib.org/doc/samtools-depth.html
For transcriptome data, the "depth" command does not make a lot of sense because the depth of read alignments at any given position depends on whether or not that position is inside an exon, and also on the level of expression of that exon
I don't know what "NR" means and where this is coming from in the "sum/NR" statement at the end of the script

Conclusion: This output of this script script does not explain the QC result.

We do not know why some of the samples did not perform well. Let's proceed with the pipeline and visualize the data in a genome browser as this visualization step may reveal more information about the problematic samples.

Show

Ann Loraine added a comment - 23/Feb/23 12:21 PM - edited Ann's comments: Based on output above: The bam files do not contain any unmapped reads, only mapped reads The samtools "depth" command computes the number of alignments per base pair position - see http://www.htslib.org/doc/samtools-depth.html For transcriptome data, the "depth" command does not make a lot of sense because the depth of read alignments at any given position depends on whether or not that position is inside an exon, and also on the level of expression of that exon I don't know what "NR" means and where this is coming from in the "sum/NR" statement at the end of the script Conclusion: This output of this script script does not explain the QC result. We do not know why some of the samples did not perform well. Let's proceed with the pipeline and visualize the data in a genome browser as this visualization step may reveal more information about the problematic samples.

Ann Loraine made changes - 23/Feb/23 12:23 PM

Status

Needs 1st Level Review [ 10005 ]

First Level Review in Progress [ 10301 ]

Ann Loraine made changes - 23/Feb/23 12:23 PM

Status

First Level Review in Progress [ 10301 ]

To-Do [ 10305 ]

Ann Loraine made changes - 23/Feb/23 12:23 PM

Assignee

Molly Davis [ molly ]

Ann Loraine made changes - 23/Feb/23 12:25 PM

Sprint

Spring 3 2023 Feb 1, Spring 4 2023 Feb 21 [ 163, 164 ]

Spring 3 2023 Feb 1, Spring 5 2023 Mar 6 [ 163, 165 ]

Ann Loraine made changes - 07/Mar/23 8:02 AM

Sprint

Spring 3 2023 Feb 1, Spring 5 2023 Mar 6 [ 163, 165 ]

Spring 3 2023 Feb 1, Spring 6 2023 Mar 20 [ 163, 166 ]

Ann Loraine made changes - 20/Mar/23 7:08 AM

Sprint

Spring 3 2023 Feb 1, Spring 6 2023 Mar 20 [ 163, 166 ]

Spring 3 2023 Feb 1, Spring 7 2023 Apr 10 [ 163, 167 ]

Ann Loraine made changes - 13/Apr/23 2:09 PM

Rank

Ranked higher

Ann Loraine made changes - 17/Apr/23 10:30 AM

Sprint

Spring 3 2023 Feb 1, Spring 7 2023 Apr 10 [ 163, 167 ]

Spring 3 2023 Feb 1 [ 163 ]

Molly Davis made changes - 24/Apr/23 10:40 AM

Issue Type	Task [ 3 ]	Epic [ 10000 ]
Sprint	Spring 3 2023 Feb 1 [ 163 ]	Spring 3 2023 Feb 1, Spring 8 2023 Apr 24 [ 163, 168 ]

Molly Davis made changes - 24/Apr/23 10:40 AM

Epic Link

IGBF-2993 [ 21429 ]

Molly Davis made changes - 24/Apr/23 10:42 AM

Epic Name

Process Kelsey's Palanivelu Lab data

Molly Davis made changes - 24/Apr/23 10:50 AM

Epic Child

~~IGBF-3323~~ [ 22345 ]

Molly Davis made changes - 24/Apr/23 10:53 AM

Epic Child

~~IGBF-3324~~ [ 22346 ]

Molly Davis made changes - 24/Apr/23 10:53 AM

Epic Color

ghx-label-6

Molly Davis made changes - 24/Apr/23 10:56 AM

Epic Child

~~IGBF-3325~~ [ 22347 ]

Molly Davis made changes - 24/Apr/23 10:58 AM

Epic Child

~~IGBF-3326~~ [ 22348 ]

Molly Davis made changes - 24/Apr/23 12:08 PM

Comment

[ Update:
* Created sample sheet:
[^KP_samples.csv]

* Started Nextflow pipeline. ]

Molly Davis made changes - 24/Apr/23 12:30 PM

Epic Child

~~IGBF-3328~~ [ 22350 ]

Molly Davis made changes - 24/Apr/23 12:33 PM

Link

This issue relates to ~~IGBF-3328~~ [ ~~IGBF-3328~~ ]

Molly Davis made changes - 24/Apr/23 12:36 PM

Link

This issue relates to ~~IGBF-3325~~ [ ~~IGBF-3325~~ ]

Molly Davis made changes - 24/Apr/23 12:36 PM

Link

This issue relates to ~~IGBF-3326~~ [ ~~IGBF-3326~~ ]

Ann Loraine made changes - 01/May/23 10:48 AM

Sprint

Spring 3 2023 Feb 1, Spring 8 2023 Apr 24 [ 163, 168 ]

Spring 3 2023 Feb 1, Spring 8 2023 Apr 24, Spring 9 2023 May 8 [ 163, 168, 169 ]

Ann Loraine made changes - 01/May/23 10:48 AM

Rank

Ranked higher

Molly Davis made changes - 02/May/23 4:03 PM

Attachment

Heinz-Ovary-Rep1-0hr-25C-unpol_R1_001_fastqc.html [ 17689 ]

Molly Davis made changes - 02/May/23 4:03 PM

Attachment

Heinz-Ovary-Rep2-0hr-25C-unpol_R1_001_fastqc.html [ 17690 ]